AI Models Directory

Browse and discover AI models from leading companies in the industry.

Gen 4

Gemini 3.1 Flash Image

By Google

🖼️Image generation 📝Content creation

NewImage

Released 1d ago
Gen 4 Reve

Reve 1.5

By Reve AI

🖼️Image generation 📽️Image to video 🖌️Image editing 🎨Custom image generation

NewImage

Released 3d ago
Gen 3

Qwen 3.5 27B

By Alibaba

Qwen3.5-27B is a dense Qwen3.5 model size aimed at strong general performance in a smaller footprint than the flagship MoE variants.

🖼️Image generation 📷Images

NewMultimodal

Released 3d ago
Gen 3

Qwen 3.5 122B A10B

By Alibaba

Qwen3.5-122B-A10B is a larger Mixture-of-Experts Qwen3.5 model with 122B total parameters and 10B active per token, targeting higher peak capability while staying compute-efficient at inference.

📷Images 🎥Videos 📚Stories

NewMultimodal

Released 3d ago
Gen 3

Qwen 3.5 35B A3B

By Alibaba

wen3.5-35B-A3B is a Mixture-of-Experts Qwen3.5 model with 35B total parameters and 3B active per token, built for strong capability with much lower active compute.

🖼️Image generation 💻Coding 📝Writing 📷Images

NewMultimodal

Released 3d ago
Gen 7

Mercury 2

By Inception Labs

Mercury 2 is Inception Labs’ diffusion-based reasoning LLM designed for real-time latency, with tunable reasoning, long context, native tool use, and schema-aligned JSON output.

🎯Goals 📝Writing 📊Infographics 🔧Code optimization

NewText

Released 3d ago
Gen 3

GPT Realtime 1.5

By OpenAI

gpt-realtime-1.5 is OpenAI’s flagship real-time voice model for audio-in, audio-out use cases like voice agents and customer support, with support for text, audio, and image inputs and text and audio outputs.

🎙Voice chatting 🔊Text to speech 🎤Voice assistants

NewMultimodal

Released 4d ago
Gen 3

Qwen 3.5 Flash

By Alibaba

Qwen3.5-Flash is the speed and cost optimized Qwen3.5 variant designed for high-throughput chat and multimodal prompting with a very long context window.

💬Chatting 📷Images 🎥Videos

NewMultimodal

Released 4d ago
Gen 7

Embed 4

By Cohere

Multimodal embedding model aimed at enterprise search and retrieval over multimodal data.

🔍Multimodal search

NewText

Released 7d ago
Gen 2

CSM

By Sesame AI Labs

Conversational speech generation model that generates audio codes from text and audio inputs for dialogue style speech output.

🔊Text to speech

NewCoding

Released 7d ago
Gen 3 Gemini

Gemini 3.1 Pro

By Google

Gemini 3.1 Pro is Google’s upgraded Gemini model built for complex tasks, with improved core reasoning, and it is available across the Gemini API, Vertex AI, the Gemini app, and NotebookLM.

🖥️Code editing 🤯Zizek debates 🎨Logo design

NewText

Released 8d ago
Gen 2 Qwen

Qwen 2.5 Coder 32B

By Alibaba

Code-focused Qwen model family aimed at code generation, reasoning, and fixing across multiple parameter sizes.

💻Coding

NewCoding

Released 8d ago
Gen 3

Phoenix 4

By Tavus

Phoenix-4 is Tavus’ real-time human rendering model that generates continuous facial behavior with controllable emotional expression and active listening cues.

💬Conversational avatars 💑Virtual girlfriend 🔄Message rephrasing

NewMultimodal

Released 9d ago
Gen 4

GigaBrain 0.5M

By GigaBrain

World-model-conditioned VLA trained via world-model-based reinforcement learning for robot manipulation and self-improvement.

🔒Private conversations 🔄Workflows 🚀Tech futurism conversations 🚰Sanitary engineering research

NewText

Released 9d ago
Gen 7

Tiny Aya

By Cohere

Small multilingual model family aimed at running offline/at-the-edge while supporting a large number of languages.

🌐Text translation 🔍SEO content 🌎Language learning

NewText

Released 9d ago
Gen 3

Lyria 3

By Google

Lyria 3 is Google DeepMind’s music generation model that lets Gemini app users create 30-second tracks from text prompts or from uploaded photos and video.

🎵Music 🎵Music lyrics

NewAudio

Released 9d ago
Gen 3 Claude

Claude Sonnet 4.6

By Anthropic

Anthropic’s Sonnet 4.6 model for scaled production and complex tasks across coding, agents, and professional workflows.

💻Coding 🤖Agents 🖍Coloring books

NewText

Released 10d ago
Gen 3 Grok

Grok 4.2

By xAI

Grok 4.2 is referenced as a Grok 4-series update for xAI’s Grok assistant, focused on general-purpose help across reasoning, coding, and real-time info retrieval inside the Grok experience.

📷Images 💻Coding 📚Learning 👗Fashion

NewMultimodal

Released 10d ago
Gen 3 Qwen

Qwen3.5 397B A17B

By Alibaba

Qwen3.5-397B-A17B is a native vision-language model from Alibaba's Qwen team with 397B parameters (17B active per token). In the Qwen3.5-Plus hosted variant it supports up to 1M-token multimodal context, with strong reasoning, coding, and agentic tool use for large-scale deployment.

🖼️Image generation 💻Coding 📷Images 💬Chatting

NewMultimodal

Released 10d ago
Gen 3

moondream 2b

By Moondream

Moondream 2B is a compact vision-language model variant designed for efficient image understanding and instruction-following with reduced memory usage.

🖼️Image generation 🔍Code reviews 🔍Image recognition

NewMultimodal

Released 13d ago
Gen 3 Seed

Seed 2.0

By ByteDance

Seed 2.0 is described publicly only as a new ByteDance Seed language model for Doubao, but there is not yet any reliable, detailed public technical description of its architecture, context length, or training data that I can quote.

🎬Video editing 🔊Text to speech 📰News analysis

NewMultimodal

Released 13d ago

No models found

Try adjusting your search or filters.

...

Search

AI Models Directory

No models found

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: