AI Models Directory
Browse and discover AI models from leading companies in the industry.
-
NewImageReleased 1d ago
-
NewImageReleased 3d ago
-
By AlibabaQwen3.5-27B is a dense Qwen3.5 model size aimed at strong general performance in a smaller footprint than the flagship MoE variants.NewMultimodalReleased 3d ago
-
By AlibabaQwen3.5-122B-A10B is a larger Mixture-of-Experts Qwen3.5 model with 122B total parameters and 10B active per token, targeting higher peak capability while staying compute-efficient at inference.NewMultimodalReleased 3d ago
-
By Alibabawen3.5-35B-A3B is a Mixture-of-Experts Qwen3.5 model with 35B total parameters and 3B active per token, built for strong capability with much lower active compute.NewMultimodalReleased 3d ago
-
Mercury 2 is Inception Labsโ diffusion-based reasoning LLM designed for real-time latency, with tunable reasoning, long context, native tool use, and schema-aligned JSON output.NewTextReleased 3d ago
-
By OpenAIgpt-realtime-1.5 is OpenAIโs flagship real-time voice model for audio-in, audio-out use cases like voice agents and customer support, with support for text, audio, and image inputs and text and audio outputs.NewMultimodalReleased 4d ago
-
By AlibabaQwen3.5-Flash is the speed and cost optimized Qwen3.5 variant designed for high-throughput chat and multimodal prompting with a very long context window.NewMultimodalReleased 4d ago
-
By CohereMultimodal embedding model aimed at enterprise search and retrieval over multimodal data.NewTextReleased 7d ago
-
Conversational speech generation model that generates audio codes from text and audio inputs for dialogue style speech output.NewCodingReleased 7d ago
-
By GoogleGemini 3.1 Pro is Googleโs upgraded Gemini model built for complex tasks, with improved core reasoning, and it is available across the Gemini API, Vertex AI, the Gemini app, and NotebookLM.NewTextReleased 8d ago
-
By AlibabaCode-focused Qwen model family aimed at code generation, reasoning, and fixing across multiple parameter sizes.NewCodingReleased 8d ago
-
By TavusPhoenix-4 is Tavusโ real-time human rendering model that generates continuous facial behavior with controllable emotional expression and active listening cues.NewMultimodalReleased 9d ago
-
By GigaBrainWorld-model-conditioned VLA trained via world-model-based reinforcement learning for robot manipulation and self-improvement.NewTextReleased 9d ago
-
By CohereSmall multilingual model family aimed at running offline/at-the-edge while supporting a large number of languages.NewTextReleased 9d ago
-
By GoogleLyria 3 is Google DeepMindโs music generation model that lets Gemini app users create 30-second tracks from text prompts or from uploaded photos and video.NewAudioReleased 9d ago
-
By AnthropicAnthropicโs Sonnet 4.6 model for scaled production and complex tasks across coding, agents, and professional workflows.NewTextReleased 10d ago
-
By xAIGrok 4.2 is referenced as a Grok 4-series update for xAIโs Grok assistant, focused on general-purpose help across reasoning, coding, and real-time info retrieval inside the Grok experience.NewMultimodalReleased 10d ago
-
By AlibabaQwen3.5-397B-A17B is a native vision-language model from Alibaba's Qwen team with 397B parameters (17B active per token). In the Qwen3.5-Plus hosted variant it supports up to 1M-token multimodal context, with strong reasoning, coding, and agentic tool use for large-scale deployment.NewMultimodalReleased 10d ago
-
By MoondreamMoondream 2B is a compact vision-language model variant designed for efficient image understanding and instruction-following with reduced memory usage.NewMultimodalReleased 13d ago
-
By ByteDanceSeed 2.0 is described publicly only as a new ByteDance Seed language model for Doubao, but there is not yet any reliable, detailed public technical description of its architecture, context length, or training data that I can quote.NewMultimodalReleased 13d ago
No models found
Try adjusting your search or filters.
