Moondream 3 Preview
Overview
Moondream 3 Preview is a compact frontier-oriented vision-language model built for fast visual reasoning, grounding, OCR, object detection, pointing, and structured output. It uses a 9B MoE architecture with 2B active parameters and extends context length to 32K, aiming to deliver strong real-world vision performance while staying efficient and inexpensive to run.
About Moondream
State-of-the-art visual understanding at speeds that make continuous processing possible. Point, detect, count, and reason—without compromise.
Tools using Moondream 3 Preview
No tools found for this model yet.
