TAAFT
Free mode
100% free
Freemium
Free Trial
Create tool

ERNIE 4.5-21B-A3B

By Baidu
New Text Gen
Released: June 30, 2025

Overview

ERNIE 4.5-21B-A3B is Baidu’s efficient MoE variant of ERNIE 4.5—about 21B total parameters with ~3B active per token—built to balance strong reasoning and coding accuracy with low latency. It supports long context, tool/function calling, structured JSON output, and streaming via ERNIE Bot and the Qianfan API.

Description

ERNIE 4.5-21B-A3B uses a Mixture-of-Experts design to route each token through a small subset of the model, keeping responses quick while preserving the step-by-step analysis ERNIE is known for. The model is instruction-tuned for careful, controllable behavior; it follows schemas for JSON, calls tools reliably for agent workflows, and maintains coherence on extended prompts for document and code understanding. In production it slots neatly into retrieval-augmented apps, customer and employee copilots, analytics explainers, and bilingual Chinese/English assistants where predictable latency and cost matter. Teams typically choose 21B-A3B when they want more capability than lightweight models without the heavier footprint of the largest ERNIE configurations, while retaining the same API surfaces, streaming, guardrails, and deployment options available through ERNIE Bot and Baidu’s Qianfan platform.

About Baidu

Baidu is a Chinese multinational technology company specializing in internet-related services, products, and artificial intelligence.

Industry: Internet
Company Size: 10001+
Location: Beijing, CN
View Company Profile

Related Models

Last updated: September 22, 2025