Llama 3.3

Llama 3.3

Model family: LLaMA

Llama 3.3 70B refines Meta’s widely used 70B checkpoint to deliver clearer step-by-step reasoning, better code understanding, and more faithful instruction following while keeping latency predictable for production. It’s a text-in/text-out, instruction-tuned model with solid multilingual capability and clean integration points for agents: stable streaming, schema-guided function calls, and deterministic JSON formatting that plays well with RAG and orchestration frameworks. The weights are released under the Llama community license and run across common inference stacks on-prem or in the cloud; quantized builds make single-node serving practical, and standard runtimes handle high-throughput deployments. If you’re coming from earlier 3.x releases, it slots in with minimal prompt or API changes while yielding noticeably stronger analysis and coding quality.

Overview

Llama 3.3 70B is Meta’s updated 70B model in the Llama 3.x family, tuned for stronger reasoning, coding, and instruction following. It supports long-context prompting, tool/function calling, and reliable JSON outputs, with open weights under the Llama license.

💻Coding 📝Writing 🚀Productivity 🙋Text humanization

Pricing

Groq source Together AI source

Compare Llama 3.3 with other models listed in the same vendor pricing tiers and context lengths.

Standard

Model	Input	Cached input	Output	Unit
GPT OSS Safeguard OpenAI	$0.075	-	$0.3	per 1M tokens
Llama 4 Scout Meta Platforms	$0.11	-	$0.34	per 1M tokens
Qwen 3 Alibaba	$0.29	-	$0.59	per 1M tokens
Llama 3.3 This model Meta Platforms	$0.59	-	$0.79	per 1M tokens
Kimi K2 Moonshot AI	$1	$0.5	$3	per 1M tokens

Tier

Standard

Model	Input	Cached input	Output	Unit
Deepseek V4 Pro DeepSeek	$1.74	$0.2	$3.48	per 1M tokens
Gemma 4 31B IT NVFP4 NVIDIA	$0.28	-	$0.86	per 1M tokens
GLM 5 Z.ai	$1	-	$3.2	per 1M tokens
GLM 5.1 Z.ai	$1.4	$0.26	$4.4	per 1M tokens
Kimi K2.6 Moonshot AI	$1.2	$0.2	$4.5	per 1M tokens
Kimi K2.7 Code Moonshot AI	$0.95	$0.19	$4	per 1M tokens
LFM2 24B A2B Liquid AI	$0.03	-	$0.12	per 1M tokens
Llama 3.3 This model Meta Platforms	$1.04	-	$1.04	per 1M tokens
MiniMax M2.5 MiniMax	$0.3	$0.06	$1.2	per 1M tokens
MiniMax M2.7 MiniMax	$0.3	$0.06	$1.2	per 1M tokens
MiniMax M3 MiniMax	$0.3	$0.06	$1.2	per 1M tokens
Nemotron 3 Ultra 550B A55B NVFP4 NVIDIA	$0.6	$0.2	$3.6	per 1M tokens
Qwen 3.5 9B Alibaba	$0.17	-	$0.25	per 1M tokens
Qwen 3.6 Plus Alibaba	$0.5	-	$3	per 1M tokens
Qwen 3.7 Max Alibaba	$1.25	$0.13	$3.75	per 1M tokens
Qwen3 235B A22B Alibaba	$0.2	-	$0.6	per 1M tokens
Qwen3.5 397B A17B Alibaba	$0.6	$0.35	$3.6	per 1M tokens
Qwen3.7-Plus Alibaba	$0.32	-	$1.28	per 1M tokens

Batch

Model	Input	Cached input	Output	Unit
Deepseek V4 Pro DeepSeek	$0.87	$0.2	$1.74	per 1M tokens
Gemma 4 31B IT NVFP4 NVIDIA	$0.28	-	$0.86	per 1M tokens
GLM 5 Z.ai	$1	-	$3.2	per 1M tokens
GLM 5.1 Z.ai	$0.7	$0.26	$2.2	per 1M tokens
Kimi K2.6 Moonshot AI	$1.2	$0.2	$4.5	per 1M tokens
Kimi K2.7 Code Moonshot AI	$0.95	$0.19	$4	per 1M tokens
LFM2 24B A2B Liquid AI	$0.01	-	$0.06	per 1M tokens
Llama 3.3 This model Meta Platforms	$0.52	-	$0.52	per 1M tokens
MiniMax M2.5 MiniMax	$0.3	$0.06	$1.2	per 1M tokens
MiniMax M2.7 MiniMax	$0.15	$0.06	$0.6	per 1M tokens
MiniMax M3 MiniMax	$0.3	$0.06	$1.2	per 1M tokens
Nemotron 3 Ultra 550B A55B NVFP4 NVIDIA	$0.6	$0.2	$3.6	per 1M tokens
Qwen 3.5 35B A3B Alibaba	$0.6	$0.35	$3.6	per 1M tokens
Qwen 3.5 9B Alibaba	$0.17	-	$0.25	per 1M tokens
Qwen 3.6 Plus Alibaba	$0.5	-	$3	per 1M tokens
Qwen3 235B A22B Alibaba	$0.1	-	$0.3	per 1M tokens
Qwen3.7-Plus Alibaba	$1.25	$0.13	$3.75	per 1M tokens

About Meta Platforms

We're connecting people to what they care about, powering new, meaningful experiences, and advancing the state-of-the-art through open research and accessible tooling.

Industry: Technology, Information and Media

Company Size: 78865

Location: Menlo Park, California, US

Website: ai.meta.com

View Company Profile