TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Apertus 8B Instruct 2509

Model family: LLaMA
Apertus 8B Instruct 2509 is the instruction-tuned variant of the Apertus 8B base model, a decoder-only transformer pretrained on 15T tokens using a staged curriculum of web, code, and curated data. It natively supports 1,811 languages, positioning it among the most massively multilingual open models available. The model is fully open: weights, training data, and training recipes are all publicly released. It is built with compliance in mind, respecting data owner opt-outs and avoiding sources with restrictive terms of service. The context window extends to 65,536 tokens. The model supports tool use for agentic workflows. It was trained on 4,096 GH200 GPUs using Megatron-LM and is deployable via Transformers, vLLM, SGLang, and MLX. Performance on general language understanding benchmarks is competitive with Llama 3.1 8B. Licensed under Apache 2.0.
Text Gen 7
Released: September 17, 2025

Overview

Apertus 8B Instruct 2509 is an 8B parameter open-source instruction-tuned language model pretrained on 15T tokens. It natively supports 1,811 languages, uses fully open data and weights with disclosed training recipes, and is designed to respect data owner opt-outs. It features a 65,536-token context window and supports tool use for agentic tasks.

About Swiss AI Initiative

The Swiss AI Initiative is the world's largest open science/open source effort for AI foundation models, started in December 2023. Seeded with over 10M GPU hours on the Alps supercomputer and a 20M CHF grant from the ETH Domain, it is the first initiative of the Swiss National AI Institute—a partnership between the ETH AI Center and the EPFL AI Center—leveraging 800+ researchers (70 AI-focused professors) from 10+ Swiss academic institutions.

Industry: Research
Location: Zürich, CH
View Company Profile
Last updated: June 30, 2026
0 AIs selected
Clear selection
#
Name
Task