Language models - OLMo 2

OLMo 2 is a family of fully-open language models, developed start-to-finish with open and accessible training data, open-source training code, reproducible training recipes, transparent evaluations, intermediate checkpoints, and more. Details are in our technical report.

The OLMo 2 model family

OLMo 2 32B

OLMo 2 32B is the most capable and largest model in the OLMo 2 family, scaling up the OLMo 2 training recipe used for our 7B and 13B models. It is trained up to 6T tokens and post-trained using Tulu 3.1. OLMo 2 32B is the first fully-open model to outperform GPT3.5-Turbo and GPT-4o mini on a suite of popular, multi-skill academic benchmarks.

OLMo 2 32B release info Read the blog post

OLMo 2 (7B and 13B)

OLMo 2 7B and 13B models are trained on up to 5T tokens. These models are on par with or better than equivalently-sized fully-open models, and competitive with open-weight models from Meta and Mistral on English academic benchmarks.

OLMo 2 release info Read the blog post

OLMo 2 1B

OLMo 2 1B is the smallest member of the OLMo model family, surpassing peer models like Gemma 3 1B or Llama 3.2 1B. The 1B model size enables rapid iteration for researchers, more local development, and a more complete picture of how our recipe scales.

OLMo 2 1B release info

Computer generated art showing abstract waving lines of different colors.

The original OLMo model family

OLMo is Ai2’s first Open Language Model framework, intentionally designed to advance AI through open research and to empower academics and researchers to study the science of language models collectively. More details in our technical report.

OLMo release info Read the original OLMo blog post

OLMo is fully open

Ai2 believes in the power of openness to build a future where AI is accessible to all. Open weights alone aren’t enough – true openness requires models to be trained in the open with fully open access to data, models, and code.

Models and Data

Explore the collection of fully-open OLMo 2 models and the underlying training data used across all stages, including pre-training, mid-training and post-training – made freely available to support open scientific research.

OLMo 2 artifacts on Hugging Face

Training

Use and extend our high-performance training code for OLMo 2, which we rely on internally for high-stakes language model training and experimentation.

Training code