Skip to main content ->
Ai2

Ai2 Newsletter

May 2025

Top story - Meet OLMo 2 1B

We're excited to round out the OLMo 2 family with its smallest member, OLMo 2 1B, surpassing peer models like Gemma 3 1B or Llama 3.2 1B. The 1B model size enables rapid iteration for researchers, more local development, and a more complete picture of how our recipe scales.

Atlantes: the first AI-powered GPS model for real-time global scale maritime intelligence

To effectively conserve and sustainably manage our critical ocean resource, we must maintain a watchful eye on maritime activity. We're open-sourcing Atlantes, a powerful new suite of AI models that use GPS data to analyze vessel behavior in real time. We turned massive, unstructured, noisy, and irregular GPS data into real-time information to help understand vessel behaviors like fishing, transiting, and anchoring.

This model is one component in a complex and global effort to monitor the planet's oceans for sustainability and conservation. We hope that by open-sourcing Atlantes, we can enable other researchers and environmentalists to better understand the strengths and limitations of this approach and also foster more widespread adoption of maritime transparency.

DataDecide: How to predict best pretraining data with small experiments

Ever wonder how LLM developers choose their pretraining data? It’s not guesswork— all AI labs create small-scale models as experiments, but the models and their data are rarely shared.

To empower open exploration of these questions, we released DataDecide, a suite of 1,050 models, 30k checkpoints, 25 datasets, and 10 benchmarks. We evaluate all models across a suite of 10 downstream tasks and calculate how accurately we can use small models to predict that one pretraining corpus will lead to better performance than another for our largest models.

Which benchmarks should we be evaluating on? Are scaling laws better? Can we use a better metric than accuracy? Read the blog to see our recommendations for model developers.

OLMoTrace increases transparency in language models

"With OLMoTrace, we’re actually bringing accessibility to openness, enabling everybody to start looking into the inner workings of the relationships between the input and output of these models." – Ali Farhadi, Ai2 CEO

More from us

Ai2 Newsletter Archive