Latest research
June 2025 - Revisiting critical batch size for large-batch OLMo pretraining
We introduce a more reliable method to measure the critical batch size (CBS), analyze how CBS changes over…
April 2025 - Introducing Atlantes: the first AI-powered GPS model for real-time global scale maritime intelligence
Atlantes: a system of transformers for real-time GPS modeling.
April 2025 - DataDecide: How to predict best pretraining data with small experiments
Explore the secrets of how language model developers make decisions with DataDecide.
April 2025 - Going beyond open data – increasing transparency and trust in language models with OLMoTrace
OLMoTrace lets you trace the outputs of language models back to their full, multi-trillion-token training data in…
March 2025 - Introducing CodeScientist: A step toward automated scientific discovery
Will there be a system that automatically identifies gaps in scientific knowledge and runs experiments?
March 2025 - Introducing Ai2 Paper Finder
Ai2 Paper Finder is an LLM-powered literature search system that mimics the iterative paper-finding process.
March 2025 - OLMo 2 32B: First fully open model to outperform GPT 3.5 and GPT 4o mini
Introducing OLMo 2 32B, the most capable and largest model in the OLMo 2 family.
February 2025 - olmOCR: Efficient PDF text extraction with vision language models
We introduce olmOCR, a high-performance toolkit designed to convert PDFs and document images into clean,…
February 2025 - OLMoE, meet iOS
Our mixture-of-experts model is available on the Apple app store! The OLMoE app allows anyone to test the model…