Ai2 blog
July 2025 - SciArena: A New Platform for Evaluating Foundation Models in Scientific Literature Tasks
Discover how SciArena is being used to evaluate foundation models’ capabilities in scientific literature tasks through community-driven, literature-grounded, and multi-disciplinary reasoning.
April 2025 - Going beyond open data – increasing transparency and trust in language models with OLMoTrace
OLMoTrace lets you trace the outputs of language models back to their full, multi-trillion-token training data in real time.
April 2025 - Introducing Atlantes: the first AI-powered GPS model for real-time global scale maritime intelligence
Atlantes: a system of transformers for real-time GPS modeling.
June 2025 - OMEGA: Can LLMs Reason Outside the Box in Math?
Discover how OMEGA is being used to evaluate large language models' ability to generalize in math through…
June 2025 - New applications of the Ai2 Climate Emulator (ACE) by the international climate modeling community
Learn how ACE is being used for seasonal forecasts and understanding decadal variations in global warming.
June 2025 - Revisiting critical batch size for large-batch OLMo pretraining
We introduce a more reliable method to measure the critical batch size (CBS), analyze how CBS changes over…
April 2025 - Highlights from Ai2 at Google Cloud Next
Key moments from Google Cloud Next, including our partnership with Google Cloud, OLMoTrace, and more.
April 2025 - DataDecide: How to predict best pretraining data with small experiments
Explore the secrets of how language model developers make decisions with DataDecide.
April 2025 - Ai2 and Google Cloud commit $20M to advance AI-powered research for the Cancer AI Alliance
We announce partnership with the Cancer AI Alliance along with Google Cloud.
March 2025 - Introducing CodeScientist: A step toward automated scientific discovery
Will there be a system that automatically identifies gaps in scientific knowledge and runs experiments?
March 2025 - Introducing Ai2 Paper Finder
Ai2 Paper Finder is an LLM-powered literature search system that mimics the iterative paper-finding process.
March 2025 - Ai2’s Recommendations to OSTP to enable open-source innovation with the U.S. AI Action Plan
Ai2's recommendation in response to the White House’s Request for Information on an AI Action Plan.