Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Luca SoldainiRodney KinneyAkshita BhagiaKyle Lo
2024
ACL 2024

Information about pretraining corpora used to train the current best-performing language models is seldom discussed: commercial models rarely detail their data, and even open models are often… 

Neural Network Parameterization of Subgrid‐Scale Physics From a Realistic Geography Global Storm‐Resolving Simulation

Oliver Watt‐MeyerNoah D. BrenowitzS. K. ClarkChristopher S. Bretherton
2024
Journal of Advances in Modeling Earth Systems

Parameterization of subgrid‐scale processes is a major source of uncertainty in global atmospheric model simulations. Global storm‐resolving simulations use a finer grid (less than 5 km) to reduce… 

Selective Visual Representations Improve Convergence and Generalization for Embodied-AI

Ainaz EftekharKuo-Hao ZengJiafei DuanRanjay Krishna
2024
ICLR • Proceedings

Embodied AI models often employ off the shelf vision backbones like CLIP to encode their visual observations. Although such general purpose representations encode rich syntactic and semantic… 

MARG: Multi-Agent Review Generation for Scientific Papers

Mike D'ArcyTom HopeLarry BirnbaumDoug Downey
2024
arXiv.org

We study the ability of LLMs to generate feedback for scientific papers and develop MARG, a feedback generation approach using multiple LLM instances that engage in internal discussion. By… 

Tropical Cirrus Are Highly Sensitive to Ice Microphysics Within a Nudged Global Storm‐Resolving Model

R. AtlasC. BrethertonA. SokolM. F. Khairoutdinov
2024
Geophysical Research Letters

Cirrus dominate the longwave radiative budget of the tropics. For the first time, the variability in cirrus properties and longwave cloud radiative effects (CREs) that arises from using different… 

Catwalk: A Unified Language Model Evaluation Framework for Many Datasets

Dirk GroeneveldAnas AwadallaIz BeltagyJesse Dodge
2023
arXiv.org

The success of large language models has shifted the evaluation paradigms in natural language processing (NLP). The community's interest has drifted towards comparing NLP models across many tasks,… 

Kilometer-scale global warming simulations and active sensors reveal changes in tropical deep convection

Maximilien BolotLucas M. HarrisKai-Yuan ChengLinjiong Zhou & Stephan Fueglistaler
2023
NPJ Climate and Atmospheric Science

Changes in tropical deep convection with global warming are a leading source of uncertainty for future climate projections. A comparison of the responses of active sensor measurements of cloud ice… 

ACE: A fast, skillful learned global atmospheric model for climate prediction

Oliver Watt‐MeyerGideon DresdnerJ. McGibbonChristopher S. Bretherton
2023
NeurIPS • Tackling Climate Change with Machine Learning

Existing ML-based atmospheric models are not suitable for climate prediction, which requires long-term stability and physical consistency. We present ACE (AI2 Climate Emulator), a 200M-parameter,… 

Harmonic Mobile Manipulation

Ruihan YangYejin KimAniruddha KembhaviKiana Ehsani
2023
IROS

Recent advancements in robotics have enabled robots to navigate complex scenes or manipulate diverse objects independently. However, robots are still impotent in many household tasks requiring… 

IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions

Wenhao YuMeng JiangPeter ClarkAshish Sabharwal
2023
EMNLP

Although counterfactual reasoning is a fundamental aspect of intelligence, the lack of large-scale counterfactual open-domain question-answering (QA) benchmarks makes it difficult to evaluate and…