Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation

Alisa LiuSwabha SwayamdiptaNoah A. SmithYejin Choi
2022
Findings of EMNLP

A recurring challenge of crowdsourcing NLP datasets at scale is that human writers often rely on repetitive patterns when crafting examples, leading to a lack of linguistic diversity. We introduce a… 

What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

Matthew FinlaysonKyle RichardsonAshish SabharwalPeter Clark
2022
EMNLP

The instruction learning paradigm—where a model learns to perform new tasks from task descriptions alone—has become popular in general-purpose model research. The capabilities of large transformer… 

Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection

Suchin GururanganDallas CardSarah K. DrierNoah A. Smith
2022
EMNLP

Language models increasingly rely on massive web dumps for diverse text data. However, these sources are rife with undesirable content. As such, resources like Wikipedia, books, and news often… 

Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering

Pan LuSwaroop MishraTony XiaA. Kalyan
2022
NeurIPS 2022

When answering a question, humans utilize the information available across different modalities to synthesize a consistent and complete chain of thought (CoT). This process is normally a black box… 

Ask4Help: Learning to Leverage an Expert for Embodied Tasks

Kunal Pratap SinghLuca WeihsAlvaro HerrastiRoozbeh Mottaghi
2022
arXiv

Embodied AI agents continue to become more capable every year with the advent of new models, environments, and benchmarks, but are still far away from being performant and reliable enough to be… 

Emulating Fast Processes in Climate Models

Noah BrenowitzW. PerkinsJ. M. NugentC. Bretherton
2022
NeurIPS•Machine Learning and Physical Sciences

Cloud microphysical parameterizations in atmospheric models describe the formation and evolution of clouds and precipitation, a central weather and climate process. Cloud-associated latent heating… 

Improving the predictions of ML-corrected climate models with novelty detection

Clayton SanfordAnna KwaOliver Watt‐MeyerC. Bretherton
2022
NeurIPS•Climate Change AI

While previous works have shown that machine learning (ML) can improve the prediction accuracy of coarse-grid climate models, these ML-augmented methods are more vulnerable to irregular inputs than… 

Machine-learned climate model corrections from a global storm-resolving model

Anna KwaS. ClarkB. HennC. Bretherton
2022
NeurIPS•Machine Learning and Physical Sciences

Due to computational constraints, running global climate models (GCMs) for many years requires a lower spatial grid resolution ( (cid:38) 50 km) than is optimal for accurately resolving important… 

Modeling the Machine Learning Multiverse

Samuel J BellOnno P. KampmanJesse DodgeNeil D. Lawrence
2022
NeurIPS

Amid mounting concern about the reliability and credibility of machine learning research, we present a principled framework for making robust and generalizable claims: the multiverse analysis . Our… 

ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

Matt DeitkeEli VanderBiltAlvaro HerrastiRoozbeh Mottaghi
2022
NeurIPS

Massive datasets and high-capacity models have driven many recent advancements in computer vision and natural language understanding. This work presents a platform to enable similar success stories…