Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

DREAM: Improving Situational QA by First Elaborating the Situation

Yuling GuBhavana Dalvi MishraPeter Clark
2021
NAACL

When people answer questions about a specific situation, e.g., "I cheated on my mid-term exam last week. Was that wrong?", cognitive science suggests that they form a mental picture of that… 

Inherently Explainable Reinforcement Learning in Natural Language

Xiangyu PengMark O. RiedlPrithviraj Ammanabrolu
2021
arXiv

We focus on the task of creating a reinforcement learning agent that is inherently explainable—with the ability to produce immediate local explanations by thinking out loud while performing a task… 

CommonsenseQA 2.0: Exposing the Limits of AI through Gamification

Alon TalmorOri YoranRonan Le BrasJonathan Berant
2021
NeurIPS

Constructing benchmarks that test the abilities of modern natural language un1 derstanding models is difficult – pre-trained language models exploit artifacts in 2 benchmarks to achieve human… 

FLEX: Unifying Evaluation for Few-Shot NLP

Jonathan BraggArman CohanKyle LoIz Beltagy
2021
NeurIPS

Few-shot NLP research is highly active, yet conducted in disjoint research threads with evaluation suites that lack challenging-yet-realistic testing setups and fail to employ careful experimental… 

MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers

Krishna PillutlaSwabha SwayamdiptaRowan ZellersZ. Harchaoui
2021
NeurIPS

As major progress is made in open-ended text generation, measuring how close machine-generated text is to human language remains a critical open problem. We introduce MAUVE , a comparison measure… 

MERLOT: Multimodal Neural Script Knowledge Models

Rowan ZellersXiming LuJack HesselYejin Choi
2021
NeurIPS

As humans, we understand events in the visual world contextually, performing multimodal reasoning across time to make inferences about the past, present, and future. We introduce MERLOT, a model… 

Natural Adversarial Objects

Felix LauNishant SubramaniSasha HarrisonRosanne Liu
2021
NeurIPS 2021 Data Centric AI Workshop

Although state-of-the-art object detection methods have shown compelling performance, models often are not robust to adversarial attacks and out-of-distribution data. We introduce a new dataset,… 

NaturalProofs: Mathematical Theorem Proving in Natural Language

S. WelleckJiachen LiuRonan Le BrasKyunghyun Cho
2021
NeurIPS

Understanding and creating mathematics using natural mathematical language – the mixture of symbolic and natural language used by humans – is a challenging and important problem for driving progress… 

One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval

Akari AsaiXinyan YuJungo KasaiHanna Hajishirzi
2021
NeurIPS

We present CORA, a Cross-lingual Open-Retrieval Answer Generation model that can answer questions across many languages even when language-specific annotated data or knowledge sources are… 

Teach Me to Explain: A Review of Datasets for Explainable Natural Language Processing

Sarah Wiegreffe and Ana Marasović
2021
NeurIPS Datasets & Benchmarks

Explainable NLP (ExNLP) has increasingly focused on collecting human-annotated explanations. These explanations are used downstream in three ways: as data augmentation to improve performance on a…