An abstract illustration of swirling shapes, meant to denote a futuristic feeling.

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

NaturalProofs: Mathematical Theorem Proving in Natural Language

S. WelleckJiachen LiuRonan Le BrasKyunghyun Cho

2021

NeurIPS

Understanding and creating mathematics using natural mathematical language – the mixture of symbolic and natural language used by humans – is a challenging and important problem for driving progress…

One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval

Akari AsaiXinyan YuJungo KasaiHanna Hajishirzi

2021

NeurIPS

We present CORA, a Cross-lingual Open-Retrieval Answer Generation model that can answer questions across many languages even when language-specific annotated data or knowledge sources are…

Teach Me to Explain: A Review of Datasets for Explainable Natural Language Processing

Sarah Wiegreffe and Ana Marasović

2021

NeurIPS Datasets & Benchmarks

Explainable NLP (ExNLP) has increasingly focused on collecting human-annotated explanations. These explanations are used downstream in three ways: as data augmentation to improve performance on a…

Bridging the Imitation Gap by Adaptive Insubordination

Luca WeihsUnnat JainJordi SalvadorA. Schwing

2021

arXiv

Why do agents often obtain better reinforcement learning policies when imitating a worse expert? We show that privileged information used by the expert is marginalized in the learned agent policy,…

Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text

Christopher ClarkJordi SalvadorDustin SchwenkAli Farhadi

2021

arXiv

Communicating with humans is challenging for AIs because it requires a shared understanding of the world, complex semantics (e.g., metaphors or analogies), and at times multimodal gestures (e.g.,…

Specializing Multilingual Language Models: An Empirical Study

Ethan C. ChauNoah A. Smith

2021

EMNLP • Workshop on Multilingual Representation Learning

Pretrained multilingual language models have become a common tool in transferring NLP capabilities to low-resource languages, often with adaptations. In this work, we study the performance,…

Towards Personalized Descriptions of Scientific Concepts

Sonia K. MurthyDaniel KingTom HopeDoug Downey

2021

EMNLP 2021 • WiNLP

A single scientific concept can be described in many different ways, and the most informative description depends on the audience. In this paper, we propose generating personalized scientific…

Achieving Model Robustness through Discrete Adversarial Training

Maor IvgiJonathan Berant

2021

EMNLP

Discrete adversarial attacks are symbolic perturbations to a language input that preserve the output label but lead to a prediction error. While such attacks have been extensively explored for the…

Back to Square One: Bias Detection, Training and Commonsense Disentanglement in the Winograd Schema

Yanai ElazarHongming ZhangYoav GoldbergDan Roth

2021

EMNLP

The Winograd Schema (WS) has been proposed as a test for measuring commonsense capabilities of models. Recently, pre-trained language model-based approaches have boosted performance on some WS…

BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief

Nora KassnerOyvind TafjordH. SchutzeP. Clark

2021

EMNLP

Although pretrained language models (PTLMs) have been shown to contain significant amounts of world knowledge, they can still produce inconsistent answers to questions when probed, even after using…

Previous502-511Next