An abstract illustration of swirling shapes, meant to denote a futuristic feeling.

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Is GPT-3 Text Indistinguishable from Human Text? SCARECROW: A Framework for Scrutinizing Machine Text

Yao DouMaxwell ForbesRik Koncel-KedziorskiYejin Choi

2022

ACL

Modern neural text generation systems can produce remarkably ﬂuent and grammatical texts. While earlier language models suffered from repetition and syntactic errors, the errors made by contemporary…

Large Scale Substitution-based Word Sense Induction

Matan EyalShoval SaddeHillel Taub-TabibYoav Goldberg

2022

ACL

We present a word-sense induction method based on pre-trained masked language models (MLMs), which can cheaply scale to large vocabularies and large corpora. The result is a corpus which is…

NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks

Swaroop MishraArindam MitraNeeraj VarshneyA. Kalyan

2022

ACL

Given the ubiquitous nature of numbers in text, reasoning with numbers to perform simple calculations is an important skill of AI systems. While many datasets and models have been developed to this…

Productive Performance Engineering for Weather and Climate Modeling with Python

Tal Ben-NunLinus GronerFlorian DeconinckTorsten Hoefler

2022

arXiv

Earth system models are developed with a tight coupling to target hardware, often containing highly-specialized code predicated on processor characteristics. This coupling stems from using…

Reframing Instructional Prompts to GPTk's Language

Swaroop MishraDaniel KhashabiChitta BaralHanna Hajishirzi

2022

Findings of ACL

How can model designers turn task instructions into effective prompts for language models? Backed by extensive empirical analysis on GPT3, we observe important features for successful instructional…

Situated Dialogue Learning through Procedural Environment Generation

Prithviraj AmmanabroluRenee JiaMark O. Riedl

2022

ACL

We teach goal-driven agents to interactively act and speak in situated environments by training on generated curriculums. Our agents operate in LIGHT (Urbanek et al. 2019)—a large-scale…

ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts

Sonia K. MurthyKyle LoDaniel KingDoug Downey

2022

arXiv

Systems that can automatically deﬁne unfamiliar terms hold the promise of improving the accessibility of scientiﬁc texts, especially for readers who may lack prerequisite background knowledge.…

Understanding Dataset Difficulty with 𝒱-Usable Information

Kawin EthayarajhYejin Choiand Swabha Swayamdipta

2022

ICML

Estimating the difficulty of a dataset typically involves comparing state-of-the-art models to humans; the bigger the performance gap, the harder the dataset is said to be. However, this comparison…

PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

Wen XiaoIz BeltagyG. CareniniArman Cohan

2022

ACL

We introduce PRIMERA, a pre-trained model for multi-document representation with a focus on summarization that reduces the need for dataset-specific architectures and large amounts of fine-tuning…

Better Retrieval May Not Lead to Better Question Answering

Zhengzhong LiangTushar KhotSteven BethardAshish Sabharwal

2022

arXiv

Considerable progress has been made recently in open-domain question answering (QA) problems, which require Information Retrieval (IR) and Reading Comprehension (RC). A popular approach to improve…

Previous442-451Next