Skip to main content ->

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

Mixture Content Selection for Diverse Sequence Generation

Jaemin ChoMinjoon SeoHannaneh Hajishirzi

Generating diverse sequences is important in many NLP applications such as question generation or summarization that exhibit semantically one-to-many relationships between source and the target… 

BERT for Coreference Resolution: Baselines and Analysis

Mandar JoshiOmer LevyDaniel S. WeldLuke Zettlemoyer

We apply BERT to coreference resolution, achieving strong improvements on the OntoNotes (+3.9 F1) and GAP (+11.5 F1) benchmarks. A qualitative analysis of model predictions indicates that, compared… 

“Going on a vacation” takes longer than “Going for a walk”: A Study of Temporal Commonsense Understanding

Ben ZhouDaniel KhashabiQiang NingDan Roth

Understanding time is crucial for understanding events expressed in natural language. Because people rarely say the obvious, it is often necessary to have commonsense knowledge about various… 

Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text

Bhavana Dalvi MishraNiket TandonAntoine BosselutPeter Clark

Our goal is to better comprehend procedural text, e.g., a paragraph about photosynthesis, by not only predicting what happens, but why some actions need to happen before others. Our approach builds… 

Transfer Learning Between Related Tasks Using Expected Label Proportions

Matan Ben NoachYoav Goldberg

Deep learning systems thrive on abundance of labeled training data but such data is not always available, calling for alternative methods of supervision. One such method is expectation… 

Language Modeling for Code-Switching: Evaluation, Integration of Monolingual Data, and Discriminative Training

Hila GonenYoav Goldberg

We focus on the problem of language modeling for code-switched language, in the context of automatic speech recognition (ASR). Language modeling for code-switched language is challenging for (at… 

SpanBERT: Improving Pre-training by Representing and Predicting Spans

Mandar JoshiDanqi ChenYinhan LiuOmer Levy

We present SpanBERT, a pre-training method that is designed to better represent and predict spans of text. Our approach extends BERT by (1) masking contiguous random spans, rather than random… 

SciBERT: A Pretrained Language Model for Scientific Text

Iz BeltagyKyle LoArman Cohan

Obtaining large-scale annotated data for NLP tasks in the scientific domain is challenging and expensive. We release SciBERT, a pretrained language model based on BERT (Devlin et al., 2018) to… 

Pretrained Language Models for Sequential Sentence Classification

Arman CohanIz BeltagyDaniel KingDaniel S. Weld

As a step toward better document-level understanding, we explore classification of a sequence of sentences into their corresponding categories, a task that requires understanding sentences in… 

Don't paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing

Jonathan HerzigJonathan Berant

A major hurdle on the road to conversational interfaces is the difficulty in collecting data that maps language utterances to logical forms. One prominent approach for data collection has been to…