An abstract illustration of swirling shapes, meant to denote a futuristic feeling.

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Shallow Syntax in Deep Water

Swabha SwayamdiptaMatthew E. PetersBrendan RoofNoah A. Smith

2019

arXiv

Shallow syntax provides an approximation of phrase-syntactic structure of sentences; it can be produced with high accuracy, and is computationally cheap to obtain. We investigate the role of shallow…

Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets

Mor GevaYoav GoldbergJonathan Berant

2019

arXiv

Crowdsourcing has been the prevalent paradigm for creating natural language understanding datasets in recent years. A common crowdsourcing practice is to recruit a small number of high-quality…

Do Neural Language Representations Learn Physical Commonsense?

Maxwell ForbesAri HoltzmanYejin Choi

2019

CogSci

Humans understand language based on the rich background knowledge about how the physical world works, which in turn allows us to reason about the physical world through language. In addition to the…

To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks

Matthew E. PetersSebastian RuderNoah A. Smith

2019

ACL • RepL4NLP

While most previous work has focused on different pretraining objectives and architectures for transfer learning, we ask how to best adapt the pretrained model to a given target task. We focus on…

Evaluating Gender Bias in Machine Translation

Gabriel StanovskyNoah A. SmithLuke Zettlemoyer

2019

ACL

We present the first challenge set and evaluation protocol for the analysis of gender bias in machine translation (MT). Our approach uses two recent coreference resolution datasets composed of…

MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension

Alon TalmorJonathan Berant

2019

ACL

A large number of reading comprehension (RC) datasets has been created recently, but little analysis has been done on whether they generalize to one another, and the extent to which existing…

Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing

Ben BoginJonathan BerantMatt Gardner

2019

ACL

Research on parsing language to SQL has largely ignored the structure of the database (DB) schema, either because the DB was very simple, or because it was observed at both training and test time.…

ScispaCy: Fast and Robust Models for Biomedical Natural Language Processing

Mark NeumannDaniel KingIz BeltagyWaleed Ammar

2019

ACL • BioNLP Workshop

Despite recent advances in natural language processing, many statistical models for processing text perform extremely poorly under domain shift. Processing biomedical and clinical text is a…

Adaptive Hashing for Model Counting

Jonathan KuckTri DaoYuanrun ZhengStefano Ermon

2019

UAI

Randomized hashing algorithms have seen recent success in providing bounds on the model count of a propositional formula. These methods repeatedly check the satisfiability of a formula subject to…

CEDR: Contextualized Embeddings for Document Ranking

Sean MacAvaneyAndrew YatesArman CohanNazli Goharian

2019

SIGIR

Although considerable attention has been given to neural ranking architectures recently, far less attention has been paid to the term representations that are used as input to these models. In this…

Previous872-881Next