Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

Global Reasoning over Database Structures for Text-to-SQL Parsing

Ben BoginMatt GardnerJonathan Berant
2019
EMNLP

State-of-the-art semantic parsers rely on auto-regressive decoding, emitting one symbol at a time. When tested against complex databases that are unobserved at training time (zero-shot), the parser… 

Language Modeling for Code-Switching: Evaluation, Integration of Monolingual Data, and Discriminative Training

Hila GonenYoav Goldberg
2019
EMNLP

We focus on the problem of language modeling for code-switched language, in the context of automatic speech recognition (ASR). Language modeling for code-switched language is challenging for (at… 

On the Limits of Learning to Actively Learn Semantic Representations

Omri KoshorekGabriel StanovskyYichu ZhouVivek Srikumar and Jonathan Berant
2019
CoNLL

One of the goals of natural language understanding is to develop models that map sentences into meaning representations. However, training such models requires expensive annotation of complex… 

Transfer Learning Between Related Tasks Using Expected Label Proportions

Matan Ben NoachYoav Goldberg
2019
EMNLP

Deep learning systems thrive on abundance of labeled training data but such data is not always available, calling for alternative methods of supervision. One such method is expectation… 

Question Answering is a Format; When is it Useful?

Matt GardnerJonathan BerantHannaneh HajishirziSewon Min
2019
arXiv

Recent years have seen a dramatic expansion of tasks and datasets posed as question answering, from reading comprehension, semantic role labeling, and even machine translation, to image and video… 

Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets

Mor GevaYoav GoldbergJonathan Berant
2019
arXiv

Crowdsourcing has been the prevalent paradigm for creating natural language understanding datasets in recent years. A common crowdsourcing practice is to recruit a small number of high-quality… 

MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension

Alon TalmorJonathan Berant
2019
ACL

A large number of reading comprehension (RC) datasets has been created recently, but little analysis has been done on whether they generalize to one another, and the extent to which existing… 

Representing Schema Structure with Graph Neural Networks for Text-to-SQL Parsing

Ben BoginJonathan BerantMatt Gardner
2019
ACL

Research on parsing language to SQL has largely ignored the structure of the database (DB) schema, either because the DB was very simple, or because it was observed at both training and test time.… 

Aligning Vector-spaces with Noisy Supervised Lexicons

Noa YehezkelJacob GoldbergerYoav Goldberg
2019
NAACL

The problem of learning to translate between two vector spaces given a set of aligned points arises in several application areas of NLP. Current solutions assume that the lexicon which defines the… 

CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge

Alon TalmorJonathan HerzigNicholas LourieJonathan Berant
2019
NAACL

When answering a question, people often draw upon their rich world knowledge in addition to the particular context. Recent work has focused primarily on answering questions given some relevant…