Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

SoPa: Bridging CNNs, RNNs, and Weighted Finite-State Machines

Roy SchwartzSam Thomson and Noah A. Smith
2018
ACL

Recurrent and convolutional neural networks comprise two distinct families of models that have proven to be useful for encoding natural language utterances. In this paper we present SoPa, a new… 

A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications

Dongyeop KangWaleed AmmarBhavana Dalvi MishraRoy Schwartz
2018
NAACL-HLT

Peer reviewing is a central component in the scientific publishing process. We present the first public dataset of scientific peer reviews available for research pur- poses (PeerRead v1), providing… 

What Happened? Leveraging VerbNet to Predict the Effects of Actions in Procedural Text

Peter ClarkBhavana DalviNiket Tandon
2018
arXiv

Our goal is to answer questions about paragraphs describing processes (e.g., photosynthesis). Texts of this genre are challenging because the effects of actions are often implicit (unstated),… 

Tracking State Changes in Procedural Text: A Challenge Dataset and Models for Process Paragraph Comprehension

Bhavana DalviLifu HuangNiket TandonPeter Clark
2018
NAACL

We present a new dataset and models for comprehending paragraphs about processes (e.g., photosynthesis), an important genre of text describing a dynamic world. The new dataset, ProPara, is the first… 

Annotation Artifacts in Natural Language Inference Data

Suchin GururanganSwabha SwayamdiptaOmer LevySam Bowman and Noah A. Smith
2018
NAACL

Large-scale datasets for natural language inference are created by presenting crowd workers with a sentence (premise), and asking them to generate three new sentences (hypotheses) that it entails,… 

Content-Based Citation Recommendation

Chandra BhagavatulaSergey FeldmanRussell PowerWaleed Ammar
2018
NAACL-HLT

We present a content-based method for recommending citations in an academic paper draft. We embed a given query document into a vector space, then use its nearest neighbors as candidates, and rerank… 

Deep Contextualized Word Representations

Matthew E. PetersMark NeumannMohit IyyerLuke Zettlemoyer
2018
NAACL

We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e.g., syntax and semantics), and (2) how these uses vary across… 

Ontology Alignment in the Biomedical Domain Using Entity Definitions and Context

Lucy L. WangChandra BhagavatulaM. NeumannWaleed Ammar
2018
ACL • Proceedings of the BioNLP 2018 Workshop

Ontology alignment is the task of identifying semantically equivalent entities from two given ontologies. Different ontologies have different representations of the same entity, resulting in a need… 

The Web as a Knowledge-base for Answering Complex Questions

Alon TalmorJonathan Berant
2018
NAACL

Answering complex questions is a time-consuming activity for humans that requires reasoning and integration of information. Recent work on reading comprehension made headway in answering simple… 

Sounding Board: A User-Centric and Content-Driven Social Chatbot

Hao FangHao ChengMaarten Sapand Mari Ostendorf
2018
NAACL-HTL

We present Sounding Board, a social chatbot that won the 2017 Amazon Alexa Prize. The system architecture consists of several components including spoken language processing, dialogue management,…