Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

Structural Scaffolds for Citation Intent Classification in Scientific Publications

Arman CohanWaleed AmmarMadeleine van ZuylenField Cady
2019
NAACL

Identifying the intent of a citation in scientific papers (e.g., background information, use of methods, comparing results) is critical for machine reading of individual publications and automated… 

Citation Count Analysis for Papers with Preprints

Sergey FeldmanKyle LoWaleed Ammar
2018
ArXiv

We explore the degree to which papers prepublished on arXiv garner more citations, in an attempt to paint a sharper picture of fairness issues related to prepublishing. A paper’s citation count is… 

Construction of the Literature Graph in Semantic Scholar

Waleed AmmarDirk GroeneveldChandra Bhagavatulaet al.
2018
NAACL-HLT

We describe a deployed scalable system for organizing published scientific literature into a heterogeneous graph to facilitate algorithmic manipulation and discovery. The resulting literature graph… 

A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications

Dongyeop KangWaleed AmmarBhavana Dalvi MishraRoy Schwartz
2018
NAACL-HLT

Peer reviewing is a central component in the scientific publishing process. We present the first public dataset of scientific peer reviews available for research pur- poses (PeerRead v1), providing… 

Content-Based Citation Recommendation

Chandra BhagavatulaSergey FeldmanRussell PowerWaleed Ammar
2018
NAACL-HLT

We present a content-based method for recommending citations in an academic paper draft. We embed a given query document into a vector space, then use its nearest neighbors as candidates, and rerank… 

Extracting Scientific Figures with Distantly Supervised Neural Networks

Noah SiegelNicholas LourieRussell Power and Waleed Ammar
2018
JCDL

Non-textual components such as charts, diagrams and tables provide key information in many scientific documents, but the lack of large labeled datasets has impeded the development of data-driven… 

Ontology Alignment in the Biomedical Domain Using Entity Definitions and Context

Lucy L. WangChandra BhagavatulaM. NeumannWaleed Ammar
2018
ACL • Proceedings of the BioNLP 2018 Workshop

Ontology alignment is the task of identifying semantically equivalent entities from two given ontologies. Different ontologies have different representations of the same entity, resulting in a need… 

Semi-supervised sequence tagging with bidirectional language models

Matthew E. PetersWaleed AmmarChandra Bhagavatulaand Russell Power
2017
ACL

Pre-trained word embeddings learned from unlabeled text have become a standard component of neural network architectures for NLP tasks. However, in most cases, the recurrent network that operates… 

AI zooms in on highly influential citations

Oren Etzioni
2017
Nature

The number of times a paper is cited is a poor proxy for its impact (see P. Stephan et al. Nature 544, 411–412; 2017). I suggest relying instead on a new metric that uses artificial intelligence… 

End-to-End Neural Ad-hoc Ranking with Kernel Pooling

Chenyan XiongZhuyun DaiJamie Callanand Russell Power
2017
SIGIR

This paper proposes K-NRM, a kernel based neural model for document ranking. Given a query and a set of documents, K-NRM uses a translation matrix that models word-level similarities via word…