Papers

Learn more about AI2's Lasting Impact Award
Viewing 941-950 of 950 papers
  • A Lightweight and High Performance Monolingual Word Aligner

    Xuchen Yao, Benjamin Van Durme, Chris Callision-Burch, and Peter ClarkACL2013 Fast alignment is essential for many natural language tasks. But in the setting of monolingual alignment, previous work has not been able to align more than one sentence pair per second. We describe a discriminatively trained monolingual word aligner that…
  • Automatic Coupling of Answer Extraction and Information Retrieval

    Xuchen Yao, Benjamin Van Durme, and Peter ClarkACL2013 Information Retrieval (IR) and Answer Extraction are often designed as isolated or loosely connected components in Question Answering (QA), with repeated overengineering on IR, and not necessarily performance gain for QA. We propose to tightly integrate them…
  • Answer Extraction as Sequence Tagging with Tree Edit Distance

    Xuchen Yao, Benjamin Van Durme, Chris Callision-Burch, and Peter ClarkNAACL2013 Our goal is to extract answers from preretrieved sentences for Question Answering (QA). We construct a linear-chain Conditional Random Field based on pairs of questions and their possible answer sentences, learning the association between questions and answer…
  • A Study of the Knowledge Base Requirements for Passing an Elementary Science Test

    Peter Clark, Phil Harrison, and Niranjan BalasubramanianCIKM • AKBC2013 Our long-term interest is in machines that contain large amounts of general and scientific knowledge, stored in a "computable" form that supports reasoning and explanation. As a medium-term focus for this, our goal is to have the computer pass a fourth-grade…
  • Extracting Meronyms for a Biology Knowledge Base Using Distant Supervision

    Xiao Ling, Dan Weld, and Peter ClarkAKBC2013 Knowledge of objects and their parts, meronym relations, are at the heart of many question-answering systems, but manually encoding these facts is impractical. Past researchers have tried hand-written patterns, supervised learning, and bootstrapped methods…
  • Learning Biological Processes with Global Constraints

    Aju Thalappillil Scaria, Jonathan Berant, Mengqiu Wang, Christopher D. Manning, Justin Lewis, Brittany Harding, and Peter ClarkEMNLP2013 Biological processes are complex phenomena involving a series of events that are related to one another through various relationships. Systems that can understand and reason over biological processes would dramatically improve the performance of semantic…
  • Semi-Markov Phrase-based Monolingual Alignment

    Xuchen Yao, Benjamin Van Durme, Chris Callision-Burch, and Peter ClarkEMNLP2013 We introduce a novel discriminative model for phrase-based monolingual alignment using a semi-Markov CRF. Our model achieves stateof-the-art alignment accuracy on two phrasebased alignment datasets (RTE and paraphrase), while doing significantly better than…
  • Probabilistic coherence, logical consistency, and Bayesian learning: Neural language models as epistemic agents

    Gregor Betz, Kyle RichardsonPLoS ONE2013 It is argued that suitably trained neural language models exhibit key properties of epistemic agency: they hold probabilistically coherent and logically consistent degrees of belief, which they can rationally revise in the face of novel evidence. To this…
  • Constructing a Textual KB from a Biology TextBook

    Peter Clark, Phil Harrison, Niranjan Balasubramanian, and Oren EtzioniNAACL-HLT • AKBC Workshop2012 As part of our work on building a "knowledgeable textbook" about biology, we are developing a textual question-answering (QA) system that can answer certain classes of biology questions posed by users. In support of that, we are building a "textual KB" - an…
  • Finding Deceptive Opinion Spam by Any Stretch of the Imagination

    Myle Ott, Yejin Choi, Claire Cardie, J. HancockACL2011 Consumers increasingly rate, review and research products online (Jansen, 2010; Litvin et al., 2008). Consequently, websites containing consumer reviews are becoming targets of opinion spam. While recent work has focused primarily on manually identifiable…