Papers

See AI2's Award Winning Papers

Learn more about AI2's Lasting Impact Award

Viewing 111-120 of 164 papers

Optimizing AI for Teamwork
Gagan Bansal, Besmira Nushi, Ece Kamar, E. Horvitz, Daniel S. WeldAAAI • 2021 In many high-stakes domains such as criminal justice, finance, and healthcare, AI systems may recommend actions to a human expert responsible for final decisions, a context known as AI-advised decision making. When AI practitioners deploy the most accurate…
GENIE: A Leaderboard for Human-in-the-Loop Evaluation of Text Generation
Daniel Khashabi, Gabriel Stanovsky, Jonathan Bragg, Nicholas Lourie, Jungo Kasai, Yejin Choi, Noah A. Smith, Daniel S. WeldarXiv • 2021 Leaderboards have eased model development for many NLP datasets by standardizing their evaluation and delegating it to an independent external repository. Their adoption, however, is so far limited to tasks which can be reliably evaluated in an automatic…
Text mining approaches for dealing with the rapidly expanding literature on COVID-19
Lucy Lu Wang, Kyle LoBriefings in Bioinformatics • 2020 More than 50 000 papers have been published about COVID-19 since the beginning of 2020 and several hundred new papers continue to be published every day. This incredible rate of scientific productivity leads to information overload, making it difficult for…
Mitigating Biases in CORD-19 for Analyzing COVID-19 Literature
Anshul Kanakia, Kuansan Wang, Yuxiao Dong, Boya Xie, Kyle Lo, Zhihong Shen, Lucy Lu Wang, Chiyuan Huang, Darrin Eide, Sebastian Kohlmeier, Chieh-Han WuFrontiers in Research Metrics and Analytics • 2020 On the behest of the Office of Science and Technology Policy in the White House, six institutions, including ours, have created an open research dataset called COVID-19 Research Dataset (CORD-19) to facilitate the development of question-answering systems…
Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions
Dongyeop Kang, Andrew Head, Risham Sidhu, Kyle Lo, Daniel S. Weld, Marti A. HearstEMNLP • SDP workshop • 2020 The task of definition detection is important for scholarly papers, because papers often make use of technical terminology that may be unfamiliar to readers. Despite prior work on definition detection, current approaches are far from being accurate enough to…
PySBD: Pragmatic Sentence Boundary Disambiguation
Nipun Sadvilkar, M. NeumannEMNLP • NLP-OSS Workshop • 2020 In this paper, we present a rule-based sentence boundary disambiguation Python package that works out-of-the-box for 22 languages. We aim to provide a realistic segmenter which can provide logical sentences even when the format and domain of the input text is…
Fact or Fiction: Verifying Scientific Claims
David Wadden, Kyle Lo, Lucy Lu Wang, Shanchuan Lin, Madeleine van Zuylen, Arman Cohan, Hannaneh HajishirziEMNLP • 2020 We introduce the task of scientific fact-checking. Given a corpus of scientific articles and a claim about a scientific finding, a fact-checking model must identify abstracts that support or refute the claim. In addition, it must provide rationales for its…
MedICaT: A Dataset of Medical Images, Captions, and Textual References
Sanjay Subramanian, Lucy Lu Wang, Sachin Mehta, Ben Bogin, Madeleine van Zuylen, Sravanthi Parasa, Sameer Singh, Matt Gardner, Hannaneh HajishirziFindings of EMNLP • 2020 Understanding the relationship between figures and text is key to scientific document understanding. Medical figures in particular are quite complex, often consisting of several subfigures (75% of figures in our dataset), with detailed text describing their…
SciSight: Combining faceted navigation and research group detection for COVID-19 exploratory scientific search
Tom Hope, Jason Portenoy, Kishore Vasan, Jonathan Borchardt, Eric Horvitz, Daniel S. Weld, Marti A. Hearst, Jevin D. WestEMNLP • Demo • 2020 The COVID-19 pandemic has sparked unprecedented mobilization of scientists, already generating thousands of new papers that join a litany of previous biomedical work in related areas. This deluge of information makes it hard for researchers to keep track of…
SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search
S. MacAvaney, Arman Cohan, N. GoharianEMNLP • 2020 With worldwide concerns surrounding the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), there is a rapidly growing body of literature on the virus. Clinicians, researchers, and policy-makers need a way to effectively search these articles. In…

1
•••
11
12
13
•••
17

Natural Language Processing

Computer Vision

AI for the Environment

Experimentation and Communication

Research

Research

Papers

Optimizing AI for Teamwork

GENIE: A Leaderboard for Human-in-the-Loop Evaluation of Text Generation

Text mining approaches for dealing with the rapidly expanding literature on COVID-19

Mitigating Biases in CORD-19 for Analyzing COVID-19 Literature

Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions

PySBD: Pragmatic Sentence Boundary Disambiguation

Fact or Fiction: Verifying Scientific Claims

MedICaT: A Dataset of Medical Images, Captions, and Textual References

SciSight: Combining faceted navigation and research group detection for COVID-19 exploratory scientific search

SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search