Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents

Kyle LoZejiang ShenBenjamin NewmanLuca Soldaini
2023
EMNLP

Despite growing interest in applying natural language processing (NLP) and computer vision (CV) models to the scholarly domain, scientific documents remain challenging to work with. They’re often in… 

RCT Rejection Sampling for Causal Estimation Evaluation

Katherine A. KeithSergey FeldmanDavid JurgensRohit Bhattacharya
2023
Transactions on Machine Learning Research

Confounding is a significant obstacle to unbiased estimation of causal effects from observational data. For settings with high-dimensional covariates -- such as text data, genomics, or the… 

CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies

Arie CattanTom HopeDoug DowneyIdo Dagan
2023
Conference on Empirical Methods in Natural Language Processing

Various NLP tasks require a complex hierarchical structure over nodes, where each node is a cluster of items. Examples include generating entailment graphs, hierarchical cross-document coreference… 

CARE: Extracting Experimental Findings From Clinical Literature

Aakanksha NaikBailey KuehlErin BransomTom Hope
2023
arXiv.org

Extracting fine-grained experimental findings from literature can provide massive utility for scientific applications. Prior work has focused on developing annotation schemas and datasets for… 

LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks

Mihir ParmarAakanksha NaikHimanshu GuptaChitta Baral
2023
arXiv.org

Many large language models (LLMs) for medicine have largely been evaluated on short texts, and their ability to handle longer sequences such as a complete electronic health record (EHR) has not been… 

Papeos: Augmenting Research Papers with Talk Videos

Tae Soo KimMatt LatzkeJonathan BraggJoseph Chee Chang
2023
UIST

Research consumption has been traditionally limited to the reading of academic papers—a static, dense, and formally written format. Alternatively, pre-recorded conference presentation videos, which… 

Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking

Hyeonsu B KangSherry WuJoseph Chee ChangA. Kittur
2023
UIST

Efficiently reviewing scholarly literature and synthesizing prior art are crucial for scientific progress. Yet, the growing scale of publications and the burden of knowledge make synthesis of… 

The Surveillance AI Pipeline

Pratyusha Ria KalluriWilliam AgnewM. ChengA. Birhane
2023
arXiv

A rapidly growing number of voices have argued that AI research, and computer vision in particular, is closely tied to mass surveillance. Yet the direct path from computer vision research to… 

When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets

Orion WellerKyle LoDavid WaddenLuca Soldaini
2023
arXiv

Using large language models (LMs) for query or document expansion can improve generalization in information retrieval. However, it is unknown whether these techniques are universally beneficial or… 

Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI Harms

Organizer of Queer In AINathaniel DennlerAnaelia OvalleJessica de Jesus de Pinho Pinhal
2023
AIES

Bias evaluation benchmarks and dataset and model documentation have emerged as central processes for assessing the biases and harms of artificial intelligence (AI) systems. However, these auditing…