Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

RCT Rejection Sampling for Causal Estimation Evaluation

Katherine A. KeithSergey FeldmanDavid JurgensRohit Bhattacharya

2023

Transactions on Machine Learning Research

Confounding is a significant obstacle to unbiased estimation of causal effects from observational data. For settings with high-dimensional covariates -- such as text data, genomics, or the…

CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies

Arie CattanTom HopeDoug DowneyIdo Dagan

2023

Conference on Empirical Methods in Natural Language Processing

Various NLP tasks require a complex hierarchical structure over nodes, where each node is a cluster of items. Examples include generating entailment graphs, hierarchical cross-document coreference…

CARE: Extracting Experimental Findings From Clinical Literature

Aakanksha NaikBailey KuehlErin BransomTom Hope

2023

arXiv.org

Extracting fine-grained experimental findings from literature can provide massive utility for scientific applications. Prior work has focused on developing annotation schemas and datasets for…

LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks

Mihir ParmarAakanksha NaikHimanshu GuptaChitta Baral

2023

arXiv.org

Many large language models (LLMs) for medicine have largely been evaluated on short texts, and their ability to handle longer sequences such as a complete electronic health record (EHR) has not been…

Papeos: Augmenting Research Papers with Talk Videos

Tae Soo KimMatt LatzkeJonathan BraggJoseph Chee Chang

2023

UIST

Research consumption has been traditionally limited to the reading of academic papers—a static, dense, and formally written format. Alternatively, pre-recorded conference presentation videos, which…

Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking

Hyeonsu B KangSherry WuJoseph Chee ChangA. Kittur

2023

UIST

Efficiently reviewing scholarly literature and synthesizing prior art are crucial for scientific progress. Yet, the growing scale of publications and the burden of knowledge make synthesis of…

The Surveillance AI Pipeline

Pratyusha Ria KalluriWilliam AgnewM. ChengA. Birhane

2023

arXiv

A rapidly growing number of voices have argued that AI research, and computer vision in particular, is closely tied to mass surveillance. Yet the direct path from computer vision research to…

When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets

Orion WellerKyle LoDavid WaddenLuca Soldaini

2023

arXiv

Using large language models (LMs) for query or document expansion can improve generalization in information retrieval. However, it is unknown whether these techniques are universally beneficial or…

Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI Harms

Organizer of Queer In AINathaniel DennlerAnaelia OvalleJessica de Jesus de Pinho Pinhal

2023

AIES

Bias evaluation benchmarks and dataset and model documentation have emerged as central processes for assessing the biases and harms of artificial intelligence (AI) systems. However, these auditing…

Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents

Catherine ChenZejiang ShenDan KleinKyle Lo

2023

Findings of ACL

Recent work has shown that infusing layout features into language models (LMs) improves processing of visually-rich documents such as scientific papers. Layout-infused LMs are often evaluated on…

Previous31-40Next