Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

On Generating Extended Summaries of Long Documents

Sajad SotudehArman CohanNazli Goharian
2021
AAAI • Scientific Document Understanding Workshop

Prior work in document summarization has mainly focused on generating short summaries of a document. While this type of summary helps get a high-level view of a given document, it is desirable in… 

Learning to Rationalize for Nonmonotonic Reasoning with Distant Supervision

Faeze BrahmanVered ShwartzRachel Rudingerand Yejin Choi
2021
AAAI

The black-box nature of neural models has motivated a line of research that aims to generate natural language rationales to explain why a model made certain predictions. Such rationale generation… 

UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark

Nicholas LourieRonan Le BrasChandra BhagavatulaYejin Choi
2021
AAAI

Commonsense AI has long been seen as a near impossible goal—until recently. Now, research interest has sharply increased with an influx of new benchmarks and models. We propose two new ways to… 

MultiTalk: A Highly-Branching Dialog Testbed for Diverse Conversations

Yao DouMaxwell ForbesAri HoltzmanYejin Choi
2021
AAAI

We study conversational dialog in which there are many possible responses to a given history. We present the MultiTalk Dataset, a corpus of over 320,000 sentences of written conversational dialog… 

Scruples: A Corpus of Community Ethical Judgments on 32, 000 Real-Life Anecdotes

Nicholas LourieRonan Le BrasYejin Choi
2021
AAAI

As AI systems become an increasing part of people's everyday lives, it becomes ever more important that they understand people's ethical norms. Motivated by descriptive ethics, a field of study that… 

GENIE: A Leaderboard for Human-in-the-Loop Evaluation of Text Generation

Daniel KhashabiGabriel StanovskyJonathan BraggDaniel S. Weld
2021
arXiv

Leaderboards have eased model development for many NLP datasets by standardizing their evaluation and delegating it to an independent external repository. Their adoption, however, is so far limited… 

On-the-Fly Attention Modularization for Neural Generation

Yue DongChandra BhagavatulaXiming LuYejin Choi
2021
arXiv

Despite considerable advancements with deep neural language models (LMs), neural text generation still suffers from degeneration: generated text is repetitive, generic, selfinconsistent, and lacking… 

VinVL: Revisiting Visual Representations in Vision-Language Models

Pengchuan ZhangXiujun LiXiaowei HuJianfeng Gao
2021
CVPR

This paper presents a detailed study of improving visual representations for vision language (VL) tasks and develops an improved object detection model to provide object-centric representations of… 

CLUE: A Chinese Language Understanding Evaluation Benchmark

L. XuX.ZhangL. Liet.al.
2020
COLING

We introduce CLUE, a Chinese Language Understanding Evaluation benchmark. It contains eight different tasks, including single-sentence classification, sentence pair classification, and machine… 

Edited Media Understanding: Reasoning About Implications of Manipulated Images

Jeff DaMaxwell ForbesRowan ZellersYejin Choi
2020
arXiv

Multimodal disinformation, from `deepfakes' to simple edits that deceive, is an important societal problem. Yet at the same time, the vast majority of media edits are harmless -- such as a filtered…