Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection

Suchin GururanganDallas CardSarah K. DrierNoah A. Smith
2022
EMNLP

Language models increasingly rely on massive web dumps for diverse text data. However, these sources are rife with undesirable content. As such, resources like Wikipedia, books, and news often… 

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

Tianbao XieChen Henry WuPeng ShiTao Yu
2022
EMNLP

Structured knowledge grounding (SKG) leverages structured knowledge to complete user requests, such as semantic parsing over databases and question answering over knowledge bases. Since the inputs… 

Twist Decoding: Diverse Generators Guide Each Other

Jungo KasaiKeisuke SakaguchiRonan Le BrasNoah A. Smith
2022
EMNLP

Natural language generation technology has recently seen remarkable progress with large-scale training, and many natural language applications are now built upon a wide range of generation models.… 

Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Yizhong WangSwaroop MishraPegah AlipoormolabashiDaniel Khashabi
2022
EMNLP

How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce SUPER-NATURALINSTRUCTIONS, a benchmark of 1,616… 

GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation

Daniel KhashabiGabriel StanovskyJonathan BraggDaniel S. Weld
2022
EMNLP

While often assumed a gold standard, effective human evaluation of text generation remains an important, open area for research. We revisit this problem with a focus on pro-ducing consistent… 

How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers

Michael HassidHao PengDaniel RotemRoy Schwartz
2022
EMNLP Findings

The attention mechanism is considered the backbone of the widely-used Transformer architecture. It contextualizes the input by computing input-specific attention matrices. We find that this… 

In-Context Learning for Few-Shot Dialogue State Tracking

Yushi HuChia-Hsuan LeeTianbao XieMari Ostendorf
2022
EMNLP Findings

Collecting and annotating task-oriented dialogues is time-consuming and costly. Thus, zero and few shot learning for dialogue tasks presents an exciting opportunity. In this work, we propose an… 

Unsupervised Learning of Hierarchical Conversation Structure

Bo-Ru LuYushi HuHao ChengMari Ostendorf
2022
EMNLP Findings

Human conversations can evolve in many different ways, creating challenges for automatic understanding and summarization. Goal-oriented conversations often have meaningful sub-dialogue structure,… 

Knowledge Transfer from Answer Ranking to Answer Generation

Matteo GabburoRik Koncel-KedziorskiSiddhant GargAlessandro Moschitti
2022
EMNLP

Recent studies show that Question Answering (QA) based on Answer Sentence Selection (AS2) can be improved by generating an improved answer from the top-k ranked answer sentences (termed GenQA). This… 

Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection

Luca Di LielloSiddhant GargLuca SoldainiAlessandro Moschitti
2022
EMNLP

An important task for designing QA systems is answer sentence selection (AS2): select-ing the sentence containing (or constituting) the answer to a question from a set of re-trieved relevant…