An abstract illustration of swirling shapes, meant to denote a futuristic feeling.

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics

Ximing LuS. WelleckPeter WestYejin Choi

2022

NAACL

The dominant paradigm for neural text generation is left-to-right decoding from autoregressive language models. Constrained or controllable generation under complex lexical constraints, however,…

Paragraph-based Transformer Pre-training for Multi-Sentence Inference

Luca Di LielloSiddhant GargLuca SoldainiAlessandro Moschitti

2022

NAACL

Inference tasks such as answer sentence selection (AS2) or fact verification are typically solved by fine-tuning transformer-based models as individual sentence-pair classifiers. Recent studies show…

Reframing Human-AI Collaboration for Generating Free-Text Explanations

Sarah WiegreffeJack HesselSwabha SwayamdiptaYejin Choi

2022

NAACL

Large language models are increasingly capa-ble of generating ﬂuent-appearing text with relatively little task-speciﬁc supervision. But can these models accurately explain classiﬁcation decisions?…

Symbolic Knowledge Distillation: from General Language Models to Commonsense Models

Peter WestChandrasekhar BhagavatulaJack HesselYejin Choi

2022

NAACL

The common practice for training commonsense models has gone from–human–to– corpus–to–machine: humans author commonsense knowledge graphs in order to train commonsense models. In this work, we…

Time Waits for No One! Analysis and Challenges of Temporal Misalignment

Kelvin LuuDaniel KhashabiSuchin GururanganNoah A. Smith

2022

NAACL

When an NLP model is trained on text data from one time period and tested or deployed on data from another, the resulting temporal misalignment can degrade end-task performance. In this work, we…

Transparent Human Evaluation for Image Captioning

Jungo KasaiKeisuke SakaguchiLavinia DunaganNoah A. Smith

2022

NAACL

We establish a rubric-based human evaluation protocol for image captioning models. Our scoring rubrics and their definitions are carefully developed based on machineand humangenerated captions on…

Weakly Supervised Text-to-SQL Parsing through Question Decomposition

Tomer WolfsonDaniel DeutchJonathan Berant

2022

Findings of NAACL

Text-to-SQL parsers are crucial in enabling non-experts to effortlessly query relational data. Training such parsers, by contrast, generally requires expertise in annotating natural language (NL)…

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi SrivastavaAbhinav RastogiAbhishek B RaoUri Shaham

2022

arXiv

Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet…

Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities

Zejiang ShenKyle LoLauren YuDoug Downey

2022

arXiv

With the advent of large language models, methods for abstractive summarization have made great strides, creating potential for use in applications to aid knowledge workers processing unwieldy…

Data Governance in the Age of Large-Scale Data-Driven Language Technology

Yacine JerniteHuu NguyenStella Rose BidermanMargaret Mitchell

2022

FAccT

The recent emergence and adoption of Machine Learning technology, and specifically of Large Language Models, has drawn attention to the need for systematic and transparent management of language…

Previous412-421Next