Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

Exploring the Challenges of Open Domain Multi-Document Summarization

John GiorgiLuca SoldainiBo WangArman Cohan
2022
arXiv

Multi-document summarization (MDS) has traditionally been studied assuming a set of ground-truth topic-related input documents is provided. In practice, the input document set is unlikely to be… 

I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation

Chandra BhagavatulaJena D. HwangDoug DowneyYejin Choi
2022
ACL

Pre-trained language models, despite their rapid advancements powered by scale, still fall short of robust commonsense capabilities. And yet, scale appears to be the win-ning recipe; after all, the… 

Reproducible scaling laws for contrastive language-image learning

Mehdi ChertiRomain BeaumontRoss WightmanJ. Jitsev
2022
arXiv

Scaling up neural networks has led to remarkable performance across a wide range of tasks. Moreover, performance often follows reliable scaling laws as a function of training set size, model size,… 

Hyperdecoders: Instance-specific decoders for multi-task NLP

Hamish IvisonMatthew E. Peters
2022
Findings of EMNLP

We investigate input-conditioned hypernetworks for multi-tasking in NLP, generating parameter-efficient adaptations for a decoder using a hypernetwork conditioned on the output of an encoder. This… 

Continued Pretraining for Better Zero- and Few-Shot Promptability

Zhaofeng WuRobert L. Logan IVPete WalshIz Beltagy
2022
EMNLP

Recently introduced language model prompting methods can achieve high accuracy in zero-and few-shot settings while requiring few to no learned task-specific parameters. Never-theless, these methods… 

Exploring The Landscape of Distributional Robustness for Question Answering Models

Anas AwadallaMitchell WortsmanGabriel IlharcoLudwig Schmidt
2022
Findings of EMNLP

We conduct a large empirical evaluation to investigate the landscape of distributional robustness in question answering. Our investigation spans over 350 models and 16 question answering datasets,… 

Lila: A Unified Benchmark for Mathematical Reasoning

Swaroop MishraMatthew FinlaysonPan LuAshwin Kalyan
2022
EMNLP

Mathematical reasoning skills are essential for general-purpose intelligent systems to perform tasks from grocery shopping to climate modeling. Towards evaluating and improving AI systems in this… 

Statistical and Computational Guarantees for Influence Diagnostics

Jillian FisherLang LiuKrishna PillutlaZaid Harchaoui
2022
arXiv

Influence diagnostics such as influence functions and approximate maximum influence perturbations are popular in machine learning and in AI domain applications. Influence diagnostics are powerful… 

What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

Matthew FinlaysonKyle RichardsonAshish SabharwalPeter Clark
2022
EMNLP

The instruction learning paradigm—where a model learns to perform new tasks from task descriptions alone—has become popular in general-purpose model research. The capabilities of large transformer… 

Teaching Broad Reasoning Skills via Decomposition-Guided Contexts

Harsh TrivediNiranjan BalasubramanianTushar KhotAshish Sabharwal
2022
EMNLP

Question-answering datasets require a broad set of reasoning skills. We show how to use question decompositions to teach language models these broad reasoning skills in a robust fashion.…