Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

Complexity-Based Prompting for Multi-Step Reasoning

Yao FuHao PengAshish SabharwalTushar Khot
2023
ICLR

We study the task of prompting large-scale language models to perform multi-step reasoning. Existing work shows that when prompted with a chain of thoughts (CoT), sequences of short sentences… 

Decomposed Prompting: A Modular Approach for Solving Complex Tasks

Tushar KhotHarsh TrivediMatthew FinlaysonAshish Sabharwal
2023
ICLR

Few-shot prompting is a surprisingly powerful way to use Large Language Models (LLMs) to solve various tasks. However, this approach struggles as the task complexity increases or when the individual… 

Toxicity in ChatGPT: Analyzing Persona-assigned Language Models

A. DeshpandeVishvak MurahariTanmay RajpurohitKarthik Narasimhan
2023
arXiv.org

Large language models (LLMs) have shown incredible capabilities and transcended the natural language processing (NLP) community, with adoption throughout many services like healthcare, therapy,… 

The Parallelism Tradeoff: Limitations of Log-Precision Transformers

William MerrillAshish Sabharwal
2023
TACL • ACL

Abstract Despite their omnipresence in modern NLP, characterizing the computational power of transformer neural nets remains an interesting open question. We prove that transformers whose arithmetic… 

Specializing Smaller Language Models towards Multi-Step Reasoning

Yao FuHao PengLitu OuTushar Khot
2023
ICML

The surprising ability of Large Language Models (LLMs) to perform well on complex reasoning with only few-shot chain-of-thought prompts is believed to emerge only in very large-scale models (100+… 

ProKnow: Process knowledge for safety constrained and explainable question generation for mental health diagnostic assistance

Kaushik RoyManas GaurMisagh SoltaniAmit P. Sheth
2023
Frontiers in Big Data

Virtual Mental Health Assistants (VMHAs) are utilized in health care to provide patient services such as counseling and suggestive care. They are not used for patient diagnostic assistance because… 

I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons

Pei ZhouAndrew ZhuJennifer HuPrithviraj Ammanabrolu
2022
arXiv

We propose a novel task, G4C, to study teacher-student natural language interactions in a goal-driven and grounded environment. Dungeons and Dragons (D&D), a role-playing game, provides an ideal… 

Lila: A Unified Benchmark for Mathematical Reasoning

Swaroop MishraMatthew FinlaysonPan LuAshwin Kalyan
2022
EMNLP

Mathematical reasoning skills are essential for general-purpose intelligent systems to perform tasks from grocery shopping to climate modeling. Towards evaluating and improving AI systems in this… 

Entailer: Answering Questions with Faithful and Truthful Chains of Reasoning

Oyvind TafjordBhavana Dalvi MishraPeter Clark
2022
EMNLP

Our goal is a question-answering (QA) system that can show how its answers are implied by its own internal beliefs via a systematic chain of reasoning . Such a capability would allow better… 

Teaching Broad Reasoning Skills via Decomposition-Guided Contexts

Harsh TrivediNiranjan BalasubramanianTushar KhotAshish Sabharwal
2022
EMNLP

Question-answering datasets require a broad set of reasoning skills. We show how to use question decompositions to teach language models these broad reasoning skills in a robust fashion.…