Skip to main content ->
Ai2

Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Filter papers

Let Me Teach You: Pedagogical Foundations of Feedback for Language Models

Beatriz BorgesNiket TandonTanja KaserAntoine Bosselut
2023
arXiv

Natural Language Feedback (NLF) is an increasingly popular avenue to align Large Language Models (LLMs) to human preferences. Despite the richness and diversity of the information it can convey, NLF… 

Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance

Yao FuLitu OuMingyu ChenTushar Khot
2023
ICML 2023, the Challenges in Deployable Generative AI workshop

As large language models (LLMs) are continuously being developed, their evaluation becomes increasingly important yet challenging. This work proposes Chain-of-Thought Hub, an open-source evaluation… 

The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks

Nikil SelvamSunipa DevDaniel KhashabiKai-Wei Chang
2023
ACL

How reliably can we trust the scores obtained from social bias benchmarks as faithful indicators of problematic social biases in a given language model? In this work, we study this question by… 

Aligning Language Models to User Opinions

EunJeong HwangBodhisattwa Prasad MajumderNiket Tandon
2023
arXiv

An important aspect of developing LLMs that interact with humans is to align models' behavior to their users. It is possible to prompt an LLM into behaving as a certain persona, especially a user… 

Anthropomorphization of AI: Opportunities and Risks

A. DeshpandeTanmay RajpurohitKarthik NarasimhanA. Kalyan
2023
arXiv.org

Anthropomorphization is the tendency to attribute human-like traits to non-human entities. It is prevalent in many social contexts -- children anthropomorphize toys, adults do so with brands, and it… 

CSTS: Conditional Semantic Textual Similarity

A. DeshpandeCarlos E. JimenezHoward ChenKarthik Narasimhan
2023
arXiv.org

Semantic textual similarity (STS) has been a cornerstone task in NLP that measures the degree of similarity between a pair of sentences, with applications in information retrieval, question… 

OpenPI2.0: An Improved Dataset for Entity Tracking in Texts

Li ZhangHai XuAbhinav KommulaChris Callison-Burch
2023
arXiv

Representing texts as information about entities has long been deemed effective in event reasoning. We propose OpenPI2.0, an improved dataset for tracking entity states in procedural texts.… 

Improving Language Models via Plug-and-Play Retrieval Feedback

Wenhao YuZhihan ZhangZhenwen LiangAshish Sabharwal
2023
arXiv

Large language models (LLMs) exhibit remarkable performance across various NLP tasks. However, they often generate incorrect or hallucinated information, which hinders their practical applicability… 

Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback

Yao FuHao PengTushar KhotMirella Lapata
2023
arXiv.org

We study whether multiple large language models (LLMs) can autonomously improve each other in a negotiation game by playing, reflecting, and criticizing. We are interested in this question because… 

Can AI language models replace human participants?

Danica DillionNiket TandonYuling GuKurt Gray
2023
Trends in Cognitive Sciences

Recent work suggests that language models such as GPT can make human-like judgments across a number of domains. We explore whether and when language models might replace human participants in…