Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Narrative Scaffolding: A Narrative-First Framework for Data-Driven Sensemaking

Oliver HuangMuhammad FatirTian LuoCarolina Nobre

2026

International Conference on Intelligent User Interfaces (IUI)

When exploring data, analysts construct narratives about what the data means by asking questions, generating visualizations, reflecting on patterns, and revising their interpretations as new…

AIMIP Phase 1: systematic evaluations of AI weather and climate models

Brian HennChristopher S. BrethertonNikolay KodunovIgnacio Lopez-Gomez

2026

arXiv

We present the AI weather and climate model intercomparison project (AIMIP), phase 1. Drawing from the rich tradition of intercomparisons in climate model development, we specify a common…

Cocoa: Co-Planning and Co-Execution with AI Agents

K. FengKevin PuMatt LatzkeJoseph Chee Chang

2026

Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems

As AI agents take on increasingly long-running tasks involving sophisticated planning and execution, there is a corresponding need for novel interaction designs that enable deeper human-agent…

Perspectra: Choosing Your Experts Enhances Critical Thinking in Multi-Agent Research Ideation

Yiren LiuViraj ShahSangho SuhYun Huang

2026

Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems

Early-stage interdisciplinary research ideation is often challenged by limited expert access, uncertainty about what to ask, and the cognitive burden of synthesizing unfamiliar domain perspectives.…

FloeNet: A mass-conserving global sea ice emulator that generalizes across climates

William GregoryM. BushukJames P. C. DuncanL. Zanna

2026

arXiv

We introduce FloeNet, a machine-learning emulator trained on the Geophysical Fluid Dynamics Laboratory global sea ice model, SIS2. FloeNet is a mass-conserving model, emulating 6-hour mass and area…

Examining Fast Radiative Feedbacks Using Machine-Learning Weather Emulators

Ankur MaheshWilliam D. CollinsTravis A. O'BrienDa Yang

2026

arXiv

The response of the climate system to increased greenhouse gases and other radiative perturbations is governed by a combination of fast and slow feedbacks. Slow feedbacks are typically activated in…

HiRO-ACE: Fast and skillful AI emulation and downscaling trained on a 3 km global storm-resolving model

Andre PerkinsAnna KwaJeremy McGibbonLucas Harris

2025

arXiv

Kilometer-scale simulations of the atmosphere are an important tool for assessing local weather extremes and climate impacts, but computational expense limits their use to small regions, short…

Leveraging In-Context Learning for Language Model Agents

Shivanshu GuptaSameer SinghAshish SabharwalBen Bogin

2025

NeurIPS • Workshop on Multi-Turn Interactions in LLMs

In-context learning (ICL) with dynamically selected demonstrations combines the flexibility of prompting large language models (LLMs) with the ability to leverage training data to improve…

A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers

William MerrillAshish Sabharwal

2025

NeurIPS

Recent theoretical results show transformers cannot express sequential reasoning problems over long inputs, intuitively because their computational *depth* is bounded. However, prior work treats the…

Exact Expressive Power of Transformers with Padding

William MerrillAshish Sabharwal

2025

NeurIPS

Chain of thought is a natural inference-time method for increasing the computational power of transformer-based large language models (LLMs), but comes at the cost of sequential decoding. Are there…

1-10Next