Research - Papers
Explore a selection of our published work on a variety of key research challenges in AI.
Interactron: Embodied Adaptive Object Detection
Over the years various methods have been proposed for the problem of object detection. Recently, we have wit-nessed great strides in this domain owing to the emergence of powerful deep neural…
MuSiQue: Multihop Questions via Single-hop Question Composition
Multihop reasoning remains an elusive goal as existing multihop benchmarks are known to be largely solvable via shortcuts. Can we create a question answering (QA) dataset that, by construction,…
Correcting Coarse-Grid Weather and Climate Models by Machine Learning From Global Storm-Resolving Simulations
Global atmospheric `storm-resolving' models with horizontal grid spacing of less than 5~km resolve deep cumulus convection and flow in complex terrain. They promise to be reference models that could…
SCROLLS: Standardized CompaRison Over Long Language Sequences
NLP benchmarks have largely focused on short texts, such as sentences and paragraphs, even though long texts comprise a considerable amount of natural language in the wild. We introduce SCROLLS, a…
Computational Lens on Cognition: Study Of Autobiographical Versus Imagined Stories With Large-Scale Language Models
Lifelong experiences and learned knowledge lead to shared expectations about how common situations tend to unfold. Such knowledge enables people to interpret story narratives and identify salient…
Imagined versus Remembered Stories: Quantifying Differences in Narrative Flow
Lifelong experiences and learned knowledge lead to shared expectations about how common situations tend to unfold. Such knowledge of narrative event flow enables people to weave together a story.…
PROMPT WAYWARDNESS: The Curious Case of Discretized Interpretation of Continuous Prompts
Fine-tuning continuous prompts for target tasks has recently emerged as a compact alternative to full model fine-tuning. Motivated by these promising results, we investigate the feasibility of…
UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training
We present UNIFIEDQA-v2, a QA model built with the same process as UNIFIEDQA, except that it utilizes more supervision – roughly 3× the number of datasets used for UNIFIEDQA. This generally leads to…
Vessel Detection in Sentinel-1 Imagery
In this document, we detail the approach in our xView3 submission. The xView3 dataset presents the challenge of detecting vessels and other maritime objects in synthetic aperture radar (SAR) images…
Tropical Cirrus in Global Storm‐Resolving Models: 2. Cirrus Life Cycle and Top‐of‐Atmosphere Radiative Fluxes
Cirrus clouds of various thicknesses and radiative characteristics extend over much of the tropics, especially around deep convection. They are difficult to observe due to their high altitude and…