Research - Papers

Explore a selection of our published work on a variety of key research challenges in AI.

Estimating the Causal Effect of Early ArXiving on Paper Acceptance

Yanai ElazarJiayao ZhangDavid WaddenNoah A. Smith

2024

CLearR

What is the effect of releasing a preprint of a paper before it is submitted for peer review? No randomized controlled trial has been conducted, so we turn to observational data to answer this…

The precipitation response to warming and CO2 increase: A comparison of a global storm resolving model and CMIP6 models.

Ilai GuendelmanTimothy M. MerlisKai-Yuan ChengStephan Fueglistaler

2024

Geophysical Research Letters

Global storm-resolving models (GSRMs) can explicitly resolve some of deep convection are now being integrated for climate timescales. GSRMs are able to simulate more realistic precipitation…

FigurA11y: AI Assistance for Writing Scientific Alt Text

Nikhil SinghLucy Lu WangJonathan Bragg

2024

IUI

High-quality alt text is crucial for making scientific figures accessible to blind and low-vision readers. Crafting complete, accurate alt text is challenging even for domain experts, as published…

Emulation of cloud microphysics in a climate model

W. Andre PerkinsNoah D. BrenowitzChristopher S. BrethertonJacqueline M. Nugent

2024

JAMES

We present a machine learning based emulator of a microphysics scheme for condensation and precipitation processes (Zhao-Carr) used operationally in a global atmospheric forecast model (FV3GFS). Our…

Closing the Curious Case of Neural Text Degeneration

Matthew FinlaysonJohn HewittAlexander KollerAshish Sabharwal

2024

ICLR

Despite their ubiquity in language generation, it remains unknown why truncation sampling heuristics like nucleus sampling are so effective. We provide a theoretical explanation for the…

A machine learning parameterization of clouds in a coarse-resolution climate model for unbiased radiation

Brian HennYakelyn R. JaureguiSpencer K. ClarkChristopher S. Bretherton

2024

JAMES

Coarse-grid weather and climate models rely particularly on parameterizations of cloud fields, and coarse-grained cloud fields from a fine-grid reference model are a natural target for a…

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic

Nathaniel WeirKate SandersOrion WellerBenjamin Van Durme

2024

arXiv.org

Contemporary language models enable new opportunities for structured reasoning with text, such as the construction and evaluation of intuitive, proof-like textual entailment trees without relying on…

A Survey on Data Selection for Language Models

Alon AlbalakYanai ElazarSang Michael XieWilliam Yang Wang

2024

arXiv

A major factor in the recent success of large language models is the use of enormous and ever-growing text datasets for unsupervised pre-training. However, naively training a model on all available…

Calibrating Large Language Models with Sample Consistency

Qing LyuKumar ShridharChaitanya MalaviyaChris Callison-Burch

2024

arXiv

Accurately gauging the confidence level of Large Language Models' (LLMs) predictions is pivotal for their reliable application. However, LLMs are often uncalibrated inherently and elude conventional…

Improving Stratocumulus Cloud Amounts in a 200‐m Resolution Multi‐Scale Modeling Framework Through Tuning of Its Interior Physics

Liran PengP. BlosseyW. HannahM. Pritchard

2024

Journal of Advances in Modeling Earth Systems

High‐Resolution Multi‐scale Modeling Frameworks (HR)—global climate models that embed separate, convection‐resolving models with high enough resolution to resolve boundary layer eddies—have exciting…

Previous141-150Next