HellaSwag

Mosaic • 2019

HellaSWAG is a dataset for studying grounded commonsense inference. It consists of 70k multiple choice questions about grounded situations: each question comes from one of two domains -- activitynet or wikihow -- with four answer choices about what might happen next in the scene. The correct answer is the (real) sentence for the next event; the three incorrect answers are adversarially generated and human verified, so as to fool machines but not humans.

Download Read Paper

License: MIT

Leaderboard

Top Public Submissions

Details	Created	Accuracy
1 CompassMTL Microsoft & SJTU	5/11/2022	96%
2 DeBERTa Large DeCLaRe Lab, SUTD	5/20/2022	96%
3 CreAT Hongqiu Wu	5/3/2022	95%
4 DeBERTa MCQ EMNLP Paper 3842 Authors	6/3/2022	95%
5 DeBERTa Large Anonymous	4/14/2022	94%

View Leaderboard

Natural Language Processing

Computer Vision

AI for the Environment

Experimentation and Communication

Research

Research

HellaSwag

Leaderboard