Multihop Questions via Single-hop Question Composition

Aristo • 2022

MuSiQue is a multihop reading comprehension dataset with 2-4 hop questions, built by composing seed questions from 5 existing single-hop datasets. The dataset is constructed with a bottom-up approach that systematically selects composable pairs of single-hop questions that are connected, i.e., where one reasoning step requires information from the other. This approach allows greater control over the properties of the resulting k-hop questions, allowing us to create a dataset that is substantially less cheatable (e.g. by shortcut-based or singlehop reasoning) and more challenging than prior similar datasets. MuSiQue comes in two variations -- MuSiQue-Answerable, which contains only answerable questions, and MuSiQue-Full, which contains both answerable and unanswerable questions. In the latter, each answerable question from MuSiQue-Answerable is paired with closely similar unanswerable question. In MuSiQue-Answerable, the task is to identify the answer and the supporting paragraphs, given a question and a context of up to 20 paragraphs. In MuSiQue-Full, the task is to first determine whether the question is answerable from the given context, and if it is, identify the answer and the supporting paragraphs.

Download Read Paper View Website View Repo

License: CC BY

Leaderboard

Top Public Submissions

Details	Created	Support+Sufficiency F1
1 OFF (Online Finetuned Flow) Paul Mineiro from Microsoft Research	6/11/2024	89%
2 Select+Answer (SA) Model Harsh Trivedi,Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal	7/4/2022	42%
3 Step Execution by End2End (EX(EE)) Model Harsh Trivedi,Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal	5/6/2022	44%
3 Step Execution by Select+Answer (EX(SA)) Model Harsh Trivedi,Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal	7/4/2022	44%
5 End2End (EE) Model Harsh Trivedi,Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal	5/6/2022	26%

View Leaderboard

Authors

Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal

Natural Language Processing

Computer Vision

AI for the Environment

Experimentation and Communication

Research

Research

Multihop Questions via Single-hop Question Composition

Leaderboard

Authors