AI2 has the following science exam question datasets listed below available for download. We use this data internally for project Aristo, a system that aims to acquire and apply computable knowledge to answer a variety of science questions from standardized exams for students in multiple grade levels.
To evaluate your models, we have also built Aristo mini, a light-weight question answering system that can quickly evaluate science questions with an evaluation web server and provided baseline solvers. You can extend the provided solvers with your own implementation to try out new approaches and compare results.
The AI2 Science Questions dataset consists of questions used in student assessments in the United States across elementary and middle school grade levels. Each question is 4-way multiple choice format and may or may not include a diagram element. This dataset contains the following files:
The AI2 Science Questions Mercury dataset consists of questions used in student assessments across elementary and middle school grade levels, provided under license by an AI2 research partner. Each question is 4-way multiple choice format and may or may not include a diagram element. This dataset contains the following files:
These 4th grade science exam questions provide an important benchmark for measuring Aristo’s progress in our research into multiple choice question answering at the elementary science level. This dataset contains the following files:
If you’d like to be notified of future releases of datasets from AI2, please subscribe:
If you have any other questions or feedback for us about this data, please contact us at firstname.lastname@example.org.