Datasets

Viewing 41-44 of 44 datasets
  • AI2 Biology How/Why Corpus

    378 biology questionsAristo • 2014This dataset consists of 185 "how" and 193 "why" biology questions authored by a domain expert, with one or more gold answer passages identified in an undergraduate textbook.
  • AI2 Geometry Questions

    100 geometry questions2014These questions guide our research into Question Answering for geometry exams. Focus is on the high school level.
  • AI2 Meaningful Citations Data Set

    630 paper annotationsSemantic Scholar • 2014This dataset is comprised of annotations for 465 computer science papers. The annotations indicate whether a citation is important (i.e., refers to ongoing or continued work on the relevant topic) or not and then assigns the citation one of four importance rankings.
  • AI2 ProcessBank Data

    200 annotated paragraphs about biological processesAristo • 2014The dataset consists of 200 paragraphs that describe biological processes. Each paragraph is annotated with its process structure, and accompanied by a few multiple-choice questions about the process. Each question has two possible answers of which exactly one is correct. This dataset was used to train a system to automatically extract process models from paragraphs that describe processes.