Clicking Download will provide a link to download the training and development sets of the latest version of the dataset.
We discovered that the start indices for a small number of answers (less than 2%) in the training, development, and test sets of v0.1 of the dataset were slightly off due to unicode processing issues. We fixed those issues in v0.2. Note that the leaderboard results should not be affected by this fix since the metrics are computed over strings, not spans.
If you need v0.1 of the dataset for any reason, you can get it here.
Details | Created | Exact Match |
---|---|---|
1 anonymous anonymous | 11/25/2022 | 83% |
2 SpanQualifier (CorefRoBERTa Large) Nanjing University (Zixian Huang, Jiaying Zhou, Chenxu Niu, Gong Cheng) | 10/12/2022 | 81% |
3 SpanQualifier (Roberta Large) Nanjing University (Zixian Huang, Jiaying Zhou, Chenxu Niu, Gong Cheng) | 10/12/2022 | 81% |
4 TASE - CorefRoBERTa Tsinghua University (Deming Ye, Yankai Lin, Jiaju Du, Zhenghao Liu, Maosong Sun, Zhiyuan Liu) | 5/15/2020 | 81% |
5 TASE-CoNLL-joint-qgen Mingzhu Wu, Nafise Sadat Moosavi, Dan Roth, Iryna Gurevych | 12/14/2020 | 80% |
Pradeep Dasigi, Nelson F. Liu, Ana Marasović, Noah A. Smith, Matt Gardner