Clicking Download will provide a link to download the training and development sets of the latest version of the dataset.
We discovered that the start indices for a small number of answers (less than 2%) in the training, development, and test sets of v0.1 of the dataset were slightly off due to unicode processing issues. We fixed those issues in v0.2. Note that the leaderboard results should not be affected by this fix since the metrics are computed over strings, not spans.
If you need v0.1 of the dataset for any reason, you can get it here.
Details | Created | Exact Match |
---|---|---|
1 anonymous anonymous | 11/25/2022 | 83% |
2 SpanQualifier (CorefRoBERTa Large) Anonymous | 10/12/2022 | 81% |
3 SpanQualifier (Roberta Large) Anonymous | 10/12/2022 | 81% |
4 TASE - CorefRoBERTa Tsinghua University (Deming Ye, Yankai Lin, Jiaju Du, Zhenghao Liu, Maosong Sun, Zhiyuan Liu) | 5/15/2020 | 81% |
5 TASE-CoNLL-joint-qgen Mingzhu Wu, Nafise Sadat Moosavi, Dan Roth, Iryna Gurevych | 12/14/2020 | 80% |
Pradeep Dasigi, Nelson F. Liu, Ana Marasović, Noah A. Smith, Matt Gardner