The dataset contains three files:
- bioprocess-bank-questions.tar.gz: There is an xml file for each paragraph containing the paragraph ID, the questions and answers.
- process-bank-structures-train.tar.gz: These are the structure annotations used for training our structure predictor. Each paragraph has two files - one containing the text and one containing the annotation. This is standard BRAT format.
- process-bank-structures-test.tar.gz: These are structure annotations used for testing. They are also in BRAT format.