Tài liệu tham khảo:
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:onestop_qa')
- Sự miêu tả :
OneStopQA is a multiple choice reading comprehension dataset annotated according to the STARC (Structured Annotations for Reading Comprehension) scheme. The reading materials are Guardian articles taken from the [OneStopEnglish corpus](https://github.com/nishkalavallabhi/OneStopEnglishCorpus). Each article comes in three difficulty levels, Elementary, Intermediate and Advanced. Each paragraph is annotated with three multiple choice reading comprehension questions. The reading comprehension questions can be answered based on any of the three paragraph levels.
- Giấy phép : Giấy phép quốc tế Creative Commons Ghi công-Chia sẻ tương tự 4.0
- Phiên bản : 1.1.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 1458 |
- Đặc trưng :
{
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"paragraph": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"level": {
"num_classes": 3,
"names": [
"Adv",
"Int",
"Ele"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"paragraph_index": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"answers": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": 4,
"id": null,
"_type": "Sequence"
},
"a_span": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"d_span": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}