참고자료:
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:onestop_qa')
- 설명 :
OneStopQA is a multiple choice reading comprehension dataset annotated according to the STARC (Structured Annotations for Reading Comprehension) scheme. The reading materials are Guardian articles taken from the [OneStopEnglish corpus](https://github.com/nishkalavallabhi/OneStopEnglishCorpus). Each article comes in three difficulty levels, Elementary, Intermediate and Advanced. Each paragraph is annotated with three multiple choice reading comprehension questions. The reading comprehension questions can be answered based on any of the three paragraph levels.
- 라이센스 : 크리에이티브 커먼즈 저작자표시-동일조건변경허락 4.0 국제 라이센스
- 버전 : 1.1.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1458 |
- 특징 :
{
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"paragraph": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"level": {
"num_classes": 3,
"names": [
"Adv",
"Int",
"Ele"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"paragraph_index": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"answers": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": 4,
"id": null,
"_type": "Sequence"
},
"a_span": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"d_span": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}