参考文献:
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:onestop_qa')
- 説明:
OneStopQA is a multiple choice reading comprehension dataset annotated according to the STARC (Structured Annotations for Reading Comprehension) scheme. The reading materials are Guardian articles taken from the [OneStopEnglish corpus](https://github.com/nishkalavallabhi/OneStopEnglishCorpus). Each article comes in three difficulty levels, Elementary, Intermediate and Advanced. Each paragraph is annotated with three multiple choice reading comprehension questions. The reading comprehension questions can be answered based on any of the three paragraph levels.
- ライセンス: クリエイティブ・コモンズ 表示 - 継承 4.0 国際ライセンス
- バージョン: 1.1.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1458年 |
- 特徴:
{
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"paragraph": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"level": {
"num_classes": 3,
"names": [
"Adv",
"Int",
"Ele"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"paragraph_index": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"answers": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": 4,
"id": null,
"_type": "Sequence"
},
"a_span": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"d_span": {
"feature": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}