참고자료:
높은
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:race/high')
- 설명 :
Race is a large-scale reading comprehension dataset with more than 28,000 passages and nearly 100,000 questions. The
dataset is collected from English examinations in China, which are designed for middle school and high school students.
The dataset can be served as the training and test sets for machine comprehension.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 0.1.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 3498 |
'train' | 62445 |
'validation' | 3451 |
- 특징 :
{
"example_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"answer": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"options": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
가운데
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:race/middle')
- 설명 :
Race is a large-scale reading comprehension dataset with more than 28,000 passages and nearly 100,000 questions. The
dataset is collected from English examinations in China, which are designed for middle school and high school students.
The dataset can be served as the training and test sets for machine comprehension.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 0.1.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 1436 |
'train' | 25421 |
'validation' | 1436 |
- 특징 :
{
"example_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"answer": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"options": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
모두
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:race/all')
- 설명 :
Race is a large-scale reading comprehension dataset with more than 28,000 passages and nearly 100,000 questions. The
dataset is collected from English examinations in China, which are designed for middle school and high school students.
The dataset can be served as the training and test sets for machine comprehension.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 0.1.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 4934 |
'train' | 87866 |
'validation' | 4887 |
- 특징 :
{
"example_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"article": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"answer": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"options": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}