참고자료:
정렬
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/alignments')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'full' | 10834 |
- 특징 :
{
"source_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"target_id_list": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
다국어
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/multilingual')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 13510 |
'train' | 7961 |
'validation' | 2672 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
multilingual_with_para
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/multilingual_with_para')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 13510 |
'train' | 7961 |
'validation' | 2672 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
교차언어_테스트
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_test')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 19736 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_test
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_test')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 19736 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_bg
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_bg')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 2344 |
'validation' | 593 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_bg
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_bg')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 2344 |
'validation' | 593 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
교차 언어_hr
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_hr')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 2341 |
'validation' | 538 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_hr
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_hr')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 2341 |
'validation' | 538 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
교차언어_hu
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_hu')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1731년 |
'validation' | 536 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_hu
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_hu')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1731년 |
'validation' | 536 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
교차 언어_it
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_it')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1010 |
'validation' | 246 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_it
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_it')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1010 |
'validation' | 246 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_mk
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_mk')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1665년 |
'validation' | 410 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_mk
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_mk')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1665년 |
'validation' | 410 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
교차언어_pl
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_pl')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1577년 |
'validation' | 394 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_pl
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_pl')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1577년 |
'validation' | 394 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
교차언어_pt
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_pt')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 740 |
'validation' | 184 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_pt
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_pt')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 740 |
'validation' | 184 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_sq
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_sq')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1194 |
'validation' | 311 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_sq
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_sq')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1194 |
'validation' | 311 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
교차언어_sr
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_sr')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1323 |
'validation' | 314 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_sr
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_sr')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1323 |
'validation' | 314 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_tr
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_tr')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1571 |
'validation' | 393 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_tr
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_tr')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1571 |
'validation' | 393 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_vi
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_vi')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1955년 |
'validation' | 488 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
crosslingual_with_para_vi
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:exams/crosslingual_with_para_vi')
- 설명 :
EXAMS is a benchmark dataset for multilingual and cross-lingual question answering from high school examinations.
It consists of more than 24,000 high-quality high school exam questions in 16 languages,
covering 8 language families and 24 school subjects from Natural Sciences and Social Sciences, among others.
- 라이센스 : CC-BY-SA-4.0
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1955년 |
'validation' | 488 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"stem": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"choices": {
"feature": {
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"para": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"answerKey": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"info": {
"grade": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"language": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}