참고자료:
검색QA
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:multi_re_qa/SearchQA')
- 설명 :
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 3163801 |
'validation' | 454836 |
- 특징 :
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
퀴즈QA
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:multi_re_qa/TriviaQA')
- 설명 :
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 1893674 |
'validation' | 238339 |
- 특징 :
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
핫팟QA
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:multi_re_qa/HotpotQA')
- 설명 :
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 508879 |
'validation' | 52191 |
- 특징 :
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
분대
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:multi_re_qa/SQuAD')
- 설명 :
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 95659 |
'validation' | 10642 |
- 특징 :
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
자연스러운 질문
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:multi_re_qa/NaturalQuestions')
- 설명 :
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 448355 |
'validation' | 22118 |
- 특징 :
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
바이오ASQ
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:multi_re_qa/BioASQ')
- 설명 :
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 14158 |
- 특징 :
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
관계추출
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:multi_re_qa/RelationExtraction')
- 설명 :
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 3301 |
- 특징 :
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
교과서QA
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:multi_re_qa/TextbookQA')
- 설명 :
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 71147 |
- 특징 :
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
듀오RC
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:multi_re_qa/DuoRC')
- 설명 :
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 5525 |
- 특징 :
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}