참고자료:
RC
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/rc')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 17210 |
'train' | 138384 |
'validation' | 17944 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
rc.nocontext
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/rc.nocontext')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 17210 |
'train' | 138384 |
'validation' | 17944 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
필터링되지 않은
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/unfiltered')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 10832 |
'train' | 87622 |
'validation' | 11313 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
필터링되지 않음.컨텍스트 없음
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/unfiltered.nocontext')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 10832 |
'train' | 87622 |
'validation' | 11313 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
rc.web
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/rc.web')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 9509 |
'train' | 76496 |
'validation' | 9951 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
rc.web.nocontext
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/rc.web.nocontext')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 9509 |
'train' | 76496 |
'validation' | 9951 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
필터링되지 않은 웹
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/unfiltered.web')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 0 |
'train' | 0 |
'validation' | 0 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
필터링되지 않은.web.nocontext
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/unfiltered.web.nocontext')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 0 |
'train' | 0 |
'validation' | 0 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
RC.위키피디아
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/rc.wikipedia')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 7701 |
'train' | 61888 |
'validation' | 7993 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
rc.wikipedia.nocontext
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/rc.wikipedia.nocontext')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 7701 |
'train' | 61888 |
'validation' | 7993 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
필터링되지 않음.wikipedia
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/unfiltered.wikipedia')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 0 |
'train' | 0 |
'validation' | 0 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}
필터링되지 않은.wikipedia.nocontext
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:trivia_qa/unfiltered.wikipedia.nocontext')
- 설명 :
TriviaqQA is a reading comprehension dataset containing over 650K
question-answer-evidence triples. TriviaqQA includes 95K question-answer
pairs authored by trivia enthusiasts and independently gathered evidence
documents, six per question on average, that provide high quality distant
supervision for answering the questions.
- 라이센스 : 알려진 라이센스 없음
- 버전 : 1.2.0
- 분할 :
나뉘다 | 예 |
---|---|
'test' | 0 |
'train' | 0 |
'validation' | 0 |
- 특징 :
{
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entity_pages": {
"feature": {
"doc_source": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wiki_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"search_results": {
"feature": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"filename": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rank": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"search_context": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"answer": {
"aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"normalized_aliases": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_matched_wiki_entity_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"normalized_value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}