参考文献:
検索QA
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:multi_re_qa/SearchQA')
- 説明:
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- ライセンス: 既知のライセンスはありません
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3163801 |
'validation' | 454836 |
- 特徴:
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
トリビアQA
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:multi_re_qa/TriviaQA')
- 説明:
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- ライセンス: 既知のライセンスはありません
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1893674 |
'validation' | 238339 |
- 特徴:
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
ホットポットQA
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:multi_re_qa/HotpotQA')
- 説明:
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- ライセンス: 既知のライセンスはありません
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 508879 |
'validation' | 52191 |
- 特徴:
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
分隊
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:multi_re_qa/SQuAD')
- 説明:
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- ライセンス: 既知のライセンスはありません
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 95659 |
'validation' | 10642 |
- 特徴:
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
自然な質問
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:multi_re_qa/NaturalQuestions')
- 説明:
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- ライセンス: 既知のライセンスはありません
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 448355 |
'validation' | 22118 |
- 特徴:
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
BioASQ
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:multi_re_qa/BioASQ')
- 説明:
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- ライセンス: 既知のライセンスはありません
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 14158 |
- 特徴:
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
関係抽出
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:multi_re_qa/RelationExtraction')
- 説明:
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- ライセンス: 既知のライセンスはありません
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 3301 |
- 特徴:
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
教科書QA
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:multi_re_qa/TextbookQA')
- 説明:
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- ライセンス: 既知のライセンスはありません
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 71147 |
- 特徴:
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
デュオRC
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:multi_re_qa/DuoRC')
- 説明:
MultiReQA contains the sentence boundary annotation from eight publicly available QA datasets including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, BioASQ, RelationExtraction, and TextbookQA. Five of these datasets, including SearchQA, TriviaQA, HotpotQA, NaturalQuestions, SQuAD, contain both training and test data, and three, including BioASQ, RelationExtraction, TextbookQA, contain only the test data
- ライセンス: 既知のライセンスはありません
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 5525 |
- 特徴:
{
"candidate_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"response_start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"response_end": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}