参考文献:
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:lc_quad')
- 説明:
LC-QuAD 2.0 is a Large Question Answering dataset with 30,000 pairs of question and its corresponding SPARQL query. The target knowledge base is Wikidata and DBpedia, specifically the 2018 version. Please see our paper for details about the dataset creation process and framework.
- ライセンス: 既知のライセンスはありません
- バージョン: 2.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 4781 |
'train' | 19293 |
- 特徴:
{
"NNQT_question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"uid": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"subgraph": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"template_index": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sparql_wikidata": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"sparql_dbpedia18": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"template": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"paraphrased_question": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}