তথ্যসূত্র:
2019-08-01
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:kilt_wikipedia/2019-08-01')
- বর্ণনা :
KILT-Wikipedia: Wikipedia pre-processed for KILT.
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 1.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'full' | 5903530 |
- বৈশিষ্ট্য :
{
"kilt_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wikipedia_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wikipedia_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"feature": {
"paragraph": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"anchors": {
"feature": {
"paragraph_id": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"start": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"end": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"href": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wikipedia_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wikipedia_id": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
},
"categories": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wikidata_info": {
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"enwikiquote_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wikidata_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wikidata_label": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"wikipedia_title": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"aliases": {
"feature": {
"alias": {
"dtype": "string",
"id": null,
"_type": "Value"
}
},
"length": -1,
"id": null,
"_type": "Sequence"
}
},
"history": {
"pageid": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"parentid": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"revid": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"pre_dump": {
"dtype": "bool",
"id": null,
"_type": "Value"
},
"timestamp": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"url": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
}