তথ্যসূত্র:
টীকা
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:simple_questions_v2/annotated')
- বর্ণনা :
SimpleQuestions is a dataset for simple QA, which consists
of a total of 108,442 questions written in natural language by human
English-speaking annotators each paired with a corresponding fact,
formatted as (subject, relationship, object), that provides the answer
but also a complete explanation. Fast have been extracted from the
Knowledge Base Freebase (freebase.com). We randomly shuffle these
questions and use 70% of them (75910) as training set, 10% as
validation set (10845), and the remaining 20% as test set.
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 1.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'test' | 75910 |
'train' | 75910 |
'validation' | 75910 |
- বৈশিষ্ট্য :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"subject_entity": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"relationship": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"object_entity": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
ফ্রিবেস 2 মি
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:simple_questions_v2/freebase2m')
- বর্ণনা :
SimpleQuestions is a dataset for simple QA, which consists
of a total of 108,442 questions written in natural language by human
English-speaking annotators each paired with a corresponding fact,
formatted as (subject, relationship, object), that provides the answer
but also a complete explanation. Fast have been extracted from the
Knowledge Base Freebase (freebase.com). We randomly shuffle these
questions and use 70% of them (75910) as training set, 10% as
validation set (10845), and the remaining 20% as test set.
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 1.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 10843106 |
- বৈশিষ্ট্য :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"subject_entity": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"relationship": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"object_entities": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
freebase5m
TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:
ds = tfds.load('huggingface:simple_questions_v2/freebase5m')
- বর্ণনা :
SimpleQuestions is a dataset for simple QA, which consists
of a total of 108,442 questions written in natural language by human
English-speaking annotators each paired with a corresponding fact,
formatted as (subject, relationship, object), that provides the answer
but also a complete explanation. Fast have been extracted from the
Knowledge Base Freebase (freebase.com). We randomly shuffle these
questions and use 70% of them (75910) as training set, 10% as
validation set (10845), and the remaining 20% as test set.
- লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
- সংস্করণ : 1.0.0
- বিভাজন :
বিভক্ত | উদাহরণ |
---|---|
'train' | 12010500 |
- বৈশিষ্ট্য :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"subject_entity": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"relationship": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"object_entities": {
"feature": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}