হিন্দি_কথা

তথ্যসূত্র:

TFDS এ এই ডেটাসেট লোড করতে নিম্নলিখিত কমান্ডটি ব্যবহার করুন:

ds = tfds.load('huggingface:hindi_discourse')
  • বর্ণনা :
The Hindi Discourse Analysis dataset is a corpus for analyzing discourse modes present in its sentences. 
It contains sentences from stories written by 11 famous authors from the 20th Century.
4-5 stories by each author have been selected which were available in the public domain resulting
in a collection of 53 stories. Most of these short stories were originally written in Hindi
but some of them were written
in other Indian languages and later translated to Hindi.
  • লাইসেন্স : কোনো পরিচিত লাইসেন্স নেই
  • সংস্করণ : 1.0.0
  • বিভাজন :
বিভক্ত উদাহরণ
'train' 9968
  • বৈশিষ্ট্য :
{
   
"Story_no": {
       
"dtype": "int32",
       
"id": null,
       
"_type": "Value"
   
},
   
"Sentence": {
       
"dtype": "string",
       
"id": null,
       
"_type": "Value"
   
},
   
"Discourse Mode": {
       
"num_classes": 6,
       
"names": [
           
"Argumentative",
           
"Descriptive",
           
"Dialogue",
           
"Informative",
           
"Narrative",
           
"Other"
       
],
       
"names_file": null,
       
"id": null,
       
"_type": "ClassLabel"
   
}
}