مراجع:
برای بارگذاری این مجموعه داده در TFDS از دستور زیر استفاده کنید:
ds = tfds.load('huggingface:medmcqa')
- توضیحات :
MedMCQA is a large-scale, Multiple-Choice Question Answering (MCQA) dataset designed to address real-world medical entrance exam questions.
MedMCQA has more than 194k high-quality AIIMS & NEET PG entrance exam MCQs covering 2.4k healthcare topics and 21 medical subjects are collected with an average token length of 12.77 and high topical diversity.
The dataset contains questions about the following topics: Anesthesia, Anatomy, Biochemistry, Dental, ENT, Forensic Medicine (FM)
Obstetrics and Gynecology (O&G), Medicine, Microbiology, Ophthalmology, Orthopedics Pathology, Pediatrics, Pharmacology, Physiology,
Psychiatry, Radiology Skin, Preventive & Social Medicine (PSM) and Surgery
- مجوز : Apache License 2.0
- نسخه : 1.1.0
- تقسیم ها :
تقسیم کنید | نمونه ها |
---|---|
'test' | 6150 |
'train' | 182822 |
'validation' | 4183 |
- ویژگی ها :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"opa": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"opb": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"opc": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"opd": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"cop": {
"num_classes": 4,
"names": [
"a",
"b",
"c",
"d"
],
"id": null,
"_type": "ClassLabel"
},
"choice_type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"exp": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"subject_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"topic_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}