Tài liệu tham khảo:
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:medmcqa')
- Sự miêu tả :
MedMCQA is a large-scale, Multiple-Choice Question Answering (MCQA) dataset designed to address real-world medical entrance exam questions.
MedMCQA has more than 194k high-quality AIIMS & NEET PG entrance exam MCQs covering 2.4k healthcare topics and 21 medical subjects are collected with an average token length of 12.77 and high topical diversity.
The dataset contains questions about the following topics: Anesthesia, Anatomy, Biochemistry, Dental, ENT, Forensic Medicine (FM)
Obstetrics and Gynecology (O&G), Medicine, Microbiology, Ophthalmology, Orthopedics Pathology, Pediatrics, Pharmacology, Physiology,
Psychiatry, Radiology Skin, Preventive & Social Medicine (PSM) and Surgery
- Giấy phép : Giấy phép Apache 2.0
- Phiên bản : 1.1.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'test' | 6150 |
'train' | 182822 |
'validation' | 4183 |
- Đặc trưng :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"question": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"opa": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"opb": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"opc": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"opd": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"cop": {
"num_classes": 4,
"names": [
"a",
"b",
"c",
"d"
],
"id": null,
"_type": "ClassLabel"
},
"choice_type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"exp": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"subject_name": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"topic_name": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}