- Description:
A dataset of 8.5K high quality linguistically diverse grade school math word problems.
Additional Documentation: Explore on Papers With Code
Source code:
tfds.text.gsm8k.Gsm8k
Versions:
1.0.0
(default): Initial release.
Download size:
10.77 MiB
Dataset size:
17.84 MiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'test' |
1,319 |
'test_socratic' |
1,319 |
'train' |
7,473 |
'train_socratic' |
7,473 |
- Feature structure:
FeaturesDict({
'annotation': Text(shape=(), dtype=string),
'answer': Text(shape=(), dtype=string),
'question': Text(shape=(), dtype=string),
'short_answer': Text(shape=(), dtype=string),
})
- Feature documentation:
Feature | Class | Shape | Dtype | Description |
---|---|---|---|---|
FeaturesDict | ||||
annotation | Text | string | ||
answer | Text | string | ||
question | Text | string | ||
short_answer | Text | string |
Supervised keys (See
as_supervised
doc):None
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):
- Citation:
@misc{cobbe2021training,
title={Training Verifiers to Solve Math Word Problems},
author={Karl Cobbe and Vineet Kosaraju and Mohammad Bavarian and Jacob Hilton and Reiichiro Nakano and Christopher Hesse and John Schulman},
year={2021},
eprint={2110.14168},
archivePrefix={arXiv},
primaryClass={cs.LG}
}