- Description:
The batting averages of 18 Major League Baseball players through their first 45 at-bats of the 1970 season, along with their batting average for the remainder the season.
The data has been modified from the table in the paper, as used for case studies using Stan and PyMC3, by adding columns explicitly listing the number of at-bats early in the season, as well as at-bats and hits for the full season.
Homepage: https://www.tensorflow.org/datasets/catalog/efron_morris75
Source code:
tfds.datasets.efron_morris75.Builder
Versions:
1.0.0
(default): Initial release.
Download size:
1008 bytes
Dataset size:
4.29 KiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'train' |
18 |
- Feature structure:
FeaturesDict({
'At-Bats': int32,
'BattingAverage': float32,
'FirstName': string,
'Hits': int32,
'LastName': string,
'RemainingAt-Bats': int32,
'RemainingAverage': float32,
'SeasonAt-Bats': int32,
'SeasonAverage': float32,
'SeasonHits': int32,
})
- Feature documentation:
Feature | Class | Shape | Dtype | Description |
---|---|---|---|---|
FeaturesDict | ||||
At-Bats | Tensor | int32 | ||
BattingAverage | Tensor | float32 | ||
FirstName | Tensor | string | ||
Hits | Tensor | int32 | ||
LastName | Tensor | string | ||
RemainingAt-Bats | Tensor | int32 | ||
RemainingAverage | Tensor | float32 | ||
SeasonAt-Bats | Tensor | int32 | ||
SeasonAverage | Tensor | float32 | ||
SeasonHits | Tensor | int32 |
Supervised keys (See
as_supervised
doc):None
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):
- Citation:
@article{efron1975data,
title={Data analysis using Stein's estimator and its generalizations},
author={Efron, Bradley and Morris, Carl},
journal={Journal of the American Statistical Association},
volume={70},
number={350},
pages={311--319},
year={1975},
publisher={Taylor \& Francis}
}