参考文献:
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:telugu_news')
- 説明:
This dataset contains Telugu language news articles along with respective
topic labels (business, editorial, entertainment, nation, sport) extracted from
the daily Andhra Jyoti. This dataset could be used to build Classification and Language Models.
- ライセンス: データ ファイル © 原著者
- バージョン: 1.1.0
- 分割:
スプリット | 例 |
---|---|
'test' | 4329 |
'train' | 17312 |
- 特徴:
{
"sno": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"date": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"heading": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"body": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"topic": {
"num_classes": 5,
"names": [
"business",
"editorial",
"entertainment",
"nation",
"sports"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}