Ссылки:
18828_alt.атеизм
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_alt.atheism')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 799 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_comp.графика
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_comp.graphics')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 973 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_comp.os.ms-windows.misc
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_comp.os.ms-windows.misc')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 985 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_comp.sys.ibm.pc.hardware
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_comp.sys.ibm.pc.hardware')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 982 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_comp.sys.mac.hardware
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_comp.sys.mac.hardware')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 961 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_comp.windows.x
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_comp.windows.x')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 980 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_разное.на продажу
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_misc.forsale')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 972 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_rec.autos
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_rec.autos')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 990 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_rec.мотоциклы
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_rec.motorcycles')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 994 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_rec.sport.baseball
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_rec.sport.baseball')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 994 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_rec.спорт.хоккей
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_rec.sport.hockey')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 999 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_sci.crypt
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_sci.crypt')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 991 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_наука.электроника
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_sci.electronics')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 981 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_sci.med
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_sci.med')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 990 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_sci.space
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_sci.space')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 987 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_soc.религия.христианин
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_soc.religion.christian')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 997 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_talk.politics.guns
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_talk.politics.guns')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 910 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_talk.politics.mideast
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_talk.politics.mideast')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 940 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_talk.politics.misc
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_talk.politics.misc')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 775 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
18828_talk.religion.misc
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/18828_talk.religion.misc')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
does not include cross-posts and includes only the "From" and "Subject" headers.
- Лицензия : Нет известной лицензии.
- Версия : 3.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 628 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_alt.атеизм
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_alt.atheism')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_comp.графика
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_comp.graphics')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_comp.os.ms-windows.misc
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_comp.os.ms-windows.misc')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_comp.sys.ibm.pc.hardware
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_comp.sys.ibm.pc.hardware')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_comp.sys.mac.hardware
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_comp.sys.mac.hardware')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_comp.windows.x
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_comp.windows.x')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_разное.на продажу
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_misc.forsale')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_rec.autos
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_rec.autos')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_rec.мотоциклы
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_rec.motorcycles')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_rec.sport.baseball
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_rec.sport.baseball')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_rec.sport.hockey
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_rec.sport.hockey')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_sci.crypt
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_sci.crypt')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_наука.электроника
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_sci.electronics')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_sci.med
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_sci.med')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_sci.space
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_sci.space')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_soc.религия.христианин
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_soc.religion.christian')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 997 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_talk.politics.guns
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_talk.politics.guns')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_talk.politics.mideast
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_talk.politics.mideast')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_talk.politics.misc
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_talk.politics.misc')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
19997_talk.religion.misc
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/19997_talk.religion.misc')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
the original, unmodified version.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'train' | 1000 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_alt.atheism
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_alt.atheism')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 319 |
'train' | 480 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_comp.graphics
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_comp.graphics')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 389 |
'train' | 584 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_comp.os.ms-windows.misc
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_comp.os.ms-windows.misc')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 394 |
'train' | 591 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_comp.sys.ibm.pc.hardware
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_comp.sys.ibm.pc.hardware')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 392 |
'train' | 590 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_comp.sys.mac.hardware
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_comp.sys.mac.hardware')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 385 |
'train' | 578 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_comp.windows.x
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_comp.windows.x')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 395 |
'train' | 593 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_misc.forsale
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_misc.forsale')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 390 |
'train' | 585 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_rec.autos
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_rec.autos')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 396 |
'train' | 594 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_rec.motorcycles
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_rec.motorcycles')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 398 |
'train' | 598 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_rec.sport.baseball
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_rec.sport.baseball')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 397 |
'train' | 597 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_rec.sport.hockey
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_rec.sport.hockey')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 399 |
'train' | 600 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_sci.crypt
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_sci.crypt')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 396 |
'train' | 595 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_sci.electronics
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_sci.electronics')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 393 |
'train' | 591 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_sci.med
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_sci.med')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 396 |
'train' | 594 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_sci.space
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_sci.space')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 394 |
'train' | 593 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_soc.religion.christian
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_soc.religion.christian')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 398 |
'train' | 599 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_talk.politics.guns
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_talk.politics.guns')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 364 |
'train' | 546 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_talk.politics.mideast
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_talk.politics.mideast')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 376 |
'train' | 564 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_talk.politics.misc
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_talk.politics.misc')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 310 |
'train' | 465 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
bydate_talk.religion.misc
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:newsgroup/bydate_talk.religion.misc')
- Описание :
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across
20 different newsgroups. The 20 newsgroups collection has become a popular data set for experiments in text applications of
machine learning techniques, such as text classification and text clustering.
sorted by date into training(60%) and test(40%) sets, does not include cross-posts (duplicates) and does not include newsgroup-identifying headers (Xref, Newsgroups, Path, Followup-To, Date)
- Лицензия : Нет известной лицензии.
- Версия : 2.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 251 |
'train' | 377 |
- Функции :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}