References:
all
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/all')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
67072 |
'train' |
1207222 |
'validation' |
67068 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
a
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/a')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
9675 |
'train' |
174134 |
'validation' |
9674 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
b
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/b')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
8974 |
'train' |
161520 |
'validation' |
8973 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
c
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/c')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
5614 |
'train' |
101042 |
'validation' |
5613 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
d
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/d')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
565 |
'train' |
10164 |
'validation' |
565 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
e
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/e')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
1914 |
'train' |
34443 |
'validation' |
1914 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
f
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/f')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
4754 |
'train' |
85568 |
'validation' |
4754 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
g
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/g')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
14386 |
'train' |
258935 |
'validation' |
14385 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
h
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/h')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
14279 |
'train' |
257019 |
'validation' |
14279 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
y
Use the following command to load this dataset in TFDS:
ds = tfds.load('huggingface:big_patent/y')
- Description:
BIGPATENT, consisting of 1.3 million records of U.S. patent documents
along with human written abstractive summaries.
Each US patent application is filed under a Cooperative Patent Classification
(CPC) code. There are nine such classification categories:
A (Human Necessities), B (Performing Operations; Transporting),
C (Chemistry; Metallurgy), D (Textiles; Paper), E (Fixed Constructions),
F (Mechanical Engineering; Lightning; Heating; Weapons; Blasting),
G (Physics), H (Electricity), and
Y (General tagging of new or cross-sectional technology)
There are two features:
- description: detailed description of patent.
- abstract: Patent abastract.
- License: Creative Commons Attribution 4.0 International
- Version: 1.0.0
- Splits:
Split | Examples |
---|---|
'test' |
6911 |
'train' |
124397 |
'validation' |
6911 |
- Features:
{
"description": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"abstract": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}