Referencias:
es
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/en')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 55000 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
da
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/da')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 55000 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
Delaware
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/de')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 55000 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
nl
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/nl')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 55000 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
sv
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/sv')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 42490 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
bg
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/bg')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 15986 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
cs
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/cs')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 23187 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
hora
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/hr')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 7944 |
'validation' | 2500 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
sustantivo, masculino, plural—
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/pl')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 23197 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
sk
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/sk')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 22971 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
SL
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/sl')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 23184 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
es
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/es')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 52785 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
fr
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/fr')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 55000 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
él
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/it')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 55000 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
pt
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/pt')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 52370 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
ro
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/ro')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 15921 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
y
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/et')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 23126 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
fi
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/fi')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 42497 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
eh
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/hu')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 22664 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
es
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/lt')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 23188 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
lv
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/lv')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 23208 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
el
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/el')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 55000 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
monte
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/mt')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 17521 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}
todos_idiomas
Utilice el siguiente comando para cargar este conjunto de datos en TFDS:
ds = tfds.load('huggingface:multi_eurlex/all_languages')
- Descripción :
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource).
Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU.
As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels);
this is multi-label classification task (given the text, predict multiple labels).
- Licencia : Ninguna licencia conocida
- Versión : 1.0.0
- Divisiones :
Dividir | Ejemplos |
---|---|
'test' | 5000 |
'train' | 55000 |
'validation' | 5000 |
- Características :
{
"celex_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"text": {
"languages": [
"en",
"da",
"de",
"nl",
"sv",
"bg",
"cs",
"hr",
"pl",
"sk",
"sl",
"es",
"fr",
"it",
"pt",
"ro",
"et",
"fi",
"hu",
"lt",
"lv",
"el",
"mt"
],
"id": null,
"_type": "Translation"
},
"labels": {
"feature": {
"num_classes": 21,
"names": [
"100149",
"100160",
"100148",
"100147",
"100152",
"100143",
"100156",
"100158",
"100154",
"100153",
"100142",
"100145",
"100150",
"100162",
"100159",
"100144",
"100151",
"100157",
"100161",
"100146",
"100155"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"length": -1,
"id": null,
"_type": "Sequence"
}
}