Referências:
ar_to_en
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/ar_to_en')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"en"
],
"id": null,
"_type": "Translation"
}
}
ar_to_es
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/ar_to_es')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"es"
],
"id": null,
"_type": "Translation"
}
}
ar_to_fr
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/ar_to_fr')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"fr"
],
"id": null,
"_type": "Translation"
}
}
ar_to_ru
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/ar_to_ru')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"ru"
],
"id": null,
"_type": "Translation"
}
}
ar_to_zh
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/ar_to_zh')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"zh"
],
"id": null,
"_type": "Translation"
}
}
en_to_es
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/en_to_es')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"en",
"es"
],
"id": null,
"_type": "Translation"
}
}
en_to_fr
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/en_to_fr')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"en",
"fr"
],
"id": null,
"_type": "Translation"
}
}
en_to_ru
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/en_to_ru')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"en",
"ru"
],
"id": null,
"_type": "Translation"
}
}
en_to_zh
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/en_to_zh')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"en",
"zh"
],
"id": null,
"_type": "Translation"
}
}
es_to_fr
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/es_to_fr')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"es",
"fr"
],
"id": null,
"_type": "Translation"
}
}
es_to_ru
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/es_to_ru')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"es",
"ru"
],
"id": null,
"_type": "Translation"
}
}
es_to_zh
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/es_to_zh')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"es",
"zh"
],
"id": null,
"_type": "Translation"
}
}
de_para_ru
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/fr_to_ru')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"fr",
"ru"
],
"id": null,
"_type": "Translation"
}
}
de_para_zh
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/fr_to_zh')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"fr",
"zh"
],
"id": null,
"_type": "Translation"
}
}
ru_to_zh
Use o seguinte comando para carregar este conjunto de dados no TFDS:
ds = tfds.load('huggingface:un_ga/ru_to_zh')
- Descrição :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- Licença : Nenhuma licença conhecida
- Versão : 2.0.0
- Divisões :
Dividir | Exemplos |
---|---|
'train' | 74067 |
- Características :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ru",
"zh"
],
"id": null,
"_type": "Translation"
}
}