참고자료:
ar_to_en
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/ar_to_en')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"en"
],
"id": null,
"_type": "Translation"
}
}
ar_to_es
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/ar_to_es')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"es"
],
"id": null,
"_type": "Translation"
}
}
ar_to_fr
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/ar_to_fr')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"fr"
],
"id": null,
"_type": "Translation"
}
}
ar_to_ru
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/ar_to_ru')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"ru"
],
"id": null,
"_type": "Translation"
}
}
ar_to_zh
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/ar_to_zh')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ar",
"zh"
],
"id": null,
"_type": "Translation"
}
}
en_to_es
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/en_to_es')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"en",
"es"
],
"id": null,
"_type": "Translation"
}
}
en_to_fr
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/en_to_fr')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"en",
"fr"
],
"id": null,
"_type": "Translation"
}
}
en_to_ru
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/en_to_ru')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"en",
"ru"
],
"id": null,
"_type": "Translation"
}
}
en_to_zh
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/en_to_zh')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"en",
"zh"
],
"id": null,
"_type": "Translation"
}
}
es_to_fr
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/es_to_fr')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"es",
"fr"
],
"id": null,
"_type": "Translation"
}
}
es_to_ru
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/es_to_ru')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"es",
"ru"
],
"id": null,
"_type": "Translation"
}
}
es_to_zh
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/es_to_zh')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"es",
"zh"
],
"id": null,
"_type": "Translation"
}
}
fr_to_ru
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/fr_to_ru')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"fr",
"ru"
],
"id": null,
"_type": "Translation"
}
}
fr_to_zh
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/fr_to_zh')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"fr",
"zh"
],
"id": null,
"_type": "Translation"
}
}
ru_to_zh
TFDS에 이 데이터세트를 로드하려면 다음 명령어를 사용하세요.
ds = tfds.load('huggingface:un_ga/ru_to_zh')
- 설명 :
United nations general assembly resolutions: A six-language parallel corpus.
This is a collection of translated documents from the United Nations originally compiled into a translation memory by Alexandre Rafalovitch, Robert Dale (see http://uncorpora.org).
6 languages, 15 bitexts
total number of files: 6
total number of tokens: 18.87M
total number of sentence fragments: 0.44M
- 라이센스 : 알려진 라이센스 없음
- 버전 : 2.0.0
- 분할 :
나뉘다 | 예 |
---|---|
'train' | 74067 |
- 특징 :
{
"id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"translation": {
"languages": [
"ru",
"zh"
],
"id": null,
"_type": "Translation"
}
}