参考文献:
en2bg
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2bg')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4061 |
- 特徴:
{
"translation": {
"languages": [
"en",
"bg"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2cs
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2cs')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3351 |
- 特徴:
{
"translation": {
"languages": [
"en",
"cs"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2da
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2da')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3757 |
- 特徴:
{
"translation": {
"languages": [
"en",
"da"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
エンツーデ
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2de')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4473 |
- 特徴:
{
"translation": {
"languages": [
"en",
"de"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
エンツーエル
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2el')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2818 |
- 特徴:
{
"translation": {
"languages": [
"en",
"el"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2es
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2es')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4303 |
- 特徴:
{
"translation": {
"languages": [
"en",
"es"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
エント
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2et')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2270 |
- 特徴:
{
"translation": {
"languages": [
"en",
"et"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2fi
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2fi')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1458年 |
- 特徴:
{
"translation": {
"languages": [
"en",
"fi"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2fr
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2fr')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4476 |
- 特徴:
{
"translation": {
"languages": [
"en",
"fr"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2hu
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2hu')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3455 |
- 特徴:
{
"translation": {
"languages": [
"en",
"hu"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2is
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2is')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2206 |
- 特徴:
{
"translation": {
"languages": [
"en",
"is"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
エンイット
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2it')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2170 |
- 特徴:
{
"translation": {
"languages": [
"en",
"it"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2lt
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2lt')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3386 |
- 特徴:
{
"translation": {
"languages": [
"en",
"lt"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2lv
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2lv')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3880 |
- 特徴:
{
"translation": {
"languages": [
"en",
"lv"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2mt
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2mt')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1722年 |
- 特徴:
{
"translation": {
"languages": [
"en",
"mt"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2nb
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2nb')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 642 |
- 特徴:
{
"translation": {
"languages": [
"en",
"nb"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2nl
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2nl')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 1805年 |
- 特徴:
{
"translation": {
"languages": [
"en",
"nl"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2pl
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2pl')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4027 |
- 特徴:
{
"translation": {
"languages": [
"en",
"pl"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2pt
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2pt')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3501 |
- 特徴:
{
"translation": {
"languages": [
"en",
"pt"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
エンロ
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2ro')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3159 |
- 特徴:
{
"translation": {
"languages": [
"en",
"ro"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2sk
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2sk')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2972 |
- 特徴:
{
"translation": {
"languages": [
"en",
"sk"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2sl
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2sl')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 4644 |
- 特徴:
{
"translation": {
"languages": [
"en",
"sl"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2sv
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2sv')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 2909 |
- 特徴:
{
"translation": {
"languages": [
"en",
"sv"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2tr
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:europa_eac_tm/en2tr')
- 説明:
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- ライセンス: クリエイティブ コモンズ 表示 4.0 インターナショナル(CC BY 4.0) ライセンス © European Union, 1995-2020
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'train' | 3198 |
- 特徴:
{
"translation": {
"languages": [
"en",
"tr"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}