Tài liệu tham khảo:
en2bg
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2bg')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 4061 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"bg"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2cs
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2cs')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 3351 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"cs"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2da
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2da')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 3757 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"da"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2de
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2de')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 4473 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"de"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2el
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2el')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 2818 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"el"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2es
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2es')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 4303 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"es"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2et
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2et')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 2270 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"et"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2fi
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2fi')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 1458 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"fi"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2fr
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2fr')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 4476 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"fr"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2hu
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2hu')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 3455 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"hu"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2is
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2is')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 2206 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"is"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2it
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2it')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 2170 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"it"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2lt
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2lt')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 3386 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"lt"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2lv
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2lv')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 3880 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"lv"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2mt
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2mt')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 1722 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"mt"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2nb
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2nb')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 642 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"nb"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2nl
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2nl')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 1805 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"nl"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2pl
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2pl')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 4027 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"pl"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2pt
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2pt')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 3501 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"pt"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2ro
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2ro')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 3159 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"ro"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2sk
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2sk')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 2972 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"sk"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2sl
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2sl')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 4644 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"sl"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2sv
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2sv')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 2909 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"sv"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}
en2tr
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:europa_eac_tm/en2tr')
- Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.
EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.
All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
- Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
- Phiên bản : 1.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'train' | 3198 |
- Đặc trưng :
{
"translation": {
"languages": [
"en",
"tr"
],
"id": null,
"_type": "Translation"
},
"sentence_type": {
"num_classes": 2,
"names": [
"form_data",
"sentence_data"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}