europa_eac_tm

Tài liệu tham khảo:

en2bg

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2bg')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 4061
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "bg"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2cs

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2cs')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 3351
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "cs"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2da

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2da')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 3757
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "da"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2de

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2de')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 4473
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2el

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2el')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 2818
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2es

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2es')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 4303
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2et

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2et')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 2270
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2fi

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2fi')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 1458
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2fr

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2fr')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 4476
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2hu

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2hu')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 3455
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2is

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2is')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 2206
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "is"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2it

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2it')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 2170
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2lt

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2lt')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 3386
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2lv

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2lv')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 3880
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2mt

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2mt')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 1722
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "mt"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2nb

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2nb')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 642
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "nb"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2nl

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2nl')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 1805
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2pl

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2pl')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 4027
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2pt

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2pt')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 3501
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2ro

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2ro')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 3159
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2sk

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2sk')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 2972
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2sl

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2sl')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 4644
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2sv

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2sv')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 2909
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

en2tr

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:europa_eac_tm/en2tr')
  • Sự miêu tả :
In October 2012, the European Union's (EU) Directorate General for Education and Culture ( DG EAC) released a translation memory (TM), i.e. a collection of sentences and their professionally produced translations, in twenty-six languages. This resource bears the name EAC Translation Memory, short EAC-TM.

EAC-TM covers up to 26 languages: 22 official languages of the EU (all except Irish) plus Icelandic, Croatian, Norwegian and Turkish. EAC-TM thus contains translations from English into the following 25 languages: Bulgarian, Czech, Danish, Dutch, Estonian, German, Greek, Finnish, French, Croatian, Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish and Turkish.

All documents and sentences were originally written in English (source language is English) and then translated into the other languages. The texts were translated by staff of the National Agencies of the Lifelong Learning and Youth in Action programmes. They are typically professionals in the field of education/youth and EU programmes. They are thus not professional translators, but they are normally native speakers of the target language.
  • Giấy phép : Giấy phép Creative Commons Ghi công 4.0 Quốc tế (CC BY 4.0) © Liên minh Châu Âu, 1995-2020
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 3198
  • Đặc trưng :
{
    "translation": {
        "languages": [
            "en",
            "tr"
        ],
        "id": null,
        "_type": "Translation"
    },
    "sentence_type": {
        "num_classes": 2,
        "names": [
            "form_data",
            "sentence_data"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}