TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

europarl_bilingual

References:

bg-cs

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-cs')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	402657

Features:

{
    "translation": {
        "languages": [
            "bg",
            "cs"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-da

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-da')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	393449

Features:

{
    "translation": {
        "languages": [
            "bg",
            "da"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-de

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-de')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	393298

Features:

{
    "translation": {
        "languages": [
            "bg",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-el

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-el')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	377341

Features:

{
    "translation": {
        "languages": [
            "bg",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-en

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-en')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	408290

Features:

{
    "translation": {
        "languages": [
            "bg",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-es

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-es')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	388226

Features:

{
    "translation": {
        "languages": [
            "bg",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-et

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-et')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	400712

Features:

{
    "translation": {
        "languages": [
            "bg",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-fi

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-fi')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	396624

Features:

{
    "translation": {
        "languages": [
            "bg",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-fr

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-fr')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	393644

Features:

{
    "translation": {
        "languages": [
            "bg",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	382773

Features:

{
    "translation": {
        "languages": [
            "bg",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	377822

Features:

{
    "translation": {
        "languages": [
            "bg",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	392554

Features:

{
    "translation": {
        "languages": [
            "bg",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	398355

Features:

{
    "translation": {
        "languages": [
            "bg",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	388273

Features:

{
    "translation": {
        "languages": [
            "bg",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	395269

Features:

{
    "translation": {
        "languages": [
            "bg",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	388972

Features:

{
    "translation": {
        "languages": [
            "bg",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	389381

Features:

{
    "translation": {
        "languages": [
            "bg",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	393815

Features:

{
    "translation": {
        "languages": [
            "bg",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	380231

Features:

{
    "translation": {
        "languages": [
            "bg",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/bg-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	398236

Features:

{
    "translation": {
        "languages": [
            "bg",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-da

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-da')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	618055

Features:

{
    "translation": {
        "languages": [
            "cs",
            "da"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-de

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-de')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	568589

Features:

{
    "translation": {
        "languages": [
            "cs",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-el

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-el')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	599489

Features:

{
    "translation": {
        "languages": [
            "cs",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-en

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-en')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	647095

Features:

{
    "translation": {
        "languages": [
            "cs",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-es

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-es')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	619774

Features:

{
    "translation": {
        "languages": [
            "cs",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-et

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-et')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	636512

Features:

{
    "translation": {
        "languages": [
            "cs",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-fi

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-fi')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	619320

Features:

{
    "translation": {
        "languages": [
            "cs",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-fr

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-fr')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	628200

Features:

{
    "translation": {
        "languages": [
            "cs",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	616160

Features:

{
    "translation": {
        "languages": [
            "cs",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	607017

Features:

{
    "translation": {
        "languages": [
            "cs",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	624292

Features:

{
    "translation": {
        "languages": [
            "cs",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	627873

Features:

{
    "translation": {
        "languages": [
            "cs",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	618414

Features:

{
    "translation": {
        "languages": [
            "cs",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	621387

Features:

{
    "translation": {
        "languages": [
            "cs",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	609729

Features:

{
    "translation": {
        "languages": [
            "cs",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	392085

Features:

{
    "translation": {
        "languages": [
            "cs",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	636128

Features:

{
    "translation": {
        "languages": [
            "cs",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	611624

Features:

{
    "translation": {
        "languages": [
            "cs",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/cs-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	631544

Features:

{
    "translation": {
        "languages": [
            "cs",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-de

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-de')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1928414

Features:

{
    "translation": {
        "languages": [
            "da",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-el

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-el')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1280579

Features:

{
    "translation": {
        "languages": [
            "da",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-en

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-en')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1991647

Features:

{
    "translation": {
        "languages": [
            "da",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-es

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-es')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1943931

Features:

{
    "translation": {
        "languages": [
            "da",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-et

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-et')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	635018

Features:

{
    "translation": {
        "languages": [
            "da",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-fi

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-fi')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1917260

Features:

{
    "translation": {
        "languages": [
            "da",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-fr

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-fr')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1992590

Features:

{
    "translation": {
        "languages": [
            "da",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	617519

Features:

{
    "translation": {
        "languages": [
            "da",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1876703

Features:

{
    "translation": {
        "languages": [
            "da",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	614923

Features:

{
    "translation": {
        "languages": [
            "da",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	627809

Features:

{
    "translation": {
        "languages": [
            "da",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1987498

Features:

{
    "translation": {
        "languages": [
            "da",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	642544

Features:

{
    "translation": {
        "languages": [
            "da",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1930454

Features:

{
    "translation": {
        "languages": [
            "da",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	388156

Features:

{
    "translation": {
        "languages": [
            "da",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	621907

Features:

{
    "translation": {
        "languages": [
            "da",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	595944

Features:

{
    "translation": {
        "languages": [
            "da",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/da-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1871171

Features:

{
    "translation": {
        "languages": [
            "da",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-el

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-el')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1223026

Features:

{
    "translation": {
        "languages": [
            "de",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-en

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-en')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1961119

Features:

{
    "translation": {
        "languages": [
            "de",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-es

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-es')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1887879

Features:

{
    "translation": {
        "languages": [
            "de",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-et

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-et')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	578248

Features:

{
    "translation": {
        "languages": [
            "de",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-fi

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-fi')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1871185

Features:

{
    "translation": {
        "languages": [
            "de",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-fr

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-fr')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1942666

Features:

{
    "translation": {
        "languages": [
            "de",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	563571

Features:

{
    "translation": {
        "languages": [
            "de",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1832989

Features:

{
    "translation": {
        "languages": [
            "de",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	565892

Features:

{
    "translation": {
        "languages": [
            "de",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	573226

Features:

{
    "translation": {
        "languages": [
            "de",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1934111

Features:

{
    "translation": {
        "languages": [
            "de",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	579166

Features:

{
    "translation": {
        "languages": [
            "de",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1884176

Features:

{
    "translation": {
        "languages": [
            "de",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	385663

Features:

{
    "translation": {
        "languages": [
            "de",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	569381

Features:

{
    "translation": {
        "languages": [
            "de",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	546212

Features:

{
    "translation": {
        "languages": [
            "de",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/de-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1842026

Features:

{
    "translation": {
        "languages": [
            "de",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-en

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-en')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1292180

Features:

{
    "translation": {
        "languages": [
            "el",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-es

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-es')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1272383

Features:

{
    "translation": {
        "languages": [
            "el",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-et

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-et')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	599915

Features:

{
    "translation": {
        "languages": [
            "el",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-fi

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-fi')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1227612

Features:

{
    "translation": {
        "languages": [
            "el",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-fr

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-fr')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1290796

Features:

{
    "translation": {
        "languages": [
            "el",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	586250

Features:

{
    "translation": {
        "languages": [
            "el",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1231222

Features:

{
    "translation": {
        "languages": [
            "el",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	590850

Features:

{
    "translation": {
        "languages": [
            "el",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	596929

Features:

{
    "translation": {
        "languages": [
            "el",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1277297

Features:

{
    "translation": {
        "languages": [
            "el",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	591069

Features:

{
    "translation": {
        "languages": [
            "el",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1261188

Features:

{
    "translation": {
        "languages": [
            "el",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	372839

Features:

{
    "translation": {
        "languages": [
            "el",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	600684

Features:

{
    "translation": {
        "languages": [
            "el",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	579109

Features:

{
    "translation": {
        "languages": [
            "el",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/el-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1273743

Features:

{
    "translation": {
        "languages": [
            "el",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-es

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-es')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	2009073

Features:

{
    "translation": {
        "languages": [
            "en",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-et

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-et')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	651236

Features:

{
    "translation": {
        "languages": [
            "en",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fi

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-fi')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1969624

Features:

{
    "translation": {
        "languages": [
            "en",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fr

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-fr')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	2051014

Features:

{
    "translation": {
        "languages": [
            "en",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	625178

Features:

{
    "translation": {
        "languages": [
            "en",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1946253

Features:

{
    "translation": {
        "languages": [
            "en",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	634284

Features:

{
    "translation": {
        "languages": [
            "en",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	639318

Features:

{
    "translation": {
        "languages": [
            "en",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	2027447

Features:

{
    "translation": {
        "languages": [
            "en",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	631160

Features:

{
    "translation": {
        "languages": [
            "en",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	2002943

Features:

{
    "translation": {
        "languages": [
            "en",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	400356

Features:

{
    "translation": {
        "languages": [
            "en",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	639958

Features:

{
    "translation": {
        "languages": [
            "en",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	624803

Features:

{
    "translation": {
        "languages": [
            "en",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/en-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1892723

Features:

{
    "translation": {
        "languages": [
            "en",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-et

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-et')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	618350

Features:

{
    "translation": {
        "languages": [
            "es",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-fi

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-fi')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1901596

Features:

{
    "translation": {
        "languages": [
            "es",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-fr

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-fr')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1982990

Features:

{
    "translation": {
        "languages": [
            "es",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	604007

Features:

{
    "translation": {
        "languages": [
            "es",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1880982

Features:

{
    "translation": {
        "languages": [
            "es",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	611082

Features:

{
    "translation": {
        "languages": [
            "es",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	615496

Features:

{
    "translation": {
        "languages": [
            "es",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1954351

Features:

{
    "translation": {
        "languages": [
            "es",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	609297

Features:

{
    "translation": {
        "languages": [
            "es",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1933321

Features:

{
    "translation": {
        "languages": [
            "es",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	387653

Features:

{
    "translation": {
        "languages": [
            "es",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	619027

Features:

{
    "translation": {
        "languages": [
            "es",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	599168

Features:

{
    "translation": {
        "languages": [
            "es",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/es-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1826855

Features:

{
    "translation": {
        "languages": [
            "es",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-fi

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-fi')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	620939

Features:

{
    "translation": {
        "languages": [
            "et",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-fr

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-fr')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	630126

Features:

{
    "translation": {
        "languages": [
            "et",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	628044

Features:

{
    "translation": {
        "languages": [
            "et",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	607088

Features:

{
    "translation": {
        "languages": [
            "et",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	622003

Features:

{
    "translation": {
        "languages": [
            "et",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	637468

Features:

{
    "translation": {
        "languages": [
            "et",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	621150

Features:

{
    "translation": {
        "languages": [
            "et",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	639046

Features:

{
    "translation": {
        "languages": [
            "et",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	616238

Features:

{
    "translation": {
        "languages": [
            "et",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	389087

Features:

{
    "translation": {
        "languages": [
            "et",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	634168

Features:

{
    "translation": {
        "languages": [
            "et",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	609731

Features:

{
    "translation": {
        "languages": [
            "et",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/et-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	656646

Features:

{
    "translation": {
        "languages": [
            "et",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-fr

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-fr')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1964126

Features:

{
    "translation": {
        "languages": [
            "fi",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	606348

Features:

{
    "translation": {
        "languages": [
            "fi",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1845203

Features:

{
    "translation": {
        "languages": [
            "fi",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	613113

Features:

{
    "translation": {
        "languages": [
            "fi",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	616816

Features:

{
    "translation": {
        "languages": [
            "fi",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1940808

Features:

{
    "translation": {
        "languages": [
            "fi",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	612689

Features:

{
    "translation": {
        "languages": [
            "fi",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1885062

Features:

{
    "translation": {
        "languages": [
            "fi",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	391430

Features:

{
    "translation": {
        "languages": [
            "fi",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	623686

Features:

{
    "translation": {
        "languages": [
            "fi",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	596661

Features:

{
    "translation": {
        "languages": [
            "fi",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fi-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1883314

Features:

{
    "translation": {
        "languages": [
            "fi",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-hu

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-hu')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	615791

Features:

{
    "translation": {
        "languages": [
            "fr",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1943673

Features:

{
    "translation": {
        "languages": [
            "fr",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	620660

Features:

{
    "translation": {
        "languages": [
            "fr",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	626280

Features:

{
    "translation": {
        "languages": [
            "fr",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	2029551

Features:

{
    "translation": {
        "languages": [
            "fr",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	621402

Features:

{
    "translation": {
        "languages": [
            "fr",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1980132

Features:

{
    "translation": {
        "languages": [
            "fr",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	387846

Features:

{
    "translation": {
        "languages": [
            "fr",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	631846

Features:

{
    "translation": {
        "languages": [
            "fr",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	606897

Features:

{
    "translation": {
        "languages": [
            "fr",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/fr-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1880390

Features:

{
    "translation": {
        "languages": [
            "fr",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-it

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-it')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	589563

Features:

{
    "translation": {
        "languages": [
            "hu",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	610298

Features:

{
    "translation": {
        "languages": [
            "hu",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	621101

Features:

{
    "translation": {
        "languages": [
            "hu",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	605806

Features:

{
    "translation": {
        "languages": [
            "hu",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	621820

Features:

{
    "translation": {
        "languages": [
            "hu",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	599639

Features:

{
    "translation": {
        "languages": [
            "hu",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	377239

Features:

{
    "translation": {
        "languages": [
            "hu",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	618247

Features:

{
    "translation": {
        "languages": [
            "hu",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	601671

Features:

{
    "translation": {
        "languages": [
            "hu",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/hu-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	631872

Features:

{
    "translation": {
        "languages": [
            "hu",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-lt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/it-lt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	593003

Features:

{
    "translation": {
        "languages": [
            "it",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/it-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	599394

Features:

{
    "translation": {
        "languages": [
            "it",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/it-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1919855

Features:

{
    "translation": {
        "languages": [
            "it",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/it-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	594472

Features:

{
    "translation": {
        "languages": [
            "it",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/it-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1877432

Features:

{
    "translation": {
        "languages": [
            "it",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/it-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	367904

Features:

{
    "translation": {
        "languages": [
            "it",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/it-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	603467

Features:

{
    "translation": {
        "languages": [
            "it",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/it-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	579968

Features:

{
    "translation": {
        "languages": [
            "it",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/it-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1766096

Features:

{
    "translation": {
        "languages": [
            "it",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-lv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lt-lv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	621857

Features:

{
    "translation": {
        "languages": [
            "lt",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lt-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	613308

Features:

{
    "translation": {
        "languages": [
            "lt",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lt-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	617296

Features:

{
    "translation": {
        "languages": [
            "lt",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lt-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	603223

Features:

{
    "translation": {
        "languages": [
            "lt",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lt-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	384679

Features:

{
    "translation": {
        "languages": [
            "lt",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lt-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	622997

Features:

{
    "translation": {
        "languages": [
            "lt",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lt-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	602442

Features:

{
    "translation": {
        "languages": [
            "lt",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lt-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	628817

Features:

{
    "translation": {
        "languages": [
            "lt",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-nl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lv-nl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	618352

Features:

{
    "translation": {
        "languages": [
            "lv",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lv-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	638453

Features:

{
    "translation": {
        "languages": [
            "lv",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lv-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	615580

Features:

{
    "translation": {
        "languages": [
            "lv",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lv-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	390857

Features:

{
    "translation": {
        "languages": [
            "lv",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lv-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	629803

Features:

{
    "translation": {
        "languages": [
            "lv",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lv-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	607381

Features:

{
    "translation": {
        "languages": [
            "lv",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/lv-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	643600

Features:

{
    "translation": {
        "languages": [
            "lv",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-pl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/nl-pl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	612797

Features:

{
    "translation": {
        "languages": [
            "nl",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/nl-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1957189

Features:

{
    "translation": {
        "languages": [
            "nl",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/nl-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	380736

Features:

{
    "translation": {
        "languages": [
            "nl",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/nl-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	622650

Features:

{
    "translation": {
        "languages": [
            "nl",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/nl-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	600023

Features:

{
    "translation": {
        "languages": [
            "nl",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/nl-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1870685

Features:

{
    "translation": {
        "languages": [
            "nl",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-pt

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/pl-pt')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	608181

Features:

{
    "translation": {
        "languages": [
            "pl",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/pl-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	389341

Features:

{
    "translation": {
        "languages": [
            "pl",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/pl-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	624330

Features:

{
    "translation": {
        "languages": [
            "pl",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/pl-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	600511

Features:

{
    "translation": {
        "languages": [
            "pl",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/pl-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	657951

Features:

{
    "translation": {
        "languages": [
            "pl",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-ro

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/pt-ro')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	381404

Features:

{
    "translation": {
        "languages": [
            "pt",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/pt-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	611895

Features:

{
    "translation": {
        "languages": [
            "pt",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/pt-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	593455

Features:

{
    "translation": {
        "languages": [
            "pt",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/pt-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	1823402

Features:

{
    "translation": {
        "languages": [
            "pt",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ro-sk

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/ro-sk')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	387839

Features:

{
    "translation": {
        "languages": [
            "ro",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ro-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/ro-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	374859

Features:

{
    "translation": {
        "languages": [
            "ro",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ro-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/ro-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	390133

Features:

{
    "translation": {
        "languages": [
            "ro",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

sk-sl

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/sk-sl')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	609698

Features:

{
    "translation": {
        "languages": [
            "sk",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

sk-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/sk-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	636353

Features:

{
    "translation": {
        "languages": [
            "sk",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

sl-sv

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:europarl_bilingual/sl-sv')

Description:

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

License: The data set comes with the same license as the original sources. Please, check the information about the source that is given on http://opus.nlpl.eu/Europarl-v8.php
Version: 8.0.0
Splits:

Split	Examples
`'train'`	608740

Features:

{
    "translation": {
        "languages": [
            "sl",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}