TFDS obsługuje teraz format Croissant 🥐 ! Przeczytaj dokumentację , aby dowiedzieć się więcej.

Ta strona została przetłumaczona przez Cloud Translation API.

opus100

Referencje:

af-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/af-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	275512
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "af",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

Amen

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/am-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	89027
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "am",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

an-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/an-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'train'`	6961

Cechy :

{
    "translation": {
        "languages": [
            "an",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/ar-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "ar",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

jak-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/as-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	138479
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "as",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

az-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/az-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	262089
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "az",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

został

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/be-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	67312
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "be",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-pl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/bg-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "bg",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/bn-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "bn",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

br-pl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/br-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	153447
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "br",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bs-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/bs-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "bs",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ca-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/ca-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "ca",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/cs-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "cs",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cy-pl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/cy-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	289521
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "cy",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/da-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "da",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/de-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "de",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

dz-en

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/dz-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'train'`	624

Cechy :

{
    "translation": {
        "languages": [
            "dz",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-pl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/el-en')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "el",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-eo

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-eo')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	337106
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "eo"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-es

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-es')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-et

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-et')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-eu

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-eu')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "eu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fa

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-fa')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "fa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fi

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-fi')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fr

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-fr')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fy

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-fy')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	54342
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "fy"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ga

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ga')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	289524
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ga"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-gd

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-gd')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	1606
`'train'`	16316
`'validation'`	1605

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "gd"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-gl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-gl')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	515344
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "gl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-gu

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-gu')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	318306
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "gu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ha

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ha')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	97983
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ha"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-on

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-he')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "he"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-cześć

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-hi')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	534319
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "hi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-godz

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-hr')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "hr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-hu

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-hu')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-hy

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-hy')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'train'`	7059

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "hy"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-id

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-id')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "id"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-ig

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ig')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	1843
`'train'`	18415
`'validation'`	1843

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ig"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-jest

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-is')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "is"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-to

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-it')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ja

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ja')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ja"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ka

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ka')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	377306
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ka"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-kk

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-kk')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	79927
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "kk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-km

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-km')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	111483
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "km"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ko

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ko')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ko"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-kn

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-kn')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	918
`'train'`	14537
`'validation'`	917

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "kn"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ku

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ku')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	144844
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ku"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ky

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ky')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	27215
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ky"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-li

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-li')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	25535
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "li"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lt

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-lt')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lv

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-lv')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mg

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-mg')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	590771
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "mg"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mk

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-mk')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "mk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-ml

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ml')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	822746
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mn

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-mn')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'train'`	4294

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "mn"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pan

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-mr')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	27007
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ms

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ms')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ms"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mt

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-mt')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "mt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-moje

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-my')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	24594
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "my"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-nb

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-nb')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	142906
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "nb"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ne

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ne')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	406381
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ne"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-nl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-nl')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-nn

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-nn')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	486055
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "nn"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-nie

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-no')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "no"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-oc

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-oc')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	35791
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "oc"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-or

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-or')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	1318
`'train'`	14273
`'validation'`	1317

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pa

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-pa')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	107296
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-pl')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ps

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ps')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	79127
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ps"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pt

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-pt')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ro

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ro')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ru

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ru')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-rw

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-rw')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	173823
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "rw"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-se

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-se')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	35907
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "se"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sz

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-sh')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	267211
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "sh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-si

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-si')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	979109
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "si"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sk

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-sk')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-sl')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-kw

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-sq')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "sq"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sr

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-sr')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "sr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sv

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-sv')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ta

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ta')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	227014
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-te

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-te')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	64352
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-tg

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-tg')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	193882
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "tg"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-t

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-th')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "th"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-tk

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-tk')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	1852
`'train'`	13110
`'validation'`	1852

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "tk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-tr

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-tr')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "tr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-tt

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-tt')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	100843
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "tt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-ug

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ug')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	72170
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ug"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-uk

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-uk')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "uk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ur

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-ur')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	753913
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-uz

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-uz')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	173157
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "uz"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-vi

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-vi')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "vi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-wa

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-wa')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	104496
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "wa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-xh

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-xh')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	439671
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "xh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-yi

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-yi')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	15010
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "yi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-yo

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-yo')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'train'`	10375

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "yo"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-z

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-zh')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-zu

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/en-zu')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000
`'train'`	38616
`'validation'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "en",
            "zu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-de

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/ar-de')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "ar",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-fr

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/ar-fr')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "ar",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-nl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/ar-nl')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "ar",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-ru

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/ar-ru')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "ar",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-zh

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/ar-zh')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "ar",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-fr

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/de-fr')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "de",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-nl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/de-nl')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "de",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-ru

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/de-ru')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "de",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-z

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/de-zh')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "de",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-nl

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/fr-nl')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "fr",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-ru

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/fr-ru')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "fr",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-z

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/fr-zh')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "fr",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-ru

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/nl-ru')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "nl",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-z

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/nl-zh')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "nl",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ru-zh

Użyj następującego polecenia, aby załadować ten zestaw danych do TFDS:

ds = tfds.load('huggingface:opus100/ru-zh')

Opis :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

Licencja : Brak znanej licencji
Wersja : 0.0.0
Podziały :

Podział	Przykłady
`'test'`	2000

Cechy :

{
    "translation": {
        "languages": [
            "ru",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}