TFDS תומך כעת בפורמט קרואסון 🥐 ! קרא את התיעוד כדי לדעת יותר.

דף זה תורגם על ידי Cloud Translation API.

אופוס 100

הפניות:

af-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/af-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	275512
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "af",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

אָמֵן

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/am-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	89027
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "am",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

an-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/an-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	6961

תכונות :

{
    "translation": {
        "languages": [
            "an",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

אר-אן

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/ar-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "ar",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

as-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/as-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	138479
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "as",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

az-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/az-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	262089
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "az",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

be-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/be-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	67312
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "be",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/bg-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "bg",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/bn-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "bn",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

br-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/br-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	153447
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "br",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bs-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/bs-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "bs",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ca-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/ca-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "ca",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/cs-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "cs",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cy-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/cy-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	289521
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "cy",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/da-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "da",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/de-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "de",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

dz-en

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/dz-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	624

תכונות :

{
    "translation": {
        "languages": [
            "dz",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

אל-אן

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/el-en')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "el",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-eo

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-eo')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	337106
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "eo"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-es

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-es')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-et

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-et')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-eu

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-eu')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "eu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fa

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-fa')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "fa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fi

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-fi')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fr

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-fr')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fy

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-fy')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	54342
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "fy"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ga

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ga')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	289524
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ga"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-gd

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-gd')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	1606
`'train'`	16316
`'validation'`	1605

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "gd"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-gl

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-gl')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	515344
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "gl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-gu

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-gu')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	318306
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "gu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ha

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ha')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	97983
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ha"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-הוא

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-he')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "he"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-hi

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-hi')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	534319
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "hi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

שעה

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-hr')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "hr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-hu

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-hu')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-hy

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-hy')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	7059

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "hy"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-id

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-id')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "id"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ig

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ig')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	1843
`'train'`	18415
`'validation'`	1843

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ig"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-is

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-is')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "is"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-it

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-it')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ja

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ja')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ja"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ka

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ka')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	377306
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ka"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-kk

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-kk')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	79927
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "kk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-km

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-km')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	111483
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "km"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ko

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ko')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ko"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-kn

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-kn')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	918
`'train'`	14537
`'validation'`	917

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "kn"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ku

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ku')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	144844
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ku"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ky

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ky')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	27215
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ky"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-li

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-li')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	25535
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "li"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lt

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-lt')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lv

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-lv')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mg

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-mg')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	590771
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "mg"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mk

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-mk')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "mk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ml

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ml')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	822746
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mn

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-mn')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	4294

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "mn"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mr

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-mr')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	27007
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ms

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ms')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ms"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mt

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-mt')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "mt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-my

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-my')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	24594
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "my"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-nb

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-nb')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	142906
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "nb"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ne

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ne')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	406381
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ne"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-nl

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-nl')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-nn

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-nn')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	486055
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "nn"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-no

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-no')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "no"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-oc

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-oc')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	35791
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "oc"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-or

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-or')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	1318
`'train'`	14273
`'validation'`	1317

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pa

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-pa')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	107296
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pl

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-pl')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ps

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ps')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	79127
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ps"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pt

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-pt')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ro

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ro')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ru

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ru')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-rw

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-rw')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	173823
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "rw"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-se

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-se')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	35907
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "se"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sh

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-sh')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	267211
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "sh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-si

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-si')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	979109
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "si"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sk

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-sk')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sl

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-sl')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sq

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-sq')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "sq"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sr

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-sr')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "sr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sv

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-sv')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ta

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ta')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	227014
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-te

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-te')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	64352
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-tg

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-tg')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	193882
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "tg"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-th

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-th')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "th"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-tk

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-tk')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	1852
`'train'`	13110
`'validation'`	1852

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "tk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-tr

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-tr')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "tr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-tt

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-tt')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	100843
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "tt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ug

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ug')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	72170
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ug"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-uk

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-uk')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "uk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ur

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-ur')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	753913
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-uz

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-uz')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	173157
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "uz"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-vi

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-vi')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "vi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-wa

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-wa')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	104496
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "wa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-xh

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-xh')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	439671
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "xh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-yi

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-yi')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	15010
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "yi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-yo

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-yo')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'train'`	10375

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "yo"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-zh

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-zh')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	1000000
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-zu

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/en-zu')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000
`'train'`	38616
`'validation'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "en",
            "zu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

אר-דה

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/ar-de')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "ar",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-fr

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/ar-fr')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "ar",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-nl

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/ar-nl')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "ar",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-ru

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/ar-ru')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "ar",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ar-zh

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/ar-zh')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "ar",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-fr

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/de-fr')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "de",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-nl

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/de-nl')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "de",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

דה-רו

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/de-ru')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "de",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-zh

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/de-zh')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "de",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-nl

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/fr-nl')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "fr",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-ru

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/fr-ru')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "fr",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-zh

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/fr-zh')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "fr",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-ru

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/nl-ru')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "nl",
            "ru"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-zh

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/nl-zh')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "nl",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ru-zh

השתמש בפקודה הבאה כדי לטעון מערך נתונים זה ב-TFDS:

ds = tfds.load('huggingface:opus100/ru-zh')

תיאור :

OPUS-100 is English-centric, meaning that all training pairs include English on either the source or target side.
The corpus covers 100 languages (including English).OPUS-100 contains approximately 55M sentence pairs.
Of the 99 language pairs, 44 have 1M sentence pairs of training data, 73 have at least 100k, and 95 have at least 10k.

רישיון : אין רישיון ידוע
גרסה : 0.0.0
פיצולים :

לְפַצֵל	דוגמאות
`'test'`	2000

תכונות :

{
    "translation": {
        "languages": [
            "ru",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}