TFDS hiện hỗ trợ định dạng Croissant 🥐 ! Đọc tài liệu để biết thêm.

Trang này được dịch bởi Cloud Translation API.

thịt lợn

Tài liệu tham khảo:

hoặc-bạn

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/or-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	43766

Đặc trưng :

{
    "translation": {
        "languages": [
            "or",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-hoặc

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/ml-or')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	19413

Đặc trưng :

{
    "translation": {
        "languages": [
            "ml",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-ta

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-ta')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	33005

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

gu-anh

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/gu-mr')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	30766

Đặc trưng :

{
    "translation": {
        "languages": [
            "gu",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

chào hoặc

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/hi-or')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	61070

Đặc trưng :

{
    "translation": {
        "languages": [
            "hi",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-hoặc

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/en-or')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	98230

Đặc trưng :

{
    "translation": {
        "languages": [
            "en",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

anh-anh

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/mr-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	49691

Đặc trưng :

{
    "translation": {
        "languages": [
            "mr",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ta

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/en-ta')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	118759

Đặc trưng :

{
    "translation": {
        "languages": [
            "en",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

chào-ta

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/hi-ta')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	64945

Đặc trưng :

{
    "translation": {
        "languages": [
            "hi",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-en

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-en')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	93560

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-hoặc

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-or')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	26456

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-ta

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/ml-ta')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	23609

Đặc trưng :

{
    "translation": {
        "languages": [
            "ml",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

gu-ur

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/gu-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	29938

Đặc trưng :

{
    "translation": {
        "languages": [
            "gu",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-ml

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-ml')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	18149

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-pa

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/ml-pa')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	21978

Đặc trưng :

{
    "translation": {
        "languages": [
            "ml",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pa

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/en-pa')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	103296

Đặc trưng :

{
    "translation": {
        "languages": [
            "en",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

chào bạn

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-hi')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	49598

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "hi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

chào-pa

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/hi-pa')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	75200

Đặc trưng :

{
    "translation": {
        "languages": [
            "hi",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

gu-te

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/gu-te')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	16335

Đặc trưng :

{
    "translation": {
        "languages": [
            "gu",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pa-ta

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/pa-ta')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	46349

Đặc trưng :

{
    "translation": {
        "languages": [
            "pa",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

chào-ml

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/hi-ml')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	27167

Đặc trưng :

{
    "translation": {
        "languages": [
            "hi",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

or-te

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/or-te')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	10475

Đặc trưng :

{
    "translation": {
        "languages": [
            "or",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ml

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/en-ml')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	44986

Đặc trưng :

{
    "translation": {
        "languages": [
            "en",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

chào

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/en-hi')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	269594

Đặc trưng :

{
    "translation": {
        "languages": [
            "en",
            "hi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-pa

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-pa')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	35109

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ông-te

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/mr-te')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	18179

Đặc trưng :

{
    "translation": {
        "languages": [
            "mr",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ông-pa

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/mr-pa')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	50418

Đặc trưng :

{
    "translation": {
        "languages": [
            "mr",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-te

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-te')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	17605

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

gu-chào

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/gu-hi')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	41587

Đặc trưng :

{
    "translation": {
        "languages": [
            "gu",
            "hi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ta-ur

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/ta-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	48892

Đặc trưng :

{
    "translation": {
        "languages": [
            "ta",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bạn

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/te-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	21148

Đặc trưng :

{
    "translation": {
        "languages": [
            "te",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

or-pa

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/or-pa')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	43159

Đặc trưng :

{
    "translation": {
        "languages": [
            "or",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

gu-ml

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/gu-ml')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	18252

Đặc trưng :

{
    "translation": {
        "languages": [
            "gu",
            "ml"
        ],
        "id": null,
        "_type": "Translation"
    }
}

gu-pa

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/gu-pa')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	35566

Đặc trưng :

{
    "translation": {
        "languages": [
            "gu",
            "pa"
        ],
        "id": null,
        "_type": "Translation"
    }
}

chào

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/hi-te')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	28569

Đặc trưng :

{
    "translation": {
        "languages": [
            "hi",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-te

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/en-te')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	44888

Đặc trưng :

{
    "translation": {
        "languages": [
            "en",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-te

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/ml-te')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	10480

Đặc trưng :

{
    "translation": {
        "languages": [
            "ml",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pa-ur

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/pa-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	51831

Đặc trưng :

{
    "translation": {
        "languages": [
            "pa",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

chào-bạn

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/hi-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	109951

Đặc trưng :

{
    "translation": {
        "languages": [
            "hi",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ông-hoặc

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/mr-or')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	47001

Đặc trưng :

{
    "translation": {
        "languages": [
            "mr",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ur

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/en-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	202578

Đặc trưng :

{
    "translation": {
        "languages": [
            "en",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-ur

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/ml-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	20913

Đặc trưng :

{
    "translation": {
        "languages": [
            "ml",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-mr

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-mr')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	34043

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

gu-ta

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/gu-ta')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	29187

Đặc trưng :

{
    "translation": {
        "languages": [
            "gu",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pa-te

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/pa-te')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	25684

Đặc trưng :

{
    "translation": {
        "languages": [
            "pa",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-gu

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-gu')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	25166

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "gu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bn-ur

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/bn-ur')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	39290

Đặc trưng :

{
    "translation": {
        "languages": [
            "bn",
            "ur"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ml-mr

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/ml-mr')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	22796

Đặc trưng :

{
    "translation": {
        "languages": [
            "ml",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

or-ta

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/or-ta')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	44035

Đặc trưng :

{
    "translation": {
        "languages": [
            "or",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ta-te

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/ta-te')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	17359

Đặc trưng :

{
    "translation": {
        "languages": [
            "ta",
            "te"
        ],
        "id": null,
        "_type": "Translation"
    }
}

gu-hoặc

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/gu-or')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	27162

Đặc trưng :

{
    "translation": {
        "languages": [
            "gu",
            "or"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-gu

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/en-gu')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	59739

Đặc trưng :

{
    "translation": {
        "languages": [
            "en",
            "gu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

xin chào ông

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/hi-mr')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	69186

Đặc trưng :

{
    "translation": {
        "languages": [
            "hi",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ông-ta

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/mr-ta')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	48535

Đặc trưng :

{
    "translation": {
        "languages": [
            "mr",
            "ta"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-mr

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:pib/en-mr')

Sự miêu tả :

Sentence aligned parallel corpus between 11 Indian Languages, crawled and extracted from the press information bureau
website.

Giấy phép : Không có giấy phép được biết đến
Phiên bản : 1.3.0
Chia tách :

Tách ra	Ví dụ
`'train'`	117199

Đặc trưng :

{
    "translation": {
        "languages": [
            "en",
            "mr"
        ],
        "id": null,
        "_type": "Translation"
    }
}