Conozca lo último en aprendizaje automático, IA generativa y más en el Simposio WiML 2023.

Se usó la API de Cloud Translation para traducir esta página.

patas-x

Referencias:

es

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:paws-x/en')

Descripción :

PAWS-X, a multilingual version of PAWS (Paraphrase Adversaries from Word Scrambling) for six languages.

This dataset contains 23,659 human translated PAWS evaluation pairs and 296,406 machine
translated training pairs in six typologically distinct languages: French, Spanish, German,
Chinese, Japanese, and Korean. English language is available by default. All translated
pairs are sourced from examples in PAWS-Wiki.

For further details, see the accompanying paper: PAWS-X: A Cross-lingual Adversarial Dataset
for Paraphrase Identification (https://arxiv.org/abs/1908.11828)

Note: There might be some missing or wrong labels in the dataset and we have replaced them with -1.

Licencia : el conjunto de datos se puede utilizar libremente para cualquier propósito, aunque se agradecería el reconocimiento de Google LLC ("Google") como fuente de datos. El conjunto de datos se proporciona "TAL CUAL" sin ninguna garantía, expresa o implícita. Google se exime de toda responsabilidad por cualquier daño, directo o indirecto, que resulte del uso del conjunto de datos.
Versión : 1.1.0
Divisiones :

Separar	Ejemplos
`'test'`	2000
`'train'`	49401
`'validation'`	2000

Características :

{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "sentence1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "0",
            "1"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

Delaware

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:paws-x/de')

Descripción :

PAWS-X, a multilingual version of PAWS (Paraphrase Adversaries from Word Scrambling) for six languages.

This dataset contains 23,659 human translated PAWS evaluation pairs and 296,406 machine
translated training pairs in six typologically distinct languages: French, Spanish, German,
Chinese, Japanese, and Korean. English language is available by default. All translated
pairs are sourced from examples in PAWS-Wiki.

For further details, see the accompanying paper: PAWS-X: A Cross-lingual Adversarial Dataset
for Paraphrase Identification (https://arxiv.org/abs/1908.11828)

Note: There might be some missing or wrong labels in the dataset and we have replaced them with -1.

Licencia : el conjunto de datos se puede utilizar libremente para cualquier propósito, aunque se agradecería el reconocimiento de Google LLC ("Google") como fuente de datos. El conjunto de datos se proporciona "TAL CUAL" sin ninguna garantía, expresa o implícita. Google se exime de toda responsabilidad por cualquier daño, directo o indirecto, que resulte del uso del conjunto de datos.
Versión : 1.1.0
Divisiones :

Separar	Ejemplos
`'test'`	2000
`'train'`	49401
`'validation'`	2000

Características :

{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "sentence1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "0",
            "1"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

es

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:paws-x/es')

Descripción :

PAWS-X, a multilingual version of PAWS (Paraphrase Adversaries from Word Scrambling) for six languages.

This dataset contains 23,659 human translated PAWS evaluation pairs and 296,406 machine
translated training pairs in six typologically distinct languages: French, Spanish, German,
Chinese, Japanese, and Korean. English language is available by default. All translated
pairs are sourced from examples in PAWS-Wiki.

For further details, see the accompanying paper: PAWS-X: A Cross-lingual Adversarial Dataset
for Paraphrase Identification (https://arxiv.org/abs/1908.11828)

Note: There might be some missing or wrong labels in the dataset and we have replaced them with -1.

Licencia : el conjunto de datos se puede utilizar libremente para cualquier propósito, aunque se agradecería el reconocimiento de Google LLC ("Google") como fuente de datos. El conjunto de datos se proporciona "TAL CUAL" sin ninguna garantía, expresa o implícita. Google se exime de toda responsabilidad por cualquier daño, directo o indirecto, que resulte del uso del conjunto de datos.
Versión : 1.1.0
Divisiones :

Separar	Ejemplos
`'test'`	2000
`'train'`	49401
`'validation'`	2000

Características :

{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "sentence1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "0",
            "1"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

es

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:paws-x/fr')

Descripción :

PAWS-X, a multilingual version of PAWS (Paraphrase Adversaries from Word Scrambling) for six languages.

This dataset contains 23,659 human translated PAWS evaluation pairs and 296,406 machine
translated training pairs in six typologically distinct languages: French, Spanish, German,
Chinese, Japanese, and Korean. English language is available by default. All translated
pairs are sourced from examples in PAWS-Wiki.

For further details, see the accompanying paper: PAWS-X: A Cross-lingual Adversarial Dataset
for Paraphrase Identification (https://arxiv.org/abs/1908.11828)

Note: There might be some missing or wrong labels in the dataset and we have replaced them with -1.

Licencia : el conjunto de datos se puede utilizar libremente para cualquier propósito, aunque se agradecería el reconocimiento de Google LLC ("Google") como fuente de datos. El conjunto de datos se proporciona "TAL CUAL" sin ninguna garantía, expresa o implícita. Google se exime de toda responsabilidad por cualquier daño, directo o indirecto, que resulte del uso del conjunto de datos.
Versión : 1.1.0
Divisiones :

Separar	Ejemplos
`'test'`	2000
`'train'`	49401
`'validation'`	2000

Características :

{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "sentence1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "0",
            "1"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

sí

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:paws-x/ja')

Descripción :

PAWS-X, a multilingual version of PAWS (Paraphrase Adversaries from Word Scrambling) for six languages.

This dataset contains 23,659 human translated PAWS evaluation pairs and 296,406 machine
translated training pairs in six typologically distinct languages: French, Spanish, German,
Chinese, Japanese, and Korean. English language is available by default. All translated
pairs are sourced from examples in PAWS-Wiki.

For further details, see the accompanying paper: PAWS-X: A Cross-lingual Adversarial Dataset
for Paraphrase Identification (https://arxiv.org/abs/1908.11828)

Note: There might be some missing or wrong labels in the dataset and we have replaced them with -1.

Licencia : el conjunto de datos se puede utilizar libremente para cualquier propósito, aunque se agradecería el reconocimiento de Google LLC ("Google") como fuente de datos. El conjunto de datos se proporciona "TAL CUAL" sin ninguna garantía, expresa o implícita. Google se exime de toda responsabilidad por cualquier daño, directo o indirecto, que resulte del uso del conjunto de datos.
Versión : 1.1.0
Divisiones :

Separar	Ejemplos
`'test'`	2000
`'train'`	49401
`'validation'`	2000

Características :

{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "sentence1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "0",
            "1"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

ko

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:paws-x/ko')

Descripción :

PAWS-X, a multilingual version of PAWS (Paraphrase Adversaries from Word Scrambling) for six languages.

This dataset contains 23,659 human translated PAWS evaluation pairs and 296,406 machine
translated training pairs in six typologically distinct languages: French, Spanish, German,
Chinese, Japanese, and Korean. English language is available by default. All translated
pairs are sourced from examples in PAWS-Wiki.

For further details, see the accompanying paper: PAWS-X: A Cross-lingual Adversarial Dataset
for Paraphrase Identification (https://arxiv.org/abs/1908.11828)

Note: There might be some missing or wrong labels in the dataset and we have replaced them with -1.

Licencia : el conjunto de datos se puede utilizar libremente para cualquier propósito, aunque se agradecería el reconocimiento de Google LLC ("Google") como fuente de datos. El conjunto de datos se proporciona "TAL CUAL" sin ninguna garantía, expresa o implícita. Google se exime de toda responsabilidad por cualquier daño, directo o indirecto, que resulte del uso del conjunto de datos.
Versión : 1.1.0
Divisiones :

Separar	Ejemplos
`'test'`	2000
`'train'`	49401
`'validation'`	2000

Características :

{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "sentence1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "0",
            "1"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

Z h

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:paws-x/zh')

Descripción :

PAWS-X, a multilingual version of PAWS (Paraphrase Adversaries from Word Scrambling) for six languages.

This dataset contains 23,659 human translated PAWS evaluation pairs and 296,406 machine
translated training pairs in six typologically distinct languages: French, Spanish, German,
Chinese, Japanese, and Korean. English language is available by default. All translated
pairs are sourced from examples in PAWS-Wiki.

For further details, see the accompanying paper: PAWS-X: A Cross-lingual Adversarial Dataset
for Paraphrase Identification (https://arxiv.org/abs/1908.11828)

Note: There might be some missing or wrong labels in the dataset and we have replaced them with -1.

Licencia : el conjunto de datos se puede utilizar libremente para cualquier propósito, aunque se agradecería el reconocimiento de Google LLC ("Google") como fuente de datos. El conjunto de datos se proporciona "TAL CUAL" sin ninguna garantía, expresa o implícita. Google se exime de toda responsabilidad por cualquier daño, directo o indirecto, que resulte del uso del conjunto de datos.
Versión : 1.1.0
Divisiones :

Separar	Ejemplos
`'test'`	2000
`'train'`	49401
`'validation'`	2000

Características :

{
    "id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "sentence1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "0",
            "1"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}