TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

journalists_questions

References:

plain_text

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:journalists_questions/plain_text')

Description:

The journalists_questions corpus (version 1.0) is a collection of 10K human-written Arabic
tweets manually labeled for question identification over Arabic tweets posted by journalists.

License: No known license
Version: 1.0.0
Splits:

Split	Examples
`'train'`	10077

Features:

{
    "tweet_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 2,
        "names": [
            "no",
            "yes"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    },
    "label_confidence": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    }
}

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2022-06-28 UTC.