TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

tashkeela

References:

Code
Huggingface

plain_text

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:tashkeela/plain_text')

Description:

Arabic vocalized texts.
it contains 75 million of fully vocalized words mainly97 books from classical and modern Arabic language.

License: No known license
Version: 1.0.0
Splits:

Split	Examples
`'train'`	97

Features:

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "book": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2022-06-28 UTC.