TFDS תומך כעת בפורמט קרואסון 🥐 ! קרא את התיעוד כדי לדעת יותר.

TFDS CLI

TFDS CLI הוא כלי שורת פקודה המספק פקודות שונות לעבודה בקלות עם ערכות נתונים של TensorFlow.

הצג באתר TensorFlow.org

הפעל בגוגל קולאב

צפה במקור ב-GitHub

הורד מחברת

השבת יומני TF ביבוא

%%capture
%env TF_CPP_MIN_LOG_LEVEL=1  # Disable logs on TF import

הַתקָנָה

כלי ה-CLI מותקן עם tensorflow-datasets (או tfds-nightly ).

pip install -q tfds-nightly
tfds --version

לרשימת כל פקודות CLI:

tfds --help

usage: tfds [-h] [--helpfull] [--version] {build,new} ...

Tensorflow Datasets CLI tool

optional arguments:
  -h, --help   show this help message and exit
  --helpfull   show full help message and exit
  --version    show program's version number and exit

command:
  {build,new}
    build      Commands for downloading and preparing datasets.
    new        Creates a new dataset directory from the template.

`tfds new` : הטמעת מערך נתונים חדש

פקודה זו תעזור לך להתחיל בכתיבת מערך הנתונים החדש של Python על ידי יצירת <dataset_name>/ המכילה קובצי יישום ברירת מחדל.

נוֹהָג:

tfds new my_dataset

2022-02-07 04:04:10.397902: E tensorflow/stream_executor/cuda/cuda_driver.cc:271] failed call to cuInit: CUDA_ERROR_NO_DEVICE: no CUDA-capable device is detected
Dataset generated at /tmpfs/src/temp/docs/my_dataset
You can start searching `TODO(my_dataset)` to complete the implementation.
Please check https://www.tensorflow.org/datasets/add_dataset for additional details.

ייצור:

ls -1 my_dataset/

__init__.py
checksums.tsv
dummy_data/
my_dataset.py
my_dataset_test.py

עיין במדריך מערכי הכתיבה שלנו למידע נוסף.

אפשרויות זמינות:

tfds new --help

usage: tfds new [-h] [--helpfull] [--dir DIR] dataset_name

positional arguments:
  dataset_name  Name of the dataset to be created (in snake_case)

optional arguments:
  -h, --help    show this help message and exit
  --helpfull    show full help message and exit
  --dir DIR     Path where the dataset directory will be created. Defaults to
                current directory.

`tfds build` : הורד והכן מערך נתונים

השתמש tfds build <my_dataset> כדי ליצור מערך נתונים חדש. <my_dataset> יכול להיות:

נתיב dataset/ תיקייה או לקובץ dataset.py (ריק עבור הספרייה הנוכחית):
- tfds build datasets/my_dataset/
- cd datasets/my_dataset/ && tfds build
- cd datasets/my_dataset/ && tfds build my_dataset
- cd datasets/my_dataset/ && tfds build my_dataset.py
מערך נתונים רשום:
- tfds build mnist
- tfds build my_dataset --imports my_project.datasets