オックスフォード_iiit_pet

説明：

Oxford-IIIT ペットデータセットは、クラスごとに約 200 枚の画像を含む 37 カテゴリのペット画像データセットです。画像にはスケール、ポーズ、照明が大きく異なります。すべての画像には、品種と種に関するグラウンドトゥルースの注釈が関連付けられています。さらに、トレーニング分割用に頭部境界ボックスが提供され、このデータセットを単純な物体検出タスクに使用できるようになります。テスト分割では、境界ボックスは空です。

追加ドキュメント:コード付きの論文について調べる
ホームページ: http://www.robots.ox.ac.uk/~vgg/data/pets/
ソースコード: tfds.datasets.oxford_iiit_pet.Builder
バージョン:
- 4.0.0 (デフォルト) : 頭の境界ボックスを追加します。破損した画像を修正します。データセットの URL を更新します。
ダウンロードサイズ: 773.52 MiB
データセットのサイズ: 773.68 MiB
自動キャッシュ(ドキュメント): いいえ
分割:

スプリット	例
`'test'`	3,669
`'train'`	3,680

機能の構造:

FeaturesDict({
    'file_name': Text(shape=(), dtype=string),
    'head_bbox': BBoxFeature(shape=(4,), dtype=float32),
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=37),
    'segmentation_mask': Image(shape=(None, None, 1), dtype=uint8),
    'species': ClassLabel(shape=(), dtype=int64, num_classes=2),
})

機能ドキュメント:

特徴	クラス	形	Dタイプ
	特徴辞書
ファイル名	文章		弦
head_bbox	BBox機能	(4,)	float32
画像	画像	(なし、なし、3)	uint8
ラベル	クラスラベル		int64
セグメンテーションマスク	画像	(なし、なし、1)	uint8
種	クラスラベル		int64

監視キー( as_supervised doc を参照): ('image', 'label')
図( tfds.show_examples ): サポートされていません。
例( tfds.as_dataframe ):

引用：

@InProceedings{parkhi12a,
  author       = "Parkhi, O. M. and Vedaldi, A. and Zisserman, A. and Jawahar, C.~V.",
  title        = "Cats and Dogs",
  booktitle    = "IEEE Conference on Computer Vision and Pattern Recognition",
  year         = "2012",
}

オックスフォード_iiit_pet コレクションでコンテンツを整理 必要に応じて、コンテンツの保存と分類を行います。

オックスフォード_iiit_pet