piece_anything

وصف :

تحميل SA-1B

Segment Anything 1 Billion (SA-1B) عبارة عن مجموعة بيانات مصممة لتدريب نماذج تجزئة الكائنات للأغراض العامة من صور العالم المفتوح. تم تقديم مجموعة البيانات في الورقة "تقسيم أي شيء" .

تتكون مجموعة البيانات SA-1B من 11 مليون صورة متنوعة وعالية الدقة ومرخصة ومحمية للخصوصية و1.1B من التعليقات التوضيحية للقناع. يتم تقديم الأقنعة بتنسيق ترميز طول التشغيل (RLE) COCO، ولا تحتوي على فئات.

الترخيص مخصص. يرجى قراءة الشروط والأحكام الكاملة على https://ai.facebook.com/datasets/segment-anything-downloads

جميع الميزات موجودة في مجموعة البيانات الأصلية باستثناء image.content (محتوى الصورة).

يمكنك فك تشفير أقنعة التجزئة باستخدام:

import tensorflow_datasets as tfds

pycocotools = tfds.core.lazy_imports.pycocotools

ds = tfds.load('segment_anything', split='train')
for example in tfds.as_numpy(ds):
  segmentation = example['annotations']['segmentation']
  for counts, size in zip(segmentation['counts'], segmentation['size']):
    encoded_mask = {'size': size, 'counts': counts}
    mask = pycocotools.decode(encoded_mask)  # np.array(dtype=uint8) mask
    ...

الصفحة الرئيسية : https://ai.facebook.com/datasets/segment-anything-downloads
كود المصدر : tfds.datasets.segment_anything.Builder
الإصدارات :
- 1.0.0 (افتراضي): الإصدار الأولي.
حجم التحميل : 10.28 TiB
حجم مجموعة البيانات : 10.59 TiB
تعليمات التنزيل اليدوي : تتطلب مجموعة البيانات هذه تنزيل البيانات المصدر يدويًا إلى download_config.manual_dir (الإعداد الافتراضي هو ~/tensorflow_datasets/downloads/manual/ ):
قم بتنزيل ملف الروابط من https://ai.facebook.com/datasets/segment-anything-downloads manual_dir يجب أن يحتوي على ملف الروابط المحفوظ كـ section_anything_links.txt.
التخزين المؤقت التلقائي ( الوثائق ): لا
الإنشقاقات :

ينقسم	أمثلة
`'train'`	11,185,362

هيكل الميزة :

FeaturesDict({
    'annotations': Sequence({
        'area': Scalar(shape=(), dtype=uint64, description=The area in pixels of the mask.),
        'bbox': BBoxFeature(shape=(4,), dtype=float32, description=The box around the mask, in TFDS format.),
        'crop_box': BBoxFeature(shape=(4,), dtype=float32, description=The crop of the image used to generate the mask, in TFDS format.),
        'id': Scalar(shape=(), dtype=uint64, description=Identifier for the annotation.),
        'point_coords': Tensor(shape=(1, 2), dtype=float64, description=The point coordinates input to the model to generate the mask.),
        'predicted_iou': Scalar(shape=(), dtype=float64, description=The model's own prediction of the mask's quality.),
        'segmentation': FeaturesDict({
            'counts': string,
            'size': Tensor(shape=(2,), dtype=uint64),
        }),
        'stability_score': Scalar(shape=(), dtype=float64, description=A measure of the mask's quality.),
    }),
    'image': FeaturesDict({
        'content': Image(shape=(None, None, 3), dtype=uint8, description=Content of the image.),
        'file_name': string,
        'height': uint64,
        'image_id': uint64,
        'width': uint64,
    }),
})

وثائق الميزة :

ميزة	فصل	شكل	نوع D	وصف
	المميزاتDict
التعليقات التوضيحية	تسلسل
الشروح/المنطقة	العددية		uint64	المساحة بالبكسل للقناع.
التعليقات التوضيحية/bbox	ميزة BBox	(4،)	float32	الصندوق المحيط بالقناع، بصيغة TFDS.
التعليقات التوضيحية/crop_box	ميزة BBox	(4،)	float32	اقتصاص الصورة المستخدمة لإنشاء القناع بتنسيق TFDS.
التعليقات التوضيحية/المعرف	العددية		uint64	معرف للتعليق التوضيحي.
التعليقات التوضيحية/point_coords	الموتر	(1، 2)	float64	تقوم النقطة بتنسيق المدخلات إلى النموذج لإنشاء القناع.
التعليقات التوضيحية/predicted_iou	العددية		float64	توقع النموذج الخاص لجودة القناع.
التعليقات التوضيحية/التجزئة	المميزاتDict			قناع التجزئة المشفر بتنسيق COCO RLE (إملاء `size` المفاتيح `counts` ).
التعليقات التوضيحية/التجزئة/التهم	الموتر		خيط
الشروح/التجزئة/الحجم	الموتر	(2،)	uint64
التعليقات التوضيحية/stability_score	العددية		float64	مقياس لجودة القناع.
صورة	المميزاتDict
الصورة/المحتوى	صورة	(لا شيء، لا شيء، 3)	uint8	محتوى الصورة.
الصورة/اسم_الملف	الموتر		خيط
الصورة/الارتفاع	الموتر		uint64
الصورة/image_id	الموتر		uint64
الصورة/العرض	الموتر		uint64

المفاتيح الخاضعة للإشراف (راجع as_supervised doc ): None
الشكل ( tfds.show_examples ): غير مدعوم.
أمثلة ( tfds.as_dataframe ):

الاقتباس :

@misc{kirillov2023segment,
  title={Segment Anything},
  author={Alexander Kirillov and Eric Mintun and Nikhila Ravi and Hanzi Mao and Chloe Rolland and Laura Gustafson and Tete Xiao and Spencer Whitehead and Alexander C. Berg and Wan-Yen Lo and Piotr Dollár and Ross Girshick},
  year={2023},
  eprint={2304.02643},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}