scene_parse150

  • Description:

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Scene Parsing Benchmark (SceneParse150) provides a standard training and evaluation platform for the algorithms of scene parsing.

Split Examples
'test' 2,000
'train' 20,210
  • Feature structure:
FeaturesDict({
    'annotation': Image(shape=(None, None, 3), dtype=uint8),
    'image': Image(shape=(None, None, 3), dtype=uint8),
})
  • Feature documentation:
Feature Class Shape Dtype Description
FeaturesDict
annotation Image (None, None, 3) uint8
image Image (None, None, 3) uint8
  • Citation:
@inproceedings{zhou2017scene,
title={Scene Parsing through ADE20K Dataset},
author={Zhou, Bolei and Zhao, Hang and Puig, Xavier and Fidler, Sanja and Barriuso, Adela and Torralba, Antonio},
booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
year={2017}
}