SAIAPR TC-12

From Chorus
Revision as of 09:55, 12 January 2011 by Cimpaniulia (Talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
SAIAPR TC-12
Domain Picture
Media Image, Text
Size
Instances 20,000
File Format JPEG
Creation Date
Task Classification, Annotation
Copyright cite paper - see below
URL http://www-i6.informatik.rwth-aachen.de/imageclef/resources/saiaprtc12/


Description

  • This site describes the segmented and annotated IAPR-TC12 benchmark (SAIAPR TC-12): an extension of the IAPR TC-12 collection for the evaluation of automatic image annotation methods and for studying their impact on multimedia information retrieval. This includes the pictures from the IAPR TC-12 collection plus:
    • Segmentation masks and segmented images for the 20,000 pictures;
    • Features extracted from the regions and labels assigned to them;
    • Region-level annotations according an annotation hierarchy;
    • Spatial relationships information.
  • Each image has been manually segmented and the resultant regions have been annotated according to a predefined vocabulary of labels; the vocabulary is organized according to a hierarchy of concepts. Visual features have been extracted from each region.


Quality

Source

  • Is an extension of the IAPR TC-12 collection for the evaluation of automatic image annotation methods and for studying their impact on multimedia information retrieval


Ground Truth Annotation

  • The following resources constitute the SAIAPR TC-12 resource:
    • Segmentation masks.. One per region: 99,535 files; one per image: 20,000 files. Each object of reasonable size is segmented by using ISATOOL. In average 5 objects per image have been segmented. The average area of such objects is of ~16% of the total of their respective image. The resultant segmented images are provided as well.
    • Annotations. One per region: 99,535 regions were manually annotated. Each segmented region is assigned a label from a carefully defined vocabulary, see [1]; the annotation vocabulary has been organized according to a conceptual hierarchy. For annotation the annotator went through the hierarchy from top to bottom looking for the best label for each object.
    • Spatial relationships. One per image: 20,000 files. The following relationships have been calculated for each pair of regions in every image: adjacent, disjoint, beside, X-aligned, above, below and Y-aligned.
    • Visual features. A vector of features per region: 99,535 vectors of attributes. The following features have been extracted from each region: area, boundary/area, width and height of the region, average and standard deviation in x and y, convexity, average, standard deviation and skewness in both color spaces RGB and CIE-Lab.



Features


Copyright Remarks

  • SAIAPR TC-12 Benchmark is available free of charge and without any copyright restrictions


Citation

  • In publications based on the SAIAPR TC-12 Benchmark and/or the use of its data or a subset thereof, please cite the following publication:

The Segmented and Annotated IAPR TC-12 Benchmark.. Escalante, H. J., Hernández, C., Gonzalez, J.., López, A., Montes, M., Morales, E., Sucar, E., , L., Grubinger, M.. Computer Vision and Image Understanding, doi:10.1016/j.cviu.2009.03.008, 2009.


External Links

Personal tools
CHORUS+