SAIAPR TC-12
From Chorus
Domain | Picture |
Media | Image, Text |
Size | |
Instances | 20,000 |
File Format | JPEG |
Creation Date | |
Task | Classification, Annotation |
Copyright | cite paper - see below |
URL | http://www-i6.informatik.rwth-aachen.de/imageclef/resources/saiaprtc12/ |
Description
- This site describes the segmented and annotated IAPR-TC12 benchmark (SAIAPR TC-12): an extension of the IAPR TC-12 collection for the evaluation of automatic image annotation methods and for studying their impact on multimedia information retrieval. This includes the pictures from the IAPR TC-12 collection plus:
- Segmentation masks and segmented images for the 20,000 pictures;
- Features extracted from the regions and labels assigned to them;
- Region-level annotations according an annotation hierarchy;
- Spatial relationships information.
- Each image has been manually segmented and the resultant regions have been annotated according to a predefined vocabulary of labels; the vocabulary is organized according to a hierarchy of concepts. Visual features have been extracted from each region.
Quality
Source
- Is an extension of the IAPR TC-12 collection for the evaluation of automatic image annotation methods and for studying their impact on multimedia information retrieval
Ground Truth Annotation
- The following resources constitute the SAIAPR TC-12 resource:
- Segmentation masks.. One per region: 99,535 files; one per image: 20,000 files. Each object of reasonable size is segmented by using ISATOOL. In average 5 objects per image have been segmented. The average area of such objects is of ~16% of the total of their respective image. The resultant segmented images are provided as well.
- Annotations. One per region: 99,535 regions were manually annotated. Each segmented region is assigned a label from a carefully defined vocabulary, see [1]; the annotation vocabulary has been organized according to a conceptual hierarchy. For annotation the annotator went through the hierarchy from top to bottom looking for the best label for each object.
- Spatial relationships. One per image: 20,000 files. The following relationships have been calculated for each pair of regions in every image: adjacent, disjoint, beside, X-aligned, above, below and Y-aligned.
- Visual features. A vector of features per region: 99,535 vectors of attributes. The following features have been extracted from each region: area, boundary/area, width and height of the region, average and standard deviation in x and y, convexity, average, standard deviation and skewness in both color spaces RGB and CIE-Lab.
Features
Copyright Remarks
- SAIAPR TC-12 Benchmark is available free of charge and without any copyright restrictions
Citation
- In publications based on the SAIAPR TC-12 Benchmark and/or the use of its data or a subset thereof, please cite the following publication:
The Segmented and Annotated IAPR TC-12 Benchmark.. Escalante, H. J., Hernández, C., Gonzalez, J.., López, A., Montes, M., Morales, E., Sucar, E., , L., Grubinger, M.. Computer Vision and Image Understanding, doi:10.1016/j.cviu.2009.03.008, 2009.
External Links
Categories: Picture | Image, Text | JPEG | Classification, Annotation | Dataset | Classification | Annotation | Image | Text