I-SEARCH Multimodal Dataset

From Chorus

Revision as of 10:15, 13 February 2013 by Gl (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

**I-SEARCH Multimodal Dataset**
Domain	Multimodal Search
Media	Text, Image, 3D, Audio, Video
Size	45MB
Instances	10305
File Format	XML
Creation Date	2012-06-08
Task	Retrieval
Copyright	Creative Commons Attribution 3.0 Unported License.
URL	http://vcl.iti.gr/is/UC/

Description

The EU-funded project I-SEARCH aims to provide a novel unified framework for multimodal content indexing, search and retrieval. The I-SEARCH framework will be able to handle specific types of multimedia and multimodal content (text, 2D image, sketch, video, 3D objects and audio) along with real-world and user-related information, which can be used as queries and retrieve any available relevant content of any of the aforementioned types.

The searchable items within I-SEARCH will span from very simple media items (e.g., a single image or an audio file) to highly complex multimedia collections (e.g., a 3D object together with multiple 2D images and audio files) along with accompanying information. All the above multimedia collections are called Content Objects (CO). For a formal representation of COs, a novel description framework is introduced by I-SEARCH: the Rich Unified Content Description (RUCoD).

The latest version of RUCoD schema (v1.4.1) is now available [1]

A multimodal dataset has been created in I-SEARCH to demonstrate the Generic Multimodal Search Use Case. The dataset consists of 10305 COs classified into 51 categories. The COs consist of images, 3D objects, sounds and videos accompanied by textual information, tags and location information (if available).

Source

content is data mined from various web sources, images : flickr [2], 3d : Google 3D warehouse [3], audio : freesound [4], videos : YouTube [5],

Ground Truth Annotation

The RUCoDs are stored in separate folders (one folder for each category) thus classification information can be directly extracted. This can be used as ground truth for search and retrieval tasks.

Features

Low-level descriptors have been extracted for the 3D objects and images of the dataset. The links to descriptors are available at the RUCoD XML files (<L_Descriptor type="ImageType"> for the image descriptors and <L_Descriptor type="Object3D"> for the 3D object descriptors).

Licensing / Copyright

Creative Commons Attribution 3.0 Unported License.

Citation

P. Daras, A. Axenopoulos, V. Darlagiannis, D. Tzovaras, X. Le Bourdon, L. Joyeux, A. Verroust-Blondet, V. Croce, T. Steiner, A. Massari, A. Camurri, S. Morin, A-D. Mezaour, L. Sutton, S. Spiller, "Introducing a Unified Framework for Content Object Description", International Journal of Multimedia Intelligence and Security, Special Issue on “Challenges in Scalable Context Aware Multimedia Computing”, Volume 2, Number 3–4/2011, DOI 10.1504/IJMIS.2011.044765, Pages: 351-375, January 2012.

External Links

I-SEARCH Multimodal Dataset [6] I-SEARCH Project [7]

I-SEARCH Multimodal Dataset

Description

Source

Ground Truth Annotation

Features

Licensing / Copyright

Citation

External Links

Views

Personal tools

Navigation

CHORUS+

Search

Toolbox