Multi-modal Analysis of Music: A large-scale Evaluation

R. Mayer, R. Neumayer:
"Multi-modal Analysis of Music: A large-scale Evaluation";
Vortrag: WEMIS 2009 - Workshop on Exploring Musical Information Spaces, Korfu; 01.10.2009 - 02.10.2009; in:"WEMIS 2009 - Workshop on Exploring Musical Information Spaces", N. Orio, A. Rauber, D. Rizo (Hrg.); University of Alicante, Alicante, Spain (2009), ISBN: 978-84-692-6082-1; S. 30 - 35.

Multimedia data by definition comprises several different types of content modalities. Music specifically inherits e.g. audio at its core, text in the form of lyrics, images by means of album covers, or video in the form of music videos. Yet, in many Music Information Retrieval applications, only the audio content is utilised. Recent studies have shown the usefulness of incorporating other modalities; in most of them, textual information in the form of song lyrics or artist biographies, were employed. Following this direction, the contribution of this paper is a large-scale evaluation of the combination of audio and text (lyrics) features for genre classification, on a database comprising over 20,000 songs. We present the audio and lyrics features employed, and provide an in-depth discussion of the experimental results.