Evaluation of Feature Extractors and Psycho-acoustic Transformations for Music Genre Classification

T. Lidy, A. Rauber:
"Evaluation of Feature Extractors and Psycho-acoustic Transformations for Music Genre Classification";
Vortrag: International Conference on Music Information Retrieval (ISMIR), London, UK; 11.09.2005 - 15.09.2005; in:"Proceedings of the Sixth International Conference on Music Information Retrieval", (2005), ISBN: 0-9551179-0-9; S. 34 - 41.

[ Publication Database ]

Abstract:


We present a study on the importance of psycho-acoustic transformations for effective audio feature calculation. From the results, both crucial and problematic parts of the algorithm for Rhythm Patterns feature extraction are identified. We furthermore introduce two new feature representations in this context: Statistical Spectrum Descriptors and Rhythm Histogram features. Evaluation on both the individual and combined feature sets is accomplished through a music genre classification task, involving 3 reference audio collections. Results are compared to published measures on the same data sets. Experiments confirmed that in all settings the inclusion of psycho-acoustic transformations provides significant improvement of classification accuracy.