Modeling concept dynamics for large scale music search

Jialie Shen, Hwee Hwa Pang, Meng Wang, Shuicheng Yan

Research output: Chapter in Book/Report/Conference proceedingConference Paper published in Proceedingspeer-review

Abstract

Continuing advances in data storage and communication technologies have led to an explosive growth in digital music collections. To cope with their increasing scale, we need effective Music Information Retrieval (MIR) capabilities like tagging, concept search and clustering. Integral to MIR is a framework for modelling music documents and generating discriminative signatures for them. In this paper, we introduce a multimodal, layered learning framework called DMCM. Distinguished from the existing approaches that encode music as an ensemble of order-less feature vectors, our framework extracts from each music document a variety of acoustic features, and translates them into low-level encodings over the temporal dimension. From them, DMCM elucidates the concept dynamics in the music document, representing them with a novel music signature scheme called Stochastic Music Concept Histogram (SMCH) that captures the probability distribution over all the concepts. Experiment results with two large music collections confirm the advantages of the proposed framework over existing methods on various MIR tasks.

Original languageEnglish
Title of host publicationSIGIR'12 - Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval
Place of PublicationPortland, Oregon, USA
PublisherAssociation for Computing Machinery (ACM)
Pages455-464
Number of pages10
ISBN (Print)9781450316583
DOIs
Publication statusPublished - 28 Sep 2012
Externally publishedYes
Event35th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2012 - Portland, OR, United States
Duration: 12 Aug 201216 Aug 2012

Conference

Conference35th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2012
Country/TerritoryUnited States
CityPortland, OR
Period12/08/1216/08/12

Fingerprint

Dive into the research topics of 'Modeling concept dynamics for large scale music search'. Together they form a unique fingerprint.

Cite this