Paper
26 November 2003 Audio thumbnailing using MPEG-7 low-level audio descriptors
Author Affiliations +
Abstract
In this paper we present an audio thumbnailing technique based on audio segmentation by similarity search. The segmentation is performed on MPEG-7 low level audio feature descriptors as a growing source of multimedia meta data. Especially for database applications or audio-on-demand services this technique could be very helpful, because there is no need to have access to the probably copyright protected original audio material. The result of the similarity search is a matrix which contains off-diagonal stripes representing similar regions, which are usually the refrains of a song and thus a very suitable segment to be used as audio thumbnail. Using the a priori knowledge that we search off-diagonal stripes which must represent several seconds of audio data and that the adjustment of the stripes must be characteristically, we implemented a filter to enhance the structure of the similarity matrix and to extract a relevant segment as an audio thumbnail.
© (2003) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jens Wellhausen and Michael Hoeynck "Audio thumbnailing using MPEG-7 low-level audio descriptors", Proc. SPIE 5242, Internet Multimedia Management Systems IV, (26 November 2003); https://doi.org/10.1117/12.511486
Lens.org Logo
CITATIONS
Cited by 10 scholarly publications and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Multimedia

Databases

Image segmentation

Compact discs

Computer programming

Feature extraction

Fourier transforms

RELATED CONTENT

Temporal audio segmentation using MPEG-7 descriptors
Proceedings of SPIE (January 10 2003)
Power-spectrum-based shape matching for MPEG-7
Proceedings of SPIE (August 30 2002)
Music classification with MPEG-7
Proceedings of SPIE (January 10 2003)
FFT-based technique for image-signature generation
Proceedings of SPIE (January 15 1997)
Music identification with MPEG-7
Proceedings of SPIE (December 18 2003)

Back to Top