Paper
16 August 2023 Multi-granularity time domain single-channel music source separation
Zhiying Liu, Yan Guo, Hui Zhang
Author Affiliations +
Proceedings Volume 12787, Sixth International Conference on Advanced Electronic Materials, Computers, and Software Engineering (AEMCSE 2023); 127870S (2023) https://doi.org/10.1117/12.3004581
Event: 6th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE 2023), 2023, Shenyang, China
Abstract
Music source separation isolates or separates the desired accompaniment or singing voice from the song recording. This study solves this problem in the time domain using only one channel. Music contains rich information over multiple temporal scales, such as rhythm, tonality, melody, harmony, chord, etc. These different temporal scales information should be represented by features in different granularity. Inspired by this underlying nature of music, a multi-granularity solution is adopted. Furthermore, music is usually self-similar and can be treated as a fractal structure, meaning multiscale pattern repetition exists in music. We have captured these patterns in a batch of different granularity features using the multi-granularity solution. A self-attention mechanism is adopted to link these regular, predictable, and repeated features. Finally, we combine the multi-granularity and self-attention and adopt the Sandglasset model, which can utilize the information in different temporal scales and the relationship of the repeated pattern among different temporal scales. Experimental results on public datasets show that the proposed method outperforms existing systems in the single-channel music source separation task.
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Zhiying Liu, Yan Guo, and Hui Zhang "Multi-granularity time domain single-channel music source separation", Proc. SPIE 12787, Sixth International Conference on Advanced Electronic Materials, Computers, and Software Engineering (AEMCSE 2023), 127870S (16 August 2023); https://doi.org/10.1117/12.3004581
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Matrices

Synthetic aperture radar

Education and training

Fractal analysis

Neural networks

Artificial neural networks

Process modeling

Back to Top