Music separation guided by cover tracks: designing the joint NMF model

1 PANAMA - Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio Inria Rennes – Bretagne Atlantique , IRISA-D5 - SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE 2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery

Abstract : In audio source separation, reference guided approaches are a class of methods that use reference signals to guide the separation. In prior work, we proposed a general framework to model the deformation between the sources and the references. In this paper, we investigate a specific scenario within this framework: music separation guided by the multitrack recording of a cover interpretation of the song to be processed. We report a series of experiments highlighting the relevance of joint Non-negative Matrix Factorization NMF, dictionary transformation, and specific transformation models for different types of sources. A signal-to-distortion ratio improvement SDRI of almost 11 decibels dB is achieved, improving by 2 dB compared to previous study on the same data set. These observations contribute to validate the relevance of the theoretical general framework and can be useful in practice for designing models for other reference guided source separation problems.

Keywords : Cover song Music separation Joint-NMF

Author: Nathan Souviraà-Labastie - Emmanuel Vincent - Frédéric Bimbot -



