A New Recursive Semi-Supervised Non-Negative Matrix Factorization for Separation of Harmonic and Percussive Elements in Digital Sounds
×
Cite This
Citation & Abstract
W. Fonseca, Z. Peixoto, F. Magalhaes, and R. Faria, "A New Recursive Semi-Supervised Non-Negative Matrix Factorization for Separation of Harmonic and Percussive Elements in Digital Sounds," J. Audio Eng. Soc., vol. 66, no. 10, pp. 779-790, (2018 October.). doi: https://doi.org/10.17743/jaes.2018.0039
W. Fonseca, Z. Peixoto, F. Magalhaes, and R. Faria, "A New Recursive Semi-Supervised Non-Negative Matrix Factorization for Separation of Harmonic and Percussive Elements in Digital Sounds," J. Audio Eng. Soc., vol. 66 Issue 10 pp. 779-790, (2018 October.). doi: https://doi.org/10.17743/jaes.2018.0039
Abstract: With the ever-increasing applications for digital signal processing, there is a strong motivation to discover new processing techniques. Methods based on matrix rank minimization have been increasingly used for signal analysis, particularly for signal separation. This research considers the analysis and application of the Non-Negative Matrix Factorization (NMF), associated with Kullback-Leibler and Itakura-Saito divergences, for the separation of digital sound sources consisting of harmonic and percussive elements. The NMF algorithm and divergence functions were implemented in a MATLAB environment and applied to musical mixes composed of electric guitar, bass, kick, ride, and snare. Then, comparative analyses of the divergence functions performance used SNR-based metrics. Considering the inconsistencies between the objective metrics and the human perception, two alternative objective metrics were proposed for the Signal-Interference Ratio (SIR), called Windowed SIR (W-SIR) and Average Windowed SIR (AW-SIR). Based on the W-SIR metric, the authors present the new Recursive Semi-Supervised NMF (RSS-NMF), for which the training information is extracted from the original signal. In both cases, the results demonstrated better performance of the RSS-NMF technique in relation to the non-supervised NMF technique.
@article{fonseca2018a,
author={fonseca, wellington and peixoto, zelia and magalhaes, flavia and faria, regis},
journal={journal of the audio engineering society},
title={a new recursive semi-supervised non-negative matrix factorization for separation of harmonic and percussive elements in digital sounds},
year={2018},
volume={66},
number={10},
pages={779-790},
doi={https://doi.org/10.17743/jaes.2018.0039},
month={october},}
@article{fonseca2018a,
author={fonseca, wellington and peixoto, zelia and magalhaes, flavia and faria, regis},
journal={journal of the audio engineering society},
title={a new recursive semi-supervised non-negative matrix factorization for separation of harmonic and percussive elements in digital sounds},
year={2018},
volume={66},
number={10},
pages={779-790},
doi={https://doi.org/10.17743/jaes.2018.0039},
month={october},
abstract={with the ever-increasing applications for digital signal processing, there is a strong motivation to discover new processing techniques. methods based on matrix rank minimization have been increasingly used for signal analysis, particularly for signal separation. this research considers the analysis and application of the non-negative matrix factorization (nmf), associated with kullback-leibler and itakura-saito divergences, for the separation of digital sound sources consisting of harmonic and percussive elements. the nmf algorithm and divergence functions were implemented in a matlab environment and applied to musical mixes composed of electric guitar, bass, kick, ride, and snare. then, comparative analyses of the divergence functions performance used snr-based metrics. considering the inconsistencies between the objective metrics and the human perception, two alternative objective metrics were proposed for the signal-interference ratio (sir), called windowed sir (w-sir) and average windowed sir (aw-sir). based on the w-sir metric, the authors present the new recursive semi-supervised nmf (rss-nmf), for which the training information is extracted from the original signal. in both cases, the results demonstrated better performance of the rss-nmf technique in relation to the non-supervised nmf technique.},}
TY - paper
TI - A New Recursive Semi-Supervised Non-Negative Matrix Factorization for Separation of Harmonic and Percussive Elements in Digital Sounds
SP - 779
EP - 790
AU - Fonseca, Wellington
AU - Peixoto, Zelia
AU - Magalhaes, Flavia
AU - Faria, Regis
PY - 2018
JO - Journal of the Audio Engineering Society
IS - 10
VO - 66
VL - 66
Y1 - October 2018
TY - paper
TI - A New Recursive Semi-Supervised Non-Negative Matrix Factorization for Separation of Harmonic and Percussive Elements in Digital Sounds
SP - 779
EP - 790
AU - Fonseca, Wellington
AU - Peixoto, Zelia
AU - Magalhaes, Flavia
AU - Faria, Regis
PY - 2018
JO - Journal of the Audio Engineering Society
IS - 10
VO - 66
VL - 66
Y1 - October 2018
AB - With the ever-increasing applications for digital signal processing, there is a strong motivation to discover new processing techniques. Methods based on matrix rank minimization have been increasingly used for signal analysis, particularly for signal separation. This research considers the analysis and application of the Non-Negative Matrix Factorization (NMF), associated with Kullback-Leibler and Itakura-Saito divergences, for the separation of digital sound sources consisting of harmonic and percussive elements. The NMF algorithm and divergence functions were implemented in a MATLAB environment and applied to musical mixes composed of electric guitar, bass, kick, ride, and snare. Then, comparative analyses of the divergence functions performance used SNR-based metrics. Considering the inconsistencies between the objective metrics and the human perception, two alternative objective metrics were proposed for the Signal-Interference Ratio (SIR), called Windowed SIR (W-SIR) and Average Windowed SIR (AW-SIR). Based on the W-SIR metric, the authors present the new Recursive Semi-Supervised NMF (RSS-NMF), for which the training information is extracted from the original signal. In both cases, the results demonstrated better performance of the RSS-NMF technique in relation to the non-supervised NMF technique.
With the ever-increasing applications for digital signal processing, there is a strong motivation to discover new processing techniques. Methods based on matrix rank minimization have been increasingly used for signal analysis, particularly for signal separation. This research considers the analysis and application of the Non-Negative Matrix Factorization (NMF), associated with Kullback-Leibler and Itakura-Saito divergences, for the separation of digital sound sources consisting of harmonic and percussive elements. The NMF algorithm and divergence functions were implemented in a MATLAB environment and applied to musical mixes composed of electric guitar, bass, kick, ride, and snare. Then, comparative analyses of the divergence functions performance used SNR-based metrics. Considering the inconsistencies between the objective metrics and the human perception, two alternative objective metrics were proposed for the Signal-Interference Ratio (SIR), called Windowed SIR (W-SIR) and Average Windowed SIR (AW-SIR). Based on the W-SIR metric, the authors present the new Recursive Semi-Supervised NMF (RSS-NMF), for which the training information is extracted from the original signal. In both cases, the results demonstrated better performance of the RSS-NMF technique in relation to the non-supervised NMF technique.
Authors:
Fonseca, Wellington; Peixoto, Zelia; Magalhaes, Flavia; Faria, Regis
Affiliations:
Pontifical Catholic University of Minas Gerais, Minas Gerais, Brazil; University of Sao Paulo, Sao Paulo, Brazil(See document for exact affiliation information.) JAES Volume 66 Issue 10 pp. 779-790; October 2018
Publication Date:
October 16, 2018Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=19861