On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise

Kirst, Christian; Weninger, Felix; Joder, Cyril; Grosche, Peter; Geiger, Jürgen; Schuller, Björn

AES E-Library

On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise

Speech de-noising algorithms often suffer from introduction of artifacts, either by removal of parts of the speech signal, or imperfect noise reduction causing the remaining noise to sound unnatural and disturbing. This contribution proposes to spatially distribute monaural noisy speech signals based on single-channel source separation, in order to improve the perceived speech quality. Stereo up-mixing is utilized on the estimated speech and noise sources instead of simply suppressing the noise. This paper investigates the case of non-negative matrix factorization (NMF) speech enhancement applied to high levels of non-stationary noise. NMF-based and spectral subtraction speech enhancement algorithms are evaluated in a listening test in terms of speech intelligibility, presence of interfering noises and overall quality with respect to the unprocessed signal. In the result, the listening test provides evidence for superior noise reduction by NMF, yet also a drop in perceived speech quality that is not covered by the employed set of common objective metrics. However, stereo up-mixing of NMF-separated speech and noise delivers high subjective noise reduction while preserving the perceived speech quality.

Authors: Kirst, Christian; Weninger, Felix; Joder, Cyril; Grosche, Peter; Geiger, Jürgen; Schuller, Björn
Affiliations: HUAWEI Technologies Duesseldorf GmbH, European Research Center, Germany; Technische Universität München, Munich, Germany(See document for exact affiliation information.)
AES Conference: 53rd International Conference: Semantic Audio (January 2014)
Paper Number: 4-2
Publication Date: January 27, 2014 Import into BibTeX
Subject: Intelligent Audio Effects
Permalink: https://www.aes.org/e-lib/browse.cfm?elib=17090

Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member and would like to subscribe to the E-Library then Join the AES!

This paper costs $33 for non-members and is free for AES members and E-Library subscribers.

Learn more about the AES E-Library

E-Library Location: (CD 53rdPapers) /conf/53/aes53-000023.pdf

Start a discussion about this paper!

AES E-Library

On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise

ABOUT AES

Contact Us