A perceptually constrained GSVD-based approach for enhancing speech corrupted by colored noise
Journal
IEEE Transactions on Audio, Speech and Language Processing
Journal Volume
15
Journal Issue
1
Pages
119-134
Date Issued
2007
Author(s)
Ju, G.-H.
Abstract
The singular value decomposition (SVD)-based method for single-channel speech enhancement has been shown to be very useful when the additive noise is white. For colored noise, with this approach, one needs to whiten the noise spectrum prior to SVD-based approach and perform the inverse whitening processing afterwards. A truncated quotient SVD (QSVD)-based approach has been proposed to handle this problem and found very useful. In this paper, a generalized SVD (GSVD)-based subspace approach for speech enhancement is first extended from the concept of the truncated QSVD-based approach, in which the dimension of the signal subspace can be precisely and automatically determined for each frame of the noisy signal. But with this new approach some residual noise is still perceivable under lower signal-to-noise ratio conditions. Therefore a perceptually constrained GSVD (PCGSVD)-based approach is further proposed to incorporate the masking properties of human auditory system to make sure the undesired residual noise to be nearly un-perceivable. Closed-form solutions are obtained for both the GSVD- and PCGSVD-based enhancement approaches. Very carefully performed objective evaluations and subjective listening tests show that the PCGSVD-based approach proposed here can offer improved speech quality, intelligibility and recognition accuracy, whether the noise is stationary or nonstationary, especially when the additive noise is nonwhite. © 2006 IEEE.
Subjects
Auditory masking thresholds; Colored noise; Generalized singular value decomposition (GSVD); Signal subspace; Speech enhancement
Other Subjects
Additive noise; Audition; Inverse problems; Signal to noise ratio; Speech enhancement; Speech intelligibility; Speech recognition; White noise; Auditory masking threshold; Closed form solutions; Colored noise; Generalized singular value decomposition; Human auditory system; Recognition accuracy; Signal sub-space; Subjective listening test; Singular value decomposition
Type
journal article