A perceptually constrained GSVD-based approach for enhancing speech corrupted by colored noise

Ju, G.-H.; LIN-SHAN LEE; Ju, G.-H.;Lee, L.S.

doi:10.1109/TASL.2006.876868

A perceptually constrained GSVD-based approach for enhancing speech corrupted by colored noise

Journal

IEEE Transactions on Audio, Speech and Language Processing

Journal Volume

15

Journal Issue

1

Pages

119-134

Date Issued

2007

Author(s)

Ju, G.-H.

LIN-SHAN LEE

DOI

10.1109/TASL.2006.876868

URI

https://www.scopus.com/inward/record.uri?eid=2-s2.0-47949093953&doi=10.1109%2fTASL.2006.876868&partnerID=40&md5=d3ab253295e0fe71c0dfbc5c7c065f26

Abstract

The singular value decomposition (SVD)-based method for single-channel speech enhancement has been shown to be very useful when the additive noise is white. For colored noise, with this approach, one needs to whiten the noise spectrum prior to SVD-based approach and perform the inverse whitening processing afterwards. A truncated quotient SVD (QSVD)-based approach has been proposed to handle this problem and found very useful. In this paper, a generalized SVD (GSVD)-based subspace approach for speech enhancement is first extended from the concept of the truncated QSVD-based approach, in which the dimension of the signal subspace can be precisely and automatically determined for each frame of the noisy signal. But with this new approach some residual noise is still perceivable under lower signal-to-noise ratio conditions. Therefore a perceptually constrained GSVD (PCGSVD)-based approach is further proposed to incorporate the masking properties of human auditory system to make sure the undesired residual noise to be nearly un-perceivable. Closed-form solutions are obtained for both the GSVD- and PCGSVD-based enhancement approaches. Very carefully performed objective evaluations and subjective listening tests show that the PCGSVD-based approach proposed here can offer improved speech quality, intelligibility and recognition accuracy, whether the noise is stationary or nonstationary, especially when the additive noise is nonwhite. © 2006 IEEE.

Subjects

Auditory masking thresholds; Colored noise; Generalized singular value decomposition (GSVD); Signal subspace; Speech enhancement

Other Subjects

Additive noise; Audition; Inverse problems; Signal to noise ratio; Speech enhancement; Speech intelligibility; Speech recognition; White noise; Auditory masking threshold; Closed form solutions; Colored noise; Generalized singular value decomposition; Human auditory system; Recognition accuracy; Signal sub-space; Subjective listening test; Singular value decomposition

Type

journal article

A perceptually constrained GSVD-based approach for enhancing speech corrupted by colored noise

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)