Improved robust features for speech recognition by integrating time-frequency principal components (TFPC) and histogram equalization (HEQ)
Journal
2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003
Pages
297-302
Date Issued
2003-12
Date
2003-12
Author(s)
Tsai, Shang-Nien
DOI
N/A
Abstract
Robustness for speech recognition technologies with respect to adverse environments has been a key issue for real applications. Time-frequency principal components (TFPC) features were shown to be a set of powerful data-driven features under matched circumstances, while histogram equalization (HEQ) was proposed as an efficient feature transformation approach to reduce the mismatch between training and testing conditions. In this paper, it is proposed that TFPC features can be well integrated with HEQ. HEQ generates a well-matched environment, in which TFPC features can be properly utilized. Extensive experiments with respect to the AURORA2 database verified that improved performance in adverse circumstances can be achieved. © 2003 IEEE.
Event(s)
IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003
Other Subjects
Graphic methods; Metadata; Adverse environment; Feature transformations; Histogram equalizations; Principal Components; Real applications; Speech recognition technology; Time frequency; Training and testing; Speech recognition
Type
conference paper
File(s)![Thumbnail Image]()
Loading...
Name
01318457.pdf
Size
465.15 KB
Format
Adobe PDF
Checksum
(MD5):de448351f26844ec9777d3e0a2916428