Histogram-based quantization for robust and/or distributed speech recognition
Journal
IEEE Transactions on Audio, Speech and Language Processing
Journal Volume
16
Journal Issue
4
Pages
859-873
Date Issued
2008
Author(s)
Wan, Chia-Yu
Abstract
In a distributed speech recognition (DSR) framework, the speech features are quantized and compressed at the client and recognized at the server. However, recognition accuracy is degraded by environmental noise at the input, quantization distortion, and transmission errors. In this paper, histogram-based quantization (HQ) is proposed, in which the partition cells for quantization are dynamically defined by the histogram or order statistics of a segment of the most recent past values of the parameter to be quantized. This scheme is shown to be able to solve to a good degree many problems related to DSR. A joint uncertainty decoding (JUD) approach is further developed to consider the uncertainty caused by both environmental noise and quantization errors. A three-stage error concealment (EC) framework is also developed to handle transmission errors. The proposed HQ is shown to be an attractive feature transformation approach for robust speech recognition outside of a DSR environment as well. All the claims have been verified by experiments using the Aurora 2 testing environment, and significant performance improvements for both robust and/or distributed speech recognition over conventional approaches have been achieved. © 2008 IEEE.
Subjects
Error compensation; Robustness; Speech recognition; Vector quantization (VQ)
SDGs
Other Subjects
Conventional approaches; Distributed speech recognition; Environmental noise; Error concealments; Feature transformations; Joint uncertainties; Order statistics; Partition cells; Performance improvements; Quantization distortions; Quantization errors; Recognition accuracies; Robust speech recognition; Robustness; Speech features; Testing environments; Transmission errors; Decoding; Error compensation; Error detection; Speech analysis; Vector quantization; Speech recognition
Type
journal article
File(s)![Thumbnail Image]()
Loading...
Name
Histogram-based quantization for robust and-or distributed speech recognition.pdf
Size
1.95 MB
Format
Adobe PDF
Checksum
(MD5):2cb447faa2258d0d7cea19720b57e6b6
