Perception of speech signals using self-organization on linear neuron array
Resource
Neural Networks, 1993. IJCNN '93-Nagoya. Proceedings of 1993 International Joint Conference on
Journal
1993 International Joint Conference on Neural Networks
Journal Volume
1
Pages
251-254
Date Issued
1993-10
Date
1993-10
Author(s)
Shiah, Chwan-Yi
DOI
N/A
Abstract
A continuous speech recognition system with finite set of Chinese words is devised for selected applications. With proper design of the self-organizing map for the speech signals, the precedence relations among the spectral patterns within a token period can be preserved by the topology preservations and the serious nonlinear time warping can thus be overcome. The one dimensional hierarchical relations among the sequential spectral patterns are able to be represented by the topology map developed on the linear array of neurons. We then devise two kinds of perception energies based on the trained map. One of the energies is derived from properly fitting a precedence curve on the sequential excitation patterns of the map during a whole word period. The other energy is obtained from the accumulation of total excitations on the map during a word period. Thresholds for the perception energies are then designed experimentally. A set of 1309 linear array maps are used for representing the total 1309 standard Chinese word pronunciations. Each linear array contains 100 equally spaced and linearly ordered neurons. A verification of the system on a personal computer with a modern DSP board has been performed and the result was quite satisfactory.
Other Subjects
Algorithms; Hierarchical systems; Maps; Mathematical models; Neural networks; Parameter estimation; Personal computers; Speech analysis; Topology; Dimensional hierarchical relation; Hidden Markov model; Linear neuron array; Nonlinear time warping; Self organizing map; Sequential excitation pattern; Time delay neural network; Topology preservation; Speech recognition
Type
conference paper
File(s)![Thumbnail Image]()
Loading...
Name
00713904.pdf
Size
426.19 KB
Format
Adobe PDF
Checksum
(MD5):de0297a116b9d240535e0cbb24f90920
