Implement Mandarin Speech Conversion on Mixed Excitation Linear Prediction (MELP) CODEC
Date Issued
2006
Date
2006
Author(s)
Hsiao, Chun-Jung
DOI
en-US
Abstract
In this work we focused on reusing parameters of 2.4kbps Mixed Excitation Linear Prediction (MELP) voice coder, implement the speech conversion from source speaker to the specified target speaker.
Using MELP algorithm to analyze the speech, statistically we found that for the same phoneme of the same speaker, the first and second stage indexes of MELP 4-stage vector quantized Line Spectral Frequency (LSF) tend to collect around some certain index values. We proposed a method that based on Mandarin syllable to build up a mapping table of these indexes between the spectral features of the source and the target speakers. To avoid the discontinued voice that caused by mismatching of the syllable, we proposed a new segmental technique based on feature vector frame. The pitch periods of residual signal were also modified using linear relationship. The simulation results show that the source speaker can be changed to the target speaker, and the quality of synthesized voice is good.
Subjects
語音轉換
混合激發線性預測
MELP
Speech Conversion
Mandarin syllable
Type
thesis
File(s)![Thumbnail Image]()
Loading...
Name
ntu-95-P92921005-1.pdf
Size
23.31 KB
Format
Adobe PDF
Checksum
(MD5):21417a27ecb4c93f44e3b6a34a342d70