An on-the-fly mandarin singing voice synthesis system
Journal
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Journal Volume
2532
Pages
631 - 638
Date Issued
2002
Author(s)
Abstract
An on-the-fly Mandarin singing voice synthesis system, called SINVOIS (singing voice synthesis), is proposed in this paper. The SINVOIS system can receive the continuous speech of the lyrics of a song, and generate the singing voice immediately based on the music score information (embedded in a MIDI file) of the song. Two sub-systems are designed and embedded into the system. One is the synthesis unit generator and the other is the pitch-shifting module. In the first one, the Viterbi decoding algorithm is employed on a continuous speech to generate the synthesis unit for singing voice. And the PSOLA method is employed to implement the pitch-shifting function in the second one. Moreover, the energy, duration, and spectrum modifications on the synthesis unit are also implemented in the second part. The synthesized singing voice sounds reasonably good. From the subjective listening test, the MOS (mean opinion score) of 3.1 are obtained for synthesized singing voices. © Springer-Verlag Berlin Heidelberg 2002.
Event(s)
3rd IEEE Pacific Rim Conference on Multimedia, PCM 2002
Other Subjects
Viterbi algorithm; Continuous speech; Mean opinion scores; Pitch shifting; Singing voices; Singing-voice synthesis; Spectrum modification; Subjective listening test; Viterbi decoding algorithms; Embedded systems
Type
conference paper