An on-the-fly mandarin singing voice synthesis system

Lin, Cheng-Yuan; JYH-SHING JANG; Hwang, Shaw-Hwa; Lin, Cheng-Yuan;Jang, Jyh-Shing Roger;Hwang, Shaw-Hwa

doi:10.1007/3-540-36228-2_78

An on-the-fly mandarin singing voice synthesis system

Journal

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Journal Volume

2532

Pages

631 - 638

Date Issued

2002

Author(s)

Lin, Cheng-Yuan

JYH-SHING JANG

Hwang, Shaw-Hwa

DOI

10.1007/3-540-36228-2_78

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/488875

https://doi.org/10.1007/3-540-36228-2_78

https://www.scopus.com/inward/record.uri?eid=2-s2.0-84949968741&doi=10.1007%2f3-540-36228-2_78&partnerID=40&md5=cf1783a16f0a2b3a8dd5399534e767a4

Abstract

An on-the-fly Mandarin singing voice synthesis system, called SINVOIS (singing voice synthesis), is proposed in this paper. The SINVOIS system can receive the continuous speech of the lyrics of a song, and generate the singing voice immediately based on the music score information (embedded in a MIDI file) of the song. Two sub-systems are designed and embedded into the system. One is the synthesis unit generator and the other is the pitch-shifting module. In the first one, the Viterbi decoding algorithm is employed on a continuous speech to generate the synthesis unit for singing voice. And the PSOLA method is employed to implement the pitch-shifting function in the second one. Moreover, the energy, duration, and spectrum modifications on the synthesis unit are also implemented in the second part. The synthesized singing voice sounds reasonably good. From the subjective listening test, the MOS (mean opinion score) of 3.1 are obtained for synthesized singing voices. © Springer-Verlag Berlin Heidelberg 2002.

Event(s)

3rd IEEE Pacific Rim Conference on Multimedia, PCM 2002

Other Subjects

Viterbi algorithm; Continuous speech; Mean opinion scores; Pitch shifting; Singing voices; Singing-voice synthesis; Spectrum modification; Subjective listening test; Viterbi decoding algorithms; Embedded systems

Type

conference paper

An on-the-fly mandarin singing voice synthesis system

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)