Automatic phonetic segmentation by score predictive model for the corpora of mandarin singing voices

Lin, C.-Y.; JYH-SHING JANG; Lin, C.-Y.;Jang, J.-S.

doi:10.1109/TASL.2007.902051

Automatic phonetic segmentation by score predictive model for the corpora of mandarin singing voices

Journal

IEEE Transactions on Audio, Speech and Language Processing

Journal Volume

15

Journal Issue

7

Pages

2151 - 2159

Date Issued

2007

Author(s)

Lin, C.-Y.

JYH-SHING JANG

DOI

10.1109/TASL.2007.902051

URI

https://www.scopus.com/inward/record.uri?eid=2-s2.0-64249131507&doi=10.1109%2fTASL.2007.902051&partnerID=40&md5=560bc9a6dc89609ceccb630db154f763

http://scholars.lib.ntu.edu.tw/handle/123456789/331151

Abstract

This paper proposes the concept of a score predictive model (SPM) that can refine the phoneme boundaries obtained by a hidden Markov model (HMM) and dynamic time warping (DTW) for a Mandarin singing voice corpus. An SPM is constructed by using support vector regression. It predicts the score of a phoneme boundary according to the boundary's 58-dimensional feature vector. The correctly identified boundaries of a singing corpus can then be used for corpus-based singing voice synthesis. Several experiments with different settings, including the use of different initial estimates, different acoustic features, and various regression approaches, were designed to verify the feasibility of the proposed approach. Experimental results demonstrate that the proposed SPM is able to effectively refine the results of the HMM and DTW. © 2006 IEEE.

Subjects

Automatic phonetic segmentation; Boundary refinement; Score predictive model (SPM); Singing voice synthesis

SDGs

[SDGs]SDG4

Other Subjects

Acoustic features; Automatic phonetic segmentation; Boundary refinement; Dynamic Time Warping; Feature vectors; Initial estimates; Score predictive model (SPM); Singing voice synthesis; Support vector regressions; Hidden Markov models; Linguistics; Predictive control systems; Refining; Software agents

Type

journal article

Automatic phonetic segmentation by score predictive model for the corpora of mandarin singing voices

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)