An Automatic Syllabic-Level Lyrics-Vocal Alignment System for Mandarin Popular Vocal Music
Date Issued
2007
Date
2007
Author(s)
Yang, Tzung-Shuo
DOI
zh-TW
Abstract
This paper presents a novel method that automatically aligns the textural lyrics with their corresponding mandarin popular music in pure vocal (i.e., consisting of singing voice only, without any musical instrument). Our goal is to automatically annotate the accurate time index of each syllable in the lyric. Forced-alignment is the baseline algorithm for this system. Because the beginning of each word may be an onset, we need to find out the real onsets. In order to separate the onsets, the support vector machine(SVM) is used. Besides, we try to add an acoustic model to improve the results. The idea is that there are some consonants in mandarin (such as: stop, fricative or affricate …) which will bring lots of burst airflow and cause zero crossing rate (ZCR) to increase suddenly. We make use of this characteristic to increase the accuracy of the alignment results.
Subjects
歌詞
強制校準
起始點
支援向量機
越零率
lyrics
forced-alignment
onset
SVM
ZCR
Type
thesis
File(s)![Thumbnail Image]()
Loading...
Name
ntu-96-R94942119-1.pdf
Size
23.31 KB
Format
Adobe PDF
Checksum
(MD5):c204881f66ab56e3a2e0fc75b696f099
