指導教授:貝蘇章臺灣大學:電信工程學研究所羅安然Lo, An-JanAn-JanLo2014-11-302018-07-052014-11-302018-07-052014http://ntur.lib.ntu.edu.tw//handle/246246/264244在信號處理的研究領域中,語音處理占了相當重要的地位。人類語音的產生可模擬為聲門送出的激發信號,與口腔物理構造所形成之濾波器兩部份的迴旋積。對此過程之細節的研究與語音特徵參數的取得,可運用在語音合成、轉換等的諸多領域。本文中,我們將對zeros of z-transform(ZZT) 演算法及其於語音激發信號擷取的應用作討論。經過Z轉換之後,語音信號的zeros 得以在Z平面上展現其mixed-phase (在半徑1之圓內外皆有zero)的性質,並可憑此進行語音激發信號和口腔通道濾波器兩部份的分離。此外,根據ZZT圖形的研究,由群延遲函數(Group delay function) 所取得之相位資訊可以透過Chirp group delay方法得到大幅度的改善,藉以取得口腔通道濾波器的特徵峰值。所取得結果將與現有語音處理工具相比較,並測試主激發信號衰減(Attenuated Main Excitation ,AME) 方法對特徵峰值取得的改善。Speech processing has been one of the major topics in the research field of signal processing. The process of speech production can be modeled as the convolution of glottal source excitation and vocal tract filter. The research in the details of speech production and the characteristic extraction can be applied in the fields such as speech synthesis and transformation. In this thesis, we discuss the zeros of the z-transform(ZZT) algorithm developed by Dr. Baris Bozkurt[1] and its application to the extraction of the excitation pulse in the source-tract model of human speech signals. After z-transform, the zeros of the speech signals can be represented on z the plane and the mixed-phase property is revealed, which would be used in source-tract separation. On the other hand, by the study of the ZZT plot, the phase information obtained from group delay spectrum could be well improved using the Chirp Group Delay. Moreover, we present the capabilities of formant tracking by ZZT, making a comparison between the performances of ZZT with other speech signal processing tools, and apply Attenuated Main Excitation(AME) for further improvement.中文摘要 i ABSTRACT ii LIST OF FIGURES v LIST OF TABLES ix Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Background 1 1.3 LF model of speech voice 4 1.4 ZZT representation of speech signals 5 Chapter 2 ZZT Algorithm 7 2.1 Definition 7 2.2 ZZT of Glottal Signal 7 2.3 Windowing effects on ZZT 10 2.4 Glottal Closure Instant(GCI) Detection 12 2.5 Conclusion 14 Chapter 3 The Source Excitation Extraction using ZZT 15 3.1 Introduction 15 3.2 ZZT Decomposition 15 3.3 Complex Cepstrum 20 3.4 Test with noise 24 3.5 Chirp decomposition 27 3.6 Conclusion 30 Chapter 4 Applications of ZZT Chirp Group Delay and Formant Tracking 31 4.1 Definition 31 4.2 Application in formant tracking 34 4.2.1 Spectrogram 35 4.2.2 Hilbert-Huang Transform 36 4.2.3 Chirp Group Delay of Zero-Phase Version Signal(CGDZP) 37 4.2.4 Disturbance of High-Pitched Frequency to Formant Tracking 42 4.2.5 Advanced Test for CGDZP and Praat 44 4.2.6 The Effect of Attenuated Main Excitation(AME) on CGDZP 52 4.3 Conclusion 58 Chapter 5 Conclusion and Future Works 59 References 615971675 bytesapplication/pdf論文公開時間:2014/08/17論文使用權限:同意有償授權(權利金給回饋學校)ZZT語音激發/口腔通道信號分離群延遲函數語音特徵峰值擷取ZZT演算法之應用於語音激發信號擷取The Source Excitation Extraction of Speech Signal Using ZZT Methodthesishttp://ntur.lib.ntu.edu.tw/bitstream/246246/264244/1/ntu-103-R96942127-1.pdf