高壓縮比語音編碼技術之研究

闕志達

標題:	高壓縮比語音編碼技術之研究 Research of Low Bit-Rate Speech Coding Technology
作者:	闕志達
關鍵字:	音訊壓縮;遮罩效應;音質量測;audiocoding;masking effect;sound quality measure
公開日期:	31-七月-1998
出版社:	臺北市：國立臺灣大學電機工程學系暨研究所
摘要:	在本計劃中，針對音訊訊號的遮罩效應 (masking effect) 進行研究，提出一個新的後遮罩 (forward masking effect) 的模型，應用在音訊壓縮音質的改善上。這個模型利用了人耳音訊系統中接收感應器與神經刺激的效應，這些效應通常在心理聲音學中後遮罩的原因。其中，人耳中的非線性效應我們以一個非線性電路的差分方程式來建立模型。我們將這個模型加入MPEG Layer III 音訊壓縮架構當中的遮罩效應，建立在時間頻率空間中的遮罩曲面。加入這個模型我們可以在相同壓縮比下得到比較好的音訊音質。在我們的實驗中，主觀與客觀的音質測試顯示我們可以比MPEG Layer III 的音訊壓縮減少12%到25%所需的位元數。 This paper presents a new forward masking model for perceptual audio coding. This model exploits adaptation of the peripheral sensory and neural elements in the auditory system, which is often deemed as the cause of forward masking. Nonlinearity of the ear is modeled by a nonlinear analog circuit with difference equations. We incorporate this model in the MPEG Layer III audio coding scheme and construct a masking plane in the frequency-time space. With some extra computations, the new audio coding scheme can improve the sound quality of the decoded audio signals. In our experiments, subjective and objective sound quality measurements show that, to achieve the same reconstructed sound quality, the new scheme requires 12% to 23% less bits than the original MPEG Layer III scheme.
URI:	http://ntur.lib.ntu.edu.tw//handle/246246/7654
其他識別:	872213E002019
Rights:	國立臺灣大學電機工程學系暨研究所
顯示於：	電機工程學系

文件中的檔案：

檔案	描述	大小	格式
872213E002019.pdf		43.15 kB	Adobe PDF	檢視/開啟

顯示文件完整紀錄

Page view(s)

checked on 2024/4/13

下載

checked on 2024/4/13

Google Scholar^TM

檢查

TAIR相關文章

文件中的檔案：

Page view(s)

下載

Google ScholarTM

Google Scholar^TM