https://scholars.lib.ntu.edu.tw/handle/123456789/641272
標題: | Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice | 作者: | Lin, Yi Heng Tseng, Wen Hsuan Chen, Li Chin Tan, Ching Ting Tsao, Yu |
關鍵字: | audio feature extraction | Consensus auditory-perceptual evaluation of voice | pre-Trained model | voice evaluation | voice quality | 公開日期: | 1-一月-2024 | 來源出版物: | Digest of Technical Papers - IEEE International Conference on Consumer Electronics | 摘要: | The Consensus Auditory-Perceptual Evaluation of Voice is a widely employed tool in clinical voice quality assessment that is significant for streaming communication among clinical professionals and benchmarking for the determination of further treatment. Currently, because the assessment relies on experienced clinicians, it tends to be inconsistent, and thus, difficult to standardize. To address this problem, we propose to leverage lightly weighted automatic audio parameter extraction, to increase the clinical relevance, reduce the complexity, and enhance the interpretability of voice quality assessment. The proposed method utilizes age, sex, and five audio parameters: jitter, absolute jitter, shimmer, harmonic-To-noise ratio (HNR), and zero crossing. A classical machine learning approach is employed. The result reveals that our approach performs similar to state-of-The-Art (SOTA) methods, and outperforms the latent representation obtained by using popular audio pre-Trained models. This approach provide insights into the feasibility of different feature extraction approaches for voice evaluation. Audio parameters such as jitter and the HNR are proven to be suitable for characterizing voice quality attributes, such as roughness and strain. Conversely, pre-Trained models exhibit limitations in effectively addressing noise-related scorings. This study contributes toward more comprehensive and precise voice quality evaluations, achieved by a comprehensively exploring diverse assessment methodologies. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/641272 | ISBN: | 9798350324136 | ISSN: | 0747668X | DOI: | 10.1109/ICCE59016.2024.10444177 |
顯示於: | 醫學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。