https://scholars.lib.ntu.edu.tw/handle/123456789/632451
標題: | Spectral-Temporal Receptive Field-Based Descriptors and Hierarchical Cascade Deep Belief Network for Guitar Playing Technique Classification | 作者: | Wang C.-Y Chang P.-C JIAN-JIUN DING Tai T.-C Santoso A Liu Y.-T Wang J.-C. |
關鍵字: | Deep belief network (DBN); guitar playing technique (GPT) classification; neural network; spectral-temporal receptive fields (STRFs) | 公開日期: | 2022 | 卷: | 52 | 期: | 5 | 起(迄)頁: | 3684-3695 | 來源出版物: | IEEE Transactions on Cybernetics | 摘要: | Music information retrieval is of great interest in audio signal processing. However, relatively little attention has been paid to the playing techniques of musical instruments. This work proposes an automatic system for classifying guitar playing techniques (GPTs). Automatic classification for GPTs is challenging because some playing techniques differ only slightly from others. This work presents a new framework for GPT classification: it uses a new feature extraction method based on spectral-temporal receptive fields (STRFs) to extract features from guitar sounds. This work applies a supervised deep learning approach to classify GPTs. Specifically, a new deep learning model, called the hierarchical cascade deep belief network (HCDBN), is proposed to perform automatic GPT classification. Several simulations were performed and the datasets of: 1) data on onsets of signals; 2) complete audio signals; and 3) audio signals in a real-world environment are adopted to compare the performance. The proposed system improves upon the F-score by approximately 11.47% in setup 1) and yields an F-score of 96.82% in setup 2). The results in setup 3) demonstrate that the proposed system also works well in a real-world environment. These results show that the proposed system is robust and has very high accuracy in automatic GPT classification. © 2013 IEEE. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85130766437&doi=10.1109%2fTCYB.2020.3014207&partnerID=40&md5=5836aa46f04c95d1dcd254476180a4f4 https://scholars.lib.ntu.edu.tw/handle/123456789/632451 |
ISSN: | 21682267 | DOI: | 10.1109/TCYB.2020.3014207 | SDG/關鍵字: | Audio acoustics; Audio signal processing; Deep learning; Music; Musical instruments; Audio signal; Deep belief network; Deep belief networks; F-score; Guitar playing technique classification; Neural-networks; Playing techniques; Real world environments; Receptive fields; Spectral-temporal receptive field; Classification (of information); music; signal processing; Music; Neural Networks, Computer; Signal Processing, Computer-Assisted |
顯示於: | 電機工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。