Spectral-Temporal Receptive Field-Based Descriptors and Hierarchical Cascade Deep Belief Network for Guitar Playing Technique Classification

Wang C.-Y;Chang P.-C;Ding J.-J;Tai T.-C;Santoso A;Liu Y.-T;Wang J.-C.

標題:	Spectral-Temporal Receptive Field-Based Descriptors and Hierarchical Cascade Deep Belief Network for Guitar Playing Technique Classification
作者:	Wang C.-Y Chang P.-C JIAN-JIUN DING Tai T.-C Santoso A Liu Y.-T Wang J.-C.
關鍵字:	Deep belief network (DBN); guitar playing technique (GPT) classification; neural network; spectral-temporal receptive fields (STRFs)
公開日期:	2022
卷:	52
期:	5
起(迄)頁:	3684-3695
來源出版物:	IEEE Transactions on Cybernetics
摘要:	Music information retrieval is of great interest in audio signal processing. However, relatively little attention has been paid to the playing techniques of musical instruments. This work proposes an automatic system for classifying guitar playing techniques (GPTs). Automatic classification for GPTs is challenging because some playing techniques differ only slightly from others. This work presents a new framework for GPT classification: it uses a new feature extraction method based on spectral-temporal receptive fields (STRFs) to extract features from guitar sounds. This work applies a supervised deep learning approach to classify GPTs. Specifically, a new deep learning model, called the hierarchical cascade deep belief network (HCDBN), is proposed to perform automatic GPT classification. Several simulations were performed and the datasets of: 1) data on onsets of signals; 2) complete audio signals; and 3) audio signals in a real-world environment are adopted to compare the performance. The proposed system improves upon the F-score by approximately 11.47% in setup 1) and yields an F-score of 96.82% in setup 2). The results in setup 3) demonstrate that the proposed system also works well in a real-world environment. These results show that the proposed system is robust and has very high accuracy in automatic GPT classification. © 2013 IEEE.
URI:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85130766437&doi=10.1109%2fTCYB.2020.3014207&partnerID=40&md5=5836aa46f04c95d1dcd254476180a4f4 https://scholars.lib.ntu.edu.tw/handle/123456789/632451
ISSN:	21682267
DOI:	10.1109/TCYB.2020.3014207
SDG/關鍵字:	Audio acoustics; Audio signal processing; Deep learning; Music; Musical instruments; Audio signal; Deep belief network; Deep belief networks; F-score; Guitar playing technique classification; Neural-networks; Playing techniques; Real world environments; Receptive fields; Spectral-temporal receptive field; Classification (of information); music; signal processing; Music; Neural Networks, Computer; Signal Processing, Computer-Assisted
顯示於：	電機工程學系

顯示文件完整紀錄

SCOPUS^TM
Citations

checked on 2024/4/23

WEB OF SCIENCE^TM
Citations

checked on 2023/11/10

Page view(s)

checked on 2024/4/27

Google Scholar^TM

檢查

Altmetric

TAIR相關文章

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Page view(s)

Google ScholarTM

Altmetric

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM