Su, LiLiSuYI-HSUAN YANG2023-10-202023-10-202014-01-019789604661374https://scholars.lib.ntu.edu.tw/handle/123456789/636394A robust onset detection method has to deal with wide dynamic ranges and diverse transient behaviors prevalent in real-world music signals. This paper presents contributions to robust onset detection by proposing two novel onset detection methods. The first one, termed power-scaled spectral flux (PSSF), applies power scaling to the spectral flux to better balance the wide dynamic range in the spectrogram. The second method, called peak-valley groupdelay (PVGD), enhances the robustness to noise terms by detecting peak-valley pairs from the summed group-delay function to capture the attack-decay envelope. The proposed methods are evaluated on a piano dataset and a diverse dataset of 12 different Western and Turkish instruments. To tackle the problem from a fundamental signal processing perspective, in this study we do not consider advanced methods such as late fusion, multi-band processing, and neural networks. Experimental result shows that the proposed methods yield competitive accuracy for the two datasets, improving the F-score for the former dataset from 0.956 to 0.963, and the F-score for the latter dataset from 0.712 to 0.754, comparing to existing methods.Power-scaled spectral flux and peak-valley group-delay methods for robust musical onset detectionconference paper2-s2.0-84908876171https://api.elsevier.com/content/abstract/scopus_id/84908876171