On the preparation and validation of a large-scale dataset of singing transcription

Wang J.-Y; JYH-SHING JANG; Wang J.-Y;Jang J.-S.R.

doi:10.1109/ICASSP39728.2021.9414601

On the preparation and validation of a large-scale dataset of singing transcription

Journal

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Journal Volume

2021-June

Pages

276-280

Date Issued

2021

Author(s)

Wang J.-Y

JYH-SHING JANG

DOI

10.1109/ICASSP39728.2021.9414601

URI

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85115050788&doi=10.1109%2fICASSP39728.2021.9414601&partnerID=40&md5=70bedf26b4c297347e361f22fd12f43a

https://scholars.lib.ntu.edu.tw/handle/123456789/607419

Abstract

This paper proposes a large-scale dataset for singing transcription, along with some methods for fine-tuning and validating its contents. The dataset is named MIR-ST500, which consists of more than 160,000 notes from 500 pop songs. To create this large-scale dataset, we set some labeling criteria and ask non-experts to label notes. We also perform some adjustments on the annotation to correct minor errors. Finally, to validate the dataset, we train a singing transcription model on MIR-ST500 dataset and evaluate it on various datasets. The result shows that we can certainly construct a better singing transcription model for various purposes using MIR-ST500, which is properly labeled and validated. ? 2021 IEEE

Event(s)

2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021

Subjects

Automatic singing transcription

Dataset preparation

Dataset validation

Music information retrieval

Signal processing

Fine tuning

Large-scale dataset

Large dataset

Type

conference paper

On the preparation and validation of a large-scale dataset of singing transcription

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)