Publication:
On the preparation and validation of a large-scale dataset of singing transcription

cris.lastimport.scopus2025-04-20T22:21:06Z
cris.virtual.departmentNetworking and Multimediaen_US
cris.virtual.departmentComputer Science and Information Engineeringen_US
cris.virtual.departmentCenter for Artificial Intelligence and Advanced Roboticsen_US
cris.virtual.departmentFinTech Centeren_US
cris.virtual.orcid0000-0002-7319-9095en_US
cris.virtualsource.departmentc584a094-1560-413c-9e15-083ce2a92ffb
cris.virtualsource.departmentc584a094-1560-413c-9e15-083ce2a92ffb
cris.virtualsource.departmentc584a094-1560-413c-9e15-083ce2a92ffb
cris.virtualsource.departmentc584a094-1560-413c-9e15-083ce2a92ffb
cris.virtualsource.orcidc584a094-1560-413c-9e15-083ce2a92ffb
dc.contributor.authorWang J.-Yen_US
dc.contributor.authorJYH-SHING JANGen_US
dc.creatorWang J.-Y;Jang J.-S.R.
dc.date.accessioned2022-04-25T06:43:46Z
dc.date.available2022-04-25T06:43:46Z
dc.date.issued2021
dc.description.abstractThis paper proposes a large-scale dataset for singing transcription, along with some methods for fine-tuning and validating its contents. The dataset is named MIR-ST500, which consists of more than 160,000 notes from 500 pop songs. To create this large-scale dataset, we set some labeling criteria and ask non-experts to label notes. We also perform some adjustments on the annotation to correct minor errors. Finally, to validate the dataset, we train a singing transcription model on MIR-ST500 dataset and evaluate it on various datasets. The result shows that we can certainly construct a better singing transcription model for various purposes using MIR-ST500, which is properly labeled and validated. ? 2021 IEEE
dc.identifier.doi10.1109/ICASSP39728.2021.9414601
dc.identifier.issn15206149
dc.identifier.scopus2-s2.0-85115050788
dc.identifier.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85115050788&doi=10.1109%2fICASSP39728.2021.9414601&partnerID=40&md5=70bedf26b4c297347e361f22fd12f43a
dc.identifier.urihttps://scholars.lib.ntu.edu.tw/handle/123456789/607419
dc.relation.conference2021 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021
dc.relation.ispartofICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
dc.relation.journalvolume2021-June
dc.relation.pages276-280
dc.subjectAutomatic singing transcription
dc.subjectDataset preparation
dc.subjectDataset validation
dc.subjectMusic information retrieval
dc.subjectSignal processing
dc.subjectFine tuning
dc.subjectLarge-scale dataset
dc.subjectLarge dataset
dc.titleOn the preparation and validation of a large-scale dataset of singing transcriptionen_US
dc.typeconference paperen
dspace.entity.typePublication

Files