MediaEval 2019 emotion and theme recognition task: A VQ-VAE based approach
Journal
CEUR Workshop Proceedings
Journal Volume
2670
Date Issued
2019-01-01
Author(s)
Abstract
In this paper, we, Taiinn (Taiwan) team, use pre-trained VQ-VAE as a feature extractor and compare two types of classifier for audio-based emotion and theme recognition. The VQ-VAE is pre-trained on the Million Song Dataset (MSD). We found better performance in ROC-AUC by fixing the pre-trained parameters of VQ-VAE while training the classifier. In addition, an embedding with bigger shape works better than the one-dimensional counterpart. The code and submitted models can be found at: https://github.com/annahung31/ moodtheme-tagging.
Type
conference paper
