https://scholars.lib.ntu.edu.tw/handle/123456789/490622
標題: | The power of words: Enhancing music mood estimation with textual input of lyrics. | 作者: | Chi, Chung-Yi Wu, Ying-Shian Chu, Wei-rong Wu, Daniel C. Tsai, Richard Tzong-Han YUNG-JEN HSU |
公開日期: | 2009 | 來源出版物: | Proceedings - 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops, ACII 2009 | 摘要: | Music mood estimation (MME) is a key technology in mood-based music recommendation. While mainstream MME research nowadays relies on audio music analysis, exploring the significance of lyrics text in predicting song emotion is gaining attention in recent years. One major impediment to MME research is the lack of a clearly labeled and publicly available dataset annotating the emotion ratings of lyrics text and audio separately. In light of this, we compiled a dataset of 600 pop songs (iPop) from the mood ratings of 246 participants who experienced three different song sessions, lyrics text (L), audio music track (M), and the combination of lyrics text and audio music track (C). We then applied statistical analysis to estimate how lyrics text and audio contribute to a song's overall valence-arousal (V-A) mood ratings. Our results show that lyrics text are not only a valid measure for estimating a song's mood ratings but also provide supplementary information that can improve audio-only MME systems. Furthermore, a detailed examination suggests that lyrics text (L) ratings are better estimators of the overall mood ratings of a song (C) in cases where L and M ratings conflict. We then construct a MME system that employs both features extracted from lyrics text and audio music track and validate the conclusions acquired in our statistical analysis. In estimating either V or A rating, the model with lyrics text plus audio track features performs better than only the model with only lyrics text or audio track features. These results validate the statement acquired by the statistical analysis. ©2009 IEEE. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/490622 https://www.scopus.com/inward/record.uri?eid=2-s2.0-77949451324&doi=10.1109%2fACII.2009.5349591&partnerID=40&md5=5cd0c24dfcad3859fcd85f68cd0cfded |
DOI: | 10.1109/ACII.2009.5349591 | SDG/關鍵字: | Audio music; Audio track; Data sets; Key technologies; Music recommendation; Statistical analysis; Supplementary information; Audio systems; Estimation; Intelligent computing; Statistical methods; Audio acoustics |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。