Evaluating music recommendation in a real-world setting: On data splitting and evaluation metrics

Chou, Szu Yu; YI-HSUAN YANG; Lin, Yu Ching

doi:10.1109/ICME.2015.7177456

Evaluating music recommendation in a real-world setting: On data splitting and evaluation metrics

Journal

Proceedings - IEEE International Conference on Multimedia and Expo

Journal Volume

2015-August

ISBN

9781479970827

Date Issued

2015-08-04

Author(s)

Chou, Szu Yu

YI-HSUAN YANG

Lin, Yu Ching

DOI

10.1109/ICME.2015.7177456

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/636334

URL

https://api.elsevier.com/content/abstract/scopus_id/84946047951

Abstract

Evaluation is important to assess the performance of a computer system in fulfilling a certain user need. In the context of recommendation, researchers usually evaluate the performance of a recommender system by holding out a random subset of observed ratings and calculating the accuracy of the system in reproducing such ratings. This evaluation strategy, however, does not consider the fact that in a real-world setting we are actually given the observed ratings of the past and have to predict for the future. There might be new songs, which create the cold-start problem, and the users' musical preference might change over time. Moreover, the user satisfaction of a recommender system may be related to factors other than accuracy. In light of these observations, we propose in this paper a novel evaluation framework that uses various time-based data splitting methods and evaluation metrics to assess the performance of recommender systems. Using millions of listening records collected from a commercial music streaming service, we compare the performance of collaborative filtering (CF) and content-based (CB) models with low-level audio features and semantic audio descriptors. Our evaluation shows that the CB model with semantic descriptors obtains a better trade-off among accuracy, novelty, diversity, freshness and popularity, and can nicely deal with the cold-start problems of new songs.

Subjects

cold-start | Collaborative filtering | content-based recommendation | data splitting | evaluation metrics

Type

conference paper

Evaluating music recommendation in a real-world setting: On data splitting and evaluation metrics

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)