https://scholars.lib.ntu.edu.tw/handle/123456789/581353
標題: | Issues and perspectives from 10,000 annotated financial social media data | 作者: | Chen C.-C Huang H.-H HSIN-HSI CHEN |
關鍵字: | Commerce; Social networking (online); Domain specific; Sentiment dictionaries; Social media datum; Investments | 公開日期: | 2020 | 起(迄)頁: | 6106-6110 | 來源出版物: | LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings | 摘要: | In this paper, we investigate the annotation of financial social media data from several angles. We present Fin-SoMe, a dataset with 10,000 labeled financial tweets annotated by experts from both the front desk and the middle desk in a bank's treasury. These annotated results reveal that (1) writer-labeled market sentiment may be a misleading label; (2) writer's sentiment and market sentiment of an investor may be different; (3) most financial tweets provide unfounded analysis results; and (4) almost no investors write down the gain/loss results for their positions, which would otherwise greatly facilitate detailed evaluation of their performance. Based on these results, we address various open problems and suggest possible directions for future work on financial social media data. We also provide an experiment on the key snippet extraction task to compare the performance of using a general sentiment dictionary and using the domain-specific dictionary. The results echo our findings from the experts' annotations. ? European Language Resources Association (ELRA), licensed under CC-BY-NC |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85095035209&partnerID=40&md5=af39622d0bba45c528474dd98d5cb6a5 https://scholars.lib.ntu.edu.tw/handle/123456789/581353 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。