https://scholars.lib.ntu.edu.tw/handle/123456789/413087
標題: | Fusing domain-specific data with general data for in-domain applications | 作者: | Yen A.-Z. Huang H.-H. HSIN-HSI CHEN |
關鍵字: | Cross-domain data fusion;Outlier detection;Sentiment analysis | 公開日期: | 2017 | 起(迄)頁: | 566-572 | 來源出版物: | 2017 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2017 | 摘要: | This paper analyzes the lexical semantics of domain-specific terms based on various pre-Trained specific domain and general domain word vectors, and addresses the semantic drift between domains. To capture lexical semantics in the specific domain, we propose a bridge mechanism to introduce domain-specific data into general data, and re-Train word vectors. We find that even a small-scale fusion can result in the similar lexical semantics learned by using the large-scale domain-specific dataset. Experiments on sentiment analysis and outlier detection show that application of word embedding by the fusion dataset has the better performance than applications of word embeddings by pure large domain-specific and pure large general datasets. The simple, but effective methodology facilitates the domain adaptation of distributed word representations. ? 2017 ACM. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/413087 | ISBN: | 9781450349512 | DOI: | 10.1145/3106426.3106473 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。