https://scholars.lib.ntu.edu.tw/handle/123456789/634398
標題: | Effective string processing and matching for author disambiguation | 作者: | Chin, Wei Sheng Juan, Yu Chin Zhuang, Yong Wu, Felix Tung, Hsiao Yu Yu, Tong Wang, Jui Pin Chang, Cheng Xia Yang, Chun Pai Chang, Wei Cheng Huang, Kuan Hao Kuo, Tzu Ming Lin, Shan Wei Lin, Young San Lu, Yu Chen Su, Yu Chuan Wei, Cheng Kuang Yin, Tu Chun Li, Chun Liang Lin, Ting Wei Tsai, Cheng Hao SHOU-DE LIN HSUAN-TIEN LIN CHIH-JEN LIN |
公開日期: | 1-一月-2013 | 來源出版物: | Proceedings of the 2013 KDD Cup 2013 Workshop | 摘要: | Track 2 in KDD Cup 2013 aims at determining duplicated authors in a data set from Microsoft Academic Search. This type of problems appears in many large-scale applications that compile information from different sources. This paper describes our solution developed at National Taiwan University to win the first prize of the competition. We propose an effective name matching framework and realize two implementations. An important strategy in our approach is to consider Chinese and non-Chinese names separately because of their different naming conventions. Post-processing including merging results of two predictions further boosts the performance. Our approach achieves F1-score 0.99202 on the private leader board, while 0.99195 on the public leader board. © 2013 ACM. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/634398 | ISBN: | 9781450324953 | DOI: | 10.1145/2517288.2517295 |
顯示於: | 資訊工程學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。