https://scholars.lib.ntu.edu.tw/handle/123456789/626273
標題: | Essential Step Toward Mining Big Polymer Data: PolyName2Structure, Mapping Polymer Names to Structures | 作者: | Lin C Wang P.-H Hsiao Y YI-TSU CHAN Engler A.C Pitera J.W Sanders D.P Cheng J YUFENG JANE TSENG |
關鍵字: | cheminformatics; machine learning; polymer informatics | 公開日期: | 2020 | 卷: | 2 | 期: | 8 | 起(迄)頁: | 3107-3113 | 來源出版物: | ACS Applied Polymer Materials | 摘要: | Advances in polymer science have made polymers essential in our everyday life and have yielded unprecedented quantities of data over the past several decades. However, it is still challenging and inefficient to organize such scattered and accumulated “big data” in a text format through mass journals, patents, and web pages due to the complexity and ambiguity of polymer representations. In this paper, we report the first automated framework, PolyName2Structure (PN2S), which is able to convert various polymer name representations to their corresponding polymer structures. In PN2S, machine learning models were built to predict the polymerization pathway, identify the reacting group(s), and generate repeating units after polymerization. This PN2S system achieved over 90% accuracy when applied to polymer names listed in a commercial catalog, embodying the first step toward resolving the complexity of the data structure for polymers by building a practical model that enables text mining of structural polymer information. Copyright © 2020 American Chemical Society |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85102519956&doi=10.1021%2facsapm.0c00273&partnerID=40&md5=05a846b0eede4a4e439f7e631089923e https://scholars.lib.ntu.edu.tw/handle/123456789/626273 |
ISSN: | 26376105 | DOI: | 10.1021/acsapm.0c00273 | SDG/關鍵字: | Polymerization; Websites; Machine learning models; Polymer science; Polymer structure; Practical model; Reacting groups; Repeating unit; Structural polymers; Text format; Text mining |
顯示於: | 化學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。