https://scholars.lib.ntu.edu.tw/handle/123456789/457946
標題: | Decision tree-based classifier in providing telehealth service | 作者: | Chern C.-C Chen Y.-J Hsiao B. CHING-CHIN CHERN |
關鍵字: | Data mining; Data preprocessing; Decision tree; Telehealth service | 公開日期: | 2019 | 出版社: | BioMed Central Ltd. | 卷: | 19 | 期: | 1 | 來源出版物: | BMC Medical Informatics and Decision Making | 摘要: | Background: Although previous research showed that telehealth services can reduce the misuse of resources and urban-rural disparities, most healthcare insurers do not include telehealth services in their health insurance schemes. Therefore, no target variable exists for the classification approaches to learn from or train with. The problem of identifying the potential recipients of telehealth services when introducing telehealth services into health welfare or health insurance schemes becomes an unsupervised classification problem without a target variable. Methods: We propose a HDTTCA approach, which is a systematic approach (the main process of HDTTCA involves (1) data set preprocessing, (2) decision tree model building, and (3) predicting and explaining of the most important attributes in the data set for patients who qualify for telehealth service) to identify those who are eligible for telehealth services. Results: This work uses data from the NHIRD provided by the NHIA in Taiwan in 2012 as our research scope, which consist of 55,389 distinct hospitals and 653,209 distinct patients with 15,882,153 outpatient and 135,775 inpatient records. After HDTTCA produces the final version of the decision tree, the rules can be used to assign the values of the target variables in the entire NHIRD. Our data indicate that 3.56% (23,262 out of 653,209) of the patients are eligible for telehealth services in 2012. This study verifies the efficiency and validity of HDTTCA by using a large data set from the NHI of Taiwan. Conclusion: This study conducts a series of experiments 30 times to compare the HDTTCA results with the logistic regression findings by measuring their average performance and determining which model addresses the telehealth patient classification problem better. Four important metrics are used to compare the results. In terms of sensitivity, the decision trees generated by HDTTCA and the logistic regression model are on equal grounds. In terms of accuracy, specificity, and precision, the decision tree generated by HDTTCA provides a better performance than that of the logistic regression model. When HDTTCA is applied, the decision tree model generates a competitive performance and provides clear, easily understandable rules. Therefore, HDTTCA is a suitable choice in solving telehealth service classification problems. ? 2019 The Author(s). |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85066609115&doi=10.1186%2fs12911-019-0825-9&partnerID=40&md5=1f36da22ab07e4ac3851427728336f29 https://scholars.lib.ntu.edu.tw/handle/123456789/457946 |
DOI: | 10.1186/s12911-019-0825-9 | SDG/關鍵字: | adult; article; classifier; controlled study; data mining; decision tree; female; hospital patient; human; major clinical study; male; outpatient; patient coding; Taiwan; target variable; telehealth; validity; classification; statistical analysis; telemedicine; theoretical model; Classification; Data Interpretation, Statistical; Data Mining; Decision Trees; Humans; Models, Theoretical; Taiwan; Telemedicine |
顯示於: | 資訊管理學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。