Ping, X.-O.X.-O.PingTseng, Y.-J.Y.-J.TsengJA-DER LIANGGUAN-TARN HUANGPEI-MING YANGFEI-PEI LAI2021-04-222021-04-2220150922-6389https://www.scopus.com/inward/record.uri?eid=2-s2.0-84926444199&doi=10.3233%2f978-1-61499-484-8-1930&partnerID=40&md5=6297328b96f9951571b4c71c43dd142ahttps://scholars.lib.ntu.edu.tw/handle/123456789/557479The problem of missing values frequently occurs during data analysis. Imputation is one of the solutions to handle missing data. Clinical data often contain multiple measurements such as laboratory test results which are measured at different time points. In this study, we compared three imputation methods and their effects on different multiple measurement data sets with different sampling time periods. Data sets of liver cancer were used in this study for classification of liver cancer recurrence based on two types of classification models built by support vector machine (SVM) and random forests. The results report appropriate combinations of imputation methods and sampling time periods which achieve better classification results than those of other imputation methods and periods. These reported the leading imputation method with SVM is significantly different (P<0.001) from mean imputation with SVM which is frequently used by data sets with missing values. ? 2015 The authors and IOS Press. All rights reserved.[SDGs]SDG3Autocorrelation; Classification (of information); Decision trees; Diseases; Intelligent control; Intelligent systems; Classification models; Classification results; Comparative studies; imputation; Missing value imputation; Missing values; Multiple measurements; Random forests; Support vector machinesMissing value imputation on multiple measurements for prediction of liver cancer recurrence: A comparative studyconference paper10.3233/978-1-61499-484-8-19302-s2.0-84926444199