https://scholars.lib.ntu.edu.tw/handle/123456789/638106
標題: | The Impact of Feature Normalization on Different Feature Types of Medical Datasets | 作者: | Hu, Ya Han KANG ERNEST LIU Tsai, Chih Fong |
關鍵字: | data preprocessing | feature normalization | medical datasets | pattern classification | 公開日期: | 12-五月-2023 | 來源出版物: | ACM International Conference Proceeding Series | 摘要: | To obtain quality data mining results, data pre-processing is usually performed in the knowledge discovery in databases (KDD) process. Particularly, feature normalization or scaling is one important step in data pre-processing. This is because many datasets usually contain some features that have broad ranges of values, and feature normalization is applied to normalize or rescale each feature value to a fixed range, usually between 0 and 1. For the medical domain datasets, they usually contain three different kinds of data including categorical, numerical, and the mixed data type, this paper examines the effect of performing feature normalization on the three different types of medical datasets. Our experimental results indicate that for the categorical and some mixed types of datasets performing feature normalization does not necessarily make the k-NN and SVM classifiers perform better than the ones without feature normalization. On the other hand, for the numerical type of datasets k-NN and SVM by feature normalization perform better than the baseline classifiers. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85178030324&doi=10.1145%2f3608298.3608304&partnerID=40&md5=4e32bcb1e94e7163cf586d999e8d3e7b https://scholars.lib.ntu.edu.tw/handle/123456789/638106 |
ISBN: | 9798400700712 | DOI: | 10.1145/3608298.3608304 |
顯示於: | 農業經濟學系 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。