Patnaik, Rakesh KumarRakesh KumarPatnaikMING-CHIH HOYeh, J. AndrewJ. AndrewYeh2023-11-232023-11-232023-01-019798350324174https://scholars.lib.ntu.edu.tw/handle/123456789/637321In the medical field, acquiring a sufficient number of medical samples can be challenging, and the collected datasets may be imbalanced and small. To address these issues, we propose a weighted SMOTE algorithm that targets imbalanced datasets. This technique has been applied to a dataset of breath biomarkers of liver disease as a feature set and a supervised learning model. Our results show that the proposed method significantly improves the prediction probability and classification performance of the chosen model in both the original imbalanced dataset and the balanced dataset. This study demonstrates the potential of the proposed approach to enhance machine learning performance while dealing with small and imbalanced datasets in medical applications.enBiomarker | Liver | Machine Learning | Oversampling | Weighted SMOTEWeighted SMOTE Algorithm: A Tool To Improve Disease Prediction With Imbalanced Dataconference paper10.1109/ICCE-Taiwan58799.2023.102267032-s2.0-85174942975https://api.elsevier.com/content/abstract/scopus_id/85174942975