https://scholars.lib.ntu.edu.tw/handle/123456789/546920
標題: | Using Partial Combination Models to Improve Prediction Quality and Transparency in Mixed Datasets | 作者: | Wu, Y.-H. Chang, Y.-H. Tien, Y.-J. Yu, C.-J. SHENG-DE WANG |
關鍵字: | expert systems; hierarchical clustering; Hierarchical method; manufacturing; prediction methods; regression analysis | 公開日期: | 2020 | 卷: | 8 | 起(迄)頁: | 132106-132120 | 來源出版物: | IEEE Access | 摘要: | Mixed Datasets with complex interactions between categorical and numerical attributes are common in engineering and business applications. For example, production rates in manufacturing systems are jointly influenced by several categorical and numerical attributes, such as machine and product types and their numerical attributes. This study aims to improve the prediction performance and transparency of mixed datasets with complex interactions using machine learning (ML) methods. The proposed method requires lesser data and computational effort than existing hierarchical or clustering regression methods. Multiple prediction models can be generated by partitioning a dataset into subsets with different categorical attribution combinations. One- and two-stage model selection methods are proposed to use the training and validation datasets in selecting better models among all the prediction models. Numerical results demonstrate the potential of the model selection approach in a mixed dataset from a semiconductor manufacturer. In comparison with regression models, more than 30% reduction in root mean square error is observed using the proposed model selection approach. The cross-validation test results also demonstrated a 10% improvement in accuracy against the properly tuned XGBoost models. Moreover, the proposed model selection approach is compatible with other regression or ML prediction methods and can be used to improve the model's transparency of any existing methods on mixed datasets. © 2013 IEEE. |
URI: | https://www.scopus.com/inward/record.url?eid=2-s2.0-85089543893&partnerID=40&md5=0fc2ae8f94fcfdd6840da527793f15b1 https://scholars.lib.ntu.edu.tw/handle/123456789/546920 |
ISSN: | 21693536 | DOI: | 10.1109/ACCESS.2020.3008475 | SDG/關鍵字: | Forecasting; Hierarchical clustering; Manufacture; Mean square error; Regression analysis; Transparency; Business applications; Computational effort; Cross-validation tests; Numerical attributes; Prediction methods; Prediction performance; Root mean square errors; Semiconductor manufacturers; Predictive analytics |
顯示於: | 工業工程學研究所 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。