Yang, Chao-HsuChao-HsuYangLin, Tony EightTony EightLinHsieh, Jui-HuaJui-HuaHsiehHsu, Kai-ChengKai-ChengHsuPEI-TE CHIUEH2025-12-182025-12-182025-11-1015499596https://www.scopus.com/record/display.uri?eid=2-s2.0-105021084923&origin=resultslisthttps://scholars.lib.ntu.edu.tw/handle/123456789/734755Assessing the mutagenicity of chemical compounds is crucial for ensuring their safety and minimizing potential environmental and public health risks. However, traditional mutagenicity assessments, such as the Ames test, are time-consuming, resource-intensive, and often limited in their capacity to screen a large number of compounds. To address this gap, predictive models powered by deep learning offer a promising alternative for rapid and cost-effective mutagenicity screening. In this study, we propose an integrated deep learning framework utilizing diverse molecular features to predict compound mutagenicity. In the total usage of 5866 compounds, 5279 compounds were utilized for model training, and the other 587 compounds were utilized for model evaluation. A total of 78 integrated models were developed by systematically combining 13 types of molecular descriptors and fingerprints. The MACCS-Mordred model demonstrated the best performance, achieving a balanced accuracy of 0.885 and a precision score of 0.922 in the testing data set. In addition, we performed an activity cliff analysis to examine potential sources of mispredictions. Applicability domain analysis further confirmed the robustness of the model, indicating that most compounds in our data set fell within the reliable prediction space. Notably, feature importance analysis revealed that mutagenic compounds are more likely to contain nitrogen-containing and ring-related substructures, offering insights into structural characteristics associated with mutagenic risk. Our results support AI-enabled screening tools for prioritizing hazardous compounds and improving early stage chemical risk assessment. This work provides practical value for environmental monitoring and regulatory decision-making.true[SDGs]SDG3Establishment of an Integrated Model for Predicting Compound Mutagenicity with a Feature Importance Analysisjournal article10.1021/acs.jcim.5c015862-s2.0-105021084923