A general optimization protocol for molecular property prediction using a deep learning network

Chen J.-H; YUFENG JANE TSENG; Chen J.-H;Tseng Y.J.

doi:10.1093/bib/bbab367

A general optimization protocol for molecular property prediction using a deep learning network

Journal

Briefings in bioinformatics

Journal Volume

23

Journal Issue

1

Date Issued

2022

Author(s)

Chen J.-H

YUFENG JANE TSENG

DOI

10.1093/bib/bbab367

URI

https://www.scopus.com/inward/record.uri?eid=2-s2.0-85123814144&doi=10.1093%2fbib%2fbbab367&partnerID=40&md5=38c59138ad63634cf8fd157af8b53763

https://scholars.lib.ntu.edu.tw/handle/123456789/607494

Abstract

The key to generating the best deep learning model for predicting molecular property is to test and apply various optimization methods. While individual optimization methods from different past works outside the pharmaceutical domain each succeeded in improving the model performance, better improvement may be achieved when specific combinations of these methods and practices are applied. In this work, three high-performance optimization methods in the literature that have been shown to dramatically improve model performance from other fields are used and discussed, eventually resulting in a general procedure for generating optimized CNN models on different properties of molecules. The three techniques are the dynamic batch size strategy for different enumeration ratios of the SMILES representation of compounds, Bayesian optimization for selecting the hyperparameters of a model and feature learning using chemical features obtained by a feedforward neural network, which are concatenated with the learned molecular feature vector. A total of seven different molecular properties (water solubility, lipophilicity, hydration energy, electronic properties, blood-brain barrier permeability and inhibition) are used. We demonstrate how each of the three techniques can affect the model and how the best model can generally benefit from using Bayesian optimization combined with dynamic batch size tuning. ? The Author(s) 2021. Published by Oxford University Press.

Subjects

CNN

deep learning

drug discovery

optimization

article

blood brain barrier

facial expression

feed forward neural network

hydration

lipophilicity

prediction

water solubility

SDGs

[SDGs]SDG3

[SDGs]SDG6

Other Subjects

Bayes theorem; solubility; Bayes Theorem; Deep Learning; Neural Networks, Computer; Solubility

Type

journal article

A general optimization protocol for molecular property prediction using a deep learning network

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)