Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors
Machine Learning Models Combined with Virtual Screening and Molecular Docking to Predict Human Topoisomerase I Inhibitors
In this work, random forest (RF), support vector machine, k-nearest neighbor and C4.5 decision tree, were used to establish classification models for predicting whether an unknown molecule is an inhibitor of human topoisomerase I (Top1) protein. All these models have achieved satisfactory results, with total prediction accuracies from 89.70% to 97.12%. Through comparative analysis, it can be found that the RF model has the best forecasting effect. The parameters were further optimized to generate the best-performing RF model. At the same time, features selection was implemented to choose properties most relevant to the inhibition of Top1 from 189 molecular descriptors through a special RF procedure. Subsequently, a ligand-based virtual screening was performed from the Maybridge database by the optimal RF model and 596 hits were picked out. Then, 67 molecules with relative probability scores over 0.7 were selected based on the screening results. Next, the 67 molecules above were docked to Top1 using AutoDock Vina. Finally, six top-ranked molecules with binding energies less than −10.0 kcal/mol were screened out and a common backbone, which is entirely different from that of existing Top1 inhibitors reported in the literature, was found.
- North Sichuan Medical University China (People's Republic of)
- Nantong Institute of Technology China (People's Republic of)
- Nanchang Institute of Technology China (People's Republic of)
- Chengdu Normal University China (People's Republic of)
- Nanyang Institute of Technology China (People's Republic of)
Molecular Structure, Organic chemistry, Quantitative Structure-Activity Relationship, Reproducibility of Results, molecular docking, Molecular Dynamics Simulation, virtual screening, Article, human topoisomerase I, Machine Learning, Molecular Docking Simulation, molecular descriptors, machine learning, QD241-441, ROC Curve, inhibitors, Humans, Computer Simulation, Topoisomerase I Inhibitors, Algorithms
Molecular Structure, Organic chemistry, Quantitative Structure-Activity Relationship, Reproducibility of Results, molecular docking, Molecular Dynamics Simulation, virtual screening, Article, human topoisomerase I, Machine Learning, Molecular Docking Simulation, molecular descriptors, machine learning, QD241-441, ROC Curve, inhibitors, Humans, Computer Simulation, Topoisomerase I Inhibitors, Algorithms
1 Research products, page 1 of 1
- 2004IsRelatedTo
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).13 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Top 10%
