Literature DB >> 32573225

Deep Learning-Based Imbalanced Data Classification for Drug Discovery.

Selçuk Korkmaz1.   

Abstract

Drug discovery studies have become increasingly expensive and time-consuming processes. In the early phase of drug discovery studies, an extensive search has been performed to find drug-like compounds, which then can be optimized over time to become a marketed drug. One of the conventional ways of detecting active compounds is to perform an HTS (high-throughput screening) experiment. As of July 2019, the PubChem repository contains 1.3 million bioassays that are generated through HTS experiments. This feature of PubChem makes it a great resource for performing machine learning algorithms to develop classification models to detect active compounds for drug discovery studies. However, data sets obtained from PubChem are highly imbalanced. This imbalanced nature of the data sets has a negative impact on the classification performance of machine learning algorithms. Here, we explored the classification performance of deep neural networks (DNN) on imbalance compound data sets after applying various data balancing methods. We used five confirmatory HTS bioassays from the PubChem repository and applied one undersampling and three oversampling methods as data balancing methods. We used a fully connected, two-hidden-layer DNN model for the classification of active and inactive molecules. To evaluate the performance of the network, we calculated six performance metrics, including balanced accuracy, precision, recall, F1 score, Matthews correlation coefficient, and area under the ROC curve. The study results showed that the effect of imbalanced data on network performance could be mitigated to a degree by applying the data balancing methods. The level of imbalance, however, has a negative effect on the performance of the network.

Mesh:

Year:  2020        PMID: 32573225     DOI: 10.1021/acs.jcim.9b01162

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  8 in total

1.  Target Prediction Model for Natural Products Using Transfer Learning.

Authors:  Bo Qiang; Junyong Lai; Hongwei Jin; Liangren Zhang; Zhenming Liu
Journal:  Int J Mol Sci       Date:  2021-04-28       Impact factor: 5.923

2.  Deep Learning Algorithms Achieved Satisfactory Predictions When Trained on a Novel Collection of Anticoronavirus Molecules.

Authors:  Emna Harigua-Souiai; Mohamed Mahmoud Heinhane; Yosser Zina Abdelkrim; Oussama Souiai; Ines Abdeljaoued-Tej; Ikram Guizani
Journal:  Front Genet       Date:  2021-11-29       Impact factor: 4.599

Review 3.  Critical Review of Synthesis, Toxicology and Detection of Acyclovir.

Authors:  Yan-Ping Wei; Liang-Yuan Yao; Yi-Yong Wu; Xia Liu; Li-Hong Peng; Ya-Ling Tian; Jian-Hua Ding; Kang-Hua Li; Quan-Guo He
Journal:  Molecules       Date:  2021-10-29       Impact factor: 4.411

4.  Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study.

Authors:  Emna Harigua-Souiai; Rafeh Oualha; Oussama Souiai; Ines Abdeljaoued-Tej; Ikram Guizani
Journal:  Bioinform Biol Insights       Date:  2022-04-22

5.  Evaluation of Effective Class-Balancing Techniques for CNN-Based Assessment of Aphanomyces Root Rot Resistance in Pea (Pisum sativum L.).

Authors:  L G Divyanth; Afef Marzougui; Maria Jose González-Bernal; Rebecca J McGee; Diego Rubiales; Sindhuja Sankaran
Journal:  Sensors (Basel)       Date:  2022-09-24       Impact factor: 3.847

6.  In silico prediction of chemical-induced hematotoxicity with machine learning and deep learning methods.

Authors:  Yuqing Hua; Yinping Shi; Xueyan Cui; Xiao Li
Journal:  Mol Divers       Date:  2021-07-01       Impact factor: 2.943

7.  Deep Learning Approach for Discovery of In Silico Drugs for Combating COVID-19.

Authors:  Nishant Jha; Deepak Prashar; Mamoon Rashid; Mohammad Shafiq; Razaullah Khan; Catalin I Pruncu; Shams Tabrez Siddiqui; M Saravana Kumar
Journal:  J Healthc Eng       Date:  2021-07-20       Impact factor: 2.682

8.  Experimental Study and Comparison of Imbalance Ensemble Classifiers with Dynamic Selection Strategy.

Authors:  Dongxue Zhao; Xin Wang; Yashuang Mu; Lidong Wang
Journal:  Entropy (Basel)       Date:  2021-06-28       Impact factor: 2.524

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.