Literature DB >> 25437231

Identification of human drug targets using machine-learning algorithms.

Priyanka Kumari1, Abhigyan Nath1, Radha Chaube2.   

Abstract

Identification of potential drug targets is a crucial task in the drug-discovery pipeline. Successful identification of candidate drug targets in entire genomes is very useful, and computational prediction methods can speed up this process. In the current work we have developed a sequence-based prediction method for the successful identification and discrimination of human drug target proteins, from human non-drug target proteins. The training features include sequence-based features, such as amino acid composition, amino acid property group composition, and dipeptide composition for generating predictive models. The classification of human drug target proteins presents a classic example of class imbalance. We have addressed this issue by using SMOTE (Synthetic Minority Over-sampling Technique) as a preprocessing step, for balancing the training data with a ratio of 1:1 between drug targets (minority samples) and non-drug targets (majority samples). Using ensemble classification learning method-Rotation Forest and ReliefF feature-selection technique for selecting the optimal subset of salient features, the best model with selected features can achieve 87.1% sensitivity, 83.6% specificity, and 85.3% accuracy, with 0.71 Matthews correlation coefficient (mcc) on a tenfold stratified cross-validation test. The subset of identified optimal features may help in assessing the compositional patterns in human drug targets. For further validation, using a rigorous leave-one-out cross-validation test, the model achieved 88.1% sensitivity, 83.0% specificity, 85.5% accuracy, and 0.712 mcc. The proposed method was tested on a second dataset, for which the current pipeline gave promising results. We suggest that the present approach can be applied successfully as a complementary tool to existing methods for novel drug target prediction.
Copyright © 2014 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Dipeptide composition; Drug targets; Ensemble learning; Property group composition; ReliefF; SMOTE

Mesh:

Year:  2014        PMID: 25437231     DOI: 10.1016/j.compbiomed.2014.11.008

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  8 in total

1.  Recent trends in artificial intelligence-driven identification and development of anti-neurodegenerative therapeutic agents.

Authors:  Kushagra Kashyap; Mohammad Imran Siddiqi
Journal:  Mol Divers       Date:  2021-07-19       Impact factor: 3.364

2.  Use of big data in drug development for precision medicine: an update.

Authors:  Tongqi Qian; Shijia Zhu; Yujin Hoshida
Journal:  Expert Rev Precis Med Drug Dev       Date:  2019-05-20

3.  Probing an optimal class distribution for enhancing prediction and feature characterization of plant virus-encoded RNA-silencing suppressors.

Authors:  Abhigyan Nath; Karthikeyan Subbiah
Journal:  3 Biotech       Date:  2016-03-21       Impact factor: 2.406

4.  Identification of DEP domain-containing proteins by a machine learning method and experimental analysis of their expression in human HCC tissues.

Authors:  Zhijun Liao; Xinrui Wang; Yeting Zeng; Quan Zou
Journal:  Sci Rep       Date:  2016-12-21       Impact factor: 4.379

5.  Genome-wide investigation of gene-cancer associations for the prediction of novel therapeutic targets in oncology.

Authors:  Adrián Bazaga; Dan Leggate; Hendrik Weisser
Journal:  Sci Rep       Date:  2020-07-01       Impact factor: 4.379

6.  Prediction of Drug Targets for Specific Diseases Leveraging Gene Perturbation Data: A Machine Learning Approach.

Authors:  Kai Zhao; Yujia Shi; Hon-Cheong So
Journal:  Pharmaceutics       Date:  2022-01-20       Impact factor: 6.321

Review 7.  A Review on Applications of Computational Methods in Drug Screening and Design.

Authors:  Xiaoqian Lin; Xiu Li; Xubo Lin
Journal:  Molecules       Date:  2020-03-18       Impact factor: 4.411

8.  Systems biology and machine learning approaches identify drug targets in diabetic nephropathy.

Authors:  Maryam Abedi; Hamid Reza Marateb; Mohammad Reza Mohebian; Seyed Hamid Aghaee-Bakhtiari; Seyed Mahdi Nassiri; Yousof Gheisari
Journal:  Sci Rep       Date:  2021-12-06       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.