Literature DB >> 33709119

PreDTIs: prediction of drug-target interactions based on multiple feature information using gradient boosting framework with data balancing and feature selection techniques.

S M Hasan Mahmud1, Wenyu Chen1, Yongsheng Liu1, Md Abdul Awal2, Kawsar Ahmed3, Md Habibur Rahman4, Mohammad Ali Moni5.   

Abstract

Discovering drug-target (protein) interactions (DTIs) is of great significance for researching and developing novel drugs, having a tremendous advantage to pharmaceutical industries and patients. However, the prediction of DTIs using wet-lab experimental methods is generally expensive and time-consuming. Therefore, different machine learning-based methods have been developed for this purpose, but there are still substantial unknown interactions needed to discover. Furthermore, data imbalance and feature dimensionality problems are a critical challenge in drug-target datasets, which can decrease the classifier performances that have not been significantly addressed yet. This paper proposed a novel drug-target interaction prediction method called PreDTIs. First, the feature vectors of the protein sequence are extracted by the pseudo-position-specific scoring matrix (PsePSSM), dipeptide composition (DC) and pseudo amino acid composition (PseAAC); and the drug is encoded with MACCS substructure fingerings. Besides, we propose a FastUS algorithm to handle the class imbalance problem and also develop a MoIFS algorithm to remove the irrelevant and redundant features for getting the best optimal features. Finally, balanced and optimal features are provided to the LightGBM Classifier to identify DTIs, and the 5-fold CV validation test method was applied to evaluate the prediction ability of the proposed method. Prediction results indicate that the proposed model PreDTIs is significantly superior to other existing methods in predicting DTIs, and our model could be used to discover new drugs for unknown disorders or infections, such as for the coronavirus disease 2019 using existing drugs compounds and severe acute respiratory syndrome coronavirus 2 protein sequences.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  SARS-CoV-2; data imbalance; drug chemical structure; drug–target interaction; feature selection; protein sequence

Year:  2021        PMID: 33709119      PMCID: PMC7989622          DOI: 10.1093/bib/bbab046

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  66 in total

1.  Protein secondary structure prediction based on position-specific scoring matrices.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-09-17       Impact factor: 5.469

2.  TTD: Therapeutic Target Database.

Authors:  X Chen; Z L Ji; Y Z Chen
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

Review 3.  A guide to drug discovery: Target selection in drug discovery.

Authors:  Jonathan Knowles; Gianni Gromo
Journal:  Nat Rev Drug Discov       Date:  2003-01       Impact factor: 84.694

4.  Nuc-PLoc: a new web-server for predicting protein subnuclear localization by fusing PseAA composition and PsePSSM.

Authors:  Hong-Bin Shen; Kuo-Chen Chou
Journal:  Protein Eng Des Sel       Date:  2007-11-10       Impact factor: 1.650

5.  Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure.

Authors:  Han Shi; Simin Liu; Junqi Chen; Xuan Li; Qin Ma; Bin Yu
Journal:  Genomics       Date:  2018-12-11       Impact factor: 5.736

6.  A Systematic Prediction of Drug-Target Interactions Using Molecular Fingerprints and Protein Sequences.

Authors:  Yu-An Huang; Zhu-Hong You; Xing Chen
Journal:  Curr Protein Pept Sci       Date:  2018       Impact factor: 3.272

Review 7.  A Survey of Data Mining and Deep Learning in Bioinformatics.

Authors:  Kun Lan; Dan-Tong Wang; Simon Fong; Lian-Sheng Liu; Kelvin K L Wong; Nilanjan Dey
Journal:  J Med Syst       Date:  2018-06-28       Impact factor: 4.460

8.  ChemoPy: freely available python package for computational biology and chemoinformatics.

Authors:  Dong-Sheng Cao; Qing-Song Xu; Qian-Nan Hu; Yi-Zeng Liang
Journal:  Bioinformatics       Date:  2013-03-14       Impact factor: 6.937

9.  Coupled matrix-matrix and coupled tensor-matrix completion methods for predicting drug-target interactions.

Authors:  Maryam Bagherian; Renaid B Kim; Cheng Jiang; Maureen A Sartor; Harm Derksen; Kayvan Najarian
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

10.  PyBioMed: a python library for various molecular representations of chemicals, proteins and DNAs and their interactions.

Authors:  Jie Dong; Zhi-Jiang Yao; Lin Zhang; Feijun Luo; Qinlu Lin; Ai-Ping Lu; Alex F Chen; Dong-Sheng Cao
Journal:  J Cheminform       Date:  2018-03-20       Impact factor: 5.514

View more
  6 in total

1.  AntiDMPpred: a web service for identifying anti-diabetic peptides.

Authors:  Xue Chen; Jian Huang; Bifang He
Journal:  PeerJ       Date:  2022-06-14       Impact factor: 3.061

2.  DeepMHADTA: Prediction of Drug-Target Binding Affinity Using Multi-Head Self-Attention and Convolutional Neural Network.

Authors:  Lei Deng; Yunyun Zeng; Hui Liu; Zixuan Liu; Xuejun Liu
Journal:  Curr Issues Mol Biol       Date:  2022-05-19       Impact factor: 2.976

3.  Identification of Crucial Genes and Key Functions in Type 2 Diabetic Hearts by Bioinformatic Analysis.

Authors:  Xin Huang; Kai-Jie Zhang; Jun-Jie Jiang; Shou-Yin Jiang; Jia-Bin Lin; Yi-Jia Lou
Journal:  Front Endocrinol (Lausanne)       Date:  2022-02-15       Impact factor: 5.555

4.  An ensemble-based drug-target interaction prediction approach using multiple feature information with data balancing.

Authors:  Heba El-Behery; Abdel-Fattah Attia; Nawal El-Fishawy; Hanaa Torkey
Journal:  J Biol Eng       Date:  2022-08-08       Impact factor: 6.248

5.  A Bioinformatic Approach Based on Systems Biology to Determine the Effects of SARS-CoV-2 Infection in Patients with Hypertrophic Cardiomyopathy.

Authors:  Xiao Han; Fei Wang; Ping Yang; Bin Di; Xiangdong Xu; Chunya Zhang; Man Yao; Yaping Sun; Yangyi Lin
Journal:  Comput Math Methods Med       Date:  2022-09-27       Impact factor: 2.809

6.  ACP-DA: Improving the Prediction of Anticancer Peptides Using Data Augmentation.

Authors:  Xian-Gan Chen; Wen Zhang; Xiaofei Yang; Chenhong Li; Hengling Chen
Journal:  Front Genet       Date:  2021-06-30       Impact factor: 4.599

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.