Literature DB >> 25513722

iDrug-Target: predicting the interactions between drug compounds and target proteins in cellular networking via benchmark dataset optimization approach.

Xuan Xiao1, Jian-Liang Min, Wei-Zhong Lin, Zi Liu, Xiang Cheng, Kuo-Chen Chou.   

Abstract

Information about the interactions of drug compounds with proteins in cellular networking is very important for drug development. Unfortunately, all the existing predictors for identifying drug-protein interactions were trained by a skewed benchmark data-set where the number of non-interactive drug-protein pairs is overwhelmingly larger than that of the interactive ones. Using this kind of highly unbalanced benchmark data-set to train predictors would lead to the outcome that many interactive drug-protein pairs might be mispredicted as non-interactive. Since the minority interactive pairs often contain the most important information for drug design, it is necessary to minimize this kind of misprediction. In this study, we adopted the neighborhood cleaning rule and synthetic minority over-sampling technique to treat the skewed benchmark datasets and balance the positive and negative subsets. The new benchmark datasets thus obtained are called the optimized benchmark datasets, based on which a new predictor called iDrug-Target was developed that contains four sub-predictors: iDrug-GPCR, iDrug-Chl, iDrug-Ezy, and iDrug-NR, specialized for identifying the interactions of drug compounds with GPCRs (G-protein-coupled receptors), ion channels, enzymes, and NR (nuclear receptors), respectively. Rigorous cross-validations on a set of experiment-confirmed datasets have indicated that these new predictors remarkably outperformed the existing ones for the same purpose. To maximize users' convenience, a public accessible Web server for iDrug-Target has been established at http://www.jci-bioinfo.cn/iDrug-Target/ , by which users can easily get their desired results. It has not escaped our notice that the aforementioned strategy can be widely used in many other areas as well.

Keywords:  NCR; SMOTE; chou’s PseAAC; iDrug-Chl; iDrug-Ezy; iDrug-GPCR; iDrug-NR; molecular fingerprints; optimized training data-set; target-jackknife validation

Mesh:

Substances:

Year:  2015        PMID: 25513722     DOI: 10.1080/07391102.2014.998710

Source DB:  PubMed          Journal:  J Biomol Struct Dyn        ISSN: 0739-1102


  41 in total

1.  Prediction of Protein-Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures.

Authors:  Guang-Hui Liu; Hong-Bin Shen; Dong-Jun Yu
Journal:  J Membr Biol       Date:  2015-11-12       Impact factor: 1.843

2.  DAKB-GPCRs: An Integrated Computational Platform for Drug Abuse Related GPCRs.

Authors:  Maozi Chen; Yankang Jing; Lirong Wang; Zhiwei Feng; Xiang-Qun Xie
Journal:  J Chem Inf Model       Date:  2019-03-14       Impact factor: 4.956

3.  Classifying Multifunctional Enzymes by Incorporating Three Different Models into Chou's General Pseudo Amino Acid Composition.

Authors:  Hong-Liang Zou; Xuan Xiao
Journal:  J Membr Biol       Date:  2016-04-25       Impact factor: 1.843

4.  Computational Methods for Drug Repurposing.

Authors:  Rosaria Valentina Rapicavoli; Salvatore Alaimo; Alfredo Ferro; Alfredo Pulvirenti
Journal:  Adv Exp Med Biol       Date:  2022       Impact factor: 2.622

5.  Imbalanced multi-label learning for identifying antimicrobial peptides and their functional types.

Authors:  Weizhong Lin; Dong Xu
Journal:  Bioinformatics       Date:  2016-08-26       Impact factor: 6.937

6.  Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition.

Authors:  Khurshid Ahmad; Muhammad Waris; Maqsood Hayat
Journal:  J Membr Biol       Date:  2016-01-08       Impact factor: 1.843

7.  iACP: a sequence-based tool for identifying anticancer peptides.

Authors:  Wei Chen; Hui Ding; Pengmian Feng; Hao Lin; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-03-29

8.  Benchmark data for identifying DNA methylation sites via pseudo trinucleotide composition.

Authors:  Zi Liu; Xuan Xiao; Wang-Ren Qiu; Kuo-Chen Chou
Journal:  Data Brief       Date:  2015-05-07

9.  Identifying new targets in leukemogenesis using computational approaches.

Authors:  Archana Jayaraman; Kaiser Jamil; Haseeb A Khan
Journal:  Saudi J Biol Sci       Date:  2015-01-20       Impact factor: 4.219

10.  BioTriangle: a web-accessible platform for generating various molecular representations for chemicals, proteins, DNAs/RNAs and their interactions.

Authors:  Jie Dong; Zhi-Jiang Yao; Ming Wen; Min-Feng Zhu; Ning-Ning Wang; Hong-Yu Miao; Ai-Ping Lu; Wen-Bin Zeng; Dong-Sheng Cao
Journal:  J Cheminform       Date:  2016-06-21       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.