Literature DB >> 28423655

Sequence-based predictive modeling to identify cancerlectins.

Hong-Yan Lai1, Xin-Xin Chen1, Wei Chen1,2, Hua Tang3, Hao Lin1.   

Abstract

Lectins are a diverse type of glycoproteins or carbohydrate-binding proteins that have a wide distribution to various species. They can specially identify and exclusively bind to a certain kind of saccharide groups. Cancerlectins are a group of lectins that are closely related to cancer and play a major role in the initiation, survival, growth, metastasis and spread of tumor. Several computational methods have emerged to discriminate cancerlectins from non-cancerlectins, which promote the study on pathogenic mechanisms and clinical treatment of cancer. However, the predictive accuracies of most of these techniques are very limited. In this work, by constructing a benchmark dataset based on the CancerLectinDB database, a new amino acid sequence-based strategy for feature description was developed, and then the binomial distribution was applied to screen the optimal feature set. Ultimately, an SVM-based predictor was performed to distinguish cancerlectins from non-cancerlectins, and achieved an accuracy of 77.48% with AUC of 85.52% in jackknife cross-validation. The results revealed that our prediction model could perform better comparing with published predictive tools.

Entities:  

Keywords:  SVM; binomial distribution; cancerlectins; optimal tripeptides

Mesh:

Substances:

Year:  2017        PMID: 28423655      PMCID: PMC5438640          DOI: 10.18632/oncotarget.15963

Source DB:  PubMed          Journal:  Oncotarget        ISSN: 1949-2553


  53 in total

Review 1.  Lectins as bioactive plant proteins: a potential in cancer treatment.

Authors:  Elvira González De Mejía; Valentin I Prisecaru
Journal:  Crit Rev Food Sci Nutr       Date:  2005       Impact factor: 11.176

2.  PseKRAAC: a flexible web server for generating pseudo K-tuple reduced amino acids composition.

Authors:  Yongchun Zuo; Yuan Li; Yingli Chen; Guangpeng Li; Zhenhe Yan; Lei Yang
Journal:  Bioinformatics       Date:  2016-08-26       Impact factor: 6.937

Review 3.  A glycobiology review: carbohydrates, lectins and implications in cancer therapeutics.

Authors:  Haike Ghazarian; Brian Idoni; Steven B Oppenheimer
Journal:  Acta Histochem       Date:  2010-03-02       Impact factor: 2.479

4.  SMOQ: a tool for predicting the absolute residue-specific quality of a single protein model with support vector machines.

Authors:  Renzhi Cao; Zheng Wang; Yiheng Wang; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2014-04-28       Impact factor: 3.169

5.  iDPF-PseRAAAC: A Web-Server for Identifying the Defensin Peptide Family and Subfamily Using Pseudo Reduced Amino Acid Alphabet Composition.

Authors:  Yongchun Zuo; Yang Lv; Zhuying Wei; Lei Yang; Guangpeng Li; Guoliang Fan
Journal:  PLoS One       Date:  2015-12-29       Impact factor: 3.240

6.  DeepQA: improving the estimation of single protein model quality with deep belief networks.

Authors:  Renzhi Cao; Debswapna Bhattacharya; Jie Hou; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2016-12-05       Impact factor: 3.169

7.  Some remarks on protein attribute prediction and pseudo amino acid composition.

Authors:  Kuo-Chen Chou
Journal:  J Theor Biol       Date:  2010-12-17       Impact factor: 2.691

8.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

9.  Accurate prediction of nuclear receptors with conjoint triad feature.

Authors:  Hongchu Wang; Xuehai Hu
Journal:  BMC Bioinformatics       Date:  2015-12-03       Impact factor: 3.169

10.  Prediction of phosphothreonine sites in human proteins by fusing different features.

Authors:  Ya-Wei Zhao; Hong-Yan Lai; Hua Tang; Wei Chen; Hao Lin
Journal:  Sci Rep       Date:  2016-10-04       Impact factor: 4.379

View more
  31 in total

1.  IDDLncLoc: Subcellular Localization of LncRNAs Based on a Framework for Imbalanced Data Distributions.

Authors:  Yan Wang; Xiaopeng Zhu; Lili Yang; Xuemei Hu; Kai He; Cuinan Yu; Shaoqing Jiao; Jiali Chen; Rui Guo; Sen Yang
Journal:  Interdiscip Sci       Date:  2022-02-22       Impact factor: 2.233

2.  IonchanPred 2.0: A Tool to Predict Ion Channels and Their Types.

Authors:  Ya-Wei Zhao; Zhen-Dong Su; Wuritu Yang; Hao Lin; Wei Chen; Hua Tang
Journal:  Int J Mol Sci       Date:  2017-08-24       Impact factor: 5.923

3.  iRNA-3typeA: Identifying Three Types of Modification at RNA's Adenosine Sites.

Authors:  Wei Chen; Pengmian Feng; Hui Yang; Hui Ding; Hao Lin; Kuo-Chen Chou
Journal:  Mol Ther Nucleic Acids       Date:  2018-03-30       Impact factor: 8.886

4.  iRSpot-Pse6NC: Identifying recombination spots in Saccharomyces cerevisiae by incorporating hexamer composition into general PseKNC.

Authors:  Hui Yang; Wang-Ren Qiu; Guoqing Liu; Feng-Biao Guo; Wei Chen; Kuo-Chen Chou; Hao Lin
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

5.  A novel feature ranking method for prediction of cancer stages using proteomics data.

Authors:  Ehsan Saghapour; Saeed Kermani; Mohammadreza Sehhati
Journal:  PLoS One       Date:  2017-09-21       Impact factor: 3.240

6.  mLoc-mRNA: predicting multiple sub-cellular localization of mRNAs using random forest algorithm coupled with feature selection via elastic net.

Authors:  Prabina Kumar Meher; Anil Rai; Atmakuri Ramakrishna Rao
Journal:  BMC Bioinformatics       Date:  2021-06-24       Impact factor: 3.169

7.  AIPpred: Sequence-Based Prediction of Anti-inflammatory Peptides Using Random Forest.

Authors:  Balachandran Manavalan; Tae H Shin; Myeong O Kim; Gwang Lee
Journal:  Front Pharmacol       Date:  2018-03-27       Impact factor: 5.810

8.  PrESOgenesis: A two-layer multi-label predictor for identifying fertility-related proteins using support vector machine and pseudo amino acid composition approach.

Authors:  Mohammad Reza Bakhtiarizadeh; Maryam Rahimi; Abdollah Mohammadi-Sangcheshmeh; Vahid Shariati J; Seyed Alireza Salami
Journal:  Sci Rep       Date:  2018-06-13       Impact factor: 4.379

9.  iterb-PPse: Identification of transcriptional terminators in bacterial by incorporating nucleotide properties into PseKNC.

Authors:  Yongxian Fan; Wanru Wang; Qingqi Zhu
Journal:  PLoS One       Date:  2020-05-15       Impact factor: 3.240

10.  Characterization of the relationship between FLI1 and immune infiltrate level in tumour immune microenvironment for breast cancer.

Authors:  Shiyuan Wang; Yakun Wang; Chunlu Yu; Yiyin Cao; Yao Yu; Yi Pan; Dongqing Su; Qianzi Lu; Wuritu Yang; Yongchun Zuo; Lei Yang
Journal:  J Cell Mol Med       Date:  2020-04-05       Impact factor: 5.310

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.