Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Sequence-based predictive modeling to identify cancerlectins.

Literature DB >> 28423655

Sequence-based predictive modeling to identify cancerlectins.

Hong-Yan Lai¹, Xin-Xin Chen¹, Wei Chen^1,2, Hua Tang³, Hao Lin¹.

Abstract

Lectins are a diverse type of glycoproteins or carbohydrate-binding proteins that have a wide distribution to various species. They can specially identify and exclusively bind to a certain kind of saccharide groups. Cancerlectins are a group of lectins that are closely related to cancer and play a major role in the initiation, survival, growth, metastasis and spread of tumor. Several computational methods have emerged to discriminate cancerlectins from non-cancerlectins, which promote the study on pathogenic mechanisms and clinical treatment of cancer. However, the predictive accuracies of most of these techniques are very limited. In this work, by constructing a benchmark dataset based on the CancerLectinDB database, a new amino acid sequence-based strategy for feature description was developed, and then the binomial distribution was applied to screen the optimal feature set. Ultimately, an SVM-based predictor was performed to distinguish cancerlectins from non-cancerlectins, and achieved an accuracy of 77.48% with AUC of 85.52% in jackknife cross-validation. The results revealed that our prediction model could perform better comparing with published predictive tools.

Entities: Chemical Disease Gene Species

Keywords: SVM; binomial distribution; cancerlectins; optimal tripeptides

Mesh：

Substances：
Lectins

Year: 2017 PMID： 28423655 PMCID： PMC5438640 DOI： 10.18632/oncotarget.15963

Source DB: PubMed Journal: Oncotarget ISSN： 1949-2553

53 in total

Review 1. Lectins as bioactive plant proteins: a potential in cancer treatment.

Authors: Elvira González De Mejía; Valentin I Prisecaru
Journal: Crit Rev Food Sci Nutr Date: 2005 Impact factor: 11.176

2. PseKRAAC: a flexible web server for generating pseudo K-tuple reduced amino acids composition.

Authors: Yongchun Zuo; Yuan Li; Yingli Chen; Guangpeng Li; Zhenhe Yan; Lei Yang
Journal: Bioinformatics Date: 2016-08-26 Impact factor: 6.937

Review 3. A glycobiology review: carbohydrates, lectins and implications in cancer therapeutics.

Authors: Haike Ghazarian; Brian Idoni; Steven B Oppenheimer
Journal: Acta Histochem Date: 2010-03-02 Impact factor: 2.479

4. SMOQ: a tool for predicting the absolute residue-specific quality of a single protein model with support vector machines.

Authors: Renzhi Cao; Zheng Wang; Yiheng Wang; Jianlin Cheng
Journal: BMC Bioinformatics Date: 2014-04-28 Impact factor: 3.169

5. iDPF-PseRAAAC: A Web-Server for Identifying the Defensin Peptide Family and Subfamily Using Pseudo Reduced Amino Acid Alphabet Composition.

Authors: Yongchun Zuo; Yang Lv; Zhuying Wei; Lei Yang; Guangpeng Li; Guoliang Fan
Journal: PLoS One Date: 2015-12-29 Impact factor: 3.240

6. DeepQA: improving the estimation of single protein model quality with deep belief networks.

Authors: Renzhi Cao; Debswapna Bhattacharya; Jie Hou; Jianlin Cheng
Journal: BMC Bioinformatics Date: 2016-12-05 Impact factor: 3.169

7. Some remarks on protein attribute prediction and pseudo amino acid composition.

Authors: Kuo-Chen Chou
Journal: J Theor Biol Date: 2010-12-17 Impact factor: 2.691

8. CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors: Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal: Bioinformatics Date: 2012-10-11 Impact factor: 6.937

9. Accurate prediction of nuclear receptors with conjoint triad feature.

Authors: Hongchu Wang; Xuehai Hu
Journal: BMC Bioinformatics Date: 2015-12-03 Impact factor: 3.169

10. Prediction of phosphothreonine sites in human proteins by fusing different features.

Authors: Ya-Wei Zhao; Hong-Yan Lai; Hua Tang; Wei Chen; Hao Lin
Journal: Sci Rep Date: 2016-10-04 Impact factor: 4.379

31 in total

1. IDDLncLoc: Subcellular Localization of LncRNAs Based on a Framework for Imbalanced Data Distributions.

Authors: Yan Wang; Xiaopeng Zhu; Lili Yang; Xuemei Hu; Kai He; Cuinan Yu; Shaoqing Jiao; Jiali Chen; Rui Guo; Sen Yang
Journal: Interdiscip Sci Date: 2022-02-22 Impact factor: 2.233

2. IonchanPred 2.0: A Tool to Predict Ion Channels and Their Types.

Authors: Ya-Wei Zhao; Zhen-Dong Su; Wuritu Yang; Hao Lin; Wei Chen; Hua Tang
Journal: Int J Mol Sci Date: 2017-08-24 Impact factor: 5.923

3. iRNA-3typeA: Identifying Three Types of Modification at RNA's Adenosine Sites.

Authors: Wei Chen; Pengmian Feng; Hui Yang; Hui Ding; Hao Lin; Kuo-Chen Chou
Journal: Mol Ther Nucleic Acids Date: 2018-03-30 Impact factor: 8.886

4. iRSpot-Pse6NC: Identifying recombination spots in Saccharomyces cerevisiae by incorporating hexamer composition into general PseKNC.

Authors: Hui Yang; Wang-Ren Qiu; Guoqing Liu; Feng-Biao Guo; Wei Chen; Kuo-Chen Chou; Hao Lin
Journal: Int J Biol Sci Date: 2018-05-22 Impact factor: 6.580

5. A novel feature ranking method for prediction of cancer stages using proteomics data.

Authors: Ehsan Saghapour; Saeed Kermani; Mohammadreza Sehhati
Journal: PLoS One Date: 2017-09-21 Impact factor: 3.240

6. mLoc-mRNA: predicting multiple sub-cellular localization of mRNAs using random forest algorithm coupled with feature selection via elastic net.

Authors: Prabina Kumar Meher; Anil Rai; Atmakuri Ramakrishna Rao
Journal: BMC Bioinformatics Date: 2021-06-24 Impact factor: 3.169

7. AIPpred: Sequence-Based Prediction of Anti-inflammatory Peptides Using Random Forest.

Authors: Balachandran Manavalan; Tae H Shin; Myeong O Kim; Gwang Lee
Journal: Front Pharmacol Date: 2018-03-27 Impact factor: 5.810

8. PrESOgenesis: A two-layer multi-label predictor for identifying fertility-related proteins using support vector machine and pseudo amino acid composition approach.

Authors: Mohammad Reza Bakhtiarizadeh; Maryam Rahimi; Abdollah Mohammadi-Sangcheshmeh; Vahid Shariati J; Seyed Alireza Salami
Journal: Sci Rep Date: 2018-06-13 Impact factor: 4.379

9. iterb-PPse: Identification of transcriptional terminators in bacterial by incorporating nucleotide properties into PseKNC.

Authors: Yongxian Fan; Wanru Wang; Qingqi Zhu
Journal: PLoS One Date: 2020-05-15 Impact factor: 3.240

10. Characterization of the relationship between FLI1 and immune infiltrate level in tumour immune microenvironment for breast cancer.

Authors: Shiyuan Wang; Yakun Wang; Chunlu Yu; Yiyin Cao; Yao Yu; Yi Pan; Dongqing Su; Qianzi Lu; Wuritu Yang; Yongchun Zuo; Lei Yang
Journal: J Cell Mol Med Date: 2020-04-05 Impact factor: 5.310