Literature DB >> 18697770

Computational prediction of human proteins that can be secreted into the bloodstream.

Juan Cui1, Qi Liu, David Puett, Ying Xu.   

Abstract

We present a novel computational method for predicting which proteins from highly and abnormally expressed genes in diseased human tissues, such as cancers, can be secreted into the bloodstream, suggesting possible marker proteins for follow-up serum proteomic studies. A main challenging issue in tackling this problem is that our understanding about the downstream localization after proteins are secreted outside the cells is very limited and not sufficient to provide useful hints about secretion to the bloodstream. To bypass this difficulty, we have taken a data mining approach by first collecting, through extensive literature searches, human proteins that are known to be secreted into the bloodstream due to various pathological conditions as detected by previous proteomic studies, and then asking the question: 'what do these secreted proteins have in common in terms of their physical and chemical properties, amino acid sequence and structural features that can be used to predict them?' We have identified a list of features, such as signal peptides, transmembrane domains, glycosylation sites, disordered regions, secondary structural content, hydrophobicity and polarity measures that show relevance to protein secretion. Using these features, we have trained a support vector machine-based classifier to predict protein secretion to the bloodstream. On a large test set containing 98 secretory proteins and 6601 non-secretory proteins of human, our classifier achieved approximately 90% prediction sensitivity and approximately 98% prediction specificity. Several additional datasets are used to further assess the performance of our classifier. On a set of 122 proteins that were found to be of abnormally high abundance in human blood due to various cancers, our program predicted 62 as blood-secreted proteins. By applying our program to abnormally highly expressed genes in gastric cancer and lung cancer tissues detected through microarray gene expression studies, we predicted 13 and 31 as blood secreted, respectively, suggesting that they could serve as potential biomarkers for these two cancers, respectively. Our study demonstrated that our method can provide highly useful information to link genomic and proteomic studies for disease biomarker discovery. Our software can be accessed at http://csbl1.bmb.uga.edu/cgi-bin/Secretion/secretion.cgi.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18697770      PMCID: PMC2562011          DOI: 10.1093/bioinformatics/btn418

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  49 in total

1.  Prediction of membrane protein types based on the hydrophobic index of amino acids.

Authors:  Z P Feng; C T Zhang
Journal:  J Protein Chem       Date:  2000-05

2.  A human transporter protein that mediates the final excretion step for toxic organic cations.

Authors:  Masato Otsuka; Takuya Matsumoto; Riyo Morimoto; Shigeo Arioka; Hiroshi Omote; Yoshinori Moriyama
Journal:  Proc Natl Acad Sci U S A       Date:  2005-12-05       Impact factor: 11.205

3.  Kernel methods for predicting protein-protein interactions.

Authors:  Asa Ben-Hur; William Stafford Noble
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

4.  Computer prediction of allergen proteins from sequence-derived protein structural and physicochemical properties.

Authors:  Juan Cui; Lian Yi Han; Hu Li; Choong Yong Ung; Zhi Qun Tang; Chan Juan Zheng; Zhi Wei Cao; Yu Zong Chen
Journal:  Mol Immunol       Date:  2006-03-23       Impact factor: 4.407

5.  Biomarker discovery from uveal melanoma secretomes: identification of gp100 and cathepsin D in patient serum.

Authors:  María Pardo; Angel García; Robin Antrobus; María José Blanco; Raymond A Dwek; Nicole Zitzmann
Journal:  J Proteome Res       Date:  2007-06-01       Impact factor: 4.466

6.  Prediction of secondary structural content of proteins from their amino acid composition alone. I. New analytic vector decomposition methods.

Authors:  F Eisenhaber; F Imperiale; P Argos; C Frömmel
Journal:  Proteins       Date:  1996-06

7.  Use of serological proteomic methods to find biomarkers associated with breast cancer.

Authors:  Zhao Rui; Ji Jian-Guo; Tong Yuan-Peng; Pu Hai; Ru Bing-Gen
Journal:  Proteomics       Date:  2003-04       Impact factor: 3.984

8.  Mimicking cellular sorting improves prediction of subcellular localization.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  J Mol Biol       Date:  2005-04-22       Impact factor: 5.469

Review 9.  The unique physiology of solid tumors: opportunities (and problems) for cancer therapy.

Authors:  J M Brown; A J Giaccia
Journal:  Cancer Res       Date:  1998-04-01       Impact factor: 12.701

10.  Secreted and cell surface genes expressed in benign and malignant colorectal tumors.

Authors:  P Buckhaults; C Rago; B St Croix; K E Romans; S Saha; L Zhang; B Vogelstein; K W Kinzler
Journal:  Cancer Res       Date:  2001-10-01       Impact factor: 12.701

View more
  18 in total

1.  In-silico prediction of blood-secretory human proteins using a ranking algorithm.

Authors:  Qi Liu; Juan Cui; Qiang Yang; Ying Xu
Journal:  BMC Bioinformatics       Date:  2010-05-14       Impact factor: 3.169

2.  Identification of N-glycan serum markers associated with hepatocellular carcinoma from mass spectrometry data.

Authors:  Zhiqun Tang; Rency S Varghese; Slavka Bekesova; Christopher A Loffredo; Mohamed Abdul Hamid; Zuzana Kyselova; Yehia Mechref; Milos V Novotny; Radoslav Goldman; Habtom W Ressom
Journal:  J Proteome Res       Date:  2010-01       Impact factor: 4.466

3.  A computational method for prediction of excretory proteins and application to identification of gastric cancer markers in urine.

Authors:  Celine S Hong; Juan Cui; Zhaohui Ni; Yingying Su; David Puett; Fan Li; Ying Xu
Journal:  PLoS One       Date:  2011-02-18       Impact factor: 3.240

4.  A comparative analysis of gene-expression data of multiple cancer types.

Authors:  Kun Xu; Juan Cui; Victor Olman; Qing Yang; David Puett; Ying Xu
Journal:  PLoS One       Date:  2010-10-27       Impact factor: 3.240

5.  The Trypanosoma brucei MitoCarta and its regulation and splicing pattern during development.

Authors:  Xiaobai Zhang; Juan Cui; Daniel Nilsson; Kapila Gunasekera; Astrid Chanfon; Xiaofeng Song; Huinan Wang; Ying Xu; Torsten Ochsenreiter
Journal:  Nucleic Acids Res       Date:  2010-07-26       Impact factor: 16.971

6.  Prediction of body fluids where proteins are secreted into based on protein interaction network.

Authors:  Le-Le Hu; Tao Huang; Yu-Dong Cai; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-07-29       Impact factor: 3.240

7.  An integrated transcriptomic and computational analysis for biomarker identification in gastric cancer.

Authors:  Juan Cui; Yunbo Chen; Wen-Chi Chou; Liankun Sun; Li Chen; Jian Suo; Zhaohui Ni; Ming Zhang; Xiaoxia Kong; Lisabeth L Hoffman; Jinsong Kang; Yingying Su; Victor Olman; Darryl Johnson; Daniel W Tench; I Jonathan Amster; Ron Orlando; David Puett; Fan Li; Ying Xu
Journal:  Nucleic Acids Res       Date:  2010-10-21       Impact factor: 16.971

8.  Regulation of gene expression in ovarian cancer cells by luteinizing hormone receptor expression and activation.

Authors:  Juan Cui; Brooke M Miner; Joanna B Eldredge; Susanne W Warrenfeltz; Phuongan Dam; Ying Xu; David Puett
Journal:  BMC Cancer       Date:  2011-06-28       Impact factor: 4.430

9.  Exploiting heterogeneous features to improve in silico prediction of peptide status - amyloidogenic or non-amyloidogenic.

Authors:  Smitha Sunil Kumaran Nair; N V Subba Reddy; K S Hareesha
Journal:  BMC Bioinformatics       Date:  2011-11-30       Impact factor: 3.169

10.  A comparative study of gene-expression data of basal cell carcinoma and melanoma reveals new insights about the two cancers.

Authors:  Kun Xu; Xizeng Mao; Minesh Mehta; Juan Cui; Chi Zhang; Ying Xu
Journal:  PLoS One       Date:  2012-01-25       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.