| Literature DB >> 28097125 |
André L Fonseca1, Vandeclécio L da Silva1, Marbella M da Fonsêca2, Isabella T J Meira2, Thayná E da Silva2, José E Kroll3, André M Ribeiro-Dos-Santos4, Cléber R Freitas2, Raimundo Furtado2, Jorge E de Souza5, Beatriz Stransky6, Sandro J de Souza2.
Abstract
It is estimated that 10 to 20% of all genes in the human genome encode cell surface proteins and due to their subcellular localization these proteins represent excellent targets for cancer diagnosis and therapeutics. Therefore, a precise characterization of the surfaceome set in different types of tumor is needed. Using TCGA data from 15 different tumor types and a new method to identify cancer genes, the S-score, we identified several potential therapeutic targets within the surfaceome set. This allowed us to expand a previous analysis from us and provided a clear characterization of the human surfaceome in the tumor landscape. Moreover, we present evidence that a three-gene set-WNT5A, CNGA2, and IGSF9B-can be used as a signature associated with shorter survival in breast cancer patients. The data made available here will help the community to develop more efficient diagnostic and therapeutic tools for a variety of tumor types.Entities:
Year: 2016 PMID: 28097125 PMCID: PMC5206789 DOI: 10.1155/2016/8346198
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Figure 1Methodology workflow. (a) The NCBI RefSeq dataset was submitted to TMHMM and selected for the presence of a transmembrane domain. Proteins containing only a signal peptide (classified as secreted) or belonging exclusively to other membranes were excluded, giving a final set of 3.758 genes coding for cell surface proteins. (b) TCGA data as used to calculate the S-score for the surfaceome set allowing the identification of putative surfaceome cancer genes.
Figure 2Surfaceome cancer genes identified by the S-score method. (a) Heatmap representation of all 248 surfaceome cancer genes for all 15 tumor types. The S-score distribution is represented as a range of colors (red as negative S-score, blue as positive S-scores). Tumor types are SKCM (Skin Cutaneous Melanoma), COADREAD (Colorectal Adenocarcinoma), LUAD (Lung Adenocarcinoma), UCEC (Uterine Corpus Endometrial Carcinoma), LGG (Low Grade Glioma), LAML (Acute Mieloid Leukemia), SARC (Sarcoma), OV (Ovarian Serous Carcinoma), PRAD (Prostate Adenocarcinoma), GBM (Glioblastoma), KIRC (Kidney Renal Clear Cell Carcinoma), BRCA (Breast Invasive Carcinoma), LUSC (Lung Squamous Cell Carcinoma), BLCA (Bladder Adenocarcinoma), and HSNC (Head-Neck Squamous Cell Carcinoma). (b) GO enrichment analysis for the three groups of genes as identified in (Figure 2(a)). The adjusted p values are sorted from least (blue) to most (red) significant. Furthermore, the dot size is based on gene ratio, which is the observed number of genes in the experimental set within the respective gene ontology category.
Figure 3Kaplan-Meier overall survival curves in breast cancer patients. Samples were classified as having the three-gene signature (WNT5A, CNGA2, and IGSF9B) altered (red) or not (blue).