| Literature DB >> 26292924 |
Li Peng1, Xiu Wu Bian2, Di Kang Li3, Chuan Xu2,4, Guang Ming Wang5, Qing You Xia1, Qing Xiong3.
Abstract
The Cancer Genome Atlas (TCGA) has accrued RNA-Seq-based transcriptome data for more than 4000 cancer tissue samples across 12 cancer types, translating these data into biological insights remains a major challenge. We analyzed and compared the transcriptomes of 4043 cancer and 548 normal tissue samples from 21 TCGA cancer types, and created a comprehensive catalog of gene expression alterations for each cancer type. By clustering genes into co-regulated gene sets, we identified seven cross-cancer gene signatures altered across a diverse panel of primary human cancer samples. A 14-gene signature extracted from these seven cross-cancer gene signatures precisely differentiated between cancerous and normal samples, the predictive accuracy of leave-one-out cross-validation (LOOCV) were 92.04%, 96.23%, 91.76%, 90.05%, 88.17%, 94.29%, and 99.10% for BLCA, BRCA, COAD, HNSC, LIHC, LUAD, and LUSC, respectively. A lung cancer-specific gene signature, containing SFTPA1 and SFTPA2 genes, accurately distinguished lung cancer from other cancer samples, the predictive accuracy of LOOCV for TCGA and GSE5364 data were 95.68% and 100%, respectively. These gene signatures provide rich insights into the transcriptional programs that trigger tumorigenesis and metastasis, and many genes in the signature gene panels may be of significant value to the diagnosis and treatment of cancer.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26292924 PMCID: PMC4544034 DOI: 10.1038/srep13413
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
The percentage of common genes and gene sets in top 3% most differentially expressed genes and gene sets between 12 cancer types*.
*Results from DE gene comparisons lie below the diagonal line; results from DE gene set comparisons are above the diagonal line.
Figure 1Two possible carcinogenic mechanisms.
(1) gene expression aberrations in cell cycle-associated pathways can directly lead to carcinogenesis, these pathways are cross-cancer gene signatures altered across a range of cancer types; (2) gene expression aberrations in organ-specific pathways can indirectly lead to carcinogenesis by interacting with cell cycle-associated pathways, these pathways are cancer-specific gene signatures altered in a single cancer type. Stars represent driver mutations that can alter the expression levels of their target genes.
Figure 2The normalized expression levels of seven cross-cancer gene signatures across 12 types of cancer and normal samples.
Significant gene sets at FDR < 0.15 from 12 types of cancers.
Figure 3The normalized expression levels of gene signatures significantly altered in one type of cancer across 12 types of cancer and normal samples.
List of lung disease-associated genes in CLUSTER1520.
| Symbol | Full name | Lung diseases |
|---|---|---|
| SFTPA2 | surfactant, pulmonary-associated protein A2 | lung cancer |
| SFTPA1 | surfactant, pulmonary-associated protein A1 | lung cancer |
| ROS1 | ROS proto-oncogene 1, receptor tyrosine kinase | lung cancer |
| ABCA3 | ATP-binding cassette, sub-family A (ABC1), member 3 | pulmonary fibrosis |
| AQP4 | aquaporin 4 | lung cancer |
| HHIP | hedgehog interacting protein | lung cancer |
| SLC34A2 | solute carrier family 34, member 2 | pulmonary alveolar microlithiasis |
| IL6R | interleukin 6 receptor | chronic obstructive pulmonary disease |
Figure 4The normalized expression levels of SFTPA1 and SFTPA2 across 12 types of cancer and normal samples.
Figure 5The predictive accuracy and error rates of LOOCV for each cancer type using the 14-gene signature.
Red indicates the predictive accuracy; Blue represents error rates.
Figure 6Pipeline of the analysis.
Number of cancer and normal samples of 12 cancer types.
| Abbreviation | Full Name | Number of Cancers | Number of Normals |
|---|---|---|---|
| BLCA | bladder urothelial carcinoma | 185 | 16 |
| BRCA | breast invasive carcinoma | 955 | 106 |
| COAD | colon adenocarcinoma | 244 | 23 |
| HNSC | head and neck squamous cell carcinoma | 353 | 39 |
| LIHC | liver hepatocellular carcinoma | 123 | 46 |
| LUAD | lung adenocarcinoma | 450 | 58 |
| LUSC | lung squamous cell carcinoma | 398 | 44 |
| KICH | kidney chromophobe | 66 | 25 |
| KIRC | kidney renal clear cell carcinoma | 497 | 72 |
| KIRP | kidney renal papillary cell carcinoma | 127 | 28 |
| PRAD | prostate adenocarcinoma | 166 | 38 |
| THCA | thyroid carcinoma | 479 | 53 |