| Literature DB >> 21829676 |
Yen-Tsung Huang1, Xihong Lin, Lucian R Chirieac, Ray McGovern, John C Wain, Rebecca S Heist, Vidar Skaug, Shanbeh Zienolddiny, Aage Haugen, Li Su, David C Christiani.
Abstract
Lung cancer, of which more than 80% is non-small cell, is the leading cause of cancer-related death in the United States. Copy number alterations (CNAs) in lung cancer have been shown to be positionally clustered in certain genomic regions. However, it remains unclear whether genes with copy number changes are functionally clustered. Using a dense single nucleotide polymorphism array, we performed genome-wide copy number analyses of a large collection of non-small cell lung tumors (n = 301). We proposed a formal statistical test for CNAs between different groups (e.g., non-involved lung vs. tumors, early vs. late stage tumors). We also customized the gene set enrichment analysis (GSEA) algorithm to investigate the overrepresentation of genes with CNAs in predefined biological pathways and gene sets (i.e., functional clustering). We found that CNAs events increase substantially from germline, early stage to late stage tumor. In addition to genomic position, CNAs tend to occur away from the gene locations, especially in germline, non-involved tissue and early stage tumors. Such tendency decreases from germline to early stage and then to late stage tumors, suggesting a relaxation of selection during tumor progression. Furthermore, genes with CNAs in non-small cell lung tumors were enriched in certain gene sets and biological pathways that play crucial roles in oncogenesis and cancer progression, demonstrating the functional aspect of CNAs in the context of biological pathways that were overlooked previously. We conclude that CNAs increase with disease progression and CNAs are both positionally and functionally clustered. The potential functional capabilities acquired via CNAs may be sufficient for normal cells to transform into malignant cells.Entities:
Mesh:
Year: 2011 PMID: 21829676 PMCID: PMC3149069 DOI: 10.1371/journal.pone.0022961
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Genome-wide copy number alterations in blood, non-involved lung tissue and tumor of NSCLC patients.
The x axis represents genomic locations, which were ordered by the somatic chromosomes. The y axis represents the prevalence (%) of NSCLC patients having copy number ≥2.7 (red or pink) and ≤1.3 (blue or light blue) in non-involved lung tissue and total tumor (A), early stage tumor and late stage tumor (B), early stage tumor of adenocarcinoma (C) and early stage tumor of squamous cell carcinoma (SCC) (D). The corresponding plots of −log10(q values) were shown in Fig. S1. The p values of comparing genome-wide CNAs patterns between non-involved tissue samples and total tumors are <0.0001 for gains and 0.40 for losses by the permutation-based global test with details described in Methods. The p values comparing early stage and late stage tumors are <0.0001 for gains and 0.046 for losses; the p values comparing early stage adenocarcinoma (C) and early stage squamous cell carcinoma (D) are 0.016 for gains and 0.027 for losses.
Figure 2Total probes (TP) and probes locating within genes (GP) in which CNAs were detected, and the mean of G/T ratios.
A, B, Counts of the total probes (TP) where CNAs events (A: copy number gains, B: copy number losses) occur were plotted for blood and non-involved tissue, total tumor, early stage tumor, late stage tumor, early stage adenocarcinoma (ACA) and early stage squamous cell carcinoma (SCC). C, D, Counts of the probes within genes (GP) in which CNAs events (C: copy number gains, D: copy number losses) were detected in the same six subgroups. E, F, Mean and its 95% confidence interval of G/T ratios in the six subgroups for copy number gains (E) and losses (F); and the dashed lines represent the null G/T ratio on the chip (104,256/256,554 = 40.64%). Non-tumor: blood (n = 63) and non-involved tissue (n = 50); All tumor: total 301 NSCLC tumors; Early tumor: stage I and II NSCLC tumors (n = 246); Late tumor: stage III and IV NSCLC tumors (n = 25); Early ACA: early stage adenocarcinoma tumors (n = 208); Early SCC: early stage squamous cell carcinoma tumors (n = 93).
Thirty two oncogenes with >10% NSCLC patients having copy number gains.
| Symbol | Full name of oncogene | Position | % of patients with CN gains | P value | |
| 1 |
| Triple functional domain | 5p15.2 | 38.21 | 3.30×10−50 |
| 2 |
| epidermal growth factor receptor | 7p12 | 26.58 | 2.26×10−23 |
| 3 |
| v-erb-a erythroblastic leukemia viral oncogene homolog 4 | 2q33.3–34 | 26.58 | 2.26×10−23 |
| 4 |
| Runt-related transcription factor 1 | 8q22 | 26.25 | 9.97×10−23 |
| 5 |
| v-abl Abelson murine leukemia viral oncogene homolog 2 | 1q24–25 | 23.59 | 7.66×10−18 |
| 6 |
| ets variant 5 | 3q28 | 22.92 | 1.07×10−16 |
| 7 |
| ets variant 1 | 7p21.3 | 22.26 | 1.38×10−15 |
| 8 |
| ataxin 2-binding protein 1 | 16p13.3 | 22.26 | 1.38×10−15 |
| 9 |
| v-myc myelocytomatosis viral oncogene homolog (avian) | 8q24.21 | 21.59 | 1.66×10−14 |
| 10 |
| telomerase reverse transcriptase | 5p15.33 | 20.60 | 5.97×10−13 |
| 11 |
| v-akt murine thymoma viral oncogene homolog 3 | 1q43–44 | 20.60 | 5.97×10−13 |
| 12 |
| v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog | 12p12.1 | 20.27 | 1.90×10−12 |
| 13 |
| ets variant 6 | 12p13 | 19.93 | 5.90×10−12 |
| 14 |
| protein tyrosine phosphatase, non-receptor type 1 | 20q13.1–13.2 | 19.27 | 5.37×10−11 |
| 15 |
| phosphoinositide-3-kinase, catalytic, alpha polypeptide | 3q26.3 | 18.60 | 4.50×10−10 |
| 16 |
| progesterone receptor | 11q22–23 | 16.61 | 1.58×10−7 |
| 17 |
| v-raf murine sarcoma viral oncogene homolog B1 | 7q34 | 15.95 | 9.28×10−7 |
| 18 |
| v-ets erythroblastosis virus E26 oncogene homolog (avian) | 21q22.3 | 15.61 | 2.17×10−6 |
| 19 |
| CUB and Sushi multiple domains 1 | 8p23.2 | 15.28 | 4.97×10−6 |
| 20 |
| E74-like factor 5 (ets domain transcription factor) | 11p13–12 | 14.95 | 1.11×10−5 |
| 21 |
| phosphodiesterase 4D | 5q12 | 14.62 | 2.41×10−5 |
| 22 |
| fibroblast growth factor receptor 1 | 8p11.2–11.1 | 14.62 | 2.41×10−5 |
| 23 |
| EPH receptor A3 | 3p11.2 | 14.62 | 2.41×10−5 |
| 24 |
| cyclin-dependent kinase 6 | 7q21–q22 | 14.62 | 2.41×10−5 |
| 25 |
| Estrogen receptor 2 | 14q23.2 | 14.62 | 2.41×10−5 |
| 26 |
| Janus kinase 2 | 9p24 | 14.62 | 2.41×10−5 |
| 27 |
| WW domain containing oxidoreductase | 16q23.3–24.1 | 13.95 | 0.000106 |
| 28 |
| platelet-derived growth factor receptor, alpha polypeptide | 4q11–13 | 13.95 | 0.000106 |
| 29 |
| cyclin D2 | 12p13 | 13.95 | 0.000106 |
| 30 |
| cyclin E1 | 19q12 | 13.62 | 0.000214 |
| 31 |
| retinoblastoma-like 1 | 20q11.2 | 11.30 | 0.0136 |
| 32 |
| met proto-oncogene (hepatocyte growth factor receptor) | 7q31 | 11.30 | 0.0136 |
*P values were calculated to test the significance of observing the percentage (%) of patients with copy number gains (copy number ≥2.7) on the oncogenes, given that the null probability of copy number gains is 0.076, which is empirically estimated from the data.
Sixteen tumor suppressor genes with >1% NSCLC patients having copy number losses.
| Symbol | Full name of tumor suppressor gene | Position | % of patients with copy number losses | P value | |
| 1 |
| cyclin-dependent kinase inhibitor 2A | 9p12 | 9.63 | 1.87×10−19 |
| 2 |
| Fragile histidine triad gene | 3p14.2 | 6.64 | 6.33×10−11 |
| 3 |
| breast cancer 2, early onset | 13q12.3 | 4.65 | 3.2×10−6 |
| 4 |
| transforming growth factor, beta receptor II | 3p22 | 4.31 | 1.55×10−5 |
| 5 |
| Folliculin | 17p11.2 | 3.65 | 0.00029 |
| 6 |
| retinoblastoma 1 | 13q14.2 | 2.66 | 0.012 |
| 7 |
| neurofibromin 2 (merlin) | 22q12.2 | 2.66 | 0.012 |
| 8 |
| phosphatase and tensin homolog | 10q23.3 | 1.66 | 0.19 |
| 9 |
| E1A binding protein p300 | 22q13.2 | 1.66 | 0.19 |
| 10 |
| F-box and WD repeat domain containing 7 | 4q31.3 | 1.66 | 0.19 |
| 11 |
| adenomatous polyposis coli | 5q21–q22 | 1.66 | 0.19 |
| 12 |
| Notch homolog 1, translocation-associated | 9q34.3 | 1.66 | 0.19 |
| 13 |
| FAT tumor suppressor homolog 3 | 11q14.3 | 1.33 | 0.36 |
| 14 |
| SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily b, member 1 | 22q11 | 1.33 | 0.36 |
| 15 |
| mothers against decapentaplegic homolog 4 | 18q21.1 | 1.33 | 0.36 |
| 16 |
| mothers against decapentaplegic homolog 2 | 18q21.1 | 1.33 | 0.36 |
*P values were calculated to test the significance of observing the percentage (%) of patients with copy number losses (copy number ≤1.3) on the tumor suppressor genes, given that the null probability of copy number losses is 0.010, which is empirically estimated from the data.
P values of the selective sixteen out of eighty-nine pathways and gene sets with enrichment of genes with copy number gains (the complete eighty-nine gene sets are listed in Table S3).
| MSigDB ID of gene set: brief description | Blood | Non-involved lung | Discovery set | Validation set | Early stage tumors | Late stage tumors | All tumors | |
| 1 | AGUIRRE_PANCREAS_CHR8: Genes on chromosome 8 with copy-number-driven expression in pancreatic adenocarcinoma. | 0.46 | <5×10−5 | <5×10−5 | <5×10−5 | <5×10−5 | 5×10−5 | <5×10−5 |
| 2 | BRCA_PROGNOSIS_NEG: Genes whose expression is consistently negatively correlated with breast cancer outcomes - higher expression is associated with metastasis and poor prognosis | 0.64 | <5×10−5 | <5×10−5 | <5×10−5 | <5×10−5 | <5×10−5 | <5×10−5 |
| 3 | SMITH_HCV_INDUCED_HCC_UP: Genes highly expressed in hepatitis C-related hepatocellular carcinoma | 0.57 | <5×10−5 | 0.00025 | 5×10−5 | <5×10−5 | 0.00085 | 5×10−5 |
| 4 | HSA04514_CELL_ADHESION_MOLECULES: Genes involved in cell adhesion molecules (CAMs) | 0.029 | 0.39 | 0.00085 | 0.00040 | 0.0049 | 0.00025 | 0.00040 |
| 5 | ONCOGENE: Census of human cancer genes | 0.37 | 0.51 | 0.0011 | 0.0047 | 0.0018 | 0.015 | 0.00065 |
| 6 | G_PROTEIN_SIGNALING: Genes involved in G protein signaling | 0.41 | <5×10−5 | 0.0016 | 0.0021 | 0.0030 | 0.011 | 0.0010 |
| 7 | HSA04610_COMPLEMENT_AND_COAGULATION_CASCADES: Genes involved in complement and coagulation cascades | 0.14 | <5×10−5 | 0.0012 | 0.0028 | 0.00065 | 0.0036 | 0.00145 |
| 8 | CELL_ADHESION: The attachment of a cell, either to another cell or to the extracellular matrix, via cell adhesion molecules. | 0.00050 | 0.76 | 0.0026 | 0.0041 | 0.012 | 0.0012 | 0.0024 |
| 9 | AT1RPATHWAY: Binding of angiotensin II to | 0.00050 | <5×10−5 | 0.0041 | 0.0038 | 0.0068 | 0.016 | 0.0031 |
| 10 | AGUIRRE_PANCREAS_CHR7: Genes on chromosome 7 with copy-number-driven expression in pancreatic adenocarcinoma. | 0.93 | <5×10−5 | 0.018 | 0.00025 | 0.0026 | 0.0019 | 0.0033 |
| 11 | EDG1PATHWAY: The lipid | 0.031 | <5×10−5 | 0.015 | 0.0089 | 0.010 | 0.022 | 0.0085 |
| 12 | LOTEM_LEUKEMIA_UP: Genes upregulated in myeloid leukemia and normally expressed in other bodily tissues. | 0.62 | <5×10−5 | 0.0048 | 0.033 | 0.0077 | 0.041 | 0.0089 |
| 13 | BRCA_ER_NEG: Genes whose expression is consistently negatively correlated with estrogen receptor status in breast cancer - higher expression is associated with | 0.84 | 0.94 | 0.0069 | 0.0099 | 0.0037 | 0.012 | 0.0090 |
| 14 | MCALPAINPATHWAY: In integrin-mediated cell migration, calpains digest links between the actin cytoskeleton and focal adhesion proteins. | 0.80 | <5×10−5 | 0.019 | 0.0080 | 0.0078 | 0.047 | 0.010 |
| 15 | HSA05211_RENAL_CELL_CARCINOMA: Genes involved in renal cell carcinoma | 0.32 | 0.32 | 0.023 | 0.031 | 0.0082 | 0.025 | 0.020 |
| 16 | WNT_SIGNALING: | 0.17 | 0.31 | 0.020 | 0.025 | 0.021 | 0.0077 | 0.024 |
P values of the selective ten out of twenty seven pathways and gene sets with enrichment of genes with copy number losses (the complete twenty-seven gene sets are listed in Table S4).
| MSigDB ID of gene set: brief description | Blood | Non-involved lung | Discovery set | Validation set | Early stage tumors | Late stage tumors | All tumors | |
| 1 | AGUIRRE_PANCREAS_CHR9: Genes on chromosome 9 with copy-number-driven expression in pancreatic adenocarcinoma. | 0.25 | 0.57 | <5×10−5 | 5×10−5 | <5×10−5 | <5×10−5 | <5×10−5 |
| 2 | HSA04080_NEUROACTIVE_LIGAND _RECEPTOR_INTERACTION: Genes involved in neuroactive ligand-receptor interaction | 0.061 | 0.00050 | 0.00070 | <5×10−5 | <5×10−5 | 0.0042 | 5×10−5 |
| 3 | CELL_CELL_SIGNALING: Genes annotated by the GO term GO:0007267. Any process that mediates the transfer of information from one cell to another. | 0.80 | 0.061 | 0.0015 | 0.013 | 0.0017 | 0.0030 | 0.00070 |
| 4 | HSA04140_REGULATION_OF_AUTOPHAGY: Genes involved in regulation of autophagy | 0.16 | <5×10−5 | 0.00065 | 0.0046 | 0.00095 | 0.00010 | 0.00085 |
| 5 | SIGNAL_TRANSDUCTION: Genes involved in signal transduction | 0.21 | 0.23 | 0.012 | 0.0019 | 0.011 | 0.022 | 0.0015 |
| 6 | CELL_SURFACE_RECEPTOR_LINKED_SIGNAL_TRANSDUCTION_GO_0007166: Genes annotated by the GO term GO:0007166. Any series of molecular signals initiated by the binding of an extracellular ligand to a receptor on the surface of the target cell. | 0.35 | 0.034 | 0.021 | 0.0017 | 0.027 | 0.00075 | 0.0031 |
| 7 | CELL_CELL_ADHESION: Genes annotated by the GO term GO:0016337. The attachment of one cell to another cell via adhesion molecules. | 0.53 | 0.069 | 0.0071 | 0.0085 | 0.0028 | 0.00035 | 0.0039 |
| 8 | HDACI_COLON_CUR24HRS_DN: Downregulated by curcumin at 24 hrs in SW260 colon carcinoma cells | 0.92 | <5×10−5 | 0.0070 | 0.0020 | 0.0094 | 0.037 | 0.0050 |
| 9 | REGULATION_OF_MITOTIC_CELL_CYCLE: Genes annotated by the GO term GO:0007346. Any process that modulates the rate or extent of progress through the mitotic cell cycle. | 0.64 | <5×10−5 | 0.0010 | 0.049 | 0.0075 | 0.017 | 0.0053 |
| 10 | CHROMATIN_MODIFICATION: Genes annotated by the GO term GO:0016568. The alteration of DNA or protein in chromatin, which may result in changing the chromatin structure. | 0.98 | 0.71 | 0.019 | 0.022 | 0.049 | 0.0095 | 0.021 |