| Literature DB >> 24205279 |
Maria Planck1, Karolina Edlund, Johan Botling, Patrick Micke, Sofi Isaksson, Johan Staaf.
Abstract
INTRODUCTION: In lung adenocarcinoma, the mutational spectrum is dominated by EGFR and KRAS mutations. Improved knowledge about genomic and transcriptional alterations in and between mutation-defined subgroups may identify genes involved in disease development or progression.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24205279 PMCID: PMC3812039 DOI: 10.1371/journal.pone.0078614
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic diagram of genomic and transcriptional analyses.
(A) Genomic analyses. (B) Transcriptional analyses. Individual cohorts are portrayed.
Characteristics of individual aCGH and SNP genomic adenocarcinoma cohorts stratified by microarray platform.
| Genomic cohort | No. of ACA | Microarray platform | Included in CN / GAP analysisB | No. of tumors / cell lines | No. of | No. of stage I/II/III/IV tumorsC | Gender female / maleC | Smoking status NS/SC,D |
|---|---|---|---|---|---|---|---|---|
| Zhao [ | 36 | Affymetrix 100K | Yes/No | 36/0 | - | - | 2/1 | - |
| GSE18252 [ | 4 | Affymetrix 100K | Yes/Yes | 4/0 | 0/4/0 | - | - | - |
| Weir [ | 112 | Affymetrix 250K Sty | Yes/No | 112/0 | - | - | - | - |
| Weir [ | 196 | Affymetrix 250K Sty | Yes/No | 196/0 | 15/49/51 | 41/14/15/3 | 110/83 | 17/116 |
| GSE19399 [ | 19 | Affymetrix 250K Sty | Yes/Yes | 19/0 | 1/1/8 | 10/1/4/0 | 10/9 | 2/16 |
| GSE17247 [ | 49 | Affymetrix 250K Sty | Yes/No | 0/49 | - | - | - | - |
| GSE28572 [ | 44 | Affymetrix 250K Nsp | Yes/Yes | 44/0 | 9/18/17 | 21/10/8/2 | 27/17 | 4/37 |
| GSE34140 [ | 141 | Affymetrix 250K Nsp | Yes/No | 141/0 | - | 79/26/9/7 | - | 10/107 |
| GSK [ | 10 | Affymetrix 250K Nsp | Yes/No | 0/10 | - | - | - | - |
| GSE19804 [ | 12 | Affymetrix 6.0 | Yes/No | 12/0 | - | 6/3/2/1 | 12/0 | 12/0 |
| GSE25016 [ | 58 | Affymetrix 6.0 | Yes/No | 58/0 | - | - | - | - |
| GSE33848 [ | 146 | Affymetrix 6.0 | Yes/No | 146/0 | - | 95/0/0/0 | - | - |
| TCGA-AC [ | 135 | Affymetrix 6.0 | Yes/No | 135/0 | - | 73/23/27/9 | 78/57 | 19/109 |
| GSE36363 [ | 62 | Affymetrix 6.0 & 250K Nsp | Yes/Yes | 62/0 | 5/14/24 | 39/8/11/0 | 37/25 | 5/56 |
| Chitale [ | 184 | Agilent 44K | Yes/No | 184/0 | 42/45/97 | 121/26/32/5 | 107/77 | 39/145 |
| GSE20393 [ | 2 | Agilent 244K | Yes/No | 2/0 | - | - | - | - |
| E-TABM-926 [ | 17 | Agilent 244K | Yes/No | 17/0 | 13/1/3 | 8/1/8/0 | 14/3 | 17/0 |
| E-TABM-1169 [ | 40 | Illumina 370K | Yes/Yes | 40/0 | 28/2/10 | 22/5/13/0 | 36/4 | 40/0 |
| GSE31586 | 5 | ROMA 85K | Yes/No | 0/5 | - | - | - | - |
| TOTAL | 1272 | - | 1272/733 | 1208/64 | 113/134/210 | 515/117/129/27 | 433/276 | 165/586 |
A: Number of used adenocarcinoma cases per cohort.B: Included in overall GISTIC analysis (CN) and/or GAP-analysis for EGFR/KRAS mutation groups.C: For tumors only.
D: NS = never-smoker, S = smoker
Clinical characteristics of patients with available mutation status in gene expression cohorts.
|
|
| ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
|
| 226 | 91 | 102 | 99 | 106 | 117 | 58 | 85 | 30 |
|
| |||||||||
| Male | 105 | 41 | 42 | 15 | 46 | 60 | 13 | 37 | 20 |
| Female | 121 | 50 | 60 | 84 | 60 | 57 | 45 | 48 | 10 |
|
| |||||||||
|
| 127 | 15 | 24 | 49 | 18 | 45 | 17 | 11 | 3 |
|
| 20 | 11 | 36 | 17 | 43 | 15 | 22 | 20 | 8 |
| EGFRwt/KRASwt | 79 | 65 | 42 | 33 | 45 | 57 | 19 | 54 | 19 |
|
| |||||||||
| I | 168 | 53 | 70 | 57 | 70 | 79 | - | 48 | - |
| II | 58 | 20 | 10 | 10 | 19 | 13 | - | 18 | - |
| III | 0 | 18 | 17 | 32 | 13 | 25 | - | 11 | - |
| IV | 0 | 0 | 5 | 0 | 4 | 0 | - | 3 | - |
|
| |||||||||
| Differential gene expression | x | x | x | x | x | x | x | x | x |
| Unsupervised analysis | x | x | x | x | |||||
|
| Affymetrix U133 2plus | Affymetrix U133A | Affymetrix U133 2plus | Affymetrix U133 2plus | Affymetrix U133 2plus | Agilent 44K | Illumina WG6 V3 | Agilent 44K | Custom cDNA |
These cohorts contain additional samples with unknown EGFR and KRAS mutation status.
Figure 2Copy number alterations in lung adenocarcinoma.
Frequency of copy number gain (red) and loss (green) for adenocarcinoma stratified by EGFR and KRAS mutational status using log2ratio ± 0.12 as threshold for identification of copy number gain and loss. Probes matched to known copy number variations are excluded. Black regions indicate genomic position of significant mGISTIC regions, which were identified from analysis of the entire 1272-sample cohort across chromosomes. Arrows indicate genomic regions with apparently different copy number alteration frequency between EGFR/KRAS mutation groups (6q, 7p, and 16p). (A) All 1272 adenocarcinomas. (B) 113 EGFR-mutated adenocarcinoma tumors. (C) 134 KRAS-mutated adenocarcinoma tumors. (D) 210 EGFRwt/KRASwt adenocarcinoma tumors.
Figure 3Copy number alterations and tumor ploidy in EGFR/KRAS mutation groups.
(A) Pattern of gross copy number alterations measured as fraction of the genome altered by copy number gain or loss in adenocarcinoma tumors stratified by EGFR and KRAS mutation status (EGFR:red, KRAS:light blue, EGFRwt/KRASwt:gray), stage, gender and patient smoking status. Copy number alterations were called using log2ratio ± 0.12 as thresholds for identification of copy number gain and loss. P-values were calculated using ANOVA for indicated groups, ***: P< 0.001, **: P< 0.01, *: P< 0.05. Top axis indicates number of cases per group. (B) mGISTIC regions discriminating between EGFR-mutated (red), KRAS-mutated (light blue) and EGFRwt/KRASwt (gray) adenocarcinoma tumors. mGISTIC regions identified by Fisher’s exact test (Bonferroni adjusted p-value < 0.05) with an additional requirement of > 20% frequency difference between the lowest and highest groups. The y-axis describes the frequency of copy number gain or loss in respective group. (C) Distribution of GAP-ploidy across the adenocarcinoma EGFR/KRAS mutation groups for 141 tumors analyzed by GAP. A GAP-ploidy of two equals a diploid, three a triploid genome and four a tetraploid genome. Curves were generated by an Epanechnikov smoothing kernel with 0.1 smoothing bandwidth.
Differences and similarities in genomic alterations and allelic imbalances between EGFR/KRAS mutation groups.
|
|
|
|
|
|---|---|---|---|
| Fraction of the genome altered by copy number gain and loss | More a | Less | Less |
| Fraction of the genome altered by LOH | More | Less | Less |
| Fraction of the genome altered by copy number neutral LOH or copy-neutral allelic imbalance | Less | Less | Less |
| Overall frequency of recurrent amplifications and genomic complexity | More | Less | Less |
| Amplifications differing between mutation groups (mGISTIC regions) | 7p11.2 ( | 8p12 ( | |
| Copy number alterations differing between mutation groups (mGISTIC regions) | +1p34.2, +5p15.33, +5q35.1, +7p22.3-p22.2, +7p21.1, +7p11.2, +7q11.21, +14q21.2, +16p13.13, -8p23.2-p23.1, -8p21.2-p12, -9p24.3, -9p23, -9p21.3, -10q23.2-q23.31 | +12p12.1 ( | |
| Copy number alterations differing between mutation groups (genome-wide screen) | +1p36.33-p31.1, +5q34-q35.3, +7p22.3-p11.1, +7q11.21, +16p13.3-p12.1, +16p11.2-q11.1, +21q22.11-q22.3, -8p22-p11.21, -8q11.23 | ||
| Characteristic total LOH regions (≥50% frequency)b | 8p, 9, 13q, 17p | 19p | |
| Predominant tumor ploidy (GAP-ploidy) | 2N (highest), 3N | 2N (highest), 3N | 2N (highest), 3N |
a Less indicates relatively lower estimates or frequencies between groups. More indicates relatively higher estimates or frequencies.
b Includes LOH caused by copy number loss as well as copy-neutral LOH.
Differentially expressed genes between EGFR/KRAS mutation groups in ≥4 of five Affymetrix adenocarcinoma cohorts.
| Gene | Name | Gene | Name |
|---|---|---|---|
| ACSF2 | acyl-CoA synthetase family member 2 | ISG20 | interferon stimulated exonuclease gene 20kDa |
| ADCY9 | adenylate cyclase 9 | ITPR3 | inositol 1,4,5-triphosphate receptor, type 3 |
| AGFG1 |
| KCNK5 * | potassium channel, subfamily K, member 5 |
| AHR | aryl hydrocarbon receptor | KIAA0319L * | polycystic kidney disease 1-like |
| APOH | apolipoprotein H precursor | KIAA0494 | hypothetical protein LOC9813 |
| ARMCX6 | armadillo repeat containing, X-linked 6 | KIAA0495 | hypothetical protein LOC57212 |
| ARSD | arylsulfatase D | KIAA1033 * | hypothetical protein LOC23325 |
| BAG1 | BCL2-associated athanogene 1 | KRAS | c-K-ras2 protein isoform b |
| BLVRA | biliverdin reductase A | LDLRAP1 * | low density lipoprotein receptor adaptor protein |
| C16orf58 | hypothetical protein LOC64755 | LRRC31 | leucine rich repeat containing 31 |
| C7orf23 | chromosome 7 open reading frame 23 | MANBA | mannosidase, beta A, lysosomal |
| CADPS2 | Ca2+-dependent activator protein for secretion 2 | MEAF6 * | MYST/Esa1-associated factor 6 |
| CAMTA1 | calmodulin-binding transcription activator 1 | MMP15 | matrix metalloproteinase 15 preproprotein |
| CLDN10 | claudin 10 | MTPAP | mitochondrial poly(A) polymerase |
| COL21A1 | collagen, type XXI, alpha 1 precursor | MYST1 | MYST histone acetyltransferase 1 |
| CTNNBIP1 | catenin, beta interacting protein 1 | NAT15 | N(alpha)-acetyltransferase 60, NatF catalytic subunit |
| DDAH1 * | dimethylarginine dimethylaminohydrolase 1 | NBPF10 | hypothetical protein LOC440673 |
| DDX21 | DEAD (Asp-Glu-Ala-Asp) box polypeptide 21 | NFYC * | nuclear transcription factor Y, gamma |
| DNAJC9 | DnaJ homolog, subfamily C, member 9 | NIPAL3 | NIPA-like domain containing 3 |
| DUSP4 * | dual specificity phosphatase 4 | PDSS1 | prenyl diphosphate synthase, subunit 1 |
| EFHC2 | EF-hand domain (C-terminal) containing 2 | PEF1 | penta-EF-hand domain containing 1 |
| EGFR * | epidermal growth factor receptor | PER3 | period 3 |
| ELN | elastin | PIGV * | phosphatidylinositol glycan class V |
| ENC1 | ectodermal-neural cortex (with BTB-like domain) | PIK3IP1 | HGFL protein |
| ENTPD4 | ectonucleoside triphosphate diphosphohydrolase | PPCS | phosphopantothenoylcysteine synthetase isoform |
| ETV5 | ets variant gene 5 (ets-related molecule) | PPFIBP2 | PTPRF interacting protein, binding protein 2 |
| FAAH | fatty acid amide hydrolase | PPIF | peptidylprolyl isomerase F precursor |
| FAM184A | family with sequence similarity 184, member A | PRDM4 | PR domain containing 4 |
| FGF13 | fibroblast growth factor 13 | PYROXD1 | pyridine nucleotide-disulphide oxidoreductase domain 1 |
| FGG | fibrinogen, gamma chain | RAPGEF5 | Rap guanine nucleotide exchange factor (GEF) 5 |
| FGGY | FGGY carbohydrate kinase domain containing | RFK | riboflavin kinase |
Figure 4Supervised classification of adenocarcinoma gene expression cohorts with respect to EGFR and KRAS mutation status.
Sensitivity (solid line) and specificity (dashed line) by EGFR/KRAS mutation group for classification of four independent validation cohorts using a 96-gene centroid classifier. The x-axis shows Pearson correlation cut-off for assigning a sample to the centroid with the highest correlation. Increasing correlation cut-offs correspond to increased stringency in classification, but introduces growing numbers of unclassified samples excluded in the calculation of sensitivity and specificity.