| Literature DB >> 33535033 |
Jian Carrot-Zhang1, Xiaotong Yao2, Siddhartha Devarakonda3, Aditya Deshpande2, Jeffrey S Damrauer4, Tiago Chedraoui Silva5, Christopher K Wong6, Hyo Young Choi7, Ina Felau8, A Gordon Robertson9, Mauro A A Castro10, Lisui Bao11, Esther Rheinbay12, Eric Minwei Liu13, Tuan Trieu13, David Haan6, Christina Yau14, Toshinori Hinoue15, Yuexin Liu16, Ofer Shapira17, Kiran Kumar18, Karen L Mungall9, Hailei Zhang17, Jake June-Koo Lee19, Ashton Berger17, Galen F Gao17, Binyamin Zhitomirsky12, Wen-Wei Liang20, Meng Zhou1, Sitapriya Moorthi21, Alice H Berger21, Eric A Collisson22, Michael C Zody23, Li Ding20, Andrew D Cherniack18, Gad Getz12, Olivier Elemento24, Christopher C Benz25, Josh Stuart6, J C Zenklusen8, Rameen Beroukhim26, Jason C Chang27, Joshua D Campbell28, D Neil Hayes7, Lixing Yang11, Peter W Laird15, John N Weinstein16, David J Kwiatkowski29, Ming S Tsao30, William D Travis27, Ekta Khurana13, Benjamin P Berman31, Katherine A Hoadley4, Nicolas Robine23, Matthew Meyerson32, Ramaswamy Govindan33, Marcin Imielinski34.
Abstract
RTK/RAS/RAF pathway alterations (RPAs) are a hallmark of lung adenocarcinoma (LUAD). In this study, we use whole-genome sequencing (WGS) of 85 cases found to be RPA(-) by previous studies from The Cancer Genome Atlas (TCGA) to characterize the minority of LUADs lacking apparent alterations in this pathway. We show that WGS analysis uncovers RPA(+) in 28 (33%) of the 85 samples. Among the remaining 57 cases, we observe focal deletions targeting the promoter or transcription start site of STK11 (n = 7) or KEAP1 (n = 3), and promoter mutations associated with the increased expression of ILF2 (n = 6). We also identify complex structural variations associated with high-level copy number amplifications. Moreover, an enrichment of focal deletions is found in TP53 mutant cases. Our results indicate that RPA(-) cases demonstrate tumor suppressor deletions and genome instability, but lack unique or recurrent genetic lesions compensating for the lack of RPAs. Larger WGS studies of RPA(-) cases are required to understand this important LUAD subset.Entities:
Keywords: TCGA; driver; genome analysis; lung adenocarcinoma; noncoding; oncogene; precision oncology; structural variation; tumor suppressor; whole genome sequencing
Mesh:
Substances:
Year: 2021 PMID: 33535033 PMCID: PMC8009291 DOI: 10.1016/j.celrep.2021.108707
Source DB: PubMed Journal: Cell Rep Impact factor: 9.423
Figure 1.Identification of RPA(–) LUADs
(A) Identification of 118 RPA(–)E LUAD cases from the 501 TCGA LUAD cohort, defined by WES or RNA-seq analysis. Eighty-five of the 118 samples were sent for WGS. The RTK/RAS/RAF pathway alterations used to define the RPA(+) or RPA(–) cases are listed in Table S1.
(B) WGS uncovered genomic alterations in the RTK/RAS/RAF pathway in 28/85 samples; 57/85 samples remain as RPA(–) after WGS analysis.
(C) Visualization of sequencing reads covering a KRAS p.G12C mutation in WGS (upper panel) and WES (lower panel) for sample (TCGA-55–7574). Both read depth and the number of reads supporting the mutation are higher in WGS than in WES.
(D) An example of EGFR amplification coupled with EGFR overexpression in TCGA-50–5939. In the second panel, purity-adjusted copy number and SV junctions (red lines) support a BFBC event underlying the amplification. Lower panels indicate WGS read depth and gene location in the region. CN, copy number.
(E) Example of a RASA1 simple deletion spanning from exon 21 to the end of the gene (12 kbp) coupled with RASA1 loss of expression in TCGA-55–8614.
See also Figure S1.
Figure 2.Recurrent coding alterations in RPA(–)G LUADs
(A) Overview of genomic alterations in 57 RPA(–)G LUADs. Genes significantly mutated (*) or significantly amplified/deleted in the RPA(–)G samples are listed.
(B and C) Example of KEAP1 (3 kbp length) (B) and (C) STK11 (8 kbp length) simple, homozygous deletion (CN = 0), resulting in loss of expression. The distribution of the KEAP1 or STK11 expression is plotted based on the full TCGA LUAD cohort.
(D) Expression comparison of samples with loss-of-function alterations in STK11 and (E) KEAP1 to other RPA(–)G LUAD samples. p values are calculated from Mann-Whitney U tests. Boxplots show median, interquartile range, and 1.5 times the interquartile range.
See also Figure S1.
Figure 3.Identification of ILF2 promoter mutations in RPA(–)G LUADs
(A) Three genes with non-coding mutations nominated through recurrence analysis across LUAD-related ATAC peaks (left). Red dots indicate loci with FDR < 0.25.
(B) Same as (A), but restricted to ATAC-seq peaks in genes with RSEM ≥ 10 across TCGA LUAD and recurrently amplified in RPA(–)G samples. Red dots indicate FDR < 0.1.
(C) Among 57 RPA(–)G samples, 6 SNVs are observed in the promoter region of ILF2; all are located within ATAC-seq peaks.
(D) Expression comparison of RPA(–)G samples with ILF2 promoter mutations and amplifications. p values are calculated from linear regression analysis correlating expression, adjusting for the local copy number of ILF2 and purity. Boxplot shows median, interquartile range, and 1.5 times the interquartile range. See also Figure S3.
Figure 4.Classification of SVs in RPA(–)G LUADs
(A) Identification of simple and complex SV events. Upper panel: SVs resulting in copy-number gain (double minute, BFBC, tyfonas, pyrgo, simple duplication). Lower panel: SVs resulting in copy-number loss (chromothripsis, rigma, chromoplexy, templated insertion chain, simple deletion). Key indicates the range of event count of SV types observed in each sample.
(B) Expression quantile of genes located in SV types with copy-number gain.
(C) Simple deletion count is more significantly enriched in the TP53 mutant RPA(–)G samples than in the TP53-wild-type RPA(–)G samples. p value is obtained from Mann-Whitney U test. Violin plots reflect kernel density estimations.
(D and E) Example of a double minute in TCGA-55–5899 spanning 3 genes (D), and (E) 2 of which (UBL3 and LIG4) showed marked overexpression relative to RNA-seq data for the full TCGA LUAD cohort. See also Figure S4.
KEY RESOURCES TABLE
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Deposited data | ||
| TCGA somatic mutation, copy | Genomic Data Commons; | |
| number, aneuploidy, genetic ancestry, | ( | |
| clinical data | ||
| COSMIC | N/A | |
| 1000 Genomes Project data | ( | |
| ExAC | ( | |
| CIVic | ( | |
| UniProt | ( | |
| TCGA ATAC-seq data | ( | |
| TCGA normalized mRNA data | Genomic Data Commons | |
| Software and algorithms | ||
| BWA v0.6.2 | ( | |
| NovoSort v1.03.01 | N/A | |
| GATK v3.4 | ( | |
| MuTect v1.1.7 | ( | |
| Strelka v1.0.14 | ( | |
| LoFreq v2.1.3a | ( | |
| Pindel v0.2.5 | ( | |
| Scalpel v0.5.3 | ( | |
| EBI validator v 0.4.3 | N/A | |
| snpEff and snpSift | ( | |
| Funseq2 | ( | |
| SignatureAnalyzer | ( | |
| GISTIC 2.0 | ( | |
| Titan | ( | |
| SvABA | ( | |
| JaBbA | ( | |
| Circular Binary Segmentation | ( | |
| Sequenza | ( | |
| Samtools | ( | |
| gGnome | ( | |
| SCISSOR | ( | |
| fishHook | ( | |