| Literature DB >> 35511172 |
Aurélie A G Gabriel1, Joshua R Atkins1, Ricardo C C Penha1, Karl Smith-Byrne1,2, Valerie Gaborieau1, Catherine Voegele1, Behnoush Abedi-Ardekani1, Maja Milojevic1, Robert Olaso3, Vincent Meyer3, Anne Boland3, Jean François Deleuze3, David Zaridze4, Anush Mukeriya4, Beata Swiatkowska5, Vladimir Janout6, Miriam Schejbalová7, Dana Mates8, Jelena Stojšić9, Miodrag Ognjanovic10, John S Witte11, Sara R Rashkin11,12, Linda Kachuri11, Rayjean J Hung13, Siddhartha Kar14,15, Paul Brennan1, Anne-Sophie Sertier16, Anthony Ferrari16, Alain Viari16,17, Mattias Johansson1, Christopher I Amos18, Matthieu Foll1, James D McKay1.
Abstract
BACKGROUND: Germline genetic variation contributes to lung cancer (LC) susceptibility. Previous genome-wide association studies (GWAS) have implicated susceptibility loci involved in smoking behaviors and DNA repair genes, but further work is required to identify susceptibility variants.Entities:
Mesh:
Year: 2022 PMID: 35511172 PMCID: PMC9360465 DOI: 10.1093/jnci/djac087
Source DB: PubMed Journal: J Natl Cancer Inst ISSN: 0027-8874 Impact factor: 11.816
Figure 1.Manhattan plot of the meta-analysis of the genome-wide by proxy (GWAx) with genome-wide association study (GWAS) into lung cancer. The Manhattan plot displays the results of the meta-analysis of the GWAx (48 843 proxy patients and 195 387 controls without a family history of any cancer) and the Transdisciplinary Research In Cancer of the Lung GWAS (29 266 patients and 56 450 controls) with already identified and novel loci noted with the likely candidate gene name presented. The table represents the newly 11 independent loci across 8 distinct cytoband regions (sites with 2 independent hits are denoted by * within the cytoband column). The x-axis is the chromosome position across the autosomal chromosomes, and the y-axis contains the association level displayed as the -log10(P value), derived by a multivariate logistic regression model. The dotted line displays the genome-wide significance threshold (5 x 10-8). L95% = lower bound confidence interval; OR = odds ratio; U95% = upper bound confidence interval.
Figure 2.Brain and lung eQTLs discovered within the 8 novel loci. Colocalization plots between lung cancer (x-axis) and CHRNB2 putamen expression (1q21.3) A) CHRNA4 putamen expression (20q13.33); (B) CHEK1 lung expression (11q24.2); (C) RP11-10O17.1 lung gene expression (15q24); (D) (y-axis). Each variant and eQTL status were compared using COLOC for colocalization to confirm that the lung cancer SNP was the same SNP driving the eQTL effect in both brain and lung tissues, the Bayesian posterior probability (PP4) of each gene was tested. Stars indicate the variant of interest and shading scaled representing the level of LD shared between other markers with sentinel variant (r2 > 0.8; r2 > 0.4; r2 > 0.1). eQTL = expression quantitative trait loci; LC = lung cancer; LD = linkage disequilibrium; SNP = single nucleotide polymorphism.
Figure 3.Germline polygenic risk score construction using smoking and eQTL related SNPs and performance testing within the UK Biobank lung cancer cohort. A) The mean lung cancer association statistics calculated by variant bins (100 variants per bin) ranked by partial least squares (PLS) components. Variants (clumped on LD based on lung cancer P values) were ranked based on PLS components for smoking propensity (Component1_smoking, top) and eQTLs (Component1_eQTL, [B]) (x-axis) and plotted against the mean lung cancer Z statistics calculated across variants in each bin (y-axis). Bin values that exceed 3 SDs from the mean are noted, with the excess observed (number of bins smoking propensity = 9, number of bins eQTL = 37) implying that the variants within these bins are enriched for LC-susceptibility alleles. C) A forest plot of the performance of the constructed PRSs in comparison to the PRS based on the 65 GWS independent loci as a baseline which included array type, sex, age of recruitment and the first 5 principal components from genetic-inferred ancestry). CI = confidence interval; eQTL = expression quantitative trait loci; LC = lung cancer; LD = linkage disequilibrium; GWS = genome-wide significant; OR = odds ratio; PRS = polygenic risk scores; SNP = single nucleotide polymorphism.
Figure 4.Polygenic risk scores for smoking (smPRS) associations with total number of mutations and mutations attributable to SBS4 in TCGA cohort. A) Associations with total number of mutations. B) Associations with SBS4 mutations. The left panels represent the distribution of the number of mutations in the smPRS quintiles. The right panels correspond, respectively, to the forest plots of smPRS associations with total mutational burden (panel A) and SBS4 mutations (panel B). For each PRS, the association was tested 1) in all lung cancer patients when considering all SNPs in the smPRS SNPs selection, 2) in all lung cancer patients when considering different subsets of SNPs in the PRS computation, 3) stratifying by histology, and 4) stratifying by smoking status. CI = confidence interval; IRR = incidence rate ratios; LUAD = Lung adenocarcinoma; LUSC = Lung Squamous Cell Carcinoma; NA = Not available; Q = quintile; TCGA = The Cancer Genome Atlas; SNP = single nucleotide polymorphism.