| Literature DB >> 28273134 |
Albert Rosenberger1, Melanie Sohns1, Stefanie Friedrichs1, Rayjean J Hung2,3, Gord Fehringer2, John McLaughlin4, Christopher I Amos5, Paul Brennan6, Angela Risch7, Irene Brüske8, Neil E Caporaso9, Maria Teresa Landi9, David C Christiani10, Yongyue Wei10, Heike Bickeböller1.
Abstract
INTRODUCTION: Gene-set analysis (GSA) is an approach using the results of single-marker genome-wide association studies when investigating pathways as a whole with respect to the genetic basis of a disease.Entities:
Mesh:
Year: 2017 PMID: 28273134 PMCID: PMC5342225 DOI: 10.1371/journal.pone.0173339
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Study selection flow cart.
Characteristics of lung cancer GWASs of the International Lung Cancer Consortium (ILCCO).
| Study | Cases | Controls | Location | Study design | Illumina genotyping platform | Number of SNPs |
|---|---|---|---|---|---|---|
| MDACC | 1 150 | 1 134 | Texas, USA | Hospital-based case–control | 317K | 312 829 |
| TORONTO | 331 | 499 | Toronto, CA | Hospital-based case–control | 317K | 314 285 |
| CE (IARC | 1 854 | 2 453 | Romania, Hungary, Slovakia, Poland, Russia, Czech Republic | Multicenter hospital-based case–control | 317K, 370Duo | |
| GLC | 487 | 480 | Germany | Population-based case–control (<50 years) | HumanHap550K | 503 381 |
| DeCODE Genetics | 830 | 11 228 | Iceland | Population-based case–control | 317K, 370Duo | 290 386 |
| HARVARD | 984 | 970 | Massachusetts, USA | Hospital-based case–control | 610Quad | 543 697 |
| 506 062 | ||||||
| EAGLE | 1 920 | 1 979 | Italy | Population-based case–control | HumanHap550v3_B, 610Quad | |
| ATBC | 1 732 | 1 271 | Finland | Cohort | HumanHap550K, HumanHap610 | |
| PLCO | 1 380 | 1 817 | 10 US Centers | Cohort-Cancer Prevention Trial | 317K / 240S, HumanHap550v3_B, HumanHap610 | |
| CPS-II | 697 | 674 | all US states | Cohort | HumanHap550K, 610Quad | |
a MD Anderson Cancer Center.
b Toronto study by Lunenfeld-Tanenbaum Research Institute.
c Central Europe Study of the International Agency for Research on Cancer.
d German Lung Cancer Study.
e Environment And Genetics in Lung cancer Etiology study.
f Alpha-Tocopherol, Beta-Carotene Cancer Prevention study.
g Prostate, Lung, Colon, Ovary screening trial.
h Cancer Prevention Study II nutrition cohort.
Significant results of META-GSA.
| KEGG pathways | number of genes | screening | replication | MSP combination | all | ||
|---|---|---|---|---|---|---|---|
| 4 studies | 3 studies | 7 studies | |||||
| n genes | |||||||
| 128 | 0.0857 | 0.0615 | |||||
| 13 | 0.0543 | 0.9122 | 0.6672 | 0.3154 | |||
| 42 | 0.1940 | 0.4890 | 0.3570 | 0.3952 | |||
| 231 other gene sets | >0.0043 | futility | stopping | ||||
SLE—systemic lupus erythematosus; folate bio folate biosynthesis; T1DM—type I diabetes mellitus, MSP—combined p-values according to the method based on the sum of p-values (adaptive designed approach for early futility stopping); p—p-value of the screening phase; p—p-value of the screening phase corrected for multiple testing; p—p-value of the replication phase, p—p-value of the gene set (combining p and p); p—p-value of the gene set corrected for multiple testing; effective number of independent gene sets according the lowest slope method (LSM).
$: t = 171.5.
* P ≤ 0.05.
** P ≤ 0.01.
*** P ≤ 0.001.
Significance and concordance of selected genes of interest.
| gene | location | number of studies with | concordance | significance |
|---|---|---|---|---|
| 6p22.1 | 2 | -0.84 | 0.0056 | |
| 6p22.1 | 2 | -0.80 | 0.0091 | |
| 6p22.1 | 2 | -0.80 | 0.0091 | |
| 6p22.1 | 2 | +0.75 | 0.0093 | |
| 6p22.1 | 2 | +0.75 | 0.0093 | |
| 6p21.3 | 2 | +0.33 | 0.0109 | |
| 6p21.3 | 1 | -0.41 | 0.0319 | |
| 5q15.33 | 4 | -0.53 | < .0001 | |
| 5q15.33 | 4 | +0.49 | 0.0013 | |
| 15q24 | 3 | -0.63 | < .0001 | |
| 15q24 | 4 | -0.58 | < .0001 | |
| 15q24 | 3 | -0.45 | 0.0009 | |
| 6p21.3 | 3 | +0.67 | < .0001 | |
| 6p21.3 | -- | +0.39 | 0.1425 | |
| 12p13.33 | 1 | +0.23 | 0.3143 | |
| 9p21.3 | -- | -0.13 | 0.6729 | |
p is the study specific p-value for gene; is the mean of study specific PDRs for a gene (95% random interval derived from all 16.000 assigned genes: [±0.306]); pooled p—p-values combined by Fisher’s inverse χ2-method.
Markers with <0.5 in genes of interest on 6p21-22.
| SNP | allocated to | Position | MAF | r2 to | D‘ to | LC | SqCLC | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (A) | (B) | (A) | (B) | OR | p-value | OR | p-value | |||||
| rs200991 | 27847716 | 0.12 | 0.646 | 1 | 0.598 | 1.14 | 0.0021 | 1.16 | 3.1×10−5 | |||
| 27847861 | 0,08 | 1 | 1 | 0.719 | 1.23 | 0.0032 | 1.22 | 9.7×10−6 | ||||
| rs9262143 | 30685004 | 0.16§ | 0.769 | 1.25 | 0.0027 | 1.25 | 1.3×10−7 | |||||
| rs3094127 | 30729670 | 0.18 | 0.664 | 0.84 | 0.0029 | 1.10 | 4.0×10−2 | |||||
| rs3128982 | 31449414 | 0.30 | 0.578 | 1.07 | 0.0032 | 1.12 | 1.1×10−3 | |||||
| rs3117582 | 31652743 | 0.09 | 0.881 | 1 | 0.485 | 1.27 | 0.0049 | 1.30 | 4.5×10−10 | |||
| rs3131379 | 31753256 | 0.09 | 0.881 | 1 | 0.461 | 1.20 | 0.0074 | 1.28 | 3.8×10−7 | |||
| rs652888 | 31883457 | 0.17 | 0.336 | 1 | 0.538 | 1.14 | 0.0013 | 1.18 | 1.3×10−4 | |||
| rs535586 | 31892560 | 0.35 | 0.131 | 1 | 0.606 | 1.09 | 0.0001 | 1.11 | 1.2×10−3 | |||
| rs659445 | 31896527 | 0.35 | 0.131 | 1 | 0.711 | 1.09 | 3.7×10−6 | 1.10 | 3.1×10−3 | |||
| 31951083 | 0.09 | 1 | 1 | 0.728 | 1.27 | 0.0090 | 1.29 | 5.8×10−6 | ||||
| rs438999 | 31960529 | 0.06 | 0.005 | 1 | -.517 | 0.91 | 0.0027 | 0.85 | 1.0×10−2 | |||
| rs454212 | 31966595 | 0.08 | -.556 | 0.95 | 0.0034 | 0.84 | 1.7×10−2 | |||||
| 31973120 | 0.09 | 1$ | 1 | 0.724 | 1.27 | 0.0080 | 1.28 | 7.2×10−6 | ||||
Odds ratios (OR), corresponding p-values from a random effects meta-analysis model; single study ORs were adjusted for age, sex, smoking and genetic background; r2 and D’ were calculated according to the HapMap3-panel.
(a) or the 1000 Genome Pilot 1-panel.
(b) using SNAP Version 2.2; HIST1++ denotes the gene cluster HIST1-H4L/H2BN/H2AK/H2BN/H4K; LC—lung cancer (all histological subtypes), SqCLC – squamous-cell lung cancer; markers with largest with genes driving the significance of the SLE gene set (HIST1++, C2 and C4A) are printed in bold. Position of SNPs is given according to NCBI Build 37. MAF … minor allele frequencies in controlls.
Subgroup analysis for hsa05322: histological subtypes, sex, age, smoking.
| META-GSA | Gene | Location | concordance | significance | |
|---|---|---|---|---|---|
| AdenoLC | HIST2-1q21.2 | 1q21.2 | -0.6 | 0.1666 | |
| SqCLC | H2AFV | 7p13 | 0.5 | 0.7209 | |
| SCLC | 0.0626 | C1QA | 1p36.12 | -0.5 | 0.7101 |
| HIST2-1q21.2 | 1q21.2 | -0.5 | 0.0577 | ||
| ELANE | 19p13.3 | 0.5 | 0.4864 | ||
| HIST1-6p22.2 | 6p22.2 | 0.5 | 0.2177 | ||
| HIST1-6p22.2 | 6p22.2 | 0.5 | 0.2177 | ||
| LCLC | 0.2056 | -- | |||
| male | 0.1453 | HIST1H3C | 6p22.2 | -0.5 | 0.3726 |
| female | HIST1H2AL | 6p22.1 | 0.5 | 0.1229 | |
| old (>50) | HIST1-6p22.1a | 6p22.1 | -0.7 | 0.0054 | |
| HIST1-6p22.1b | 6p22.1 | 0.5 | 0.1578 | ||
| C2 | 6p21.3 | 0.5 | 0.0013 | ||
| H2AFV | 7p1 | 0.6 | 0.4005 | ||
| young (≤50) | 0.0588 | -- | |||
| current smokers | 0.3563 | HIST1-6p22.1a | 6p22.1 | 0.4 | 0.1720 |
| H3F3C | 12p11.21 | 0.4 | 0.5821 | ||
| HIST3H3 | 1q42 | 0.4 | 0.6468 | ||
| HIST1-6p22.1b | 6p22.1 | 0.4 | 0.2028 | ||
| HIST1-6p22.1c | 6p22.1 | 0.4 | 0.3375 | ||
| former smokers | 0.4691 | -- | |||
| ever smokers | 0.5132 | HIST1-6p22.1a | 6p22.1 | 0.5 | 0.0462 |
| HIST1-6p22.1c | 6p22.1 | 0.5 | 0.1587 | ||
| never smokers | 0.5429 | FCGR3A | 1q23 | -0.5 | 0.2300 |
| CTSG | 14q11.2 | -0.5 | 0.3403 |
Listed are genes, respectively regions containing genes with .
HIST2-1q21.2: HIST2H2AA3 / HIST2H2AA4 / HIST2H3C / HIST2H4B.
HIST3-1q42: HIST3H2A / HIST3H2BB / HIST3H3.
HIST1-6p22.1a: HIST1H4K / HIST1H2AK / HIST1H2AL / HIST1H2BM / HIST1H2BN / HIST1H3I / HIST1H4L / HIST1H3J / HIST1H4J (27.800K).
HIST1-6p22.1b: HIST1H2AG / HIST1H2BK (27.150 K).
HIST1-6p22.1c: HIST1H2BI / HIST1H3G / HIST1H4H (26.280 K).
HIST1-6p22.2: HIST1H3E / HIST1H2AE / HIST1H2BG / HIST1H4E (26.200 K).
The numbers in brackets are the approximate locations according to dbGENE.
Fig 2Association and correlation with gene expression in the chromosome 6p21-22 region.
LC—lung cancer, SLE—systemic lupus erythematosus; correlation to gene expression: pooled p-values as reported by Nguyen et al., 2014 [40]; association with LC: pooled p-values as reported by Timofeeva et al. 2012 [13].