| Literature DB >> 22235315 |
Fenna C M Sillé1, Reuben Thomas, Martyn T Smith, Lucia Conde, Christine F Skibola.
Abstract
Recent genome-wide association studies (GWAS) have identified several gene variants associated with sporadic chronic lymphocytic leukemia/small lymphocytic lymphoma (CLL/SLL). Many of these CLL/SLL susceptibility loci are located in non-coding or intergenic regions, posing a significant challenge to determine their potential functional relevance. Here, we review the literature of all CLL/SLL GWAS and validation studies, and apply eQTL analysis to identify putatively functional SNPs that affect gene expression that may be causal in the pathogenesis of CLL/SLL. We tested 12 independent risk loci for their potential to alter gene expression through cis-acting mechanisms, using publicly available gene expression profiles with matching genotype information. Sixteen SNPs were identified that are linked to differential expression of SP140, a putative tumor suppressor gene previously associated with CLL/SLL. Three additional SNPs were associated with differential expression of DACT3 and GNG8, which are involved in the WNT/β-catenin- and G protein-coupled receptor signaling pathways, respectively, that have been previously implicated in CLL/SLL pathogenesis. Using in silico functional prediction tools, we found that 14 of the 19 significant eQTL SNPs lie in multiple putative regulatory elements, several of which have prior implications in CLL/SLL or other hematological malignancies. Although experimental validation is needed, our study shows that the use of existing GWAS data in combination with eQTL analysis and in silico methods represents a useful starting point to screen for putatively causal SNPs that may be involved in the etiology of CLL/SLL.Entities:
Mesh:
Year: 2012 PMID: 22235315 PMCID: PMC3250464 DOI: 10.1371/journal.pone.0029632
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
CLL/SLL susceptibility loci identified through genome-wide association studies and follow-up studies.
| Risk locus | SNP ID | r2 | Chr Pos GRCh37 (bp) | Major/minor allele | Nearest gene(s) | SNP location | OR (risk allele) | (95% CI) (risk allele) |
| Ref. | Further validation ref. |
| 2q13 |
| 111797458 | A/ |
| Intron 10 of | 1.39 | (1.25–1.53) | 2.36E-10 |
|
| |
| 2q37.1 |
| 231091223 | T |
| Intron 1 of | 1.41 | (1.26–1.57) | 5.40E-10 |
|
| |
| 2q37.3 |
| 242371101 | G/ |
| Exon 9 | 1.39 | (1.25–1.56) | 2.11E-09 |
|
| |
|
| 1 | 242344333 | A/ |
| Intron 4 | 1.38 | (1.24–1.54) | 4.53E-09 |
| ||
| 6p25.3 |
| 411064 | A/ |
| 3′UTR | 1.54 | (1.41–1.69) | 1.91E-20 |
|
| |
|
| 0.74 | 417727 | A/ |
| ∼10 kb from 3′UTR | 1.51 | (1.38–1.65) | 4.62E-19 |
|
| |
|
| 0.99 | 410417 | A/ |
| 3′UTR | 1.59 | (1.43–1.78) | 3.68E-17 |
| ||
|
| 0.99 | 409119 | A/ |
| 3′UTR | 1.59 | (1.42–1.77) | 5.61E-17 |
| ||
|
| 0.99 | 408079 | C/ |
| 3′UTR | 1.58 | (1.41–1.76) | 2.13E-16 |
| ||
| 8q24.21 |
| 128192981 | A/ |
| – | 1.26 | (1.17–1.35) | 7.84E-10 |
|
| |
|
| 0.75 | 128188019 | G/ |
| – | 1.2 | (1.12–1.29) | 4.60E-07 |
| ||
| 11q24.1 |
| 123361397 | G/ |
| ∼50 kb centromeric | 1.45 | (1.31–1.61) | 3.78E-12 |
|
| |
| 15q23 |
| 70018990 | G/ |
| – | 1.37 | (1.26–1.50) | 4.54E-12 |
|
| |
|
| >0.9 | 69987882 |
|
|
| 1.54 | (1.24–1.92) | 1.00E-04 |
| ||
|
| >0.9 | 69992114 | C/T/ |
|
| 1.56 | (1.25–1.95) | 9.00E-05 |
| ||
|
| >0.9 | 70004352 |
|
|
| 1.52 | (1.22–1.90) | 2.10E-04 |
| ||
|
| >0.9 | 70007323 | C/ |
|
| 1.52 | (1.22–1.89) | 2.10E-04 |
| ||
| 15q25.2 |
| 83254708 | A/ |
| Intron 2 | 1.17 | (1.11–1.24) | 1.10E-07 |
|
| |
| 16q24.1 |
| 85975659 | C/ |
| ∼19 kb telomeric | 1.22 | (1.12–1.32) | 3.60E-07 |
|
| |
|
| 0.93 | 85973866 |
|
| ∼17 kb telomeric | 0.77 | (0.61–0.97) | 2.37E-02 |
| ||
|
| 85944439 | A/ |
| Intron 3 | 0.64 | (0.55–0.74) | 3.16E-09 |
| |||
|
| 0.99 | 85943466 | T/ |
| Intron 3 | 0.66 | (0.57–0.77) | 3.37E-08 |
| ||
|
| 0.99 | 85945076 | T/ |
| Intron 3 | 0.66 | (0.56–0.76) | 1.89E-08 |
| ||
|
| 0.99 | 85944823 | T/ |
| Intron 3 | 0.64 | (0.55–0.74) | 3.16E-09 |
| ||
| 18q21.1 |
| 47843534 | C |
| Telomeric | 1.16 | (1.09–1.25) | 1.30E-05 |
|
| |
| 19q13.32 |
| 47207654 | G/ |
| Intron 3 of | 1.35 | (1.22–1.49) | 3.96E-09 |
|
| |
|
| 0.27 | 47268373 | G/ |
| Intergenic | 1.37 | (1.21–1.55) | 3.65E-07 |
|
Notes: In regular font are original independent SNPs identified through GWAS. In italics font are SNPs in LD with the original GWAS SNPs. In bold font are independently validated SNPs used for eQTL analysis and risk alleles as called by the primary study. Nearest gene(s) map within ∼200 kb of each SNP.
OR, CI and P-trend quoted are per copy of risk allele (bold in column 5) from all data combined in the primary study. P-trend, significance of the association between each SNP and risk of CLL/SLL.
Conditional analysis reportedly provided no evidence for an independent role compared to original SNP [4], [7], [12].
Acquired after fine-scale mapping.
Significance obtained from combined analysis from refs [12] and [8].
Abbreviations: LD, Linkage disequilibrium; OR, odds ratio; CLL/SLL, chronic lymphocytic leukemia/small lymphocytic lymphoma; CI, confidence interval.
Figure 1CLL/SLL susceptibility loci identified through genome-wide association studies.
Karyotype depicts CLL/SLL-associated SNPs and SNPs in LD with those SNPs that were identified through previous genome-wide association studies (GWAS) and follow-up studies. Independent loci are color-coded with the primary GWAS SNP in dark and SNPs in LD in a lighter shade. Chromosome locations are based on chromosome build 37.1 GRCh37.
CLL/SLL-associated SNPs that alter gene expression through cis-acting mechanisms.
| Differential gene expression | SNP ID | r2 | Chr | Chr Pos GRCh37 (bp) | SNP location | Major/minor allele | Raw | Adjusted | Expression |
|
| rs6716753 | 0.95 | 2 | 231097129 | Intron 1 of | T/ | 5.88E-03 | 0.186 | ↓ |
| rs6743984 | 0.95 | 2 | 231109329 | Intron 5 of | T/ | 5.88E-03 | 0.186 | ↓ | |
| rs7423615 | 1 | 2 | 231116874 | Intron 10 of | C/ | 1.87E-03 | 0.114 | ↓ | |
| rs7563433 | 1 | 2 | 231095678 | Intron 1 of | T/ | 6.10E-03 | 0.186 | ↓ | |
| rs9989746 | 0.89 | 2 | 231115790 | Intron 10 of | G/ | 6.72E-04 | 0.082 | ↓ | |
| rs9989835 | 1 | 2 | 231115630 | Intron 9 of | A/ | 4.49E-03 | 0.171 | ↓ | |
| rs9989899 | 1 | 2 | 231114131 | Intron 9 of | G/ | 1.87E-03 | 0.114 | ↓ | |
| rs10198539 | 0.89 | 2 | 231105646 | Intron 5 of | C/ | 8.43E-04 | 0.086 | ↓ | |
| rs10201872 | 0.89 | 2 | 231106724 | Intron 5 of | C/ | 6.72E-04 | 0.082 | ↓ | |
| rs10202244 | 0.89 | 2 | 231106935 | Intron 5 of | G/ | 6.72E-04 | 0.082 | ↓ | |
| rs10209615 | 0.83 | 2 | 231089943 | Upstream of | T/ | 4.66E-04 | 0.082 | ↓ | |
| rs10498246 | 1 | 2 | 231116417 | Intron 10 of | T/ | 3.09E-03 | 0.157 | ↓ | |
| rs13384787 | 0.89 | 2 | 231088084 | Upstream of | A/ | 6.72E-04 | 0.082 | ↓ | |
| rs13385151 | 1 | 2 | 231115976 | Intron 10 of | C/ | 3.75E-03 | 0.164 | ↓ | |
|
| - | 2 | 231091223 | Intron 1 of | T/ | 3.09E-03 | 0.157 | ↓ | |
| rs28445040 | 1 | 2 | 231110582 | Exon 7 of | C/ | 1.87E-03 | 0.114 | ↓ | |
|
| rs4802322 | 0.95 | 19 | 47242992 | Intron 1 of STRN4 |
| 6.63E-03 | 0.193 | ↑ |
|
| - | 19 | 47207654 | Intron 5 of | G/ | 3.34E-03 | 0.157 | ↑ | |
|
| rs4802322 | 0.95 | 19 | 47242992 | Intron 1 of | G/ | 4.01E-03 | 0.164 | ↑ |
|
| - | 19 | 47207654 | Intron 5 of | G/ | 1.85E-03 | 0.114 | ↑ | |
| rs11670473 | 0.88 | 19 | 47257481 | Intron 3 of | G/ | 5.01E-03 | 0.180 | ↑ |
Notes: Depicted are CLL/SLL-associated SNPs and SNPs in LD that are significantly linked to differential gene expression (BH<0.20). Highlighted in bold are the original GWAS SNPs and the risk allele, or the minor allele when the risk allele is not known.
Same SNP influencing expression of two distinct genes.
The synonymous rs28445040 variation (TCC→TCT) does not lead to a substitution for the serine ([Ser]→[Ser]) at amino acid position 223.
Abbreviations: BH, Benjamini-Hochberg; ChrPos, chromosome position; CLL/SLL, chronic lymphocytic leukemia/small lymphocytic lymphoma.
Figure 2CLL/SLL-associated SNPs that significantly alter gene expression.
Expression quantitative trait loci (eQTL) analysis identified 19 significant CLL/SLL-associated SNPs linked to differential expression of SP140 on chromosome 2 (A), and DACT3 and GNG8 on chromosome 19 (BH<0.20) (B). eQTL SNPs are depicted on a partial chromosome map that includes the differentially expressed gene(s). Chromosome locations are based on chromosome build 37.1 GRCh37.