| Literature DB >> 30678091 |
Alina S Ustiugova1,2, Kirill V Korneev3,4, Dmitry V Kuprash5,6, And Marina A Afanasyeva7.
Abstract
Genome-wide association studies (GWASes) revealed several single-nucleotide polymorphisms (SNPs) in the human 17q12-21 locus associated with autoimmune diseases. However, follow-up studies are still needed to identify causative SNPs directly mediating autoimmune risk in the locus. We have chosen six SNPs in high linkage disequilibrium with the GWAS hits that showed the strongest evidence of causality according to association pattern and epigenetic data and assessed their functionality in a local genomic context using luciferase reporter system. We found that rs12946510, rs4795397, rs12709365, and rs8067378 influenced the reporter expression level in leukocytic cell lines. The strongest effect visible in three distinct cell types was observed for rs12946510 that is predicted to alter MEF2A/C and FOXO1 binding sites.Entities:
Keywords: Crohn’s disease; enhancer; luciferase reporter assay; multiple sclerosis; post-GWAS; primary biliary cirrhosis; regulatory variant; rheumatoid arthritis; type 1 diabetes; ulcerative colitis
Mesh:
Substances:
Year: 2019 PMID: 30678091 PMCID: PMC6409600 DOI: 10.3390/genes10020077
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Primer sequences used for enhancer cloning.
| SNP | Enhancer (Chr17 Hg19 Coordinates) | Length, bp | Forward Primer (5′-3′) | Reverse Primer (5′-3′) |
|---|---|---|---|---|
| rs12946510 | 37912040- | 1057 | TTTggatccAATGCAATTCCAGTGGGGGT | TTTgtcgacGCCCCTCAGTAGCTGGTTTT |
| rs2313430 | 37929651- | 718 | TTTggatccGGACATCAGGCCTTTGGGAA | TTAgtcgacTGAACTGGGGAAGAGGGACA |
| rs4795397 | 38023183- | 1705 | AAAggatccTGGTTAAGTCTCCTCTCATAGGATT | AAAgtcgacTAAAACGCGGGCATTGGACT |
| rs12709365/rs13380815 | 38026822- | 3029 | TTTagatctAAGGACTTCAGACGAGCGTT | TTTgtcgacTACTCCAGCTCTCTTTTGAGAA |
| rs8067378 | 38050770- | 1448 | TTTggatccCACCTGCTCCTGTCTGATGC | TTTgtcgacGAGCCCATTGCAAGCAGTCT |
Primers used for the cloning of the size-matched control sequences.
| Control For | Hg19 Coordinates | Length, bp | Forward Primer (5′-3′) | Reverse Primer (5′-3′) |
|---|---|---|---|---|
| rs2313430 | chr17:38087769- | 713 | TTTggatccATACAGTGATTGCATTTGCTTCG | TTTgtcgacTGATCATCGCCATCTTCATTTACTT |
| rs12946510 | chr10:6101364- | 909 | AAggatccGCTGTACCCAGTGCGTAG | TATgtcgacTACTGCAAAGTGGCTATGAAG |
| rs4795397/rs806737 | chr17:38087831- | 1566 | AAAggatccGAGCCATGAGGTGATAATTATGGAA | AAAgtcgacATGAAAAAGATCACCCTAAATCCCT |
| rs12709365/rs13380815 | chr18:69576417- | 3013 | ATTTggatccTGGAAGTTCAGTGAGTGTGTC | TATTgtcgacTCCTCATGCTTCCGGTTGTC |
* contained the minor allele (T) of rs12722489 making the sequence non-active as enhancer.
Primers used for PCR mutagenesis.
| SNP | Forward primer (5′-3′) | Reverse primer (5′-3′) |
|---|---|---|
| rs12946510 | GAGTTAAAAA | CTGTGGTTTT |
| rs2313430 | GAGATCTTTT | GAACATGAA |
| rs4795397 | GAAAAGGCCA | TGGAGCCCGA |
| rs12709365 | CCTTGGAACATA | ATTAATAATAC |
| rs13380815 | ATGACAGAATT | CCACAAAATCT |
| rs8067378 | CGTTATAAAT | TTTTTCCC |
Electroporation parameters used for transfection.
| Cell Line | Pulse Voltage, V | Pulse Width, ms | Pulse Number |
|---|---|---|---|
| MP1 | 1300 | 30 | 1 |
| Jurkat | 1350 | 10 | 3 |
| Nalm6 | 1300 | 30 | 1 |
| MT-2 | 1400 | 30 | 1 |
| U-937 | 1400 | 30 | 1 |
Figure 1Candidate functional SNPs in the human autoimmunity-associated locus 17q12-21 and the corresponding putative enhancers. (a) University of California Santa Cruz (UCSC) Genome Browser [51,52] view of the candidate functional SNPs (black lines) associated with autoimmune diseases in the 17q12-21 locus and the corresponding putative enhancers (highlighted in green). Histone acetylation tracks provided by ENCODE [53,54] illustrate enhancer activity in a variety of primary human leukocytes. Gene schemes are shown in purple. Thin lines correspond to introns, thicker parts correspond to exons with coding sequences being the thickest. Direction of transcription is indicated by arrows next to gene names. DHS—DNaseI hypersensitivity clusters in 125 cell types from ENCODE (V3). TF binding—ChIP-seq peaks for 161 transcription factors in 91 cell types from ENCODE. (b) Linkage disequilibrium between candidate SNPs (r2 in CEU (Utah Residents with Northern and Western European Ancestry) population according to 1000 Genomes Phase 3 [44,55]. (c) The design of reporter vectors visualized using SnapGene software (from GSL Biotech; available at snapgene.com). The construct used for rs12946510 is shown as an example. Genomic segments highlighted in (a) were inserted downstream of the luciferase gene placed under the synthetic minimal promoter (mP) from the pGL4.24 vector. Two allelic variants were created to test each candidate SNP.
Association data for the candidate functional single-nucleotide polymorphisms (SNPs) in the 17q12-21 locus.
| Fine-Mapping Study | Candidate SNP | Risk Allele | RAF in European Population (KGph3) | Ancestral/Alternative Allele | Index SNP | GWAS (Ref. #) | Associated Disease | Association | OR |
|---|---|---|---|---|---|---|---|---|---|
| Farh et al., 2015; Schmiedel et al., 2016 | rs12946510 | T | 0,47 | C/T | rs12946510 | [ | Multiple sclerosis | 2.90 × 10−9 | 1.07 |
| Schmiedel et al., 2016 | rs2313430 | T | 0,52 | T/C | rs2305480 | [ | Ulcerative colitis | 3.01 × 10−8 | 1.15 |
| Schmiedel et al., 2016 | rs4795397 | G | 0,48 | A/G | rs2305480 | [ | Ulcerative colitis | 3.01 × 10−8 | 1.15 |
| Farh et al., 2015 | rs12709365 | G | 0,47 | A/G | rs2872507 | [ | Ulcerative colitis | 5 × 10−11 | 1.15 |
| [ | Crohn’s disease | 5 × 10−9 | 1.12 | ||||||
| [ | Crohn’s disease | 2 × 10−9 | 1.14 | ||||||
| [ | Type 1 diabetes autoantibodies | 2 × 10−6 | 1.10 | ||||||
| [ | Rheumatoid arthritis | 9 × 10−7 | 1.10 | ||||||
| rs12936409 | [ | Rheumatoid arthritis | 2.8 × 10−9 | 1.10 | |||||
| Farh et al., 2015 | rs13380815 | G | 0,47 | A/G | See above | ||||
| Farh et al., 2015 | rs8067378 | G | 0,51 | G/A | rs8067378 | [ | Ulcerative colitis | 9.74 × 10−8 | 1.12 |
| rs8067378 | [ | Primary biliary cirrhosis | 6.05 × 10−14 | 1.26 | |||||
KGph3 = phase 3 of the 1000 Genomes Project [44]; RAF = risk allele frequency; OR = odds ratio; GWAS = genome-wide association study (GWAS).
Association of studied variants with expression levels of genes located within 100 kb distance from the candidate functional SNPs.
| Gene | Candidate SNP | Cell Type | Evidence Type * | Reference |
|---|---|---|---|---|
| STARD3 | rs13380815/ | primary monocytes (24 h LPS-stimulated; IFNγ-stimulated; naïve) | by LD (r2 = 1) | [ |
| PGAP3 | rs2313430/ | primary monocytes (2 h LPS-stimulated) | by LD (r2 = 1) | [ |
| rs12709365/ | primary monocytes (24 h LPS-stimulated) | by LD (r2 = 1) | [ | |
| rs2313430/ | primary monocytes (IFNγ-stimulated) | by LD (r2 = 1) | [ | |
| rs12709365/ | naïve primary monocytes | by LD (r2 = 1) | [ | |
| IKZF3 | rs2313430 | naïve primary monocytes | by LD (r2 = 1) | [ |
| rs2313430/ | whole blood (meta-analysis) | by LD (r2 = 1) | [ | |
| rs13380815/ | whole blood | by LD (r2 = 0.967) | [ | |
| rs8067378/ | whole blood | by LD (r2 = 0.875) | [ | |
| rs12946510 | whole blood | direct data | [ | |
| ZPBP2 | rs2313430/ | lymphoblastoid cell lines | by LD (r2 = 0.967) | [ |
| GSDMB | all | whole blood, spleen, EBV-immortalized B cells (except rs12946510) | direct data | GTEx V7 [ |
| rs8067378 | whole blood | direct data | [ | |
| rs12709365/ | whole blood | by LD (r2 = 1) | [ | |
| rs8067378 | EBV-immortalized B cells | direct data | [ | |
| rs2313430/ | primary peripheral blood CD4+ lymphocytes | by LD (r2 = 1) | [ | |
| rs8067378/ | whole blood, meta-analysis | by LD (r2 = 1) | [ | |
| rs8067378 | lymphocytes (inferred) | direct data | [ | |
| rs2313430/ | whole blood | by LD (r2 = 1) | [ | |
| ORMDL3 | all | whole blood, spleen, EBV-immortalized B cells | direct data | GTEx V7 [ |
| rs13380815/ | whole blood | by LD (r2 = 1) | [ | |
| rs8067378 | EBV-transformed B cell lines | direct data | [ | |
| rs13380815/ | primary peripheral blood CD4+ lymphocytes | by LD (r2 = 0.874) | [ | |
| rs8067378/ | primary monocytes (24 h LPS-stimulated) | by LD (r2 = 0.091) | [ | |
| rs8067378/ | naïve primary monocytes | by LD (r2 = 1) | [ | |
| rs4795397 | whole blood, meta-analysis | direct data | [ | |
| rs8067378 | lymphocytes (inferred) | direct data | [ | |
| rs2313430/ | whole blood | by LD (r2 = 1) | [ | |
| rs2313430/ | lymphoblastoid cell lines | by LD (r2 = 1) | [ | |
| GSDMA | rs8067378/ | primary monocytes (2 h and 24 h LPS-stimulated; IFNγ-stimulated) | by LD (r2 = 1) | [ |
| PSMD3 | rs12709365/ | primary monocytes (24 h LPS-stimulated) | by LD (r2 = 0.874) | [ |
* “Direct data” type of evidence indicates that a candidate SNP was present on the genotyping chip used in the corresponding study. Otherwise, we looked for expression quantitative trait loci (eQTLs) in strong linkage disequilibrium (LD; r2 > 0.8 according to the 1000 Genomes Pilot Project [44]) with any of the candidate SNPs. We provide here the variant in the highest LD with an index eQTL, or a pair of variants if they are equally associated. LPS = lipopolysaccharide; IFNγ = interferon gamma; EBV = Epstein–Barr virus.
Cell lines used for transfection.
| Name | Origin | Phenotype | References |
|---|---|---|---|
| Nalm6 | Pre-B cells cultured from the blood of a patient with non-T, non-B acute lymphoblastic leukemia | B cell precursors | [ |
| MP1 | EBV-transformed peripheral B lymphocytes | Mature IgM-producing B cells | [ |
| Jurkat | Cells derived from the peripheral blood of a patient with acute lymphoblastic leukemia | T helpers | [ |
| MT-2 | Human T cell leukemia virus type 1 (HTLV-1) infected leukocytes from cord blood | Regulatory T cells | [ |
| U-937 | Monocytic cell line derived from diffuse histiocytic lymphoma | Monocytes | [ |
Figure 2Effects of the candidate functional SNPs on enhancer activity in luciferase reporter assay. Two allelic variants of putative enhancers containing each tested SNP were cloned into modified pGL3 vector downstream of the firefly luciferase gene. A panel of leukocytic cell lines were transfected with these reporter constructs together with pRL. Luminescence was measured 24 h after. Relative firefly to Renilla signal was normalized to the value obtained from the promoter-only control. White boxes—protective alleles; black boxes—risk alleles; green boxes—size-matched negative controls. N ≥ 6. Mean ± SEM. * p < 0.05 for the difference between alternative alleles.
Figure 3Polymorphism rs12946510 alters MEF2A/C and FOXO1 binding sites. (a) University of California Santa Cruz (UCSC) Genome Browser [51,52] view of the putative enhancer surrounding rs12946510 (highlighted in blue) with selected ENCODE [53,54] regulation tracks and conservation among 100 vertebrates. Histone modification H3K27ac is indicative of an active enhancer and shows a local drop in the area of transcription factors binding. DNase I hypersensitivity (DHS) density signal corresponds to open chromatin and shows similar profile in primary cells and lymphoblastoid cell line GM12878, for which ChIP-seq signals of MEF2A and MEF2C binding are available. (b) DNA sequence surrounding rs12946510 and ChIP-seq based motif logos for MEF2A, MEF2C, and FOXO1 according to JASPAR 2018 [65]. C to T substitution dramatically alters motif P-value for MEF2A and moderately for MEF2C and FOXO1 (fold-change is given according to PERFECTOS-APE [63,64]).