Literature DB >> 23160276

Identification of a functional variant in the KIF5A-CYP27B1-METTL1-FAM119B locus associated with multiple sclerosis.

Antonio Alcina1, Maria Fedetz, Oscar Fernández, Albert Saiz, Guillermo Izquierdo, Miguel Lucas, Laura Leyva, Juan-Antonio García-León, María Del Mar Abad-Grau, Iraide Alloza, Alfredo Antigüedad, María J Garcia-Barcina, Koen Vandenbroeck, Jezabel Varadé, Belén de la Hera, Rafael Arroyo, Manuel Comabella, Xavier Montalban, Natalia Petit-Marty, Arcadi Navarro, David Otaegui, Javier Olascoaga, Yolanda Blanco, Elena Urcelay, Fuencisla Matesanz.   

Abstract

BACKGROUND AND AIM: Several studies have highlighted the association of the 12q13.3-12q14.1 region with coeliac disease, type 1 diabetes, rheumatoid arthritis and multiple sclerosis (MS); however, the causal variants underlying diseases are still unclear. The authors sought to identify the functional variant of this region associated with MS.
METHODS: Tag-single nucleotide polymorphism (SNP) analysis of the associated region encoding 15 genes was performed in 2876 MS patients and 2910 healthy Caucasian controls together with expression regulation analyses.
RESULTS: rs6581155, which tagged 18 variants within a region where 9 genes map, was sufficient to model the association. This SNP was in total linkage disequilibrium (LD) with other polymorphisms that associated with the expression levels of FAM119B, AVIL, TSFM, TSPAN31 and CYP27B1 genes in different expression quantitative trait loci studies. Functional annotations from Encyclopedia of DNA Elements (ENCODE) showed that six out of these rs6581155-tagged-SNPs were located in regions with regulatory potential and only one of them, rs10877013, exhibited allele-dependent (ratio A/G=9.5-fold) and orientation-dependent (forward/reverse=2.7-fold) enhancer activity as determined by luciferase reporter assays. This enhancer is located in a region where a long-range chromatin interaction among the promoters and promoter-enhancer of several genes has been described, possibly affecting their expression simultaneously.
CONCLUSIONS: This study determines a functional variant which alters the enhancer activity of a regulatory element in the locus affecting the expression of several genes and explains the association of the 12q13.3-12q14.1 region with MS.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 23160276      PMCID: PMC3538279          DOI: 10.1136/jmedgenet-2012-101085

Source DB:  PubMed          Journal:  J Med Genet        ISSN: 0022-2593            Impact factor:   6.318


Introduction

Multiple sclerosis (MS) is a chronic autoimmune disease with a complex pathogenesis in which demyelination and neurodegeneration are the main contributors to disability.1 Susceptibility to MS is thought to be conferred by a combination of genetic and environmental factors.2 The best characterised region implicated in predisposition to MS is the Major Histocompatibility Complex on chromosome 6p21, specifically the HLA-DRB1* 15:01 class II allele, but this accounts for less than 50% of MS heritability.3 New powerful methods as gene expression profiling and genome-wide association study (GWAS) have provided evidence for new susceptibility loci implicated in MS.4–6 These comprise over 40 loci including IL2RA, CBLB, IL7R, CLEC16A, TNFRSF1A, PTGER4 and IRF8.7–9 For many risk loci, the association signals do not directly implicate a single gene and the causative role for candidate genes in the region can only be speculated. A striking number of loci have demonstrated genome-wide evidence for association in multiple, distinct autoimmune disorders.10 11 One of these loci is 12q13–14 which has been associated in GWAS for rheumatoid arthritis (RA), coeliac disease (CD) and MS.8 12–14 However, different genes have been suggested in each study based on the main associated signal. A meta-analysis of two published GWAS totalling 3393 RA cases and 12 462 healthy controls identified an association at rs1678542 localised in the KIF5A intronic region.13 Another meta-analysis of two published GWAS on CD (4533 cases and 10 750 controls) and RA (5539 cases and 17 231 controls) described the association of both diseases at rs10876993 localised in the intergenic region between B4GALNT1 and OS9 genes.14 Association at this locus was also described in a MS GWAS performed by the Australian and New Zealand Multiple Sclerosis Genetics Consortium in 1618 MS-cases. In this case, an associated single nucleotide polymorphism (SNP) (rs703842) was located at the 3' untranslated region (3' UTR) of the METTL1 gene.12 The last GWAS performed by the International Multiple Sclerosis Genetics Consortium with 10 000 MS patients also reported the association with this region at rs12368653 in theAGAP2 gene.8 In candidate gene studies, the KIF5A variant was demonstrated to be associated to MS15 and type 1 diabetes.16 Also rare variants in the CYP27B1 gene have been associated with MS.17 Other candidate-gene studies had demonstrated association of variants located at the CYP27B1 gene with type 1 diabetes18 and MS.19 The FAM119B gene has been pinpointed as the causal gene of the locus since its expression has been shown to be much lower in leukocytes of MS patients carrying the MS risk allele.20 On the other hand, since, in several studies, MS patients have lower levels of precursors of vitamin D in serum compared with controls, it has been suggested that CYP27B1, an enzyme implicated in Vitamin D activation, may be the causal gene.19 In this work we performed a fine mapping of the 12q13.3–12q14.1 region by a Tag-SNP approach to identify the polymorphism leading the association with MS. We also performed functional studies to determine the involvement of the associated MS risk variants in gene expression.

Material and methods

Study subjects and SNP genotyping

The entire data set studied consisted of 2876 MS patients diagnosed following Poser's criteria21 and 2910 age and gender matched healthy controls, mostly blood donors and staff, matched by age, gender, ethnicity and place of recruitment. Patients and control subjects were included in the study based upon written informed consent. Demographic and clinical characteristics of the sample collections have been published before15 22 (a summary is in the online supplementary table S1). SNPs were genotyped using the iPLEX Sequenom MassARRAY platform in the Spanish National Genotyping Center's facilities at Santiago de Compostela University (http://www.cegen.org). All DNAs ware received in the Centro Nacional de Genotipado (CEGEN) centre for genotyping in 96-well plates; the concentration of double-stranded DNA was assessed using PicoGreen (Molecular Probes). As quality control, two trios of Center for the Study of Human Polymorphisms (CEPH) samples were genotyped (CEPH Na10830, Na10831, Na12147, Na10860, Na10861, Na11984). The genotypes of these samples corresponded with the ones deposited in HapMap with no detection of Mendelian inconsistencies. Additionally, a 10% of random samples were subjected to regenotyping. The resulting data were concordant with an average accuracy over 99.9%. We performed quality filtering of both samples and SNPs to ensure robust association testing.

Ethics statement

All participants provided a written informed consent to participate in this study, which was approved by the Institutional Review Board of Hospital Regional Universitario Carlos Haya of Malaga, Hospital Clinic of Barcelona, Hospital Virgen Macarena of Sevilla, Hospital de Basurto of Bilbao, Hospital Clínico San Carlos of Madrid, Hospital Universitari Vall d´Hebron of Barcelona and Hospital Donostia of San Sebastian.

Luciferase assays

Six fragments in both orientations with potential regulatory activity, as indicated by ENCODE, containing the tagged variants, 24 inserts in total, plus a plasmid without insert, were assayed for luciferase activity. Thus, fragments I–VI were PCR amplified (the coordinates and the primer used are indicated in online supplementary table S2) using DNA from homozygous individual for each allele of rs6581155. The fragments were previously cloned in Vaccinia DNA Topoisomerase I (TOPO) T/A vector (Invitrogen) and sequenced. Then, each clone was introduced in forward and reverse orientation in pGL4.25 (luc2CP/minP) vector which contains a minimal promoter upstream of the luciferase reporter gene luc2CP. Triplicates were transfected into human Raji cells (ATCC CCL86). The Raji cell line was maintained in Roswell Park Memorial Institute (RPMI) 1640 medium, supplemented with 10% fetal calf serum, 2.0 mM glutamine, 10 mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES), 1 mM streptomycin, 100 U/ml penicillin, 0.05 mM 2-mercaptoethanol. Exponentially growing cells were transfected by nucleofection Amaxa Cell Line Nucleofector Kit V (Lonza) according to manufacture protocols. Briefly, Raji cells (2×106) were mixed with I–VI plasmid constructs or pGL4.25 control plasmid 2 μg of DNA and 0.2 μg of DNA of Renilla luciferase reporter gene plasmid pRL-TK(Promega) for normalising transfection efficiency. The transfected cells were cultured in 12 well-plates for 24 h. Cells were harvested and washed twice in Phosphate Buffered Saline (PBS) at 4°C. Luciferase activity was evaluated using Dual-Luciferase Assay System (Promega) according to manufacturers. For analysis, lysates were thawed on ice and the protein concentration was measured by the Bradford method. Then, 40 μg of protein from all supernatants were analysed for both firefly and Renilla luciferase activities in a luminometer F12 (Berthold Detection Systems). The following equation was used to determine the normalised fold changes in activity between tested groups: The normalised fold changes in activity from each of the three experiments are then averaged together.

Statistical analysis

Departure from Hardy-Weinberg equilibrium for all the biallelic SNP markers was tested using an exact test.23 Raw data were analysed for comparison of allele and genotype counts using PLINK V.1.05. To avoid false-positive results due to multiple testing and considering that the SNPs analysed are not in complete disequilibrium, we applied the Benjamini-Hochberg method24 that is robust against positive dependence and controls the false discovery rate.25 In order to determine the effect and independence of variant association with MS, multiple logistic regression models were computed. Linkage disequilibrium (LD) patterns between SNPs were analysed with Haploview 4.2.26 Gender was analysed for potential differential risk contribution. We used the Breslow-Day Test to evaluate the homogeneity between gender cohorts. The age of onset with respect to the genotypes was analysed by the Kruskal-Wallis test.

Results

A tag-SNP association study of the region

We performed the fine mapping of the 12q13.3–12q14.1 region between coordinates 56227468 to 56469005 (B36) (figure 1). To analyse this region, 16 SNPs were chosen by pairwise tagging from the HapMap B36 Utah residents with Northern and Western European ancestry from the CEPH collection (CEU) population which captured 89 markers with r2≥0.8 (mean r2=0.93) and a minor-allele frequency ≥0.1. These tag-SNPs captured the variants associated with RA, MS and CD previously reported in different GWAS (see online supplementary table S3).
Figure 1

Scheme of the 12q13.3–12q14.1 region. The illustration shows from top to bottom the following parts as indicated : Scale and chromosomal position (B36/gh18); (in blue) diseases-associated variants as reported in different genome-wide association study (RA, rheumatoid arthritis; CD, coeliac disease; MS, multiple sclerosis); Tag-SNPs analysed in the present work; (in red) MS-associated rs6581155-tagged variants with r2≥0.8 and a minor-allele frequency <0.1; Ref-Seq genes present in the region; the LD structure represented by r-square for the CEU population.

Scheme of the 12q13.3–12q14.1 region. The illustration shows from top to bottom the following parts as indicated : Scale and chromosomal position (B36/gh18); (in blue) diseases-associated variants as reported in different genome-wide association study (RA, rheumatoid arthritis; CD, coeliac disease; MS, multiple sclerosis); Tag-SNPs analysed in the present work; (in red) MS-associated rs6581155-tagged variants with r2≥0.8 and a minor-allele frequency <0.1; Ref-Seq genes present in the region; the LD structure represented by r-square for the CEU population. Tag-SNPs were genotyped in 2895 MS patients and 2942 healthy controls, mostly blood donors, matched by age, gender, ethnicity and place of recruitment. The genotypic distribution between cases and controls is shown in table 1. The genotype frequencies were fitted to an additive model. The top associated SNP was rs6581155 (p=4.57E−7; OR=0.79; 95% CI 0.73 to 0.87). We performed a logistic regression analysis to determine the effect of further independent variants, by testing the addition of each SNP to rs6581155 and by adding this polymorphism to each one of the other Tag-SNPs (table 2). The results showed that, in addition to generate the strongest signal, rs6581155 alone was sufficient to model the association in the locus.
Table 1

Association study of the different Tag-SNPs with multiple sclerosis

SNPAllelesMultiple sclerosisHealthy controlsAllelic p value
12111222111222UncorrectedBenjamini-HochbergAdjustedPer allele OR (95% CI)
rs1678542CG248 (9)1149 (41.7)1361 (49.3)318 (11.2)1229 (43.3)1291 (45.5)0.00050.00170.87 (0.8 to 0.94)
rs2888334CG134 (4.7)946 (33.2)1770 (62.1)166 (5.7)1030 (36.9)1676 (58.4)0.0020.0040.87 (0.8 to 0.95)
rs775249TC181 (6.4)1047 (36.8)1619 (56.9)181 (6.3)1100 (38.2)1601 (55.5)0.440.510.97 (0.89 to 1.05)
rs1678536GC129 (4.5)922 (32.4)1796 (63.1)163 (5.7)1013 (35.2)1701 (59.1)0.0010.00230.86 (0.79 to 0.94)
rs7313599GA752 (26.4)1399 (49.1)696 (24.5)660 (23)1430 (49.7)784 (27.3)0.00080.00211.13 (1.05 to 1.22)
rs1078109AG6 (0.2)366 (12.9)2472 (86.9)8 (0.28)306 (10.6)2568 (89.1)0.0180.0261.2 (1.03 to 1.4)
rs7305391AG128 (4.5)908 (31.9)1813 (63.6)119 (4.1)967 (33.6)1792 (62.3)0.50.5370.97 (0.89 to 1.06)
rs1564374GA477 (17.2)1383 (49.9)909 (32.8)570 (20.9)1349 (49.4)812 (29.7)0.000370.00150.87 (0.81 to 0.94)
rs11172290GA58 (2)735 (25.8)2052 (72.1)73 (2.5)726 (25.2)2082 (72.2)0.790.790.99 (0.89 to 1.09)
rs3825078AT176 (6.2)1115 (39.1)1558 (54.7)238 (8.2)1149 (40)1497 (51.9)0.0030.0060.88 (0.81 to 0.96)
rs701006AG222 (7.8)1225 (43)1403 (49.2)318 (11)1266 (44)1295 (45)0.0000140.000080.84 (0.78 to 0.91)
rs2301551GC76 (2.7)867 (30.5)1897 (66.8)130 (4.5)967 (33.7)1769 (61.7)3.22E−60.000030.8 (0.73 to 0.88)
rs12307841CT26 (0.9)518 (18.1)2311 (80.9)28 (0.97)469 (16.3)2386 (82.7)0.10970.151.11 (0.98 to 1.25)
rs12368653GA430 (15.1)1403 (49.2)1016 (35.7)502 (17.5)1413 (49.2)955 (33.3)0.0090.0150.91 (0.84 to 0.98)
rs1048691TC135 (4.8)958 (33.8)1738 (61.4)127 (4.4)940 (32.7)1805 (62.8)0.240.291.06 (0.96 to 1.15)
rs6581155GA110 (4)909 (32.7)1757 (63.3)173 (6.2)1002 (36.2)1596 (57.6)4.57E−77.3E−60.79 (0.73 to 0.87)

Counts (percentages in brackets) are represented for each genotype.

Table 2

Regression analyses of the 16 Tag SNPs

SNPA1*Add locus to rs6581155†Add rs6581155 to locus
pOR (95% CI)pOR (95% CI)
rs1678542C0.22230.94 (0.86 to 1.04)0.00013930.82 (0.74 to 0.91)
rs2888334C0.84870.99 (0.89 to 1.1)0.00010010.8 (0.72 to 0.9)
rs775249T0.17190.94 (0.86 to 1.03)3.99E−070.79 (0.72 to 0.87)
rs1678536G0.79640.99 (0.88 to 1.1)9.86E−050.8 (0.72 to 0.9)
rs7313599G0.14791.06 (0.98 to 1.15)6.20E−050.82 (0.74 to 0.9)
rs1078109A0.093911.15 (0.98 to 1.35)1.42E−060.8 (0.73 to 0.88)
rs7305391A0.26150.95 (0.87 to 1.04)6.29E−070.8 (0.73 to 0.87)
rs1564374G0.24870.95 (0.86 to 1.04)0.0007590.83 (0.74 to 0.93)
rs11172290G0.6340.98 (0.88 to 1.08)4.10E−070.79 (0.73 to 0.87)
rs3825078A0.5891.03 (0.93 to 1.14)1.36E−050.78 (0.7 to 0.87)
rs701006A0.47270.96 (0.85 to 1.08)0.0030560.82 (0.72 to 0.94)
rs2301551G0.15870.88 (0.75 to 1.05)0.10690.88 (0.75 to 1.03)
rs12307841C0.62061.03 (0.91 to 1.17)1.23E−060.8 (0.73 to 0.87)
rs12368653G0.55731.03 (0.93 to 1.14)1.74E−050.78 (0.69 to 0.87)
rs1048691T0.7951.01 (0.92 to 1.11)1.80E−060.8 (0.72 to 0.87)
rs6581155GNANANANA

Results for models assuming additive effects.

*Denotes results adjusted for rs6581155.

†Denotes results adjusted for locus tested.

NA, not applicable.

Association study of the different Tag-SNPs with multiple sclerosis Counts (percentages in brackets) are represented for each genotype. Regression analyses of the 16 Tag SNPs Results for models assuming additive effects. *Denotes results adjusted for rs6581155. †Denotes results adjusted for locus tested. NA, not applicable. Stratification by gender and age at onset and genotypes of rs6581155 SNP showed absence of statistical significance. Moreover, we genotyped the rare variant rs118204009 located in the CYP27B gene that has been recently associated with MS in a subgroup of 1006 patients and 1044 healthy controls.17 We did not detect this polymorphism in any individual of the MS or control groups (data not shown).

Association of rs6581155-tagged SNPs with gene transcription levels

To determine if selected Tag-SNPs could have a functional effect on the transcription levels of the genes located in the 12q13.3–12q141 region, we first used expression quantitative trait loci (eQTL) data from Zeller et al27 obtained from monocyte samples of 1490 German individuals (http://genecanvas.ecgene.net/news.php?readmore=45). There were 23 eQTLs for FAM119B, 6 for AVIL, 18 for XRCC6BP1, 18 for TSFM and 20 for TSPAN31 genes in the whole region. The variants best correlated with expression of FAM119B, AVIL, TSFM and TSPAN31were also SNPs tagged by rs6581155 and, therefore, in high LD with it (r2>0.8) (table 3). The eQTLs for the XRCC6BP gene were in low LD with rs6581155 (r2=0.18) indicating that they were not involved in MS risk. Interestingly, the minor alleles for the rs6581155-tagged SNPs, which were protective for MS, correlated with high expression of FAM119B and AVIL but low expression of TSFM and TSPAN31 (see online supplementary figure S1) although there were large differences between the correlation coefficients obtained for each gene. FAM119B was the gene most strongly correlated with the rs6581155-tagged SNPs in monocytes.
Table 3

rs6581155 tagged SNPs that have been reported to be eQTLs in the region

StudyZeller et al27Dixon et al28Montgomery et al29Stranger et al30Dimas et al31
Population1490 German individuals206 families of British descent60 CEU individuals of the HapMap collection60 individuals of the HapMap collection CEU80 individuals of Western European origin
GeneAVILXRCC6BP1TSFMFAM119BTSPAN31FAM119BTSFMCYP27B1FAM119BFAM119BFAM119BFAM119BFAM119B






Cell typeMonocytesLCLLCLLCLT-cellsLCLFibroblasts






Probe/TranscriptILMN_20148ILMN_16915ILMN_27946ILMN_12426ILMN_11441213861_s_at212656_atENST00000228606ENST00000300209GI_24308058-SILMN_1723846ILMN_1723846ILMN_1723846





p Value/effectp Value/effectp Value/effectp Value/effectp Value
SNP
rs65811552.1E−10/−0.57
rs107477835.00E−21/−0.7319.00E−13/0.5647.5E−11/−0.58
rs9238295.00E−21/−0.7319.00E−13/0.5646E−11/−0.58
rs104315522.80E−21/−0.7451.00E−12/0.5692.1E−10/−0.57
rs20720527.70E−19/−6891.00E−13/0.5885E−10/−0.56
rs22916174.10E−11/−0.627.60E−11/0.6266E−11/−0.58
rs46465361.7E−07/−0.0215E−38/0.117.71E−19/0.0553.34E−236/−0.521.4E−11/0.0332.1E−10/−0.57
rs7248346E−11/−0.58
rs20695021.84E−08/−0.0244.4E−38/0.116.19E−17/0.0497.78E−221/−0.52.29E−12/0.0367.7E−19/−0.6891.40E−13/0.5835.3E−10/−0.56
rs108770138.80E−20/−0.7111.80E−12/0.564.1E−10/−0.560.000012/−0.4810.000292/−0.4070.0000001/−0.571
rs108770118.90E−14/−0.6129.00E−13/0.60.00014157/−0.4720.00013264/−0.4749.5E−11/−0.57
rs7010082.30E−12/−0.5632.30E−11/0.551.6E−8/−0.51
rs81816441.2E−07/−0.0223.8E−38/0.111.4E−19/0.0573.9E−234/−0.525.5E−12/0.0342.1E−10/−0.57
rs108770194.2E−07/−0.0236.31E−38/0.111.024E−190.0588.32E−233/−0.521.52E−11/0.0342.1E−10/−0.57
rs111723352.1E−10/−0.57
rs7038421.67E−07/−0.0212.42E−37/0.111.42E−18/0.0547.5E−235/−0.521.41E−11/0.0377.80E−17/−0.6942.50E−11/0.5682.1E−10/−0.57
rs108770152.1E−10/−0.57
rs111723331.11E−07/−0.0237.31E−39/0.111.74E−19/0.0572.07E−233/−0.515.63E−12/0.0352.1E−10/−0.57

Zeller et al Genome-Wide Expression analysis performed with Illumina HT-12 v3 BeadChip. Association significance was obtained by ANalysis Of VAriance (ANOVA).

Dixon et al Genome-Wide Expression analysis performed by U133 Plus 2.0 GeneChips (Affymetrix). Association analysis was obtained by fitting a simple regression model to each trait and used a variance component approach to account for correlation between different observed phenotypes within each family.

Montgomery et al Genome-Wide Expression analysis performed by RNA sequencing in an Illumina GAII. Association analysis and multiple testing corrections were conducted by spearman rank correlation and permuting the expression phenotype 10 000 times and summarising the extreme p value distribution for each particular exon, transcript and probe.

Stranger et al Genome-Wide Expression analysis performed with Sentrix Human-6 Expression BeadChip V.1, Illumina. Association significance was obtained by spearman rank correlation with 10 000 permutations of the expression phenotypes.

Dimas et al Gene expression was performed by commercial whole-genome expression array (WG-6 V.3 Expression BeadChip, Illumina). Association analysis and multiple testing corrections were conducted by spearman rank correlation. Thresholds for each gene were assigned after 10 000 permutations of expression values relative to genotypes where the haplotypic (LD) structure was maintained in each interaction.

eQTL, expression quantitative trait loci; LCL, lymphoblastic cell lines.

rs6581155 tagged SNPs that have been reported to be eQTLs in the region Zeller et al Genome-Wide Expression analysis performed with Illumina HT-12 v3 BeadChip. Association significance was obtained by ANalysis Of VAriance (ANOVA). Dixon et al Genome-Wide Expression analysis performed by U133 Plus 2.0 GeneChips (Affymetrix). Association analysis was obtained by fitting a simple regression model to each trait and used a variance component approach to account for correlation between different observed phenotypes within each family. Montgomery et al Genome-Wide Expression analysis performed by RNA sequencing in an Illumina GAII. Association analysis and multiple testing corrections were conducted by spearman rank correlation and permuting the expression phenotype 10 000 times and summarising the extreme p value distribution for each particular exon, transcript and probe. Stranger et al Genome-Wide Expression analysis performed with Sentrix Human-6 Expression BeadChip V.1, Illumina. Association significance was obtained by spearman rank correlation with 10 000 permutations of the expression phenotypes. Dimas et al Gene expression was performed by commercial whole-genome expression array (WG-6 V.3 Expression BeadChip, Illumina). Association analysis and multiple testing corrections were conducted by spearman rank correlation. Thresholds for each gene were assigned after 10 000 permutations of expression values relative to genotypes where the haplotypic (LD) structure was maintained in each interaction. eQTL, expression quantitative trait loci; LCL, lymphoblastic cell lines. The eQTL data from Dixon et al28 obtained in lymphoblastic cell lines (LCL) from 206 families of British descent confirmed the eQTLs previously observed for FAM119B and TSFM, although the correlation coefficients were similar for both genes in contrast to the data from Zeller et al.27 The eQTLs for FAM119B were also reported in three additional data sets carried out in different cell types, as shown in table 3. The data from Montgomery et al,29 obtained by RNA-Seq from LCL of 60 HapMap CEU individuals, have reported a rs6581155-tagged SNP to be eQTLs for FAM119B and CYP27B1, in which the minor allele inversely correlated with high expression of these genes (correlation coefficient (r) for CYP27B1=−0.472, and for FAM119B=−0.474).

Enhancer activity

Since all the rs6581155-tagged polymorphisms were located in intronic and intergenic non-coding regions, another strategy to search for functionality was to use annotations of gene regulatory regions from ENCODE (http://genome.ucsc.edu/cgi-bin/hgTrackUi?hgsid=173506627&c=chr8&g=wgEncodeReg). Six out of 18 captured SNPs were located in remarkable regions associated with histone methylation, binding sites of transcription factors or in DNAse I hypersensitive site as shown in figure 2A.
Figure 2

Enhancer-suppressor activity of the rs6581155-tagged variants. (A) Schematic illustration to show the localisation of the six tagged SNPs (in red) in potential regulatory regions (in black, I–VI) as indicated by ENCODE for the GM12878 lymphoblastoid cell line and the genes present in the region: Enhancer H3K4Me1 track shows where modification of histone proteins is suggestive of enhancer; Promoter H3K4Me3 track shows a histone mark associated with promoters; Layered H3K4Me3 track shows histone mark associated with promoters that are active or poised to be activated; DNase Clusters track shows regions where chromatin is hypersensitive to DNase I enzyme; Txn Factor ChIP track shows DNA regions where transcription factors bind to DNA as assayed by chromatin immunoprecipitation (ChIP) with antibodies specific to the transcription factor followed by sequencing of the precipitated DNA (ChIP-Seq). (B) Δ Fold luciferase activity respect to the empty vector of the different constructs corresponding to the six regions (I–VI) with potential regulatory activity, containing the rs6581155 tagged polymorphisms transfected into Raji B cells. Four clones for each region bearing the different alleles (allele m, minor and M, major) and in both orientations (Forward and Reverse) from three independent transfection experiments are represented. Luciferase activity levels are referred to the level of the control plasmid containing only the basic promoter and the Renilla activity.

Enhancer-suppressor activity of the rs6581155-tagged variants. (A) Schematic illustration to show the localisation of the six tagged SNPs (in red) in potential regulatory regions (in black, I–VI) as indicated by ENCODE for the GM12878 lymphoblastoid cell line and the genes present in the region: Enhancer H3K4Me1 track shows where modification of histone proteins is suggestive of enhancer; Promoter H3K4Me3 track shows a histone mark associated with promoters; Layered H3K4Me3 track shows histone mark associated with promoters that are active or poised to be activated; DNase Clusters track shows regions where chromatin is hypersensitive to DNase I enzyme; Txn Factor ChIP track shows DNA regions where transcription factors bind to DNA as assayed by chromatin immunoprecipitation (ChIP) with antibodies specific to the transcription factor followed by sequencing of the precipitated DNA (ChIP-Seq). (B) Δ Fold luciferase activity respect to the empty vector of the different constructs corresponding to the six regions (I–VI) with potential regulatory activity, containing the rs6581155 tagged polymorphisms transfected into Raji B cells. Four clones for each region bearing the different alleles (allele m, minor and M, major) and in both orientations (Forward and Reverse) from three independent transfection experiments are represented. Luciferase activity levels are referred to the level of the control plasmid containing only the basic promoter and the Renilla activity. To test their potential activities as enhancers or repressors of gene expression, we cloned the fragments containing these six variants upstream of the basic promoter of a luciferase reporter system and transfected into Raji lymphoblastic B cells. We cloned four inserts for each of the six regions carrying both alleles in forward and reverse orientation. Only region III containing variant SNP rs10877013 had allele-dependent and orientation-dependent enhancer activity. Thus, allele A versus allele G in forward orientation showed 9.5-fold higher ratio of luciferase activity (figure 2B). We observed a 2.7 times higher enhancer activity of the T allele in the reverse fragment. TFSEARCH, Match and TESS software were used to predict potential transcription factor binding sites that could be responsible of the different allele-depending activity of fragment III. The SNP rs10877013 falls within the putative binding sites for the CCAAT/enhancer binding proteins (C/EBP) α and β. Although both alleles appear to be consistent with the consensus sequence for C/EBP TF it is possible that a base change could affect the affinity or isotype binding of the C/EBP transcription factor to the sequence.

Discussion

The 12q13.3–12q14.1 region has been associated to several autoimmune diseases8 12–16 but no causal variant has been found so far for any of them exception made for a rare variant in the coding region of CYP27B1 gene associated with MS.17 This variant was not detected in our sample set. In the present work, by integrating several strategies, we have uncovered the variant most strongly associated with MS in the region. This variant also affects the activity of an enhancer in an allele-dependent and orientation-dependent manner and correlates with the expression of five genes of the locus. Interestingly, the major allele of this SNP (rs10877013) correlates with high luciferase activity and at the same time with high expression of TSFM and TSPAN31 and low expression of FAM119B, CYP27B1 and AVIL. Although it could seem contradictory that one enhancer may differentially affect the expression of different genes, parallel situations have been described extensively in two scenarios: first, the same transcription factor is capable of activating or repressing depending on the context;32 33 second, a single DNA sequence can be bound by different factors with opposite activities.34 On the other hand, we cannot forget the important role of H3K4me preventing aberrant gene expression or modulating transcriptional response. H3K4me turns to mark for both positive and negative regulation. The role of H3K4me2 in histones deacetylation could serve as a fine-tuning regulatory mechanism.35 It has been described that interacting promotes are coexpressed in a tissue-specific manner. Based on the Chromatin Interaction Analysis with Paired-End-Tag sequencing results mapping long-range chromatin interactions associated with RNAPII and H3K4me2,36 37 we observed that the genes in the analysed region are engaged through promoter–promoter, promoter–enhancer interactions forming a multigene complex that could provide a structural framework for cotranscription (figure 3). In this sense, the effect of a polymorphism on one enhancer embedded in this multigene complex could have diverging effects depending on the interaction of this enhancer with the different promoters. Additionally, the interactions could be tissue-type or cell-type specific, as has been shown by Li et al.36 Therefore, it is not evident which of the genes in the region may be responsible for the genetic association with MS, since they seem to be coordinately regulated, and the enhancer containing the MS-associated variant may be affecting either of them; in fact, it acts as an eQTL for all of them. We observed that the Spearman's correlation coefficients (r) between the expression levels of the different genes in this cluster and the rs6581155-tagged SNPs were different in the different assays performed. For instance, in monocytes the best correlation is observed with FAM119B (r=−0.52), however in data obtained from LCL by RNA-seq a medium-high correlation coefficient is found for both CYP27B1 (r=−0.472) and FAM119B (r=−0.474), and with microarrays the correlation coefficient is very similar for TSFM and FAM119B. Based on data from LCL cells, TSFM, FAM119B or CYP27B1 would be candidates, while FAM119B appears to emerge from monocytes. In fact, Gandhi et al20 reported that FAM119B expression level was much lower in MS patients carrying the susceptibility haplotype. We observed that this correlation occurs also in samples from healthy donors as they are the CEU HapMap samples. However, we do not know the precise conditions (cell type, stage of disease, etc) under which expression of these genes could be determinant for the development of MS. For the same reason the variant-specific enhancer activity of fragment III, reported in this work, could vary in different cell-types. Here we have used in the activity assays the same cell-type used in the eQTL analysis to reproduce the effect observed in the expression of these genes. The Raji cells are lymphoblast-like cells derived from B-lymphocyte as the LCL from HapMap. The effect of the enhancer on the gene expression of proximal tubule cells of the kidney and the disease-activated macrophages, which are the major source of CYP27B1,38 could shed light on the involvement of this variant in MS.
Figure 3

Datasets on Long-Range Chromatin Interaction with Paired-End-Tag sequencing (ChIA-PET) in the studied region. (A) Coordinates of chr12 in B37 and known genes by University of California Santa Cruz (UCSC) browser. (B) H3K4me2 ChIA-PET data obtained from CD4 T cells, data from Chepelev et al.37 (C) RNAPII ChIA-PET data obtained in K562 cells (data from Li et al36) In red, the number of the PET-counts (PETc) of the most frequent interactions is indicated. (D) RNAPII ChIA-PET data obtained in MCF7 cells.36 The most frequent interactions are indicated in red. (E) ENCODE information on transcription levels assayed by sequencing of polyadenylated RNA from K562 cells. (F) ENCODE information on the RNAPII binding sites by ChIP-seq. (G) H3K4Me3 histone mark in K562 cells associated with promoters. (H) H3K4Me1 modification of histone proteins in K562 cells, suggestive of enhancer and, to a lesser extent, other regulatory activity.

Datasets on Long-Range Chromatin Interaction with Paired-End-Tag sequencing (ChIA-PET) in the studied region. (A) Coordinates of chr12 in B37 and known genes by University of California Santa Cruz (UCSC) browser. (B) H3K4me2 ChIA-PET data obtained from CD4 T cells, data from Chepelev et al.37 (C) RNAPII ChIA-PET data obtained in K562 cells (data from Li et al36) In red, the number of the PET-counts (PETc) of the most frequent interactions is indicated. (D) RNAPII ChIA-PET data obtained in MCF7 cells.36 The most frequent interactions are indicated in red. (E) ENCODE information on transcription levels assayed by sequencing of polyadenylated RNA from K562 cells. (F) ENCODE information on the RNAPII binding sites by ChIP-seq. (G) H3K4Me3 histone mark in K562 cells associated with promoters. (H) H3K4Me1 modification of histone proteins in K562 cells, suggestive of enhancer and, to a lesser extent, other regulatory activity. Relying on published data from other groups on the concentration of precursors of vitamin D in serum, two independent groups reported a correlation between the major alleles of rs703842, tagged by rs6581155 in the present work, with high 25-hydroxyvitamin D (25(OH)D) concentrations in MS patients.39 40 On the other hand, Lange et al41 reported a correlation of the major allele of rs10877012, in total LD with rs6581155, and low 1,25(OH)2D serum levels. The major allele of the MS-associated SNPs, in total LD with the previously mentioned SNPs, correlated with low expression of the CYP27B1 gene. This enzyme catalyses the conversion of 25(OH)D to 1,25(OH)2D, thus a low expression of this enzyme, could reconcile the above seemingly conflicting data, via accumulation of 25(OH)D and a low 1,25(OH)2D product. These data support the CYP27B1 gene as a very plausible candidate gene for association of the region with MS. Vitamin D, besides having well-known control functions of calcium and phosphorus metabolism, bone formation and mineralisation, also has a complex role in the maintenance of immune homeostasis. The administration of vitamin D in animal models leads to improvement of immune-mediated symptoms. The correlation between the MS- associated variants and circulating levels of vitamin D supports the important role of vitamin D in susceptibility to MS. Thus, if this is the case in human MS, both the pharmacogenomics and the expression analysis of all the genes affecting the causing factor can reveal the kind of intervention necessary for neutralising a pathogenic situation.42 In this work we localise a group of variants in almost total LD that explain the association of the 12q13.3–12q14.1 region with MS. One of these variants (rs10877013) strongly affects the activity of one enhancer in the region, in an allele-dependent and orientation-dependent way, that could be the cause of the alteration of TSFM, TSPAN31, FAM119B, CYP27B1 and AVIL gene expression, due to the promoter multigene interactions observed in the region. Some of the tagged variants have been described to be associated with the vitamin D serum concentration. Our results point to one of these as the causal gene for MS association in the locus.
  40 in total

Review 1.  New approaches to investigating heterogeneity in complex traits.

Authors:  R Bomprezzi; P E Kovanen; R Martin
Journal:  J Med Genet       Date:  2003-08       Impact factor: 6.318

2.  Haploview: analysis and visualization of LD and haplotype maps.

Authors:  J C Barrett; B Fry; J Maller; M J Daly
Journal:  Bioinformatics       Date:  2004-08-05       Impact factor: 6.937

3.  A note on exact tests of Hardy-Weinberg equilibrium.

Authors:  Janis E Wigginton; David J Cutler; Goncalo R Abecasis
Journal:  Am J Hum Genet       Date:  2005-03-23       Impact factor: 11.025

4.  A genome-wide association study of global gene expression.

Authors:  Anna L Dixon; Liming Liang; Miriam F Moffatt; Wei Chen; Simon Heath; Kenny C C Wong; Jenny Taylor; Edward Burnett; Ivo Gut; Martin Farrall; G Mark Lathrop; Gonçalo R Abecasis; William O C Cookson
Journal:  Nat Genet       Date:  2007-09-16       Impact factor: 38.330

5.  The histone-like NF-Y is a bifunctional transcription factor.

Authors:  Michele Ceribelli; Diletta Dolfini; Daniele Merico; Raffaella Gatta; Alessandra M Viganò; Giulio Pavesi; Roberto Mantovani
Journal:  Mol Cell Biol       Date:  2008-01-22       Impact factor: 4.272

Review 6.  Multiple sclerosis genetics.

Authors:  J P McElroy; J R Oksenberg
Journal:  Curr Top Microbiol Immunol       Date:  2008       Impact factor: 4.291

Review 7.  The neurobiology of multiple sclerosis: genes, inflammation, and neurodegeneration.

Authors:  Stephen L Hauser; Jorge R Oksenberg
Journal:  Neuron       Date:  2006-10-05       Impact factor: 17.173

8.  New diagnostic criteria for multiple sclerosis: guidelines for research protocols.

Authors:  C M Poser; D W Paty; L Scheinberg; W I McDonald; F A Davis; G C Ebers; K P Johnson; W A Sibley; D H Silberberg; W W Tourtellotte
Journal:  Ann Neurol       Date:  1983-03       Impact factor: 10.422

9.  Population genomics of human gene expression.

Authors:  Barbara E Stranger; Alexandra C Nica; Matthew S Forrest; Antigone Dimas; Christine P Bird; Claude Beazley; Catherine E Ingle; Mark Dunning; Paul Flicek; Daphne Koller; Stephen Montgomery; Simon Tavaré; Panos Deloukas; Emmanouil T Dermitzakis
Journal:  Nat Genet       Date:  2007-09-16       Impact factor: 38.330

10.  Association of the vitamin D metabolism gene CYP27B1 with type 1 diabetes.

Authors:  Rebecca Bailey; Jason D Cooper; Lauren Zeitels; Deborah J Smyth; Jennie H M Yang; Neil M Walker; Elina Hyppönen; David B Dunger; Elizabeth Ramos-Lopez; Klaus Badenhoop; Sergey Nejentsev; John A Todd
Journal:  Diabetes       Date:  2007-07-02       Impact factor: 9.461

View more
  21 in total

Review 1.  Dissecting Complex and Multifactorial Nature of Alzheimer's Disease Pathogenesis: a Clinical, Genomic, and Systems Biology Perspective.

Authors:  Puneet Talwar; Juhi Sinha; Sandeep Grover; Chitra Rawat; Suman Kushwaha; Rachna Agarwal; Vibha Taneja; Ritushree Kukreti
Journal:  Mol Neurobiol       Date:  2015-09-09       Impact factor: 5.590

2.  A kidney-specific genetic control module in mice governs endocrine regulation of the cytochrome P450 gene Cyp27b1 essential for vitamin D3 activation.

Authors:  Mark B Meyer; Nancy A Benkusky; Martin Kaufmann; Seong Min Lee; Melda Onal; Glenville Jones; J Wesley Pike
Journal:  J Biol Chem       Date:  2017-08-14       Impact factor: 5.157

3.  Targeted genomic deletions identify diverse enhancer functions and generate a kidney-specific, endocrine-deficient Cyp27b1 pseudo-null mouse.

Authors:  Mark B Meyer; Nancy A Benkusky; Martin Kaufmann; Seong Min Lee; Robert R Redfield; Glenville Jones; J Wesley Pike
Journal:  J Biol Chem       Date:  2019-05-03       Impact factor: 5.157

Review 4.  Transcriptional regulation and its misregulation in disease.

Authors:  Tong Ihn Lee; Richard A Young
Journal:  Cell       Date:  2013-03-14       Impact factor: 41.582

5.  Genetic contribution of CYP1A1 variant on treatment outcome in epilepsy patients: a functional and interethnic perspective.

Authors:  P Talwar; N Kanojia; S Mahendru; R Baghel; S Grover; G Arora; G K Grewal; S Parween; A Srivastava; M Singh; S Vig; S Kushwaha; S Sharma; K Bala; S Kukreti; R Kukreti
Journal:  Pharmacogenomics J       Date:  2016-03-08       Impact factor: 3.550

Review 6.  In search of regulatory circuits that control the biological activity of vitamin D.

Authors:  Sylvia Christakos
Journal:  J Biol Chem       Date:  2017-10-20       Impact factor: 5.157

7.  KIF5A and the contribution of susceptibility genotypes as a predictive biomarker for multiple sclerosis.

Authors:  Kelly Hares; K Kemp; S Loveless; C M Rice; N Scolding; E Tallantyre; N Robertson; A Wilkins
Journal:  J Neurol       Date:  2021-01-23       Impact factor: 4.849

8.  Non-coding genomic regions possessing enhancer and silencer potential are associated with healthy aging and exceptional survival.

Authors:  Sangkyu Kim; David A Welsh; Leann Myers; Katie E Cherry; Jennifer Wyckoff; S Michal Jazwinski
Journal:  Oncotarget       Date:  2015-02-28

9.  Bi-allelic Variants in METTL5 Cause Autosomal-Recessive Intellectual Disability and Microcephaly.

Authors:  Elodie M Richard; Daniel L Polla; Muhammad Zaman Assir; Minerva Contreras; Mohsin Shahzad; Asma A Khan; Attia Razzaq; Javed Akram; Moazzam N Tarar; Thomas A Blanpied; Zubair M Ahmed; Rami Abou Jamra; Dagmar Wieczorek; Hans van Bokhoven; Sheikh Riazuddin; Saima Riazuddin
Journal:  Am J Hum Genet       Date:  2019-09-26       Impact factor: 11.025

10.  The splice site variant rs11078928 may be associated with a genotype-dependent alteration in expression of GSDMB transcripts.

Authors:  Faer S Morrison; Jonathan M Locke; Andrew R Wood; Marcus Tuke; Dorota Pasko; Anna Murray; Tim Frayling; Lorna W Harries
Journal:  BMC Genomics       Date:  2013-09-17       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.