Literature DB >> 26320842

Susceptibility Genes for Multiple Sclerosis Identified in a Gene-Based Genome-Wide Association Study.

Xiang Lin1,2,3, Fei Yan Deng1,2, Xin Lu1,2, Shu Feng Lei2,4.   

Abstract

BACKGROUND AND
PURPOSE: Multiple sclerosis (MS) is a demyelinating and inflammatory disease of the central nervous system. The aim of this study was to identify more genes associated with MS.
METHODS: Based on the publicly available data of the single-nucleotide polymorphism-based genome-wide association study (GWAS) from the database of Genotypes and Phenotypes, we conducted a powerful gene-based GWAS in an initial sample with 931 family trios, and a replication study sample with 978 cases and 883 controls. For interesting genes, gene expression in MS-related cells between MS cases and controls was examined by using publicly available datasets.
RESULTS: A total of 58 genes was identified, including 20 "novel" genes significantly associated with MS (p<1.40×10⁻⁴). In the replication study, 44 of the 58 identified genes had been genotyped and 35 replicated the association. In the gene-expression study, 21 of the 58 identified genes exhibited differential expressions in MS-related cells. Thus, 15 novel genes were supported by replicated association and/or differential expression. In particular, four of the novel genes, those encoding myelin oligodendrocyte glycoprotein (MOG), coiled-coil alpha-helical rod protein 1 (CCHCR1), human leukocyte antigen complex group 22 (HCG22), and major histocompatibility complex, class II, DM alpha (HLA-DMA), were supported by the evidence of both.
CONCLUSIONS: The results of this study emphasize the high power of gene-based GWAS in detecting the susceptibility genes of MS. The novel genes identified herein may provide new insights into the molecular genetic mechanisms underlying MS.

Entities:  

Keywords:  gene expression; gene-based GWAS; human leukocyte antigen; multiple sclerosis

Year:  2015        PMID: 26320842      PMCID: PMC4596110          DOI: 10.3988/jcn.2015.11.4.311

Source DB:  PubMed          Journal:  J Clin Neurol        ISSN: 1738-6586            Impact factor:   3.077


INTRODUCTION

Multiple sclerosis (MS) is a chronic demyelinating and inflammatory disease of the the central nervous system (CNS).12 Genetic factors contribute markedly to the susceptibility to MS, since the children of affected parents have a tenfold higher risk of developing the condition than the general population.3 It has been demonstrated that the human leukocyte antigen (HLA) gene is closely related to MS susceptibility. The HLA gene contains a great many genes (HLA class I, II, and III) residing on chromosome 6, which is related to immune system function in humans. The proteins encoded by the HLA class I and II regions are involved in antigen processing and presentation, and play a major role in autoimmune events. MS is believed to be an immune-mediated disorder that leads to recurrent immune attacks on the CNS.4 Recent genome-wide association studies (GWASs) have identified many loci with modest effects.567 However, previous GWASs used single-nucleotide polymorphisms (SNPs) as a basic analysis unit,5689 and adopted stringent thresholds of significance to control for the false-positive rate. This approach resulted in a large number of SNPs with potential effects being filtered out and ignored. The gene-based GWAS study strategy, involving analyzing all variants within a putative gene, has proved to be more powerful than regular single-SNP-based GWASs for detecting disease susceptibility genes.101112 To detect "novel" genes associated with MS, we performed a gene-based GWAS using the Knowledge-based mining system for Genome-wide Genetic studies (KGG; http://statgenpro.psychiatry.hku.hk/limx/kgg/index.html)10 in an initial study sample containing 931 family trios.8 We also performed other functional analyses to supplement the evidence regarding the relevance of the novel genes to MS.

METHODS

Samples

The initial GWAS sample included a total of 931 family trios, each of which consisted of an affected MS child and both parents. The replication sample contained 978 cases and 883 controls. The MS patient was diagnosed according to the McDonald criteria.13 Both the initial and replication study samples included all clinical subtypes and partial clinically isolated syndromes. However, the replication study had the priority to include patients with the relapsing onset form of MS. Institutional Review Board was exempted because this study used public available database of Genotypes and Phenotypes (dbGaP) which dose not involve any personal information. The study samples, genotyping, quality control, and SNP exclusion criteria have been described previously.89

Gene-based GWAS

The gene-based GWAS and replication analysis were based on the probability values generated in previous genome-wide SNP-based association studies and downloaded from the dbGaP (http://www.ncbi.nlm.nih.gov/gap/?term=multiple+sclerosis, accession number: phs000139 and phs000171). We used the Gene-Based Association Test Using Extended Simes Procedure analysis method modeled in KGG 2.5 (http://statgenpro.psychiatry.hku.hk/limx/kgg/index.html).10 The SNPs, ranging from the upstream 5 kb at the 5' end to the downstream 5 kb at the 3' end, were assigned to one gene. In total, 48% of SNPs across the whole genome were assigned to genes.

Differential expression analysis for MS-associated genes

We downloaded four gene-expression data sets from the Gene Expression Omnibus (GEO) of the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/geo; GSE21942, GSE27694, GSE16461, and GSE52139). A case-control study design was used to investigate all four data sets, and multiple cell types were investigated, including peripheral blood mononuclear cells (PBMCs), CD34+ hematopoietic progenitor cells (HPCs), CD8+ T lymphocytes, and spinal cord. The study design data analysis are described in detail elsewhere.14151617 Differentially expressed genes between MS cases and controls were identified by comparing mean gene-expression signals in MS cases versus controls and analyzing the findings using t-tests.

Protein-protein interaction network

MS-associated gene interactions and associations were detected by protein-protein interaction analysis, conducted by searching the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database (http://string-db.org/). The STRING database integrates known and predicted associations derived from a genomic context, high-throughput experiments, coexpression, and previous knowledge (text mining).18

Functional annotation clustering analysis

The probability of the identified MS-associated genes clustering in a Gene Ontology (GO) term or a particular biological pathway as defined by the GO project and Kyoto Encyclopedia of Genes and Genomes database was tested by performing a functional annotation clustering analysis using the Database for Annotation, Visualization and Integrated Discovery integrated database query tools (http://david.abcc.ncifcrf.gov/).1920 The enrichment was measured quantitatively using Fisher's exact test, and the Bonferroni method21 was adopted to correct for multiple testing.

RESULTS

Quantile-quantile plots of the association results for the original genome scan and the present gene-based GWAS are shown in Fig. 1, in which probability values (shown as -log10 values) for all 20,761 genes and all SNPs are plotted against the expected null distribution. The tail of the distribution of gene-based probability values deviated more significantly than those of SNPs inside or outside of the gene. The distribution of the probability values for the present gene-based GWAS is shown in Fig. 2.
Fig. 1

Quantile-quantile plots of the association results for the initial genome scan. The tail of the distribution of gene-based probability values deviated more significantly than those of SNPs inside or outside of the gene, which suggests that the power was higher for a gene-based association analysis than a single SNP-based GWAS in detecting associations. GWAS: genome-wide association study, SNP: single-nucleotide polymorphism.

Fig. 2

Manhattan plot of gene probability values on chromosomes. Most of the genes significantly associated with MS are mapped to the HLA region (chromosome 6). HLA: human leukocyte antigen, MS: multiple sclerosis.

Based on a false discovery rate threshold of 1.40×10-4, 58 genes were found to be significantly associated with MS in the initial gene-based GWAS (Supplementary Table 1 in the online-only Data Supplement). Among those 58 genes, 53 were located in the HLA region on chromosome 6. In contrast, according to the raw SNP-based probability values, 75 SNPs in 31 genes were found to be significantly associated with MS (threshold of p<1.49×10-7). Therefore, the present gene-based GWAS detected 27 novel genes that had not previously been detected in the original SNP-based GWAS. After searching the Phenotype-Genotype Integrator (www.ncbi.nlm.nih.gov/gap/phegeni/), a database that archives previous association results, we found that 7 of the 27 genes had already been reported for significant associations (threshold of p<5.0×10-5). Therefore, among the 58 MS-associated genes that we identified, 20 were considered as novel MS candidate genes. Of note, five of these novel genes were located outside the HLA region. Among the total 58 identified MS-associated genes, 44 genes with genotype data in the replication sample were subjected to further association tests, which revealed that 35 of them (80%) were still significantly associated with MS (p<0.05). Furthermore, 21 of these 58 genes, including 8 novel genes, exhibited differential expression in the differential expression analysis (Table 1). Most interestingly, the genes encoding histone-lysine N-methyltransferase, H3 lysine-9 specific 3 (EHMT2), major histocompatibility complex, Class I, A (HLA-C), and negative elongation factor E (RDBP) exhibited significantly differential expressions between cases and controls in both PBMCs and CD34+ HPCs, and the gene encoding myelin oligodendrocyte glycoprotein (MOG) in PBMCs, CD8+ T lymphocytes, and spinal cord, simultaneously (Table 1).
Table 1

MS-associated genes with significantly differential expressions in MS-related cells/tissue

SampleS1S2S3S4
Target cells/tissuePBMCCD34+ HPCCD8+ T lymphocytesSpinal cord
Sample size12:158:54:48:8
Platform[HG-U133_Plus_2] Affymetrix HumanAgilent-014850 Whole Human[HG-U133_Plus_2] Affymetrix Human[HG-U133_Plus_2] Affymetrix Human
Genome U133 Plus 2.0 ArrayGenome Microarray 4×44K G4112FGenome U133 Plus 2.0 ArrayGenome U133 Plus 2.0 Array
PMID22021740222524662121682924910450
GSE No.GSE21942GSE27694GSE16461GSE52139

The ratio listed in the line of "Sample size" is the number of multiple sclerosis (MS) cases compared with that of controls. The genes listed here that are not defined in the main text are major histocompatibility complex, class II, DO beta (HLA-DOB), major histocompatibility complex, class II, DR alha (HLA-DRA), POU class 5 homeobox 1 (POU5F1), uncharacterized LOC100294145 (LOC100294145), tenascin XB (TNXB), psoriasis susceptibility 1 candidate 1 (PSORS1C1), major histocompatibility complex, class II, DM beta (HLA-DMB), transporter 2, ATP-binding cassette, sub-family B (TAP2), allograft inflammatory factor 1 (AIF1), major histocompatibility complex, class I, F (HLA-F), major histocompatibility complex, class II, DQ alpha 1 (HLA-DQA1), respectively.

*Only the most significant probe is listed, even if more than one probe was detected for a gene.

GSE No.: Gene Expression Omnibus number, www.ncbi.nlm.nih.gov/geo/, PMID: PubMed unique identifier.

The 58 identified MS-associated genes were retrieved from the STRING database. Only 36 genes, including 10 novel genes, were annotated in this database. The genes at the HLA regions were clearly enriched into two clusters: HLA class I and II clusters (Fig. 3). Two novel genes, those encoding major histocompatibility complex, class, I, A (HLA-A) and major histocompatibility complex, class II, DM alpha (HLA-DMA), were involved in HLA-class I and II clusters, respectively. Another four novel genes, MOG, and those encoding coiled-coil alpha-helical rod protein 1 (CCHCR1), bromodomain-containing protein 2 (BRD2), and chromosome 6 open reading frame 15 (C6orf15), were directly connected with the HLA clusters (Fig. 3).
Fig. 3

Protein-protein interactions between MS-associated genes. Only the connected genes are shown. The novel genes are labeled in red. Different line colors represent the types of evidence for the association. HLA class I cluster: major histocompatibility complex, class I, A (HLA-A), Major histocompatibility complex, class I, C (HLA-C), and major histocompatibility complex, class I, F (HLA-F); HLA class II cluster: major histocompatibility complex, class II, DQ alpha 1 (HLA-DQA1), major histocompatibility complex, class II, DQ alpha 2 (HLA-DQA2), major histocompatibility complex, class II, DQ beta 2 (HLA-DQB2), major histocompatibility complex, class II, DR alha (HLA-DRA), major histocompatibility complex, class II, DO beta (HLA-DOB), major histocompatibility complex, class II, DM alpha (HLA-DMA), and major histocompatibility complex, class II, DM beta (HLA-DMB). HLA: human leukocyte antigen, MS: Multiple sclerosis.

Taking both the replication data and gene-expression data together, we found that 15 novel genes were supported by replicated association and/or differential expression (Table 2). Most interestingly, the significance of four genes—HLA-DMA, CCHCR1, MOG, and the gene encoding HLA complex group 22 (HCG22)—in MS was supported by both the replication and gene-expression studies. In addition, the significance of two non-HLA genes—those encoding family with sequence similarity 69, member A (FAM69A) and POU domain class 2 transcription factor 3 (POU2F3)—in MS was supported by evidence of differential expressions in PBMCs and CD8+ T lymphocytes, respectively.
Table 2

Fifteen novel MS-associated genes, supported by replication studies and/or expression studies

GeneChromosomeStart positionLength#SNP*Initial pReplication pDifferential expression p, sample
MOG6296197582539121.21×10-55.09×10-42.12×10-2S1
3.65×10-2S3
2.92×10-2S4
HLA-DMA6329113911450823.15×10-74.08×10-28.62×10-4S1
CCHCR16311052162579924.03×10-51.08×10-31.05×10-2S1
HCG226310169841566961.21×10-42.29×10-23.85×10-2S1
BRD26329314372284511.07×10-5NS5.65×10-6S1
FAM69A193302721129358114.47×10-5NA1.63×10-2S1
LOC1002941456328569531958232.27×10-5NA4.21×10-2S2
POU2F3111200000008970231.13×10-4NS8.41×10-3S3
TRIM26P6302010781397823.72×10-57.80×10-6NANA
HCP5P146297333241233911.33×10-49.94×10-5NANA
3.8-1.4 (HLA complex group 26)6298286921117271.58×10-51.08×10-3NANA
C6orf156310740001133239.26×10-52.22×10-3NSNS
MUC216309464851619044.40×10-53.42×10-3NANA
HLA-A6299053091335241.61×10-58.38×10-3NSNS
HLA-F-AS16296893783244871.83×10-62.52×10-2NANA

The genes listed here that are not defined in the main text are tripartite motif containing 26 B (TRIM26P), HLA complex P5 pseudogene 14 (HCP5P14), HLA complex group 26 pseudogene (3.8-1.4), mucin 21, cell surface associated (MUC21), HLA-F antisense RNA 1 (HLA-F-AS1), respectively.

*Number of SNPs included in the gene with p<0.05, †Multiple sclerosis (MS)-related cells/tissue sample used for the differential expression analysis; S1, PBMC; S2, CD34+ HPC; S3, CD8+ T lymphocytes; S4, spinal cord.

NA: not available, NS: not significant.

The identified 58 MS-associated genes were found enriched in 54 GO terms, even after Bonferroni correction (p<0.05) (Supplementary Table 2 in the online-only Data Supplement). Most significantly, 12 genes were enriched in "antigen processing and presentation" (GO: 0019882; p=271.09×10-16), and 11 genes were enriched in "MHC protein complex" (GO: 0012611; p=1.06×10-16).

DISCUSSION

Gene-based association analysis is an efficient method for detecting associations between candidate genes and complex diseases, as it combines signals for all variants within a putative gene. By using this method, several studies have identified new associations between genes and diseases.2223 The present study again highlights the superior power of gene-based association analysis for detecting associations for MS. Specifically, we detected 58 genes significantly associated with MS, including 20 novel genes that were undetected in previous single SNP-based GWASs. Furthermore, associations for 80% of the identified MS-associated genes were replicated. Most previous association studies have identified only the statistical relevance of genes to MS (at the DNA level), without dissecting the functional mechanisms underlying those associations. In contrast, in the present study we not only established statistical associations between genetic markers and MS at the DNA level, but also performed follow-up differential gene expression analyses and functional annotation clustering analyses as important supplementary methods to analyze the function of causal variants. This supplementary evidence strengthens the likelihood that the eight novel genes identified in this study with significantly differential expressions are directly involved in the pathogenesis of MS. Recent GWASs have identified a great number of MS-associated genetic loci,59242526 most of which have been mapped to the HLA region. Most of the loci identified in the present study were also located in this region. As we know, the major HLAs are essential elements for immune function. Therefore, our findings also highlight the importance of the autoimmune system in the etiology of MS. Most interestingly, evidence from association, replication, and differential expression studies strongly supports the significance of the following four genes to MS: MOG, HLA-DMA, CCHCR1, and HCG22. Up to now, the biological function of HCG22 in MS or immunity is unknown. HLA-DMA and CCHCR1 exhibit suggestive associations with rheumatoid arthritis (RA) and psoriasis,272829 respectively, suggesting that both genes are involved in the immune response. The protein encoded by MOG, myelin oligodendrocyte glycoprotein (MOG) is a membrane protein expressed on the surface of oligodendrocyte cells and myelin sheaths.30 MOG is an important candidate target antigen in MS.313233 Monoclonal antibodies against MOG were used to develop an animal model of MS.34 The functions of these genes in MS need further investigation. In addition, three genes (POU2F3, BRD2, and FAM69A) were expressed differentially in MS-related cells. POU2F3, a non-HLA gene (chromosome 11), is associated with melanoma and cervical cancer.3536 BRD2 is associated with cancer, obesity, type 2 diabetes, RA, and inflammation,373839 and was differentially expressed in the PBMCs of patients with RA (p=2.31×10-3, data sets from GEO; www.ncbi.nlm.nih.gov/geo, GSE#: GSE15573). The associations with other autoimmune diseases suggested the possible relevance of BRD2 and POU2F3 to MS. The specific functions of FAM69A in MS are unknown. A previous study found that 21 SNPs located at the GFI-EVI5-RPL5-FAM69A locus were positively associated with MS.40 In-depth studies are needed to disclose the functional mechanism of these genes in MS. It should be noted that since there is strong linkage disequilibrium (LD) at the HLA region, it is reasonable to infer that the significant signals for some of the genetic markers are partially due to their strong LD with true functional variants within the HLA region. Although the supplementary analysis could play a part in analyzing the function of causal variants, it is also necessary to research the functional mechanisms underlying the associations further, especially those between the HLAs and MS. In conclusion, this gene-based GWAS has identified 20 novel MS-associated genes. The results highlight the advantages of gene-based association analysis over single SNP-based GWASs for detecting susceptibility genes for MS. The new findings may provide novel insights into the molecular mechanisms underlying MS.
Supplementary Table 1

Results of gene-based and differential expression analyses of 58 MS-associated genes

GeneChromosomeStart positionLength#SNP*Initial pGenes significant in initial study based on SNP-based pGenes in PheGenIReplication pDifferential expression pSample
C6orf1063225547589181213.97×10-39C6orf10C6orf103.16×10-3NSNS
NOTCH46321576203922472.51×10-31NOTCH4NOTCH4NSNSNS
TNXB6320039327822126.61×10-28TNXBTNXB3.37×10-34.45×10-2S2
HLA-DRA6324026191520493.73×10-24HLA-DRAHLA-DRA4.08×10-35.82×10-5S1
AIF16315779941180413.15×10-23AIF1AIF1NA2.04×10-2S1
SNORA386315858561013113.15×10-23SNORA38NANANANA
BAT26315834502710431.18×10-22BAT2BAT23.42×10-2NSNS
BAT36316018052367234.41×10-19BAT3BAT39.52×10-3NSNS
PPIAP96314816541152511.46×10-15PPIAP9PPIAP94.67×10-2NANA
HLA-DQB26327188751745569.02×10-15HLA-DQB2HLA-DQB21.01×10-2NSNS
HLA-DQA26327041631550161.25×10-14HLA-DQA2HLA-DQA23.93×10-3NSNS
EHMT26318425362792818.91×10-14EHMT2EHMT2NS4.30×10-3S1
4.50×10-2S2
LOC1002872726312383521317951.79×10-13LOC100287272NA4.76×10-3NANA
MCCD16314917391126924.39×10-13MCCD1MCCD1NANSNS
LOC1004628126313404941031154.71×10-13LOC100462812NA8.07×10-5NANA
HCG276311605371620836.34×10-13HCG27HCG273.84×10-2NSNS
HLA-DRB96324225971026969.42×10-13HLA-DRB9HLA-DRB91.60×10-3NANA
HLA-S6313443461091864.89×10-12HLA-SHLA-S3.36×10-2NANA
HLA-DQA16326001831624611.68×10-11HLA-DQA1HLA-DQA15.96×10-43.51×10-2S1
TAP26327846102693728.02×10-11TAP2TAP2NS3.35×10-3S1
BTNL26323575132238771.43×10-10BTNL2BTNL28.76×10-5NSNS
HLA-DOB6327755401428541.78×10-10HLA-DOBHLA-DOBNA1.63×10-5S1
RDBP6319148641700024.31×10-10RDBPRDBP4.16×10-22.01×10-2S1
3.70×10-2S2
PSORS1C163107760835261108.59×10-10PSORS1C1PSORS1C1NS4.85×10-2S2
HLA-DMB6328974061644112.63×10-9HLA-DMBHLA-DMBNS4.45×10-2S3
DHX166306158962993417.13×10-9DHX16DHX162.73×10-3NSNS
POU5F16311271141633749.96×10-9POU5F1POU5F14.68×10-38.28×10-3S2
TCF196311213031568922.02×10-7TCF19TCF191.78×10-2NSNS
3.8-1.5 (HLA complex group 26)6297278931118042.40×10-73.8-1.5NA1.27×10-3NANA
HLA-F6296861171395674.67×10-7HLA-FNA1.17×10-22.64×10-2S1
LOC1005074366313625617602584.98×10-7LOC100507436NANSNANA
HLA-X6314246231064411.55×10-6NAHLA-XNANANA
HCG4P46299179821042818.60×10-6NAHCG4P46.98×10-4NANA
RPL3P26312431081124021.66×10-5NARPL3P2NANANA
HLA-C6312315291332621.72×10-5NAHLA-C4.29×10-26.15×10-3S1
2.74×10-2S2
DHFRP26313262441349822.89×10-5NADHFRP2NANANA
MICE6297042341764614.63×10-5NAMICENANANA
TRIM316300656742019311.18×10-4NATRIM311.31×10-4NSNS
MOG6296197582539121.21×10-5NANA5.09×10-42.12×10-2S1
3.65×10-2S3
2.92×10-2S4
HLA-DMA6329113911450823.15×10-7NANA4.08×10-28.62×10-4S1
CCHCR16311052162579924.03×10-5NANA1.08×10-31.05×10-2S1
HCG226310169841566961.21×10-4NANA2.29×10-23.85×10-2S1
BRD26329314372284511.07×10-5NANANS5.65×10-6S1
FAM69A193302721129358114.47×10-5NANANA1.63×10-2S1
LOC1002941456328569531958232.27×10-5NANANA4.21×10-2S2
POU2F3111200000008970231.13×10-4NANANS8.41×10-3S3
TRIM26P6302010781397823.72×10-5NANA7.80×10-6NANA
HCP5P146297333241233911.33×10-4NANA9.94×10-5NANA
3.8-1.4(HLA complex group 26)6298286921117271.58×10-5NANA1.08×10-3NANA
C6orf156310740001133239.26×10-5NANA2.22×10-3NSNS
MUC216309464851619044.40×10-5NANA3.42×10-3NANA
HLA-A6299053091335241.61×10-5NANA8.38×10-3NSNS
HLA-F-AS16296893783244871.83×10-6NANA2.52×10-2NANA
MXD351770000001679127.60×10-7NANANSNSNS
LOC10028724722--12.71×10-5NANANANANA
LOC1001279341933921361050916.18×10-5NANANANANA
RPL15P46314908531064512.08×10-7NANANANANA
HCG2P86297678961354313.96×10-5NANANANANA

PheGenI, Phenotype-Genotype Integrator.

*The number of SNP included in a gene with p-value<0.05, †Ms-related cells sample used to differential expression ananlysis. S1: PBMC, S2: CD34+ HPC, S3: CD8+ T lymphocytes, S4, spinal cord.

The genes listed here that are not defined in the main text are chromosome 6 open reading frame 10 (C6orf10), notch 4 (NOTCH4), small nucleolar RNA, H/ACA box 38 (SNORA38), proline-rich coiled-coil 2A (BAT2), BCL2-associated athanogene 6 (BAT3), peptidylprolyl isomerase A pseudogene 9 (PPIAP9), major histocompatibility complex, class II, DQ beta 2 (HLA-DQB2), major histocompatibility complex, class II, DQ alpha 2 (HLA-DQA2), ubiquitin specific peptidase 8 pseudogene 1 (LOC100287272), mitochondrial coiled-coil domain 1 (MCCD1), fibroblast growth factor receptor 3 pseudogene 1 (LOC100462812), HLA complex group 27 (HCG27), major histocompatibility complex, class II, DR beta 9 (HLA-DRB9), major histocompatibility complex, class I, S (HLA-S), butyrophilin-like 2 (BTNL2), DEAH (Asp-Glu-Ala-His) box polypeptide 16 (DHX16) transcription factor 19 (TCF19), HLA complex group 26 pseudogene (3.8-1.5), MHC class I polypeptide-related sequence A (LOC100507436), major histocompatibility complex, class I, x (HLA-X), HLA complex group 4 pseudogene 4 (HCG4P4), ribosomal protein L3 pseudogene 2 (RPL3P2), dihydrofolate reductase pseudogene 2 (DHFRP2), MHC class I polypeptide-related sequence E (MICE), ripartite motif containing 31 (TRIM31), MAX dimerization protein 3 (MXD3), similar to hCG1987428 (LOC100287247), meiotic nuclear divisions 1 homolog (LOC100127934), ribosomal protein L15 pseudogene 4 (RPL15P4), HLA complex group 2 pseudogene 8 (HCG2P8) respectively.

Supplementary Table 2

Enrichment of GO terms and KEGG pathways of the 58 MS-associated genes

CategoryTermCount%GenesList totalRelative enrichmentBonferroni*
GOTERM_BP_FATGO:0019882~antigen processing and presentation1221.433112, 285830, 3117, 3118, 6891, 3107, 3109, 3108, 3134, 3105, 3132, 31222578.231.09×10-16
GOTERM_CC_FATGO:0042611~MHC protein complex1119.643112, 285830, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 3132, 312224102.781.06×10-16
SP_PIR_KEYWORDSmhc ii916.073112, 3117, 3118, 3107, 3109, 3108, 3105, 3132, 312234164.251.84×10-14
UP_SEQ_FEATURERegion of interest:Connecting peptide916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 312233137.173.39×10-13
UP_SEQ_FEATUREDomain:Ig-like C1-type916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 312233130.325.42×10-13
KEGG_PATHWAYhsa05330:Allograft rejection916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 31221490.807.23×10-14
KEGG_PATHWAYhsa05332:Graft-versus-host disease916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 31221483.821.47×10-13
KEGG_PATHWAYhsa04940:Type I diabetes mellitus916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 31221477.832.80×10-13
KEGG_PATHWAYhsa04612:Antigen processing and presentation1017.863112, 3117, 3118, 6891, 3107, 3109, 3108, 3134, 3105, 31221443.767.53×10-13
KEGG_PATHWAYhsa05320:Autoimmune thyroid disease916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 31221464.101.50×10-12
INTERPROIPR003597:Immunoglobulin C1-set916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 31222976.031.16×10-11
UP_SEQ_FEATURERegion of interest:Alpha-1712.503117, 3118, 3107, 3108, 3134, 3105, 312233202.711.56×10-10
UP_SEQ_FEATURERegion of interest: Alpha-2712.503117, 3118, 3107, 3108, 3134, 3105, 312233202.711.56×10-10
INTERPROIPR014745:MHC class II, alpha/beta chain, N-terminal712.503112, 3117, 3118, 3109, 3108, 3132, 312229182.787.35×10-11
INTERPROIPR003006:Immunoglobulin/major histocompatibility complex, conserved site916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 31222959.438.98×10-11
KEGG_PATHWAYhsa05416:Viral myocarditis916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 31221446.042.47×10-11
SMARTSM00407:IGc1916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 31222450.077.27×10-11
GOTERM_CC_FATGO:0042613~MHC class II protein complex712.503112, 3117, 3118, 3109, 3108, 3132, 312224128.555.56×10-10
GOTERM_BP_FATGO:0002504~antigen processing and presentation of peptide or polysaccharide antigen via MHC class II712.503112, 3117, 3118, 3109, 3108, 3132, 312225114.783.42×10-9
GOTERM_MF_FATGO:0032395~MHC class II receptor activity610.713112, 3117, 3118, 3107, 3108, 312223178.267.85×10-9
SP_PIR_KEYWORDSImmune response1017.863112, 3117, 3118, 6891, 3107, 3109, 3108, 3134, 3105, 31223425.268.42×10-9
KEGG_PATHWAYhsa04514:Cell adhesion molecules (CAMs)916.073112, 3117, 3118, 3107, 3109, 3108, 3134, 3105, 31221424.764.04×10-9
GOTERM_CC_FATGO:0042825~TAP complex58.933112, 6891, 3109, 3108, 312224380.422.00×10-8
GOTERM_CC_FATGO:0042824~MHC class I peptide loading complex58.933112, 6891, 3109, 3108, 312224295.887.18×10-8
GOTERM_BP_FATGO:0048002~antigen processing and presentation of peptide antigen610.716891, 3107, 3108, 3134, 3105, 312225115.952.17×10-7
PIR_SUPERFAMILYPIRSF001991:Class II histocompatibility antigen610.713112, 3117, 3118, 3109, 3108, 31221894.822.79×10-8
INTERPROIPR007110:Immunoglobulin-like1119.643112, 3117, 3118, 4340, 3107, 3109, 3108, 3134, 56244, 3105, 31222912.613.53×10-7
KEGG_PATHWAYhsa05310:Asthma610.713112, 3117, 3118, 3109, 3108, 31221475.151.10×10-7
GOTERM_BP_FATGO:0006955~immune response1221.433112, 285830, 3117, 3118, 6891, 3107, 3109, 3108, 3134, 3105, 3132, 3122259.411.53×10-6
GOTERM_MF_FATGO:0042288~MHC class I protein binding58.933112, 6891, 3109, 3108, 312223176.408.88×10-7
INTERPROIPR013783:Immunoglobulin-like fold1119.643112, 3117, 3118, 4340, 3107, 3109, 3108, 3134, 56244, 3105, 31222911.439.06×10-7
GOTERM_MF_FATGO:0042287~MHC protein binding58.933112, 6891, 3109, 3108, 312223112.906.11×10-6
KEGG_PATHWAYhsa04672:Intestinal immune network for IgA production610.713112, 3117, 3118, 3109, 3108, 31221444.481.72×10-6
SP_PIR_KEYWORDSHeterodimer610.713117, 3118, 3107, 3134, 3105, 31223432.966.98×10-5
SP_PIR_KEYWORDSTransmembrane protein1017.863112, 3117, 3118, 4340, 6891, 3107, 3108, 3134, 3105, 3122348.817.59×10-5
INTERPROIPR001003:MHC class II, alpha chain, N-terminal47.143117, 3118, 3108, 312229191.487.32×10-5
KEGG_PATHWAYhsa05322:Systemic lupus erythematosus610.713112, 3117, 3118, 3109, 3108, 31221422.016.04×10-5
GOTERM_BP_FATGO:0002474~antigen processing and presentation of peptide antigen via MHC class I47.146891, 3107, 3134, 310525127.326.60×10-4
GOTERM_CC_FATGO:0044459~plasma membrane part1526.794855, 3112, 3117, 3118, 6891, 394263, 285830, 199, 3107, 3109, 3108, 3134, 3105, 3132, 3122243.632.43×10-4
INTERPROIPR001039:MHC class I, alpha chain, alpha1, and alpha247.14285830, 3107, 3134, 31052985.109.56×10-4
INTERPROIPR011161:MHC class I-like antigen recognition47.14285830, 3107, 3134, 31052982.061.07×10-3
GOTERM_CC_FATGO:0042612~MHC class I protein complex47.14285830, 3107, 3134, 31052476.081.17×10-3
BIOCARTAh_mhcPathway:Antigen processing and presentation35.366891, 3105, 31223143.705.67×10-4
UP_SEQ_FEATURESequence variant3155.36170679, 3112, 4855, 11074, 54535, 6891, 56244, 7148, 10919, 6941, 199, 6046, 83463, 5460, 3107, 3109, 3108, 3134, 3105, 401250, 3117, 3118, 4340, 29113, 8449, 10665, 7916, 394263, 25833, 3122, 7917331.501.94×10-2
SP_PIR_KEYWORDSPolymorphism3155.36170679, 3112, 4855, 11074, 54535, 6891, 56244, 7148, 10919, 6941, 199, 6046, 83463, 5460, 3107, 3109, 3108, 3134, 3105, 401250, 3117, 3118, 4340, 29113, 8449, 10665, 7916, 394263, 25833, 3122, 7917341.527.47×10-3
UP_SEQ_FEATURERegion of interest:Alpha-335.363107, 3134, 310533193.062.92×10-2
SP_PIR_KEYWORDSmhc i35.363107, 3134, 310534169.721.05×10-2
GOTERM_BP_FATGO:0002478~antigen processing and presentation of exogenous peptide antigen35.366891, 3108, 312225147.583.25×10-2
PIR_SUPERFAMILYPIRSF001990:Class I histocompatibility antigen35.363107, 3134, 310518136.962.30×10-3
GOTERM_BP_FATGO:0019884~antigen processing and presentation of exogenous antigen35.366891, 3108, 312225115.955.30×10-2
GOTERM_MF_FATGO:0042277~peptide binding58.933112, 6891, 3109, 3108, 31222313.902.69×10-2
INTERPROIPR000353:MHC class II, beta chain, N-terminal35.363112, 3109, 313229101.372.84×10-2
GOTERM_MF_FATGO:0032393~MHC class I receptor activity35.363107, 3134, 31052399.612.89×10-2
GOTERM_CC_FATGO:0005886~plasma membrane1628.574855, 3112, 3117, 3118, 4340, 6891, 394263, 285830, 199, 3107, 3109, 3108, 3134, 3105, 3132, 3122242.262.98×10-2

Count, the number of genes enriched in particular GO terms or KEGG pathways; Genes, potential genes enriched in particular GO terms or KEGG pathways.

*p value with Bonferroni correction for multiple tests. Only significant results with p<0.05 are listed.

GO: Gene Ontology, KEGG: Kyoto Encyclopedia of Genes and Genomes, MHC: major histocompatibility complex, MS: mltiple sclerosis.

  39 in total

1.  The future of association studies: gene-based analysis and replication.

Authors:  Benjamin M Neale; Pak C Sham
Journal:  Am J Hum Genet       Date:  2004-07-22       Impact factor: 11.025

2.  Risk alleles for multiple sclerosis in multiplex families.

Authors:  M J D'Netto; H Ward; K M Morrison; S V Ramagopalan; D A Dyment; G C DeLuca; L Handunnetthi; A D Sadovnick; G C Ebers
Journal:  Neurology       Date:  2009-06-09       Impact factor: 9.910

3.  A common haplotype of interferon regulatory factor 5 (IRF5) regulates splicing and expression and is associated with increased risk of systemic lupus erythematosus.

Authors:  Robert R Graham; Sergey V Kozyrev; Emily C Baechler; M V Prasad Linga Reddy; Robert M Plenge; Jason W Bauer; Ward A Ortmann; Thearith Koeuth; Ma Francisca González Escribano; Bernardo Pons-Estel; Michelle Petri; Mark Daly; Peter K Gregersen; Javier Martín; David Altshuler; Timothy W Behrens; Marta E Alarcón-Riquelme
Journal:  Nat Genet       Date:  2006-04-16       Impact factor: 38.330

4.  Increased frequencies of myelin oligodendrocyte glycoprotein/MHC class II-binding CD4 cells in patients with multiple sclerosis.

Authors:  Khadir Raddassi; Sally C Kent; Junbao Yang; Kasia Bourcier; Elizabeth M Bradshaw; Vicki Seyfert-Margolis; Gerald T Nepom; William W Kwok; David A Hafler
Journal:  J Immunol       Date:  2011-06-08       Impact factor: 5.422

Review 5.  The bromodomain: a chromatin browser?

Authors:  P Filetici; O P; P Ballario
Journal:  Front Biosci       Date:  2001-08-01

6.  Pathway analysis of genome-wide association studies on rheumatoid arthritis.

Authors:  Gwan Gyu Song; Sang-Cheol Bae; Young Ho Lee
Journal:  Clin Exp Rheumatol       Date:  2013-04-09       Impact factor: 4.473

7.  Tag-SNP analysis of the GFI1-EVI5-RPL5-FAM69 risk locus for multiple sclerosis.

Authors:  Antonio Alcina; Oscar Fernández; Juan Ramón Gonzalez; Antonio Catalá-Rabasa; María Fedetz; Dorothy Ndagire; Laura Leyva; Miguel Guerrero; Carmen Arnal; Concepción Delgado; Miguel Lucas; Guillermo Izquierdo; Fuencisla Matesanz
Journal:  Eur J Hum Genet       Date:  2010-01-20       Impact factor: 4.246

8.  Multiple sclerosis.

Authors:  Alastair Compston; Alasdair Coles
Journal:  Lancet       Date:  2008-10-25       Impact factor: 79.321

9.  Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis.

Authors:  Stephen Sawcer; Garrett Hellenthal; Matti Pirinen; Chris C A Spencer; Nikolaos A Patsopoulos; Loukas Moutsianas; Alexander Dilthey; Zhan Su; Colin Freeman; Sarah E Hunt; Sarah Edkins; Emma Gray; David R Booth; Simon C Potter; An Goris; Gavin Band; Annette Bang Oturai; Amy Strange; Janna Saarela; Céline Bellenguez; Bertrand Fontaine; Matthew Gillman; Bernhard Hemmer; Rhian Gwilliam; Frauke Zipp; Alagurevathi Jayakumar; Roland Martin; Stephen Leslie; Stanley Hawkins; Eleni Giannoulatou; Sandra D'alfonso; Hannah Blackburn; Filippo Martinelli Boneschi; Jennifer Liddle; Hanne F Harbo; Marc L Perez; Anne Spurkland; Matthew J Waller; Marcin P Mycko; Michelle Ricketts; Manuel Comabella; Naomi Hammond; Ingrid Kockum; Owen T McCann; Maria Ban; Pamela Whittaker; Anu Kemppinen; Paul Weston; Clive Hawkins; Sara Widaa; John Zajicek; Serge Dronov; Neil Robertson; Suzannah J Bumpstead; Lisa F Barcellos; Rathi Ravindrarajah; Roby Abraham; Lars Alfredsson; Kristin Ardlie; Cristin Aubin; Amie Baker; Katharine Baker; Sergio E Baranzini; Laura Bergamaschi; Roberto Bergamaschi; Allan Bernstein; Achim Berthele; Mike Boggild; Jonathan P Bradfield; David Brassat; Simon A Broadley; Dorothea Buck; Helmut Butzkueven; Ruggero Capra; William M Carroll; Paola Cavalla; Elisabeth G Celius; Sabine Cepok; Rosetta Chiavacci; Françoise Clerget-Darpoux; Katleen Clysters; Giancarlo Comi; Mark Cossburn; Isabelle Cournu-Rebeix; Mathew B Cox; Wendy Cozen; Bruce A C Cree; Anne H Cross; Daniele Cusi; Mark J Daly; Emma Davis; Paul I W de Bakker; Marc Debouverie; Marie Beatrice D'hooghe; Katherine Dixon; Rita Dobosi; Bénédicte Dubois; David Ellinghaus; Irina Elovaara; Federica Esposito; Claire Fontenille; Simon Foote; Andre Franke; Daniela Galimberti; Angelo Ghezzi; Joseph Glessner; Refujia Gomez; Olivier Gout; Colin Graham; Struan F A Grant; Franca Rosa Guerini; Hakon Hakonarson; Per Hall; Anders Hamsten; Hans-Peter Hartung; Rob N Heard; Simon Heath; Jeremy Hobart; Muna Hoshi; Carmen Infante-Duarte; Gillian Ingram; Wendy Ingram; Talat Islam; Maja Jagodic; Michael Kabesch; Allan G Kermode; Trevor J Kilpatrick; Cecilia Kim; Norman Klopp; Keijo Koivisto; Malin Larsson; Mark Lathrop; Jeannette S Lechner-Scott; Maurizio A Leone; Virpi Leppä; Ulrika Liljedahl; Izaura Lima Bomfim; Robin R Lincoln; Jenny Link; Jianjun Liu; Aslaug R Lorentzen; Sara Lupoli; Fabio Macciardi; Thomas Mack; Mark Marriott; Vittorio Martinelli; Deborah Mason; Jacob L McCauley; Frank Mentch; Inger-Lise Mero; Tania Mihalova; Xavier Montalban; John Mottershead; Kjell-Morten Myhr; Paola Naldi; William Ollier; Alison Page; Aarno Palotie; Jean Pelletier; Laura Piccio; Trevor Pickersgill; Fredrik Piehl; Susan Pobywajlo; Hong L Quach; Patricia P Ramsay; Mauri Reunanen; Richard Reynolds; John D Rioux; Mariaemma Rodegher; Sabine Roesner; Justin P Rubio; Ina-Maria Rückert; Marco Salvetti; Erika Salvi; Adam Santaniello; Catherine A Schaefer; Stefan Schreiber; Christian Schulze; Rodney J Scott; Finn Sellebjerg; Krzysztof W Selmaj; David Sexton; Ling Shen; Brigid Simms-Acuna; Sheila Skidmore; Patrick M A Sleiman; Cathrine Smestad; Per Soelberg Sørensen; Helle Bach Søndergaard; Jim Stankovich; Richard C Strange; Anna-Maija Sulonen; Emilie Sundqvist; Ann-Christine Syvänen; Francesca Taddeo; Bruce Taylor; Jenefer M Blackwell; Pentti Tienari; Elvira Bramon; Ayman Tourbah; Matthew A Brown; Ewa Tronczynska; Juan P Casas; Niall Tubridy; Aiden Corvin; Jane Vickery; Janusz Jankowski; Pablo Villoslada; Hugh S Markus; Kai Wang; Christopher G Mathew; James Wason; Colin N A Palmer; H-Erich Wichmann; Robert Plomin; Ernest Willoughby; Anna Rautanen; Juliane Winkelmann; Michael Wittig; Richard C Trembath; Jacqueline Yaouanq; Ananth C Viswanathan; Haitao Zhang; Nicholas W Wood; Rebecca Zuvich; Panos Deloukas; Cordelia Langford; Audrey Duncanson; Jorge R Oksenberg; Margaret A Pericak-Vance; Jonathan L Haines; Tomas Olsson; Jan Hillert; Adrian J Ivinson; Philip L De Jager; Leena Peltonen; Graeme J Stewart; David A Hafler; Stephen L Hauser; Gil McVean; Peter Donnelly; Alastair Compston
Journal:  Nature       Date:  2011-08-10       Impact factor: 49.962

10.  A new gene-based association test for genome-wide association studies.

Authors:  Alfonso Buil; Angel Martinez-Perez; Alexandre Perera-Lluna; Leonor Rib; Pere Caminal; Jose Manuel Soria
Journal:  BMC Proc       Date:  2009-12-15
View more
  4 in total

1.  Multi-strategy genome-wide association studies identify the DCAF16-NCAPG region as a susceptibility locus for average daily gain in cattle.

Authors:  Wengang Zhang; Junya Li; Yong Guo; Lupei Zhang; Lingyang Xu; Xue Gao; Bo Zhu; Huijiang Gao; Hemin Ni; Yan Chen
Journal:  Sci Rep       Date:  2016-11-28       Impact factor: 4.379

Review 2.  The Relationship of the Mechanisms of the Pathogenesis of Multiple Sclerosis and the Expression of Endogenous Retroviruses.

Authors:  Vera R Lezhnyova; Ekaterina V Martynova; Timur I Khaiboullin; Richard A Urbanowicz; Svetlana F Khaiboullina; Albert A Rizvanov
Journal:  Biology (Basel)       Date:  2020-12-11

Review 3.  Epstein-Barr virus and multiple sclerosis.

Authors:  Samantha S Soldan; Paul M Lieberman
Journal:  Nat Rev Microbiol       Date:  2022-08-05       Impact factor: 78.297

4.  Association of EVI5 rs11808092, CD58 rs2300747, and CIITA rs3087456 polymorphisms with multiple sclerosis risk: A meta-analysis.

Authors:  Jiahe Liu; Xu Liu; Yong Liu; Shimin Deng; Hongbin Huang; Qicong Chen; Weidong Liu; Zunnan Huang
Journal:  Meta Gene       Date:  2016-04-25
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.