| Literature DB >> 34367251 |
Ting Li1, He Li2, Yue Li1, Shu-An Dong3, Ming Yi1, Qiu-Xia Zhang1, Bin Feng1, Li Yang1, Fu-Dong Shi1,4,5, Chun-Sheng Yang1.
Abstract
BACKGROUND: Neuromyelitis optica spectrum disorder (NMOSD) is an inflammatory disease of the central nervous system and it is understandable that environmental and genetic factors underlie the etiology of NMOSD. However, the susceptibility genes and associated pathways of NMOSD patients who are AQP4-Ab positive and negative have not been elucidated.Entities:
Keywords: Genome-wide association study; Neuromyelitis optica spectrum disorder; gene differential expression; gene sets; pathway
Year: 2021 PMID: 34367251 PMCID: PMC8335167 DOI: 10.3389/fgene.2021.690537
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Flow diagram of the three-phase analysis design. In phase I, two new gene-based tests for the GWAS approach, called VEGAS2 and PLINK, were used to conduct a gene-based association study. In phase II, Gene sets identified with meta p < 0.05/n were carried forward to the next phase. In phase III, protein-protein association networks were performed by STRING. Meanwhile, KEGG pathways and GO analysis were carried out for these NMOSD susceptibility genes. VEGAS2 and PLINK are two kinds of new gene-based tests. “n” represents the number of genes shared by the VEGAS2 and PLINK tests.
Susceptibility genes by using more stringent p-Values and the intersection of VEGAS2 and PLINK in AQP4-positive NMOSD.
| Gene set | Position | nSNPs | |
| ATP6V1G2 | chr6:31620699..31621843 | 11 | 1E-06 |
| ATP6V1G2-DDX39B | chr6:31530219..31546848 | 20 | 1E-06 |
| CFB* | chr6:32022003..32027809 | 18 | 1E-06 |
| CLIC1 | chr6:31810689..31812273 | 4 | 1E-06 |
| EHMT2* | chr6:31956199..31972526 | 16 | 1E-06 |
| HLA-DQA1* | chr6:32717784..32717784 | 22 | 1E-06 |
| MB21D1 | chr6:73423711..73452297 | 29 | 1E-06 |
| MSH5* | chr6:31816126..31837338 | 34 | 1E-06 |
| MSH5-SAPCD1 | chr6:31739948..31764850 | 36 | 1E-06 |
| SLC44A4* | chr6:31942176..31954213 | 44 | 1E-06 |
| STK19_1 | chr6:31971175..31981446 | 10 | 1E-06 |
| TNXB_2 | chr6:32041153..32109338 | 85 | 1E-06 |
| SLC17A5 | chr6:74361136..74419621 | 334 | 2E-06 |
| AGER | chr6:32257794..32259972 | 5 | 1.09E-06 |
| BTNL2 | chr6:32470908..32482618 | 35 | 1.43E-06 |
| C6orf10 | chr6:32368537..32447625 | 213 | 1.88E-09 |
| C6orf136 | chr6:30726885..30726885 | 1 | 7.09E-08 |
| C6orf27 | chr6:31841629..31850569 | 4 | 6.22E-07 |
| C6orf48 | chr6:31912625..31915519 | 4 | 1E-08 |
| CFB* | chr6:32022003..32027809 | 15 | 1.2E-06 |
| EGFL8 | chr6:32242488..32242488 | 1 | 1.9E-08 |
| EHMT2* | chr6:31956199..31972526 | 10 | 1.81E-09 |
| HLA-DQA1* | chr6:32717784..32717784 | 1 | 3.29E-09 |
| HLA-DRA | chr6:32515687..32520787 | 37 | 8.77E-10 |
| MSH5* | chr6:31816126..31837338 | 19 | 3.78E-10 |
| PBX2 | chr6:32262263..32265617 | 4 | 3.54E-07 |
| PRRT1 | chr6:32225949..32225949 | 1 | 2.21E-08 |
| RNF5 | chr6:32254470..32256381 | 6 | 9.36E-08 |
| SKIV2L | chr6:32035321..32045016 | 17 | 6.39E-07 |
| SLC44A4* | chr6:31942176..31954213 | 19 | 2.82E-06 |
| STK19 | chr6:32048876..32055439 | 7 | 2.24E-13 |
Comparison of significant shared gene sets in AQP4-positive NMOSD produced with VEGAS2 and PLINK.
| Gene set | Position | PLINK | VEGAS2 | |||
| NSNP | NSNP | P meta | ||||
| MSH5 | chr6:31816126..31837338 | 19 | 3.78E-10 | 34 | 1.00E-06 | 1.38E-14 |
| EHMT2 | chr6:31956199..31972526 | 10 | 1.81E-09 | 16 | 1.00E-06 | 6.31E-14 |
| HLA-DQA1# | chr6:32717784..32717784 | 1 | 3.29E-09 | 22 | 1.00E-06 | 1.13E-13 |
| CFB | chr6:32022003..32027809 | 15 | 1.20E-06 | 18 | 1.00E-06 | 3.42E-11 |
| SLC44A4 | chr6:31942176..31954213 | 19 | 2.82E-06 | 44 | 1.00E-06 | 7.77E-11 |
| C6orf10 | chr6:32368537..32447625 | 213 | 1.88E-09 | 14 | 0.0027 | 1.37E-10 |
| CLIC1 | chr6:31810689..31812273 | 2 | 7.76E-06 | 4 | 1.00E-06 | 2.06E-10 |
| SLC17A5 | chr6:74361136..74419621 | 92 | 4.78E-06 | 334 | 2.00E-06 | 2.52E-10 |
| HIST1H1B | chr6:27943197..27943251 | 2 | 8.83E-06 | 12 | 1.20E-05 | 2.54E-09 |
| HIST1H2BL | chr6:27883653..27883676 | 2 | 2.57E-05 | 6 | 2.10E-05 | 1.21E-08 |
| HIST1H2AL | chr6:27941153..27941153 | 1 | 2.66E-05 | 9 | 2.30E-05 | 1.36E-08 |
| ATP6V1G2 | chr6:31620699..31621843 | 5 | 0.003494 | 11 | 1.00E-06 | 7.15E-08 |
| ZSCAN16 | chr6:28200582..28205172 | 8 | 7.64E-05 | 14 | 5.40E-05 | 8.38E-08 |
| BTN3A2 | chr6:26473574..26486128 | 39 | 0.000377 | 89 | 2.20E-05 | 1.62E-07 |
| EEF1A1 | chr6:74283039..74286513 | 7 | 9.38E-05 | 20 | 9.40E-05 | 1.72E-07 |
| MTO1 | chr6:74228188..74266790 | 17 | 0.000618 | 78 | 2.30E-05 | 2.71E-07 |
| VARS | chr6:31856799..31870822 | 5 | 1.10E-05 | 2 | 0.00213 | 4.36E-07 |
| C6orf15 | chr6:31187055..31188078 | 9 | 0.000135 | 24 | 0.000181 | 4.54E-07 |
| ARSI | chr5:149656490..149660452 | 3 | 0.000152 | 18 | 0.000197 | 5.48E-07 |
| ZFP57 | chr6:29750488..29756485 | 16 | 0.000276 | 17 | 0.000148 | 7.37E-07 |
| BTN2A1 | chr6:26566244..26576639 | 20 | 0.003007 | 66 | 3.60E-05 | 1.84E-06 |
| C2 | chr6:32003952..32019988 | 12 | 0.001923 | 41 | 8.30E-05 | 2.66E-06 |
| LTA | chr6:31648120..31648763 | 5 | 0.001067 | 8 | 0.000202 | 3.52E-06 |
| HIST1H3J | chr6:27966400..27966400 | 1 | 0.000498 | 6 | 0.000467 | 3.79E-06 |
| PSORS1C1 | chr6:31190939..31215712 | 65 | 0.005206 | 149 | 6.40E-05 | 5.3E-06 |
| GLS | chr2:191457280..191537657 | 43 | 0.000938 | 108 | 0.000379 | 5.63E-06 |
| CDSN | chr6:31190939..31192771 | 17 | 0.001358 | 35 | 0.000402 | 8.42E-06 |
| APPL1 | chr3:57238434..57282001 | 17 | 0.000839 | 63 | 0.000788 | 1.01E-05 |
| NFKBIL1 | chr6:31623742..31633891 | 12 | 0.003355 | 32 | 0.000241 | 1.22E-05 |
| ZNF165 | chr6:28154652..28163474 | 8 | 0.003189 | 29 | 0.00042 | 1.95E-05 |
| ZKSCAN3 | chr6:28426460..28441375 | 24 | 0.008439 | 64 | 0.000175 | 2.13E-05 |
| HSPA1A | chr6:31891486..31891486 | 1 | 0.000957 | 2 | 0.00157 | 2.16E-05 |
| C6orf203 | chr6:107458491..107478905 | 17 | 0.000463 | 87 | 0.00427 | 2.79E-05 |
| ZMYND19 | chr9:139597070..139602897 | 4 | 0.000315 | 19 | 0.00638 | 2.83E-05 |
| EIF3F | chr11:7965996..7973800 | 16 | 0.004203 | 35 | 0.000498 | 2.95E-05 |
| PSORS1C2 | chr6:31213289..31215066 | 12 | 0.004212 | 16 | 0.000612 | 3.57E-05 |
| HRASLS2 | chr11:63078313..63086972 | 5 | 0.003647 | 14 | 0.000846 | 4.22E-05 |
| KLHL30 | chr2:238715095..238725505 | 11 | 0.000507 | 54 | 0.0069 | 4.75E-05 |
| CAPN2 | chr1:221967442..222030091 | 65 | 0.002096 | 206 | 0.00187 | 5.27E-05 |
| CDC16 | chr13:114021570..114055715 | 19 | 0.003988 | 90 | 0.00103 | 5.51E-05 |
| PKN3 | chr9:130508561..130522288 | 2 | 0.001899 | 13 | 0.00222 | 5.64E-05 |
| TBCB | chr19:41301630..41306302 | 5 | 0.000416 | 25 | 0.015898 | 8.55E-05 |
| TJP2 | chr9:70980117..71059730 | 108 | 0.007071 | 418 | 0.001 | 9.09E-05 |
| TAP2 | chr6:32897785..32914439 | 65 | 0.006917 | 112 | 0.00111 | 9.81E-05 |
| RBBP6 | chr16:24458431..24490907 | 30 | 0.005909 | 75 | 0.00134 | 0.000101 |
| CYP2A6 | chr19:46042434..46046373 | 3 | 0.001634 | 26 | 0.00522 | 0.000108 |
| PSMB9 | chr6:32930164..32934963 | 14 | 0.002805 | 23 | 0.00336 | 0.000118 |
| TCP10 | chr6:167715748..167716814 | 4 | 0.004822 | 28 | 0.00241 | 0.000144 |
| POLA2 | chr11:64786765..64821494 | 15 | 0.003271 | 36 | 0.00391 | 0.000157 |
| LST1 | chr6:31662586..31664560 | 6 | 0.0259 | 7 | 0.000496 | 0.000158 |
Comparison of significant shared gene sets in AQP4 negative NMOSD produced with VEGAS2 and PLINK.
| Gene set | Position | VEGAS2 | PLINK | |||
| NSNP | NSNP | P meta | ||||
| RGS1 | chr1:190811722..190815166 | 9 | 0.001563 | 25 | 0.000215 | 5.35E-06 |
| COL8A2 | chr1:36338171..36338204 | 2 | 4.32E-05 | 15 | 0.015598 | 1.02E-05 |
| AGTPBP1 | chr9:87354033..87545005 | 115 | 0.000611 | 413 | 0.00196 | 1.75E-05 |
| SYT9 | chr11:7232814..7446395 | 376 | 0.02569 | 743 | 0.000047 | 1.77E-05 |
| SGIP1 | chr1:66772691..66982406 | 374 | 0.002918 | 654 | 0.000461 | 1.95E-05 |
| CEACAM4 | chr19:46817929..46824888 | 10 | 0.03613 | 31 | 0.000043 | 2.23E-05 |
| INTS5 | chr11:62177128..62177128 | 1 | 0.000506 | 5 | 0.00558 | 3.89E-05 |
| RAD51AP2 | chr2:17557392..17562159 | 7 | 0.003099 | 36 | 0.000931 | 3.97E-05 |
| CDR2L | chr17:70496373..70513013 | 6 | 0.000389 | 20 | 0.00889 | 4.7E-05 |
| ZNF652 | chr17:44728416..44791811 | 39 | 0.005543 | 148 | 0.000784 | 5.8E-05 |
| WDR86 | chr7:150709151..150736999 | 23 | 0.008985 | 101 | 0.000599 | 7.07E-05 |
| HSD11B1 | chr1:207927291..207973298 | 28 | 0.005835 | 76 | 0.000976 | 7.45E-05 |
| SYCE1 | chr10:135217645..135231917 | 54 | 0.001929 | 100 | 0.00354 | 8.81E-05 |
| SSFA2 | chr2:182469362..182502779 | 16 | 0.003418 | 48 | 0.00205 | 9.02E-05 |
| WT1 | chr11:32366048..32412429 | 65 | 0.002659 | 159 | 0.00276 | 9.41E-05 |
| RAX2 | chr19:3720253..3720899 | 4 | 0.001959 | 16 | 0.004 | 1E-04 |
| CAPG | chr2:85475570..85489706 | 11 | 0.009872 | 77 | 0.000851 | 0.000107 |
| PIGX | chr3:197926529..197946898 | 12 | 0.003838 | 75 | 0.00228 | 0.000111 |
| RPS3 | chr11:74788237..74794135 | 4 | 0.001819 | 31 | 0.00564 | 0.000128 |
| PGRMC2 | chr4:129414746..129427709 | 5 | 0.003674 | 19 | 0.0028 | 0.000128 |
| TRAF3IP3 | chr1:207996202..208020329 | 32 | 0.00363 | 61 | 0.0033 | 0.000148 |
| PTPLAD1 | chr15:63611672..63657183 | 24 | 0.005597 | 89 | 0.00231 | 0.000158 |
FIGURE 2Network of known and predicted interactions between proteins encoded by NMOSD potential susceptibility genes were identified by GWAS of AQP4-positive NMOSD. Network nodes represent the proteins produced by a single, protein-coding gene locus. Edges represent protein-protein associations meant to be specific and meaningful. In STRING, blue edge represents protein-protein associations with known interactions from curated databases, purple edge represents protein-protein associations with known interactions experimentally determined, green edge represents protein-protein associations with predicted interactions with gene neighborhood, red edge represents protein-protein associations with predicted interactions with gene fusions, Navy blue edge represents protein-protein associations with predicted interactions with gene co-occurrence, and other associations with text mining (yellow edge), co-expression (black edge), and protein homology (lavender edge).
FIGURE 3Network of known and predicted interactions between proteins encoded by NMOSD potential susceptibility genes were identified by GWAS of AQP4-negative NMOSD.
Significant KEGG pathways of AQP4-positive and negative with P < 0.05 by pathway analysis of NMOSD GWAS.
| Pathway ID | Pathway name | Pathway classification | Related genes | |
| hsa05322 | Systemic lupus erythematosus | Immune disease | 6.62E-05 | HLA-DQA1; HIST1H2BL; HIST1H2AL; C2; HIST1H3J |
| hsa05150 | Staphylococcus aureus infection | Infectious disease | 8.17E-04 | HLA-DQA1; CFB; C2 |
| hsa04612 | Antigen processing and presentation | Immune system | 0.002058 | HLA-DQA1; HSPA1A; TAP2 |
| hsa04940 | Type I diabetes mellitus | Endocrine and metabolic disease | 0.008931 | HLA-DQA1; LTA |
| hsa05110 | Vibrio cholerae infection | Infectious disease | 0.011943 | ATP6V1G2; TJP2 |
| hsa04145 | Phagosome | Transport and catabolism | 0.013689 | HLA-DQA1; ATP6V1G2; TAP2 |
| hsa05134 | Legionellosis | Infectious disease | 0.014332 | EEF1A1; HSPA1A |
| hsa05034 | Alcoholism | Substance dependence | 0.021444 | HIST1H2BL; HIST1H2AL; HIST1H3J |
| hsa05168 | Herpes simplex infection | Infectious disease | 0.023038 | HLA-DQA1; LTA; TAP2 |
| hsa04610 | Complement and coagulation cascades | Immune system | 0.028318 | CFB; C2 |
| hsa04211 | Longevity regulating pathway | Aging | 0.03527 | EHMT2; APPL1 |
| hsa05323 | Rheumatoid arthritis | Immune disease | 0.035998 | HLA-DQA1; ATP6V1G2 |
| hsa00563 | Glycosylphosphatidylinositol (GPI)-anchor biosynthesis | Glycan biosynthesis and metabolism | 1.33E-02 | PIGX |
| hsa00140 | Steroid hormone biosynthesis | Lipid metabolism | 3.18E-02 | HSD11B1 |
| hsa00980 | Metabolism of xenobiotics by cytochrome P450 | Xenobiotics biodegradation and metabolism | 4.01E-02 | HSD11B1 |
| hsa05204 | Chemical carcinogenesis | Cancer | 4.32E-02 | HSD11B1 |
FIGURE 4The top ten GO annotations of AQP4-positive NMOSD, (A) is for biological process, (B) is for molecular function, and (C) is for cellular component.
FIGURE 5The top ten GO annotations of AQP4- negative NMOSD, (A) is for biological process, (B) is for molecular function, and (C) is for cellular component.