Literature DB >> 28382934

Connecting myelin-related and synaptic dysfunction in schizophrenia with SNP-rich gene expression hubs.

Hedi Hegyi1.   

Abstract

Combining genome-wide mapping of SNP-rich regions in schizophrenics and gene expression data in all brain compartments across the human life span revealed that genes with promoters most frequently mutated in schizophrenia are expression hubs interacting with far more genes than the rest of the genome. We summed up the differentially methylated "expression neighbors" of genes that fall into one of 108 distinct schizophrenia-associated loci with high number of SNPs. Surprisingly, the number of expression neighbors of the genes in these loci were 35 times higher for the positively correlating genes (32 times higher for the negatively correlating ones) than for the rest of the ~16000 genes. While the genes in the 108 loci have little known impact in schizophrenia, we identified many more known schizophrenia-related important genes with a high degree of connectedness (e.g. MOBP, SYNGR1 and DGCR6), validating our approach. Both the most connected positive and negative hubs affected synapse-related genes the most, supporting the synaptic origin of schizophrenia. At least half of the top genes in both the correlating and anti-correlating categories are cancer-related, including oncogenes (RRAS and ALDOA), providing further insight into the observed inverse relationship between the two diseases.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28382934      PMCID: PMC5382542          DOI: 10.1038/srep45494

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Gene expression correlation, protein-protein interaction and other high-throughput experiments in the post-genomic era have revealed that genes tend to form complex, scale-free networks where most genes have a few connections with others and a few have a high number of interactions, commonly referred to as “hubs”, establishing them as important central genes in these gene networks1. These highly interconnected genes have become the targets of intense research expecting them to play prominent roles in genetic diseases. However, measurements found only a weak correlation between disease genes and hubs, e.g. Barabasi et al.2 found that disease genes have 32% more interactions with other proteins than non-disease genes, arguing that genetic mutations in topologically central, widely expressed genes are more likely to result in severe impairment of normal development, leading to lethality in utero and eventual deletion from the population2. This is apparently not the case in schizophrenia, a disease that steadily affects about one percent of the population despite the lower fecundity of the affected individuals3. In most cases of schizophrenia no known mutations exist in protein-coding genes4 and the role of gene expression dysregulation in schizophrenia is increasingly recognized56. A recent meta-analysis, examining the mutations in the genomes of 36,000 schizophrenics, identified 108 distinct genomic regions with significantly higher mutation rates7. We integrated this set with gene expression data from the Allen Brain Atlas89. After determining the correlating and anti-correlating neighbors of genes in the 108 loci we found that the median number of correlating neighbors were about 35 times higher for the genes in the 108 regions than in the rest of the genome (32 times for the anti-correlating pairs). We also integrated the data with a recent methylome study in schizophrenics10. Ranking the genes for the hypermethylated probes of the positively correlating gene pairs identified the top gene as SYNGR1, a synapse-related gene whose regulatory region overlaps with one of the 108 SNP-rich loci in ref. 7. Ranking the genes for the hypermethylated probes of the negatively correlating gene pairs identified MBP and MOBP. They are both myelin-related, anti-correlating with a large number of synapse-related and glutamate receptor genes, offering a model that connects these frequently observed but so far disjoint pathologies in schizophrenia.

Results

Genes in the most mutated genomic regions have the highest number of correlating partners

In the first step we identified all human gene expression pairs with a Pearson correlation of >=0.8 or <=−0.7 in human brain tissues using the Allen Brain Atlas (website: brain-map.org)89, and a recent database with pre-calculated Pearson correlations for all relevant gene pairs at http://www.szdb.org/download.html#coexpression by Wu et al.11. This resulted in 1,257,407 positively correlating and 1,108,585 negatively correlating unique gene pairs belonging to 16829 and 16761 individual genes, respectively (Supplementary Tables 1 and 2). Subsequently we mapped all known human genes in 108 highly mutated human genomic regions identified in ref. 7 separately for genes whose promoters or only cis regulatory elements fall into these 108 loci. Filtering for genes present in the correlating or anti-correlating pairs by Wu et al.11 resulted in 254 promoter-selected and 462 cis-selected genes (indicated by 1p/1c in Supplementary Tables 1 and 2, respectively). Counting the positively and negatively correlating partners for all the genes except in the 108 loci in ref. 7 resulted in a median number of 71 positively and 63 negatively correlating pairs, respectively (Fig. 1). Unexpectedly, the median numbers of correlating pairs for the promoter-selected genes were 2472 and 2013, corresponding to a 35-fold and 32-fold increase for the positive and negative pairs, respectively.
Figure 1

Boxplots of inteacting partners for all genes and promoter-selected and cis-selected genes in 108 SNP-rich genomic regions taken from ref. 7.

(A) Positive, (B) negative correlating partners.

The number of correlating pairs for the cis-selected genes also proved to be higher (386 and 320 pairs on average for positively and negatively correlating pairs, respectively), corresponding to an approximately 2.5-fold increase when compared to the general gene population. The median values were similar to the average gene population in this category (Fig. 1) but an unpaired t-test showed that the cis-derived genes and the general gene population (excluding the 108-loci genes) are significantly different (p-value = 0.012). We also investigated the number of differentially methylated gene pairs using a table of 56001 differentially methylated probes from a genome-wide methylome study in schizophrenics10. We counted separately the number of hypermethylated and hypomethylated genes for the highly correlating and anti-correlating gene pairs and also the total sum of hyper- and hypomethylated probes belonging to these genes. The sums of scores for both the number of differentially methylated gene pairs and the total sum of their differentially methylated probes are shown in Supplementary Tables 1 and 2, for the positively and negatively correlating pairs, respectively. To determine which measurement distinguishes the best between schizophrenia-related and non-specific genes we calculated the effect size for the 108-loci genes and also for genes annotated by Genecards and Malacards1213 as schizophrenia-related (Fig. 2), using Cliff’s delta for five different measurements (see legend for details).
Figure 2

The effect sizes for the ratios of gene neighbor numbers between specific genes and their complementary gene sets (i.e. the rest of the ~16 k genes present in the study): 108-loci cis genes; Malacards-annotated schizophrenia genes; Genecards-annotated schizophrenia genes; 108-loci promoter genes.

Effect sizes were calculated for 5 different measurements: (i) pairs, the total number of correlating gene pairs; (ii) pairs_hypometh, the number of correlating gene pairs that are hypomethylated (defined as in at least one probe the gene is hypomethylated); (iii) pairs_hypermeth, the number of correlating gene pairs that are hypermethylated; (iv) pairs_hypoSum, the total number of hypomethylated probes of the correlating gene pairs; (v) pairs_hyperSum, the total number of hypermethylated pairs for the correlating gene pairs. The numbers were filtered using only pairs of genes where both genes were differentially methylated in ref. 10. (A) Positively, (B) Negatively correlating gene pairs.

Figure 2a and b shows the effect sizes for all relevant pairs of complementary gene sets (i.e. promoter-derived vs. non-promoter-derived, Genecards vs. non-Genecards genes, etc.) comparing the numbers of correlating gene partners in one set to the numbers of the gene partners in the other gene set. As expected the promoter-derived 108-loci genes had the greatest effect size, followed by Malacards and Genecards, the 108-loci cis-derived genes having the smallest effect. The results were similar for the negatively correlating pairs (Fig. 2b). Here we also filtered for gene pairs that were both differentially methylated in ref. 10, which increased the effect size by 0.039 for “pairs” in Fig. 2 for the promoter-derived positive set and by 0.061 for the same for the negative set. (Unfiltered charts shown in Supplementary Figure 1A,B).

Specific genes with high numbers of correlating partners

We ranked all genes according to the number of hypermethylated probes (taken from ref. 10) of the correlating gene partners. Tables 1 and 2 show the top 20 genes with the most hypermethylated probes, in the positively and negatively correlating partners, respectively. The top-ranking gene in Table 1 is SYNGR1, synaptogyrin, followed by GRIN1, a glutamate receptor and CHRNB2, a cholinergic receptor. All three genes have synapse-related functions, SYNGR1 and GRIN1 regulate synaptic plasticity whereas CHRNB2 regulates synapse assembly14.
Table 1

The top 20 positively correlating genes, ranked according to the hypermethylated probes (listed in column 6) in the correlating gene partners.

genepairspairs hypo methpairs hyper methpairs hypo Sumpairs hyper Sumprobes hypoprobes hypergenecards Scoremalacards Scoreprom/ cis?prot lendescription
SYNGR143361437208635538290088.57NA1c233synaptogyrin 1
GRIN138001261191533027959219.077.55NA959glutamate receptor, ionotropic, NMDA 1
CHRNB240701391199234657880139.075.10NA502cholinergic receptor, nicotinic beta 2
ARHGEF11*4024139419513506783203NANANA1562Rho guanine nucleotide exchange factor 11
ALDOA*4949155822083983781125−2.01NA1p418aldolase, fructose-bisphosphate A
SREBF2*3750129518103352764310−1.79NA1c1141sterol regulatory element binding TF 2
PITPNM233081202173431887626218−2.01NA1p1349phosphatidylinositol transfer protein
GRAMD1B4159142819803418756926NANA1c745GRAM domain containing 1B
NISCH*4152133119513250749503−1.60NA1c1504nischarin
RIMS1*4679147420673461746503NANA1c1692regulating synaptic membrane exocytosis 1
CPLX234331149167731397345359.072.48NA134complexin 2
CHGB*36651205174631547316109.073.83NA677chromogranin B
STX1A3941125918803023728316−0.60NANA288syntaxin 1 A
SEZ6L2*3406116917473046726501NANA1c923seizure related 6 homolog (mouse)-like 2
L3MBTL24406136119723127722710NANA1p705l(3)mbt-like 2 (Drosophila)
SLC45A1*3454116017402988721814NANA1c782solute carrier family 45 member 1
GRIA1*43371377195032567141039.06NA1c916glutamate receptor, ionotropic, AMPA 1
RANGAP13704123217723074709101NANA1p587Ran GTPase activating protein 1
TCF20*3497124217003258704401NANA1p1960transcription factor 20 (AR1)
EPC25124151420843436690910NANA1p807enhancer of polycomb homolog 2

The total number of positively correlating genes (column 2), of which hypo- or hypermethylated (columns 3 and 4), are also listed. Genecards and Malacards scores are listed in columns 7 and 8. The column “prom/cis?” indicates if the promoter of the gene (“1p”) or the cis regulatory region of the gene (“1c”) falls into the 108 highly mutated loci in ref. 7. A “*” next to the gene identifier indicates a role in cancer.

Table 2

The top 20 negatively correlating genes, ranked according to the hypermethylated probes (listed in column 6) in the anti-correlating gene partners.

genepairspairs hypo methpairs hyper methpairs hypo Sumpairs hyper Sumprobes hypoprobes hypergenecards Scoremalacards Scoreprom/ cis?protlendescription
DBI*60131882260144009376NANA−1.01NANA143diazepam binding inhibitor (GABA receptor modulator)
C1orf545656179225074233916401NANA1p131chromosome 1 open reading frame 54
RRAS*6426185126694197898610NANANA218related RAS viral (r-ras) oncogene homolog
C2orf826837194827204274893421−0.01NA1p121chromosome 2 open reading frame 82
NDUFA4L26517183726354132866851NANA1p93NADH dehydrogenase 1 alpha subcomplex, 4-like 2
MOBP57411678246236228576909.072.40NA206myelin-associated oligodendrocyte basic protein
MBP*546716562349364482662588.56NANA304myelin basic protein
CDK2AP1*4638149621203668813050−0.79NA1p115cyclin-dependent kinase 2 associated protein 1
S100A1*4791152921143501749710NANANA147S100 calcium binding protein A1
FABP7*39211299185232247490310.00NANA166fatty acid binding protein 7, brain
TMEM219*51881603215337957218NANANANA1p240transmembrane protein 219
DGCR6*599317212310383670400110.4813.8NA220DiGeorge syndrome critical region gene 6
ESRP2*3767118817702842682801NANA1p727epithelial splicing regulatory protein 2
AS3MT37111269171831206780020.22NA1p375arsenite methyltransferase
ANP32E3907120417473092673810NANA1p268acidic nuclear phosphoprotein 32 family member E
ALDH1A1*39921236181528986720NANA−1.00NANA501aldehyde dehydrogenase 1 family member A1
PSMB10*52141565205835796528NANANANA1p273proteasome subunit beta 10
HIRIP355031514218331586524NANANANA1p556HIRA interacting protein 3
SELENBP1*50441426209029526514400.15NANA514selenium binding protein 1
RENBP*52011438211129566507NANA−0.60NANA427renin binding protein

The total number of positively correlating genes (column 2), of which hypo- or hypermethylated (columns 3 and 4), are also listed. Genecards and Malacards scores are listed in columns 7 and 8. The column “prom/cis?” indicates if the promoter of the gene (“1p”) or the cis regulatory region of the gene (“1c”) falls into the 108 highly mutated loci in ref. 7. A “*” next to the gene identifier indicates a role in cancer.

SYNGR1 is also present in the 108 most mutated loci in ref. 7 although only for its regulatory region, not for its promoter. Further functional analysis identified 5 more synapse-related genes in Table 1: NISCH regulates synaptic transmission, RIMS1 and STX1A regulate postsynaptic potential, CPLX2 regulates synaptic plasticity and GRIA1 regulates long-term synaptic depression. Altogether 11 out of the 20 genes in Table 1 are annotated in Genecards as schizophrenia-related and 14 appear in the 108 genomic regions in ref. 7. The only gene in Table 1 that appears in neither is ARHGEF11, a glutamate transport enhancer, however it had a significantly higher expression in the thalamus of schizophrenics in ref. 15. Table 2 lists the top 20 genes with the highest number of hypermethylated probes for the negatively correlating partners. The top-ranking gene, DBI, is a GABA receptor modulator; its role having been contemplated in schizophrenia16 the authors concluded that DBI might have a symptom modulatory rather than an etiological role in schizophrenia. Unexpectedly, MOBP, myelin-associated oligodendrocyte basic protein and MBP, myelin basic protein, both appear in Table 2. While myelin-related abnormalities are one of the hallmarks of schizophrenia, surprisingly, not much functional information is available about MOBP beyond its role in the formation of the myelin sheath17 and a knockout mouse was phenotypically indistinguishable from the wild type18. Nevertheless, the authors argue that MOBP probably has a so far undiscovered function, due to the conservation of several alternatively spliced variants in rat and mouse18. Altogether, 11 of the 20 genes with the greatest numbers of hypermethylated probes for their anti-correlating gene partners (Table 2) are annotated as schizophrenia-related in Genecards and 10 appear in the 108 loci in ref. 7. Interestingly, several genes in both tables are cancer-related (indicated with a “*” next to the gene identifiers in Tables 1 and 2). Among the top 20 genes for hypermethylated probes of correlating partners are ARHGEF111920 and SREPF2, both associated with prostate and breast cancer2122 while ALDOA functions as an oncogene in highly metastatic pancreatic cancer23. NISCH has a tumor-suppressive function in breast cancer24, CHGB is associated with aggressive VHL-associated pancreatic neuroendocrine tumors25. SEZ6L2 is a prognostic marker for lung cancer26, SLC45A1 is deleted in neuroblastoma27. GRIA1 might play a role in glioma proliferation28 while TCF20 is again associated with prostate29 and breast cancer30. Among the top 20 genes for hypermethylated probes of anti-correlating partners are RRAS, an oncogene, also involved in neuronal axon guidance31 and MBP, associated with oligodendrogliomas32. TMEM219 regulates apoptosis33; CDK2AP1 is a putative oral cancer suppressor34; PSMB10, a proteasome subunit, is upregulated via the NFKB1 pathway in cancer cells35. S100A1 is also associated with several tumor types and inhibits apoptosis in ventricular cardiomyocytes36. DGCR6 is associated with DiGeorge syndrome, a consequence of microdeletions in chromosomal region 22q11.2 and also has increased levels in metastatic mammary tumour cells37. FABP7 plays a role in neurogenesis and is a marker of glioma stem cells38. Epithelial splicing regulatory protein 2 (ESRP2) suppresses cancer cell motility39. ALDH1A1 is a marker and prognostic factor for several human cancers, bladder40, pancreatic41, lung42, colorectal and breast cancer43, among others. Altogether, 10 genes among the top 20 for the greatest number of hypermethylated probes for the positively correlating gene partners (Table 1) and 13 of the 20 genes with the greatest numbers of hypermethylated probes for their anti-correlating gene partners (Table 2) are annotated as cancer-related in Genecards.

The connection between the positive and negative hubs

The existence of the two kinds of hub genes with high numbers of positive or negative correlating partners raises the question about their functionality and the underlying neurobiological pathways: are they related or do they form mostly separate networks? To answer this question we constructed three gene networks (Fig. 3): two representing the positive and negative correlations only for SYNGR1 and MOBP, respectively (Fig. 3a,b), and one (Fig. 3c) showing a combined network of the two. We chose SYNGR1 for the positive hub for it is the top ranking gene in Table 1 and is also located in one of the PGC regions whereas we chose MOBP because it ranks highly among the anti-correlating genes, indicating a putative regulatory function, which, however, has not been observed yet experimentally.
Figure 3

Gene networks of hub genes generated from hypermethylated Malacards-annotated genes.

(A) SYNGR1-centered, positively correlating gene network. The red-colored edges show correlations with SYNGR1. (B) Negatively correlating gene network, centered on MOBP. All the red-colored edges show negative correlations with MOBP. All the other pairwise negative correlations are shown in grey. The purple-colored genes overlap with 108 highly mutated genomic loci in schizophrenics (see text for details). (C) The combined network in (A and B). All light green-colored genes correlate positively with SYNGR1 and negatively with MOBP. The purple-colored and turquoise-colored genes correlate only with MOBP or SYNGR1, respectively.

To reduce the size of the networks we selected only Malacards-annotated schizophrenia-related genes that are hypermethylated in ref. 10. SYNGR1 correlates positively with 49 genes (Fig. 3a); MOBP also has 49 negatively correlating partners (Fig. 3b). The combined network is shown in Fig. 3c. Strikingly, 41 genes are shared interacting partners between MOBP and SYNGR1. Each of the two hub genes interacts with only 8 genes that are not shared with the other hub. Clearly, the positive and negative correlations - and such interactions – form highly interconnected networks that provide synaptic functions, the core functionality of the human brain. The shared genes reflect remarkably on the nature and most consistently observed features of schizophrenia: 6 shared genes are glutamate receptors (GRIA4, GRIK3, GRIK5, GRIN1, GRM1, GRM2), one GABA receptor (GABBR1), a cannabinoid receptor (CNR1), GAD2, a glutamate decarboxylase, two synapse-related (SYN2, SYT11) and several neuron-specific genes are also present among the shared genes (Fig. 3c). We repeated the selection process replacing the Malacards-genes with the 108-loci genes. This resulted in a similar number of interacting genes that correlate positively with SYNGR1 (50 genes) and negatively with MOBP (48 genes). They share 39 common genes (Fig. 4). Biological processes derived from a Gene Ontology44 analysis of the shared genes either in the Malacards set or the 108-loci set and in both cases shared between SYNGR1 and MOBP and significantly enriched (p-value < 0.05) for both sets are shown in Table 3. For both gene sets the top two biological processes with the highest significance are “synaptic transmission” and “modulation of synaptic transmission”.
Figure 4

(A) All positively correlating genes in the 108 loci. The network is again centered on SYNGR1. The red-colored edges show correlations with SYNGR1. (B) All the genes in the 108 loci that correlate negatively with MOBP (edges colored red) or one another (grey edges). (C) The combined network in (A and B). All light green-colored genes correlate positively with SYNGR1 and negatively with MOBP (edges show in red in both cases). The purple-colored and turquoise-colored genes correlate only with MOBP or SYNGR1 (of the two), respectively.

Table 3

Biological processes shared between the Malacards gene set interacting with both SYNGR1 and MOBP in Fig. 3C (41 light green-colored genes) and the 108-loci genes interacting with both SYNGR1 and MOBP in Fig. 4C (39 light green-colored genes).

GO_identifierGO_Biological Processshared Genes_108 locip_val (108 loci)shared Genes Malacardsp_val (Malac Genes)
GO:0050804modulation of synaptic transmission70.00359141.23e-13
GO:0007268synaptic transmission90.00359168.05e-12
GO:0061564axon development70.0318112.14e-06
GO:0031175neuron projection development80.0289111.08e-05
GO:0048167regulation of synaptic plasticity40.046964.71e-05
GO:0048666neuron development90.0233114.71e-05
GO:0048812neuron projection morphogenesis70.031890.000146
GO:0007411axon guidance60.041570.00128
GO:0030182neuron differentiation90.031890.00545
GO:0044765single-organism transport150.0318150.0062
GO:0007399nervous system development140.0138120.011
GO:1902578single-organism localization160.0289150.0111

12 biological processes in the table are common in the two shared sets, despite the very limited number of actually shared genes (only NRGN is shared besides MOBP and SYNGR1 but these two were not included in the GO analysis).

Functional analysis of schizophrenia genes correlating with the top 20 hub genes

To get a further insight into the biological processes the top ranking genes and their network neighbors (i.e. their correlating and anti-correlating gene partners) partake in we took the top 20 genes with the most hypermethylated probes for the correlating (and anti-correlating) gene partners and selected those gene neighbors that correlated with minimum 19 of the top 20 genes. This resulted in 421 genes for the positively correlating set and 460 genes for the negatively correlating one. For this in silico functional analysis we used an online tool provided by STRING45 to identify Gene Ontology terms that are over-represented in our gene set. The positive gene set was associated with 278 significantly enriched GO terms for biological processes while the negative set had 151 such GO terms. Interestingly, while the two gene sets share only 117 common genes i.e. about 1/4th of the total gene number for either set, they also share 117 common biological processes (Table 4), a much higher fraction for either set (42% of the positive set-derived biological processes and 77% of the negative set-derived processes are shared).
Table 4

Biological processes shared between the positively correlating and negatively correlating gene neighbors of the top 20 genes in Table 1 and Table 2.

GO_identifierGO_Biological ProcessPos Neighborsp_val (Pos Neigh)Neg Neighborsp_val (Neg Neigh)
GO:0007399nervous system development885.21e-121054.23e-17
GO:0007268synaptic transmission482.95e-14497.45e-14
GO:0048699generation of neurons681.57e-11703.36e-10
GO:0048666neuron development526.61e-12523.78e-10
GO:0031175neuron projection development482.83e-12469.22e-10
GO:0022008neurogenesis681.62e-10702.03e-09
GO:0007267cell-cell signaling572.83e-12531.32e-08
GO:0048731system development990.0005941252.02e-08
GO:0030182neuron differentiation565.4e-11542.31e-08
GO:0048667cell morphogenesis involved in neuron differentiation393.77e-10382.98e-08
GO:0048812neuron projection morphogenesis397.5e-10385.45e-08
GO:0061564axon development371.74e-09361.16e-07
GO:0007409axonogenesis362.41e-09351.49e-07
GO:0007275multicellular organismal development1060.003471327.65e-07
GO:0030030cell projection organization512.12e-08501.66e-06
GO:0007610behavior321.34e-06341.8e-06
GO:0048858cell projection morphogenesis421.1e-07415.72e-06
GO:0007156homophilic cell adhesion via plasma membrane adhesion molecules100.0277177.86e-06
GO:0048856anatomical structure development1050.006191281.01e-05
GO:0050803regulation of synapse structure or activity282.83e-12201.43e-05
GO:0044767single-organism developmental process1180.002251382.61e-05
GO:0098742cell-cell adhesion via plasma-membrane adhesion molecules110.0369182.65e-05
GO:0050804modulation of synaptic transmission272.94e-10213.04e-05
GO:0044708single-organism behavior261.07e-05273.14e-05
GO:1902580single-organism cellular localization370.000174418.75e-05
GO:0007626locomotory behavior193.27e-06180.000103
GO:0032502developmental process1160.006281360.000108
GO:0030001metal ion transport363.9e-07330.00011
GO:0051650establishment of vesicle localization167.43e-05170.000111
GO:0051640organelle localization255.23e-06240.000152
GO:1902578single-organism localization1162.83e-12980.000153
GO:0006811ion transport468.77e-05490.000157
GO:0007611learning or memory160.000443180.000177
GO:0000902cell morphogenesis481.22e-07430.000182
GO:0030534adult behavior153.73e-05150.000195
GO:0044765single-organism transport1107.01e-12930.000195
GO:0050890cognition170.000535190.000241
GO:0007417central nervous system development350.0013400.000252
GO:0098660inorganic ion transmembrane transport341.76e-06310.000405
GO:0051641cellular localization761.56e-06730.000436
GO:0031344regulation of cell projection organization287.75e-06260.000577
GO:0051656establishment of organelle localization226.8e-06200.000577
GO:0061024membrane organization370.000377390.000832
GO:0051960regulation of nervous system development386.8e-07330.000962
GO:0048468cell development655.66e-07590.00104
GO:0032989cellular component morphogenesis495.34e-07430.00118
GO:0007420brain development280.00376320.0012
GO:0006810transport1252.83e-121020.00145
GO:0043087regulation of GTPase activity296.67e-05280.00148
GO:0010975regulation of neuron projection development246.8e-06210.00152
GO:0045664regulation of neuron differentiation282.54e-05260.00152
GO:0007613memory80.032110.00167
GO:0051234establishment of localization1266.68e-121040.0017
GO:0030154cell differentiation890.00066940.00201
GO:0048869cellular developmental process910.00124970.00259
GO:0006836neurotransmitter transport110.00842130.00268
GO:0044700single organism signaling1391.56e-061340.00275
GO:0051179localization1462.83e-121200.00276
GO:0071702organic substance transport663.73e-05640.00276
GO:0060322head development280.00956320.00351
GO:0065008regulation of biological quality941.07e-05900.004
GO:1902582single-organism intracellular transport410.00307440.00444
GO:0043547positive regulation of GTPase activity279.52e-05250.00452
GO:0007154cell communication1436.53e-071350.00462
GO:0007411axon guidance253.26e-05220.00462
GO:0034220ion transmembrane transport413.01e-06350.00483
GO:0050767regulation of neurogenesis314.27e-05280.00483
GO:0007269neurotransmitter secretion90.0209110.00491
GO:0070838divalent metal ion transport160.00343170.00515
GO:0023061signal release140.00137140.00516
GO:0043269regulation of ion transport397.33e-09280.00569
GO:0051649establishment of localization in cell689.18e-07590.00675
GO:0006812cation transport350.000182330.00683
GO:0035637multicellular organismal signaling100.00233100.00764
GO:0050806positive regulation of synaptic transmission131.78e-05100.00826
GO:0044093positive regulation of molecular function550.00211570.00829
GO:0051336regulation of hydrolase activity512.23e-05460.00829
GO:0048167regulation of synaptic plasticity171.64e-07110.00978
GO:0098662inorganic cation transmembrane transport304.56e-06240.0106
GO:0008088axon cargo transport95.65e-0660.0132
GO:0072384organelle transport along microtubule70.00064560.0132
GO:0008104protein localization622.35e-05560.0139
GO:0045665negative regulation of neuron differentiation100.041120.0139
GO:0007264small GTPase mediated signal transduction337.44e-05290.0144
GO:0023051regulation of signaling760.00514800.0146
GO:0031345negative regulation of cell projection organization100.00473100.0146
GO:0001764neuron migration110.00124100.0154
GO:0015031protein transport511.07e-05440.0154
GO:0043085positive regulation of catalytic activity490.00174490.0159
GO:0098655cation transmembrane transport331.16e-05270.0159
GO:0010646regulation of cell communication885.43e-05830.0163
GO:0010977negative regulation of neuron projection development80.022690.0163
GO:0045184establishment of protein localization531.32e-05460.0166
GO:0001508action potential90.0060290.0172
GO:0006816calcium ion transport140.00545140.019
GO:0044802single-organism membrane organization280.00842290.0227
GO:0033036macromolecule localization695.82e-05630.0246
GO:0055085transmembrane transport491.53e-06390.0251
GO:0035556intracellular signal transduction653.26e-05580.0265
GO:0046907intracellular transport470.00101450.0265
GO:0051899membrane depolarization70.0170.0265
GO:0065009regulation of molecular function800.00064780.027
GO:0007612learning100.01100.0298
GO:0048489synaptic vesicle transport100.0029690.0324
GO:0050773regulation of dendrite development110.0006690.0324
GO:0032940secretion by cell220.00835220.0356
GO:0051345positive regulation of hydrolase activity413.94e-06320.0358
GO:0060284regulation of cell development365.56e-05300.0365
GO:0097479synaptic vesicle localization110.00078290.0365
GO:0050790regulation of catalytic activity670.00203660.0383
GO:0050770regulation of axonogenesis100.014100.0405
GO:0034765regulation of ion transmembrane transport311.05e-08190.0418
GO:0050807regulation of synapse organization146.24e-0690.0426
GO:0035725sodium ion transmembrane transport120.00121100.0438
GO:0016043cellular component organization1250.0002261210.0456
GO:0047496vesicle transport along microtubule40.021540.0457
GO:0051128regulation of cellular component organization763.59e-06640.0495
This result also shows that gene networks as a whole are remarkably redundant, i.e. similar functions are often carried out by several functional homologs in the brain. This network view of the genes associated with schizophrenia is also revelatory for the polygenic nature of schizophrenia: once a mutation perturbs the expression of a highly connected hub gene with many interacting partners, this in turn will lead to the perturbation of several hundred or even thousand interacting genes. The differential methylation of several thousands of genes in the prefrontal cortex of schizophrenics might be one such manifestation of the complexity of the disease. The shared GO terms ranked by significance are listed in Table 4. The top process is “nervous system development”, followed by “synaptic transmission”, reflecting two major aspects of schizophrenia, i.e. it is considered a (neuro)developmental disease46 affecting mostly the synapse47. On a different note, the most frequently occurring word in Table 4 is “regulation” (occurring 29 times), followed by “transport” (24x), perhaps underlining somewhat less pronounced aspects of schizophrenia.

Positive hub genes are longer, negative hub genes are shorter than average

We also calculated the protein length and protein disorder statistics for both the top 20 positive and negative hubs. Surprisingly, while the top 20 positive hub proteins were significantly longer (p-value < 10−5) than the average human protein (894 and 442 amino acids, on average, respectively), the top 20 negative hub proteins were significantly shorter (327 amino acids on average). We did not find their disorder to be significantly different from that of the general human protein population. Likewise, the 108 region-derived proteins among the top 20 positive and negative hubs in Tables 1 and 2, were also longer (14 proteins in Table 1a with an average length of 983 amino acids) and shorter (10 proteins in Table 1b with an average length of 261), respectively, than the average human protein (442 aa).

Discussion

It has been known for at least a decade that myelin has an inhibitory role in axonal regeneration48 in the CNS. Myelin is dysfunctional in schizophrenia49 and this dysfunctionality leads to changes in synaptic formation and function, another hallmark of schizophrenia49. Several studies have identified genes whose expression is abnormal in schizophrenic brains affecting myelin-related50 and synapse-related biochemical pathways47. However, this is the first time, to our knowledge, that a model for a complete network of gene interactions is presented that would account for both of these recurring anomalies in schizophrenia. Our gene network has two hubs, one, SYNGR1, with a synaptic function, correlates positively and apparently interacts with a high number of genes that also interact with one another and at the same time interact negatively with MOBP, a myelin gene. Myelin is known to inhibit axonal sprouting, a step considered important for synaptic formation51. While neither MOBP, nor MBP are known to have such inhibitory functions, another myelin-related protein, RTN4 (also called Nogo-A), ranking 88th among the negatively correlating genes (Supplementary Table 2), does have such a function, inhibiting axon growth51. It is tempting to hypothesize that either MOBP or MBP also have such a - so far undiscovered - inhibitory function. MOBP is the 3rd most abundant protein in the CNS myelin and has several alternatively spliced variants. As highlighted by Montague et al.18 a physiological function for MOBP has not been found yet. Our model with two antagonistic hub genes would also account for the antagonistic relationship between myelin genes and synapse-related functions52. We also found a high representation of cancer-related genes in both the top correlating and anti-correlating genes (Tables 1 and 2). While only limited association has been found so far connecting schizophrenia- and cancer-related gene hubs53, there are several studies pinpointing the essential role of hub proteins in cancer254. Only a handful of papers have been published on gene networks in schizophrenia, e.g. a study highlighting shared hubs and gene networks between schizophrenia and diabetes55 and our own work on microRNA-driven networks and hub genes in schizophrenia56. Perhaps the most relevant study connecting cancer- and schizophrenia-related gene networks is by Ibanez et al.57 who found an inverse comorbidity between the two diseases, observing expression deregulations in opposite directions of the same genes and gene networks. Our model is also supported by the extraordinary enrichment of correlating partners for the genes encoded in the 108 genomic regions with the highest mutation rates in the genomes of schizophrenics identified in ref. 7. As mentioned above, the median number of correlating partners for those genes whose promoters are located in the 108 loci is 35 times higher for the positively correlating genes than for the rest of the genes in this study and 32 times higher for the negatively correlating genes. The effect size for these comparisons is in the range of 0.8–0.9, making the number of interacting partners, reflecting the centrality of a gene, the single most important factor when considering the biological significance of the individual schizophrenia-related genes and their contribution to the disease. There is a significant but smaller enrichment (2.5 times on average) for those 108-loci genes whose promoters are not, only their enhancers are present in these loci, which raises the intriguing possibility that for most genes in this set the mutations fall into or close to the promoter regions, compromising their functionality as in most cases of schizophrenia the protein sequences are not corrupted by mutations. The assumption that it is the regulatory regions, not the protein-coding regions that are affected mostly in schizophrenia is also apparent in the fact that despite the strong centrality (“hubness”) of the affected genes we do find surviving phenotypes, which are the patients, exactly. As raised in the introduction, hub genes with a mutation in the coding region would make these mutations lethal in most instances2. The robustness of our findings is also supported by the fact that when we replace the Malacards genes with the 108-loci genes in Fig. 3 (sharing only SYNGR1, SRR and NRGN as indicated in Fig. 3) and analyze the Gene Ontology terms for the 39 genes correlated positively by SYNGR1 and negatively by MOBP, we find that the most significant biological process associated with this gene set is again “synaptic transmission”. This also shows the remarkable redundancy of the gene networks in the human brain. Altogether, beyond providing an intriguing new model for schizophrenia, with more details for the underlining gene networks than before, it is also fascinating and quite fitting that synaptic transmission, perhaps the most complex and dynamic part of the human brain also entails the most genetic complexity, i.e. the most connected gene networks with the highest number of correlating/interacting gene partners.

Methods

Determining pairwise gene expression correlations in the human brain

In the first step all pairwise Pearson correlations were determined for those genes expressed in the brain that have expression data in the Allen Brain Atlas (website: brain-map.org)89. Gene expression was measured for 50,000 genes in 524 different tissues taken from several compartments of the brains of several individuals spanning the human lifetime between 2 weeks of post-conception and 40 years of age. We used an in-house Perl script to calculate pairwise correlations complemented by correlation data taken from SZDB.org11. We filtered the results keeping only pairwise Pearson correlation that were either minimum 0.8 or maximum −0.7. If there were several values for the same gene pair we used the most extreme ones, the highest and lowest values for the positive and negative correlations, respectively. We mapped all known human genes in 108 highly mutated human genomic regions identified in ref. 7 separately for genes whose promoters or only cis regulatory elements fall into these 108 loci. We determined the location of the promoters and cis elements in the human genome using a study by Thurman et al.58.

Counting correlating gene pairs and their differentially methylated probes

For each gene whose expression correlated (r >= 0.8) or anti-correlated (r <= −0.7) with other genes in our data set (16830 and 16762 genes with correlating and anti-correlating partners, respectively) we counted the number of correlating and anti-correlating partners. Using a methylome data set in ref. 10 that recorded 56001 differentially methylated probes between two subgroups of schizophrenia patients we also counted gene partners for each gene that were hypermethylated or hypomethylated (with at least one hypermethylated or hypomethylated probe, respectively) and also the total number of hyper- or hypomethylated probes for the correlating and anti-correlating gene partners. The genes were ranked for the total sum of hypermethylated probes of the ‘neighboring’ (i.e. either correlating or anti-correlating) genes as this ranking placed SYNGR1, a gene present with its regulatory region in the 108 SNP rich regions in schizophrenics, identified by the Psychiatric Genomics Consortium (PGC) to the top of the list (in Table 1). This ranking also produced 14 genes out of the top 20 genes in Table 1 with either its promoter or cis-regulatory region overlapping with the SNP-rich PGC regions in ref. 7.

Determining the effect size to distinguish between schizophrenia-related and unrelated genes

To determine which measurement distinguishes the best between schizophrenia-related and non-specific genes we calculated the effect size for the 108-loci genes and also for genes annotated by Genecards and Malacards1213 as schizophrenia-related (Fig. 2) regarding each specific feature mentioned in the previous paragraph (i.e. the number of correlating genes, the number of hypermethylated, hypomethylated correlating genes and the total sum of the differentially methylated probes of the correlating pairs). We repeated the calculations for the negatively correlating pairs as well. To calculate the effect size we used Cliff’s delta. Cliff’s delta is defined as where the two distributions are of size m and n with items x and x (with running from 1 to m and running from 1 to n) respectively, and # is defined as the number of times. Cliff’s delta shows that out of all possible pairwise comparisons between the numbers in set A and set B in what proportion will the numbers in set A be bigger than in set B. Cliff’s delta does not require any assumptions about distribution types.

Functional analysis with the Gene Ontology module of STRING

To carry out functional analysis of the top 20 genes and their network neighbors we used the Gene Ontology (GO) module14 of the STRING45 webserver. The server lists all functional categories that are significantly enriched in the provided gene set, and supplies the corresponding p-values. We recorded the biological processes for the top 20 positive and negative genes and also their network neighbors.

Network visualization, statistical calculations

To visualize the gene correlation networks in Figs 3 and 4 Cytoscape59 was used. To carry out t-test calculations and calculate the corresponding p-values we used cpan’s Statistics package, a Perl library. Whenever not mentioned explicitly, calculations and data manipulation was carried out with in-house Perl script (available on request from the author).

Additional Information

How to cite this article: Hegyi, H. Connecting myelin-related and synaptic dysfunction in schizophrenia with SNP-rich gene expression hubs. Sci. Rep. 7, 45494; doi: 10.1038/srep45494 (2017). Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
  59 in total

Review 1.  Network biology: understanding the cell's functional organization.

Authors:  Albert-László Barabási; Zoltán N Oltvai
Journal:  Nat Rev Genet       Date:  2004-02       Impact factor: 53.242

2.  The human disease network.

Authors:  Kwang-Il Goh; Michael E Cusick; David Valle; Barton Childs; Marc Vidal; Albert-László Barabási
Journal:  Proc Natl Acad Sci U S A       Date:  2007-05-14       Impact factor: 11.205

3.  Aldehyde dehydrogenase 1A1 and gelsolin identified as novel invasion-modulating factors in conditioned medium of pancreatic cancer cells.

Authors:  Naomi Walsh; Paul Dowling; Norma O'Donovan; Michael Henry; Paula Meleady; Martin Clynes
Journal:  J Proteomics       Date:  2008-09-25       Impact factor: 4.044

4.  Fertility and fecundity of an outpatient sample with schizophrenia.

Authors:  Angela Cristina Cesar Terzian; Sérgio Baxter Andreoli; Denise Razzouk; Ana Cristina Chaves; Jair de Jesus Mari
Journal:  Braz J Psychiatry       Date:  2006-12       Impact factor: 2.697

5.  Chromosome 17q12 variants contribute to risk of early-onset prostate cancer.

Authors:  Albert M Levin; Mitchell J Machiela; Kimberly A Zuhlke; Anna M Ray; Kathleen A Cooney; Julie A Douglas
Journal:  Cancer Res       Date:  2008-08-15       Impact factor: 12.701

Review 6.  Early developmental disturbances of cortical inhibitory neurons: contribution to cognitive deficits in schizophrenia.

Authors:  David W Volk; David A Lewis
Journal:  Schizophr Bull       Date:  2014-07-22       Impact factor: 9.306

Review 7.  Chromosome 8p as a potential hub for developmental neuropsychiatric disorders: implications for schizophrenia, autism and cancer.

Authors:  R Tabarés-Seisdedos; J L R Rubenstein
Journal:  Mol Psychiatry       Date:  2009-02-10       Impact factor: 15.992

8.  Dysregulation of sterol response element-binding proteins and downstream effectors in prostate cancer during progression to androgen independence.

Authors:  Susan L Ettinger; Richard Sobel; Tanis G Whitmore; Majid Akbari; Dawn R Bradley; Martin E Gleave; Colleen C Nelson
Journal:  Cancer Res       Date:  2004-03-15       Impact factor: 12.701

9.  QuickGO: a web-based tool for Gene Ontology searching.

Authors:  David Binns; Emily Dimmer; Rachael Huntley; Daniel Barrell; Claire O'Donovan; Rolf Apweiler
Journal:  Bioinformatics       Date:  2009-09-10       Impact factor: 6.937

10.  MicroRNA-derived network analysis of differentially methylated genes in schizophrenia, implicating GABA receptor B1 [GABBR1] and protein kinase B [AKT1].

Authors:  Vadim Gumerov; Hedi Hegyi
Journal:  Biol Direct       Date:  2015-10-08       Impact factor: 4.540

View more
  3 in total

1.  Microglial Homeostasis Requires Balanced CSF-1/CSF-2 Receptor Signaling.

Authors:  Violeta Chitu; Fabrizio Biundo; Gabriel G L Shlager; Eun S Park; Ping Wang; Maria E Gulinello; Şölen Gokhan; Harmony C Ketchum; Kusumika Saha; Michael A DeTure; Dennis W Dickson; Zbignew K Wszolek; Deyou Zheng; Andrew L Croxford; Burkhard Becher; Daqian Sun; Mark F Mehler; E Richard Stanley
Journal:  Cell Rep       Date:  2020-03-03       Impact factor: 9.423

2.  Dentate gyrus volume deficit in schizophrenia.

Authors:  Soichiro Nakahara; Jessica A Turner; Vince D Calhoun; Kelvin O Lim; Bryon Mueller; Juan R Bustillo; Daniel S O'Leary; Sarah McEwen; James Voyvodic; Aysenil Belger; Daniel H Mathalon; Judith M Ford; Fabio Macciardi; Mitsuyuki Matsumoto; Steven G Potkin; Theo G M van Erp
Journal:  Psychol Med       Date:  2019-06-03       Impact factor: 7.723

3.  Coexpression of the discoidin domain receptor 1 gene with oligodendrocyte-related and schizophrenia risk genes in the developing and adult human brain.

Authors:  Gerard Muntané; Marc Chillida; Selena Aranda; Arcadi Navarro; Elisabet Vilella
Journal:  Brain Behav       Date:  2021-07-29       Impact factor: 2.708

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.