| Literature DB >> 32245959 |
Guangzao Huang1,2,3, Daniel Osorio4, Jinting Guan1,2, Guoli Ji5,6,7, James J Cai8,9,10.
Abstract
Schizophrenia (SCZ) is a severe, highly heterogeneous psychiatric disorder with varied clinical presentations. The polygenic genetic architecture of SCZ makes identification of causal variants a daunting task. Gene expression analyses hold the promise of revealing connections between dysregulated transcription and underlying variants in SCZ. However, the most commonly used differential expression analysis often assumes grouped samples are from homogeneous populations and thus cannot be used to detect expression variance differences between samples. Here, we applied the test for equality of variances to normalized expression data, generated by the CommonMind Consortium (CMC), from brains of 212 SCZ and 214 unaffected control (CTL) samples. We identified 87 genes, including VEGFA (vascular endothelial growth factor) and BDNF (brain-derived neurotrophic factor), that showed a significantly higher expression variance among SCZ samples than CTL samples. In contrast, only one gene showed the opposite pattern. To extend our analysis to gene sets, we proposed a Mahalanobis distance-based test for multivariate homogeneity of group dispersions, with which we identified 110 gene sets with a significantly higher expression variability in SCZ, including sets of genes encoding phosphatidylinositol 3-kinase (PI3K) complex and several others involved in cerebellar cortex morphogenesis, neuromuscular junction development, and cerebellar Purkinje cell layer development. Taken together, our results suggest that SCZ brains are characterized by overdispersed gene expression-overall gene expression variability among SCZ samples is significantly higher than that among CTL samples. Our study showcases the application of variability-centric analyses in SCZ research.Entities:
Year: 2020 PMID: 32245959 PMCID: PMC7125213 DOI: 10.1038/s41537-020-0097-5
Source DB: PubMed Journal: NPJ Schizophr ISSN: 2334-265X
Fig. 1Expression profiles of VEGFA show more dispersed expression in SCZ than CTL.
a Normalized expression values in 212 CTL and 214 SCZ samples generated by the CMC study. B–F test p-value = 3.6e−6. b Normalized expression values in 266 CTL and 139 SCZ samples, as reported in ref. [42]. B–F test P-value = 1.7e−9.
Functional gene sets showing higher expression variability in SCZ compared with CTL.
| Gene set | Number of genesa | Top 5 genes contributing most to multivariate expression variability of the gene set | ||
|---|---|---|---|---|
| 1 | GO_PHOSPHATIDYLINOSITOL_3_KINASE_COMPLEX | 15 (20) | 7.47E−05 | PIK3R1, PIK3R4, NRBF2, PIK3CA, PIK3C3 |
| 2 | GO_REGULATION_OF_B_CELL_PROLIFERATION | 28 (55) | 1.28E−04 | AHR, INPP5D, PKN1, SLC39A10, CD74 |
| 3b | GO_CEREBELLAR_CORTEX_MORPHOGENESIS | 23 (30) | 1.85E−04 | KNDC1, SPTBN2, LDB1, SERPINE2, FAIM2 |
| 4 | GO_REGULATION_OF_ENDOPLASMIC_RETICULUM_UNFOLDED_PROTEIN_RESPONSE | 22 (28) | 2.48E−04 | PIK3R1, DAB2IP, HSPA5, BBC3, POMT2 |
| 5 | GO_GENETIC_IMPRINTING | 15 (20) | 4.42E−04 | ARID4A, BRCA1, ARID4B, ZFP57, MECP2 |
| 6 | GO_POSITIVE_REGULATION_OF_STRIATED_MUSCLE_CELL_DIFFERENTIATION | 24 (52) | 5.44E−04 | HOPX, CYP26B1, MAPK14, PROX1, THRA |
| 7 | GO_ENDORIBONUCLEASE_ACTIVITY | 33 (54) | 5.57E−04 | FEN1, POP4, DROSHA, RNASEH2B, RPP30 |
| 8 | GO_INSULIN_LIKE_GROWTH_FACTOR_RECEPTOR_SIGNALING_PATHWAY | 14 (14) | 7.87E−04 | PIK3R1, TSC2, GRB10, AKT1, IGF2R |
| 9 | GO_LIPOPROTEIN_PARTICLE_RECEPTOR_ACTIVITY | 12 (16) | 9.76E−04 | LDLR, LRP6, OLR1, SCARB1, LRP10 |
| 10 | GO_RNA_POLYMERASE_II_ACTIVATING_TRANSCRIPTION_FACTOR_BINDING | 24 (36) | 9.83E−04 | LDB1, BHLHE40, BEX1, RB1, NCOR1 |
| 11 | GO_FOLIC_ACID_CONTAINING_COMPOUND_METABOLIC_PROCESS | 22 (29) | 1.10E−03 | MTHFD1, MTRR, ALDH1L1, SLC19A1, FTCD |
| 12b | GO_CEREBELLAR_CORTEX_FORMATION | 19 (22) | 1.74E−03 | KNDC1, LDB1, CDK5, FAIM2, CEND1 |
| 13 | GO_NEGATIVE_REGULATION_OF_VASCULATURE_DEVELOPMENT | 40 (80) | 1.79E−03 | ITGB1BP1, PDCD10, SEMA4A, PML, DAB2IP |
| 14 | GO_RETROGRADE_TRANSPORT_VESICLE_RECYCLING_WITHIN_GOLGI | 19 (24) | 1.97E−03 | OPTN, RAB6A, COG1, PACS1, GOLGA1 |
| 15b | GO_NEUROMUSCULAR_JUNCTION_DEVELOPMENT | 28 (36) | 2.13E−03 | UNC13A, CACNB1, ETV5, AFG3L2, ERBB2 |
| 16b | GO_CEREBELLAR_PURKINJE_CELL_LAYER_DEVELOPMENT | 20 (24) | 2.39E−03 | LDB1, SPTBN2, SEZ6L, UQCRQ, ATP2B2 |
| 17 | GO_NEGATIVE_REGULATION_OF_EPITHELIAL_CELL_APOPTOTIC_PROCESS | 18 (35) | 3.55E−03 | SEMA5A, SCG2, WFS1, TEK, KRIT1 |
| 18 | GO_PYRIMIDINE_DEOXYRIBONUCLEOTIDE_METABOLIC_PROCESS | 14 (18) | 4.02E−03 | NT5C, MBD4, NEIL2, DTYMK, UNG |
| 19 | GO_NEGATIVE_REGULATION_OF_CIRCADIAN_RHYTHM | 11 (17) | 4.14E−03 | ADORA1, CRY2, PER2, SUV39H2, SIN3A |
| 20 | GO_NEGATIVE_REGULATION_OF_OXIDOREDUCTASE_ACTIVITY | 13 (26) | 4.29E−03 | NFKB1, ATP2B4, TMLHE, CNR1, PRDX5 |
| 21 | GO_ACTIVATING_TRANSCRIPTION_FACTOR_BINDING | 40 (57) | 4.51E−03 | LDB1, BHLHE40, RB1, BEX1, NCOR1 |
| 22 | GO_FOUR_WAY_JUNCTION_DNA_BINDING | 13 (15) | 4.97E−03 | HMGB3, MECP2, HMGB1, RAD51C, MSH6 |
| 23b | GO_CELL_DIFFERENTIATION_IN_HINDBRAIN | 15 (21) | 5.24E−03 | KNDC1, LDB1, FAIM2, CEND1, CACNA1A |
| 24b | GO_CEREBELLAR_PURKINJE_CELL_LAYER_MORPHOGENESIS | 11 (14) | 5.59E−03 | LDB1, SPTBN2, FAIM2, CACNA1A, ATP2B2 |
| 25 | GO_ESTABLISHMENT_OF_MITOTIC_SPINDLE_ORIENTATION | 16 (20) | 6.20E−03 | HTT, SPRY1, SPRY2, PAFAH1B1, NUMA1 |
| 26 | GO_PTERIDINE_CONTAINING_COMPOUND_METABOLIC_PROCESS | 28 (36) | 6.64E−03 | PTS, MTR, MTHFD1, MTRR, ALDH1L1 |
| 27b | GO_HINDBRAIN_MORPHOGENESIS | 29 (40) | 7.09E−03 | KNDC1, SPTBN2, LRP6, FAIM2, LDB1 |
| 28b | GO_REGULATION_OF_ASTROCYTE_DIFFERENTIATION | 19 (27) | 7.42E−03 | BMP2, PRPF19, SERPINE2, NOTCH1, NF1 |
| 29 | GO_PHOSPHATIDYLCHOLINE_BIOSYNTHETIC_PROCESS | 18 (27) | 8.39E−03 | ACHE, FGF7, LPIN1, SLC44A5, PCYT1A |
| 30 | GO_REGULATION_OF_GENE_EXPRESSION_BY_GENETIC_IMPRINTING | 12 (16) | 8.53E−03 | ARID4A, BRCA1, ARID4B, ZFP57, DIRAS3 |
For each gene sets, the top five genes that contribute most to expression variability are given.
aNumber of genes used in the analysis (number of genes in the gene set).
bGene sets with functions involved in the central and peripheral nervous system.
Fig. 2Gene set, cerebellar cortex morphogenesis, show more dispersed expression in SCZ.
The PCA analysis was performed with the gene set expression matrix of pooled samples that contain all SCZ and CTL samples. a Distribution of CTL samples on the PCA space defined by the first two PCs. SCZ samples are made invisible by plotting in white color. b Distribution of SCZ samples with CTL samples made invisible. c Distribution of all samples in the PCA space. Dashed lines indicate the 99% confidence ellipses. d Boxplot of MD vectors in SCZ and CTL groups, showing the high within-group variance in SCZ.
Fig. 3Genetic variants associated with gene expression variability in SCZ.
a Two examples of SCZ-specific evQTLs, showing significant differences in expression variances between genotypes in SCZ but not in CTL. P-values of the B–F test for three genotype groups in SCZ and CTL, respectively, are given. b Quantile-quantile plot of P-values for evQTL associations in SCZ (y-axis) against those in CTL (x-axis). c Difference in the density of highly significant evQTL associations between variants and genes in SCZ and CTL.