| Literature DB >> 29632383 |
Alexander Gusev1,2,3, Nicholas Mancuso4, Hyejung Won5,6, Maria Kousi7, Hilary K Finucane8,9,10, Yakir Reshef11, Lingyun Song12,13, Alexias Safi12,13, Steven McCarroll14,15, Benjamin M Neale9,15,16, Roel A Ophoff6,17, Michael C O'Donovan18, Gregory E Crawford12,13, Daniel H Geschwind5,6,19,20, Nicholas Katsanis7, Patrick F Sullivan21,22, Bogdan Pasaniuc23,24, Alkes L Price25,26.
Abstract
Genome-wide association studies (GWAS) have identified over 100 risk loci for schizophrenia, but the causal mechanisms remain largely unknown. We performed a transcriptome-wide association study (TWAS) integrating a schizophrenia GWAS of 79,845 individuals from the Psychiatric Genomics Consortium with expression data from brain, blood, and adipose tissues across 3,693 primarily control individuals. We identified 157 TWAS-significant genes, of which 35 did not overlap a known GWAS locus. Of these 157 genes, 42 were associated with specific chromatin features measured in independent samples, thus highlighting potential regulatory targets for follow-up. Suppression of one identified susceptibility gene, mapk3, in zebrafish showed a significant effect on neurodevelopmental phenotypes. Expression and splicing from the brain captured most of the TWAS effect across all genes. This large-scale connection of associations to target genes, tissues, and regulatory features is an essential step in moving toward a mechanistic understanding of GWAS.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29632383 PMCID: PMC5942893 DOI: 10.1038/s41588-018-0092-1
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Schematic of TWAS approach
Illustration of the TWAS approach: genetic predictor of gene expression (E) is learned in a reference panel (top); integrated with SCZ GWAS association statistics to infer SCZ-E association (middle); further integrated with individual-level chromatin phenotypes to infer genes with SCZ and chromatin-E associations (bottom). See Supplementary Fig. 1 for detailed analysis flowchart.
Figure 2SCZ TWAS associations and polygenic effects
(top) Manhattan plot of all TWAS associations. Each point represents a single gene tested, with physical position plotted on x-axis and Z-score of association between gene and SCZ plotted on y-axis. Transcriptome-wide significant associations are highlighted as red points, with jointly significant independent associations (see Methods) labeled with gene names and color-coded by expression reference (red CMC; blue METSIM, purple YFS, green NTR, black ALL). (bottom) Polygenic TWAS effects across reference tissues. Out of sample SCZ prediction R2 for gene-based polygenic risk scores (GE-PRS) as a function of significance cutoff. Significant correlations (after Bonferroni correction for number of thresholds tested) are indicated with a (*) and the most significant P-value reported. Right-most panel shows prediction from all tissues jointly (black) and from CMC/brain genes + splicing events jointly (red). R2 computed after subtracting ancestry principal components and converting to liability scale with population prevalence of 1%.
Number of TWAS-associated genes across all phenotypes and tissues.
| CMC/brain introns | CMC/brain | NTR/blood | YFS/blood | MET/adipose | Total | |
|---|---|---|---|---|---|---|
| Heritable | (9,009) 3,890 | 5,514 | 2,743 | 5,418 | 4,654 | 11,749 |
| SCZ associated | (80) 46 | 44 | 35 | 48 | 39 | 157 |
| SCZ associated (novel | (12) 10 | 9 | 6 | 6 | 7 | 35 |
|
| ||||||
| Chromatin associated | (224) 125 | 244 | 182 | 346 | 232 | 806 |
| SCZ and chromatin associated | (10) 8 | 11 | 10 | 13 | 7 | 42 |
Number of splicing events shown in parenthesis.
Total number of unique gene associations.
Novel defined as not overlapping (±500kb) 108 published PGC SCZ GWAS regions.
Figure 3Chromatin TWAS associations compared to top eSNP/cQTL associations
Number of unique genes significantly associated with a chromatin peak after Bonferroni correction for a given distance from the gene (x-axis): (left) using top eSNP in chromatin cohort; (right) using chromatin TWAS from all reference panels. Results from CEU (YRI) populations shown in top (bottom) panels.
Figure 4Chromatin and SCZ TWAS association at PPP2R3C
Example association of PPP2R3C gene expression and SCZ and four nearby chromatin peaks. (A) locus schematic with all nearby genes and chromatin peaks; TWAS associated features highlighted in blue and green. (B–G left) Manhattan plots of marginal association statistics before and after conditioning on the TWAS predicted expression (colored/dark dots, respectively). Dashed line shows local significance threshold after Bonferroni correction for number of SNPs. (B–G right) Relationship between marginal GWAS/QTL association (y axis) and the correlation (x-axis) between TWAS predicted expression (GEpred estimated in the 1000 Genomes reference) and marginal GWAS/QTL association. The color of each point reflects the eQTL effect size of the expression used for GEpred and size of each point reflects absolute significance of the eQTL. (B) SCZ GWAS association; (C) PPP2R3C expression phenotype used for TWAS prediction and associated with SCZ/chromatin; (D) 1st TWAS associated H3k27ac peak in CEU; (E) 2nd TWAS associated H3k27ac peak in CEU; (F) 1st TWAS associated H3k4me1 peak in CEU; (G) 2nd TWAS associated H3k4me1 peak in CEU. See Supplementary Note, Supplementary Fig. 32, 33, 34 for additional examples and simulations.
Figure 5Chromatin and SCZ TWAS association at KLC1
Example association of KLC1 splice event to SCZ with evidence of chromatin interaction in Hi-C from developing brain. (A) locus schematic with all nearby genes and chromatin peaks; TWAS associated features highlighted in blue and green. Hi-C GZ/CP rows show significance of Hi-C chromatin interaction between the 10kb block containing the associated chromatin peaks (gray, with neighboring white blocks not tested) and every other 10kb block in the region; 10kb being the highest resolution for this Hi-C data. Dark red shading indicates more significant and interactions significant at 0.01 FDR are labeled with stars. The most significant interaction in the locus overlaps the KLC1 promoter. The interactions are shown for fetal brain data from CP (cortical and subcortical plate) and GZ (germinal zone) and corresponding topological domains (TADs) are outlined with solid black lines. (B–F left) Manhattan plots of marginal association statistics before and after conditioning on the TWAS predicted expression (colored/dark dots, respectively). Dashed line shows local significance threshold after Bonferroni correction for number of SNPs. (B–F right) Relationship between marginal GWAS/QTL association (y axis) and the correlation (x-axis) between TWAS predicted expression (GEpred estimated in the 1000 Genomes reference) and marginal GWAS/QTL association. The color of each point reflects the eQTL effect size of the expression used for GEpred and size of each point reflects absolute significance of the eQTL.(B) SCZ GWAS association; (C) KLC1 total expression, both panels show independence from the TWAS predicted expression; (D) KLC1 splicing event phenotype used for TWAS prediction and associated with SCZ/chromatin; (E) TWAS associated H3k4me1 chromatin peak in YRI; (F) TWAS associated H3k4me3 chromatin peak in YRI. See Supplementary Note, Supplementary Fig. 32, 33, 34 for additional examples and simulations.
Figure 6Suppression of endogenous mapk3 rescues the microcephaly and neuronal proliferation phenotypes induced by overexpression of wild-type KCTD13
Dorsal view of 4 days post fertilization (dpf) control larvae (A) and embryos injected with either morpholino (MO) against endogenous mapk3 (B), human capped wild-type (WT) KCTD13 mRNA (C) or combinatorial administration of mapk3 MO and WT human KCTD13 mRNA (D). Quantification of the headsize phenotype across the four conditions (E). Dorsal view of 3dpf embryos stained with an antibody against phospho-histone 3 (PH3), a marker of neuronal proliferation of control larvae (F) or embryos injected with either MO against mapk3 (G), human capped wild-type (WT) KCTD13 mRNA (H) or combinatorial administration (I). Graph showing quantification of the proliferating neuronal count across the four conditions (J). Student’s t-test was used to determine statistical significance.
TWAS genes with association to schizophrenia and chromatin phenotypes
42 genes (including 7 genes at novel loci, highlighted with a [*]) had a significant TWAS association with SCZ and chromatin phenotypes. For each significant TWAS association with SCZ, the number of significant genechromatin associations (FWER 5% among TWAS gene-mark associations, by Bonforroni correction) are reported. In the middle columns ‘.’ represents genes that were not heritable in the study and therefore not TWAS-associated. In the right columns ‘.’ represents no identified association; genes with no chromatin associations are not shown. Top panel shows results from genes, with TSS listed as position; bottom panel shows results from splicing events in CMC with exon-exon junction listed as position (details in Supplementary Table 18). The minority of loci without subsequent evidence of colocalziation of molecular phenotype with SCZ are indicated with † (see Supplementary Table 3 for full numerical results).
| Gene | Chr | Position | TWAS P-value
| Associated marks
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| YFS/blood | MET/adipose | NTR/blood | CMC/brain | DHS | H3K27AC | H3K4ME1 | H3K4ME3 | PU1 | RPB2 | |||
| 1 | 8,483,747 | 4e-07 | 2e-06 | 2e-06 | . | . | . | . | . | . | 3 | |
| 1 | 8,378,144 | . | . | . | 4e-08 | . | . | . | . | . | 1 | |
| 1 | 36,621,565 | 6e-04 | . | 1e-06 | . | . | . | . | . | . | 1 | |
| 1 | 43,855,483 | 5e-01 | . | . | 2e-06 | . | . | 1 | . | . | . | |
| 1 | 150,207,026 | . | . | 1e-08 | . | . | . | . | 1 | . | . | |
| 1 | 150,266,261 | 3e-06 | 3e-03 | 6e-03 | 2e-02 | . | . | 1 | . | . | . | |
| 1 | 176,176,380 | 4e-06 | . | . | . | . | . | 1 | . | . | . | |
| 2 | 200,775,978 | . | 6e-10 | . | . | . | . | . | 1 | . | . | |
| 3 | 52,737,714 | . | . | 5e-08 | 3e-08 | . | 1 | . | . | . | . | |
| 3 | 52,321,835 | 2e-08 | . | . | . | . | 1 | . | . | . | . | |
| 3 | 52,719,935 | 7e-09 | 6e-07 | . | 5e-02 | . | . | . | 1 | . | . | |
| 3 | 52,804,965 | . | . | . | 2e-09 | . | . | . | 1 | . | . | |
| 3 | 52,567,793 | 6e-06 | 6e-06 | . | 7e-01 | . | 1 | . | 1 | . | . | |
| 3 | 52,279,808 | 2e-07 | 2e-07 | . | 2e-03 | . | 1 | . | . | . | . | |
| 3 | 52,931,597 | 1e-02 | 4e-01 | 1e-08 | 6e-06 | . | 1 | 1 | 2 | . | . | |
| 3 | 135,969,166 | 1e-08 | 1e-10 | . | 3e-10 | 1 | . | 3 | . | . | . | |
| 5 | 44,826,178 | . | 6e-06 | . | . | . | . | 1 | . | . | . | |
| 5 | 140,053,171 | . | 8e-07 | 1e-02 | . | . | 1 | . | 1 | . | . | |
| 5 | 140,027,383 | 4e-06 | 1e-06 | . | 5e-05 | . | 1 | . | 1 | . | . | |
| 5 | 140,027,370 | 2e-06 | . | . | 4e-06 | . | 1 | . | 2 | . | . | |
| 5 | 140,174,443 | . | . | . | 7e-06 | . | 1 | . | 1 | . | . | |
| 5 | 140,080,031 | 5e-06 | 1e-03 | . | 3e-06 | . | . | . | 1 | . | . | |
| 10 | 104,629,209 | . | 6e-08 | 7e-09 | 1e-05 | . | 1 | . | . | . | . | |
| 12 | 123,717,785 | 4e-09 | 1e-05 | . | 2e-08 | . | 1 | . | . | 1 | . | |
| 14 | 35,591,526 | 7e-01 | 2e-07 | 5e-01 | . | . | 2 | 1 | . | . | . | |
| 14 | 35,591,748 | 6e-05 | 1e-01 | 3e-06 | 2e-02 | . | 2 | 2 | . | . | . | |
| 16 | 30,134,630 | 5e-05 | . | . | 1e-06 | . | 1 | . | . | . | 1 | |
| 16 | 67,753,273 | . | . | 6e-07 | 2e-05 | . | 1 | . | 2 | . | . | |
| 16 | 67,840,780 | . | . | . | 2e-06 | . | . | 1 | 2 | . | . | |
| 16 | 68,038,024 | 1e-06 | . | 3e-06 | 4e-04 | . | . | . | 2 | . | . | |
| 16 | 68,344,876 | 1e-05 | 8e-04 | . | 8e-06 | . | . | 1 | 1 | . | . | |
| 17 | 18,950,336 | . | . | 5e-07 | . | . | . | . | . | . | 1 | |
| 17 | 19,314,490 | 8e-06 | . | . | . | . | . | . | . | . | 1 | |
| 20 | 37,377,096 | 2e-07 | 2e-04 | . | 7e-01 | 1 | . | 1 | . | . | . | |
| 21 | 37,507,262 | 6e-03 | 2e-03 | 2e-06 | 5e-04 | 1 | . | 2 | . | . | . | |
|
| ||||||||||||
| CMC/brain splicing | ||||||||||||
| 3 | 17,255,862 - 17,279,655 | . | . | . | 3e-06 | . | . | 1 | . | . | . | |
| 3 | 52,800,010 - 52,800,194 | . | . | . | 1e-06 | . | . | 1 | . | . | . | |
| 11 | 82,985,783 - 82,991,184 | . | . | . | 3e-07 | . | 1 | . | . | . | . | |
| 12 | 123,821,038 - 123,825,535 | . | . | . | 4e-10 | . | . | . | . | 1 | . | |
| 14 | 104,145,855 - 104,151,323 | . | . | . | 7e-12 | . | . | 1 | 1 | . | . | |
| 14 | 60,074,210 - 60,193,637 | . | . | . | 1e-06 | . | . | 1 | . | . | . | |
| 16 | 29,997,825 - 29,998,165 | . | . | . | 4e-06 | . | . | . | . | . | ||
| 16 | 30,094,168 - 30,094,715 | . | . | . | 2e-06 | . | . | . | . | . | ||
Novel, not overlapping 108 PGC SCZ GWAS loci.
No significant SCZ colocalization posterior in any reference (exlucding chromatin features in YRI).