| Literature DB >> 24901509 |
Dilafruz Juraeva1, Britta Haenisch2, Marc Zapatka3, Josef Frank4, Stephanie H Witt4, Thomas W Mühleisen5, Jens Treutlein4, Jana Strohmaier4, Sandra Meier6, Franziska Degenhardt7, Ina Giegling8, Stephan Ripke9, Markus Leber10, Christoph Lange11, Thomas G Schulze12, Rainald Mössner13, Igor Nenadic14, Heinrich Sauer14, Dan Rujescu8, Wolfgang Maier13, Anders Børglum15, Roel Ophoff16, Sven Cichon17, Markus M Nöthen7, Marcella Rietschel4, Manuel Mattheisen18, Benedikt Brors1.
Abstract
In the present study, an integrated hierarchical approach was applied to: (1) identify pathways associated with susceptibility to schizophrenia; (2) detect genes that may be potentially affected in these pathways since they contain an associated polymorphism; and (3) annotate the functional consequences of such single-nucleotide polymorphisms (SNPs) in the affected genes or their regulatory regions. The Global Test was applied to detect schizophrenia-associated pathways using discovery and replication datasets comprising 5,040 and 5,082 individuals of European ancestry, respectively. Information concerning functional gene-sets was retrieved from the Kyoto Encyclopedia of Genes and Genomes, Gene Ontology, and the Molecular Signatures Database. Fourteen of the gene-sets or pathways identified in the discovery dataset were confirmed in the replication dataset. These include functional processes involved in transcriptional regulation and gene expression, synapse organization, cell adhesion, and apoptosis. For two genes, i.e. CTCF and CACNB2, evidence for association with schizophrenia was available (at the gene-level) in both the discovery study and published data from the Psychiatric Genomics Consortium schizophrenia study. Furthermore, these genes mapped to four of the 14 presently identified pathways. Several of the SNPs assigned to CTCF and CACNB2 have potential functional consequences, and a gene in close proximity to CACNB2, i.e. ARL5B, was identified as a potential gene of interest. Application of the present hierarchical approach thus allowed: (1) identification of novel biological gene-sets or pathways with potential involvement in the etiology of schizophrenia, as well as replication of these findings in an independent cohort; (2) detection of genes of interest for future follow-up studies; and (3) the highlighting of novel genes in previously reported candidate regions for schizophrenia.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24901509 PMCID: PMC4046913 DOI: 10.1371/journal.pgen.1004345
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Flowchart for (1) detection and replication of schizophrenia associated pathways and (2) identification of the most informative genes, and (3) functional annotation of single nucleotide polymorphisms in the genes of interest.
Description of individual samples.
| Sample | Ancestry | Case (n) | Control (n) | Platform | Reference |
| BOMA | German | 1 531 | 2 168 | I5, I6Q, IWQ |
|
| UTR | Dutch | 699 | 642 | I5 |
|
| GAIN | European | 1 157 | 1 364 | A6 |
|
| MGS | European | 1 279 | 1 282 | A6 |
|
Platforms are: I5, Illumina HumanHap 550; I6Q, Illumina Human610 Quad; IWQ, Illumina Human660W-Quad; A6, Affymetrix Genome-Wide Human SNP Array 6.0.
Publication reporting individual sample level genotypes for Schizophrenia is listed.
Discovery set: single nucleotide polymorphisms (SNPs) before pruning – 491,393; after pruning – 419,267.
Replication set: SNPs before pruning – 669,059; after pruning – 552,988.
Comparisons of FDRs (BH) and p-values (P) for the BOMA-UTR and the GAIN-MGS data sets for the replicated pathways.
| Description | Pathway ID | BOMA-UTR | GAIN-MGS | Number of SNPs | |||
| BH | P | BH | P | BOMA-UTR | GAIN-MGS | ||
| dbMIR:gagcctg,mir-484 | GAGCCTG,MIR-484 | 1.60E-02 | 1.32E-04 | 1.01E-04 | 6.66E-06 | 1,658 | 2,332 |
| dbGO:0008270:zinc ion binding | GO:0008270 | 1.58E-02 | 8.13E-06 | 1.01E-04 | 7.46E-06 | 14,839 | 34,704 |
| dbGO:0046914:transition metal ion binding | GO:0046914 | 5.17E-04 | 8.20E-07 | 1.02E-04 | 1.14E-05 | 17,193 | 40,248 |
| dbTFT::v$hnf4 q6 | V$HNF4_Q6 | 5.85E-04 | 6.42E-06 | 5.04E-04 | 9.33E-05 | 3,450 | 4,375 |
| dbGO:0010628:positive regulation of gene expression | GO:0010628 | 2.34E-02 | 3.21E-04 | 7.88E-04 | 1.75E-04 | 8,878 | 18,006 |
| dbTFT:v$chop 01 | V$CHOP_01 | 3.76E-05 | 1.65E-07 | 5.51E-03 | 1.63E-03 | 4,365 | 5,436 |
| dbKEGG:04514:cell adhesion molecules (cams) | hsa04514 | 2.02E-02 | 8.45E-04 | 1.21E-02 | 4.02E-03 | 3,562 | 4,846 |
| dbTFT:v$ciz 01 | V$CIZ_01 | 3.76E-05 | 1.63E-07 | 1.59E-02 | 5.88E-03 | 4,443 | 5,987 |
| dbKEGG:04210:apoptosis | hsa04210 | 1.97E-02 | 6.42E-04 | 3.33E-02 | 1.48E-02 | 985 | 1,304 |
| dbTFT:v$sox5 01 | V$SOX5_01 | 1.02E-03 | 2.01E-05 | 5.32E-02 | 2.56E-02 | 5,067 | 6,159 |
| dbTFT:v$cebpa 01 | V$CEBPA_01 | 4.03E-04 | 3.54E-06 | 7.84E-02 | 4.07E-02 | 4,113 | 5,133 |
| dbTFT:v$ptf1beta q6 | V$PTF1BETA_Q6 | 1.02E-03 | 1.60E-05 | 1.07E-01 | 6.73E-02 | 4,849 | 5,911 |
| dbCGP:Kyng dna damage by uv | KYNG_DNA_DAMAGE_BY_UV | 2.89E-02 | 9.89E-05 | 1.49E-01 | 9.96E-02 | 577 | 732 |
| dbGO:0050808:synapse organization | GO:0050808 | 3.21E-02 | 2.89E-05 | 3.28E-01 | 2.92E-01 | 2,504 | 3,862 |
* - Significant pathways identified by more than one pathway analysis method within the BOMA-UTR data set. The test statistics obtained using the alternative algorithms are provided in Table S1B.
Note: FDR – False Discovery Rate; BH – Benjamini-Hochberg.
Figure 2(A) Circos plots integrating the Global Test and FORGE analysis and heatmaps for the levels of single nucleotide polymorphism (SNP)- and gene significance.
(B) Inset legend providing information represented by each data ring. Notes: for visibility, the implicated gene locations were zoomed in upon by up to 1200%. The inset legend image provides information represented by each ideogram. −log10 of the individual SNP and the gene p-values increase radially outward. The arc of each heatmap wedge maps directly to the location of the SNP in the genome. The arc width is proportional to the size of the associated gene (plus 20 kb upstream and downstream). Individual SNP p-values for the BOMA-UTR and the GAIN-MGS data sets are shown as scatterplots on ideograms A and B. The gene p-values for Psychiatric Genetics Consortium (PGC) datasets are shown as a scatterplot on ideogram C. The significance scores for genes contributing to a pathway significance are shown as heatmaps on ideograms 1–14. 1 - dbGO:0050808:synapse organization; 2 - dbKEGG:04514:cell adhesion molecules; 3 - dbCGP:Kyng dna damage by UV; 4 - dbKEGG:04210:apoptosis; 5 - dbGO:0046914:transition metal ion binding; 6 - dbGO:0008270:zinc ion binding; 7 - dbGO:0010628:positive regulation of gene expression; 8 - dbMIR:gagcctg,mir-484; 9 - dbTFT:v$cebpa 01; 10 - dbTFT::v$hnf4 q6; 11 - dbTFT:v$chop 01; 12 - dbTFT:v$ptf1bea q6; 13 - dbTFT:v$ciz 01; 14 - dbTFT:v$sox5 01. The darker the red, the higher the contribution of the SNP/gene to the association of the respective pathway. Comparing the overlapping of important genes in different pathways allows investigation of whether they lie within intersections of those pathways.
Figure 3RegulomeDB functional annotation for SNPs in CTCF and its regulatory regions.
Notes: * genotyped in the BOMA-UTR data set and sorted by their genomic coordinates. SNPs are within or 20 kb upstream and downstream of CTCF. ** AR FOXA1 USF1 CDX2 HNF4A TRIM28 USF2 TCF4 HDAC2 SP1 BHLHE40. *** KROX SP4 SP1:SP3 HIC1 Zif268 Sp4 Sp1 SP1 Egr. § RegulomeDB score: [1f] - likely to affect binding and linked to expression of a gene target; [2b] - likely to affect binding; [4], [5], [6] - minimum binding evidence.