| Literature DB >> 29249828 |
Tatiana Polushina1,2, Sudheer Giddaluru1,2, Francesco Bettella3,4, Thomas Espeseth4,5, Astri J Lundervold6,7, Srdjan Djurovic1, Sven Cichon8,9,10,11, Per Hoffmann8,9,10, Markus M Nöthen9,10, Vidar M Steen1,2, Ole A Andreassen3,4, Stéphanie Le Hellard12,13.
Abstract
We have tested published methods for capturing allelic heterogeneity and identifying loci of joint effects to uncover more of the "hidden heritability" of schizophrenia (SCZ). We used two tools, cojo-GCTA and multi-SNP, to analyze meta-statistics from the latest genome-wide association study (GWAS) on SCZ by the Psychiatric Genomics Consortium (PGC). Stepwise regression on markers with p values <10-7 in cojo-GCTA identified 96 independent signals. Eighty-five passed the genome-wide significance threshold. Cross-validation of cojo-GCTA by CLUMP was 76%, i.e., 26 of the loci identified by the PGC using CLUMP were found to be dependent on another locus by cojo-GCTA. The overlap between cojo-GCTA and multi-SNP was better (up to 92%). Three markers reached genome-wide significance (5 × 10-8) in a joint effect model. In addition, two loci showed possible allelic heterogeneity within 1-Mb genomic regions, while CLUMP analysis had identified 16 such regions. Cojo-GCTA identified fewer independent loci than CLUMP and seems to be more conservative, probably because it accounts for long-range LD and interaction effects between markers. These findings also explain why fewer loci with possible allelic heterogeneity remained significant after cojo-GCTA analysis. With multi-SNP, 86 markers were selected at the threshold 10-7. Multi-SNP identifies fewer independent signals, due to splitting of the data and use of smaller samples. We recommend that cojo-GCTA and multi-SNP are used for post-GWAS analysis of all traits to call independent loci. We conclude that only a few loci in SCZ show joint effects or allelic heterogeneity, but this could be due to lack of power for that data set.Entities:
Mesh:
Year: 2017 PMID: 29249828 PMCID: PMC5802566 DOI: 10.1038/s41398-017-0033-2
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
Number of independent SNPs for different thresholds
| Threshold | 10−3 | 10−4 | 10−5 | 10−6 | 10−7 | 5 × 10−8 |
|---|---|---|---|---|---|---|
| Number of markers below threshold before CRa | 82,041 | 35,630 | 18,710 | 11,028 | 6533 | 5706 |
| Number of markers below threshold after CRb | 7038 | 1048 | 460 | 200 | 101 | 88 |
| Number of markers below 5 × 10−8 after CRc | 2504 | 186 | 134 | 92 | 90 | 88 |
| Number of markers validated with German cohort as LD referenced | — | 69 | 74 | 80 | 83 | 83 |
| % Markers validated with German cohort as LD referencee | 37% | 55% | 87% | 92% | 94% |
CR conditional regression, LD linkage disequilibrium, p Germ joint p values with German LD reference, p Norg joint p values with Norwegian LD reference
aNumber of SNPs below the indicated threshold in the initial data set
bNumber of markers that were selected using the stepwise procedure with the Norwegian LD reference sample
cNumber of signals that passed the genome-wide significance threshold in the joint model
dNumber of validated SNPs: a marker is deemed validated for joint effect if it passes the genome-wide significance level (5 × 10−8) after stepwise analysis with the Norwegian LD reference sample and after joint analysis with the German LD reference sample, and if −log10(pGerm)/−log10(pNorg) <2 (the joint p values estimated using the Norwegian sample as LD reference do not differ essentially from the joint p values estimated using the German LD reference)
ePercentages of validated SNPs for different thresholds. For the threshold 10−3, the list of selected markers after the stepwise procedure could not be fitted with the German sample because of redundant signals
SNPs that became significant in the joint analysis using the merged Norwegian and German cohorts as LD reference
| SNPa | Positionb |
| pJ after CRd | PGC |
|---|---|---|---|---|
| rs1509378 | 2: 22,754,466 | 8.37 × 10−8 | 4.23 × 10−8 | Not reported |
| rs12474906 | 2: 28,033,538 | 1.01 × 10−7 | 4.99 × 10−8 | 1.36 × 10−7 |
| rs12148337 | 15: 70,589,272 | 5.33 × 10−8 | 6.51 × 10−9 | 1.78 × 10−8 |
| rs2398180 | 15: 96,863,169 | 0.002 | 6.37 × 10−9 | Not reported |
adbSNP reference ID for the SNP
bGenomic position (chromosome:base pair) of the marker based on UCSC hg19/NCBI build 37
c p values in the PGC-SCZ discovery sample
d p values in the joint effect model with merged Norwegian and German LD reference samples
eInformation from the PGC for replication testing of each marker
Fig. 1Loci with multiple independent signals of association
The plots show the results for the joint and conditional analysis with merged Norwegian and German cohorts as the LD reference sample for the loci on chromosome 18 (a) and chromosome 22 (b). On each plot, several independent signals are identified using the stepwise procedure within a 10 Mb window in cojo-GCTA. SNPs are plotted according to their chromosomal positions based on UCSC hg19/NCBI build 37. The –log10(p values) of the SNPs are shown on each plot. LD values between the lead SNP and the other markers are indicated by color. Genes located in the region of interest are indicated at the bottom. Plots were generated using the LocusZoom tool[34]
Annotation of the independent markers to genes
| Positiona | Gene(s) in regionb |
|---|---|
| 2: 22,621,296–22,821,666 |
|
| 2: 27,784,034–28,281,545 |
|
| 15: 96,817,467–96,866,320 |
|
aLocus positions are displayed as chromosome:start–end based on UCSC hg19/NCBI build 37. Loci were delimited by taking into account all markers in LD with the marker selected by cojo-GCTA
bRegions were screened for gene content using RefSeq in the UCSC genome browser