| Literature DB >> 22723893 |
Thomas E Gladwin1, Eske M Derks, Marcella Rietschel, Manuel Mattheisen, René Breuer, Thomas G Schulze, Markus M Nöthen, Douglas Levinson, Jianxin Shi, Pablo V Gejman, Sven Cichon, Roel A Ophoff.
Abstract
Recent studies suggest that variation in complex disorders (e.g., schizophrenia) is explained by a large number of genetic variants with small effect size (Odds Ratio ≈ 1.05-1.1). The statistical power to detect these genetic variants in Genome Wide Association (GWA) studies with large numbers of cases and controls (v 15,000) is still low. As it will be difficult to further increase sample size, we decided to explore an alternative method for analyzing GWA data in a study of schizophrenia, dramatically reducing the number of statistical tests. The underlying hypothesis was that at least some of the genetic variants related to a common outcome are collocated in segments of chromosomes at a wider scale than single genes. Our approach was therefore to study the association between relatively large segments of DNA and disease status. An association test was performed for each SNP and the number of nominally significant tests in a segment was counted. We then performed a permutation-based binomial test to determine whether this region contained significantly more nominally significant SNPs than expected under the null hypothesis of no association, taking linkage into account. Genome Wide Association data of three independent schizophrenia case/control cohorts with European ancestry (Dutch, German, and US) using segments of DNA with variable length (2 to 32 Mbp) was analyzed. Using this approach we identified a region at chromosome 5q23.3-q31.3 (128-160 Mbp) that was significantly enriched with nominally associated SNPs in three independent case-control samples. We conclude that considering relatively wide segments of chromosomes may reveal reliable relationships between the genome and schizophrenia, suggesting novel methodological possibilities as well as raising theoretical questions.Entities:
Mesh:
Year: 2012 PMID: 22723893 PMCID: PMC3377732 DOI: 10.1371/journal.pone.0038828
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Metasignificance of segments located in chromosome 5 (128–160 Mbp).
| Segment width [Mbp] | Replicable region [Mbp] | p (Netherlands) | p (GAIN) | p (Germany) | Fisher’s combined probability test |
| 4 | chr5: 132–136 | 0.017 | 0.009 | 0.025 | 3.49 E-4 |
| 8 | chr5: 128–136 | 0.001 | 0.008 | 0.026 | 2.80 E-5 |
| 16 | chr5: 120–136 | 0.001 | 0.016 | 0.013 | 2.80 E-5 |
| 16 | chr5: 128–144 | 0.001 | 0.021 | 0.001 | 3.67 E-6 |
| 32 | chr5: 128–160 | 0.001 | 0.023 | 0.001 | 3.98 E-6 |
Note. Segment width refers to the width of regions over which tests of the number of nominally significant SNPs were tested. The replicable region indicates the location of the segment. The p-values provide the results of permutation based tests.
Figure 1“Manhattan plot of the 22 autosomal chromosomes”.
This figure shows at the y–axis the p-values of the SNPs in a GWA analysis. The chromosomes are shown at the x-axis. The red line indicates a p-value of 10-7, the blue line indicates a p-value of 10-5 and the green line indicates a p-value of .05.
Figure 2“Manhattan plot of the top segment located at chromosome 5 (128–136
Mbp)”. This figure shows at the y–axis the p-values of the SNPs located at chromosome 5 (128–136 Mbp). The chromosomes are shown at the x-axis. The red line indicates a p-value of 10-7, the blue line indicates a p-value of 10-5 and the green line indicates a p-value of .05.
Overview of disease associated genes located within the significantly associated region at chromosome 5 (128–136 Mbp).
| Start position(Mbp) | End position (Mbp) | Gene name | Database | Phenotype associations | Empirical p set-based test (Netherlands) | Empirical p set-based test (GAIN) | Empirical p set-based test (German) |
| 130,627,600 | 130,758,281 |
| GAD; SZGENE | Immune ( | .038 | .077 | .014 |
| 130,789,503 | 130,998,828 |
| SZGENE | .025 | .124 | .001 | |
| 131,170,738 | 131,375,769 |
| GAD; SZGENE | Psychiatric ( | .069 | .097 | .001 |
| 131,424,245 | 131,426,795 |
| GAD; SZGENE | Immune ( | .005 | 1 | 1 |
| 131,437,383 | 131,439,758 |
| GAD | Immune ( | .020 | 1 | 1 |
| 131,621,285 | 131,637,046 |
| GAD | Metabolic ( | .001 | .094 | 1 |
| 131,658,043 | 131,707,798 |
| GAD | Immune ( | .002 | .070 | 1 |
| 131,733,342 | 131,759,202 |
| GAD | Immune ( | 1 | .027 | 1 |
| 131,846,683 | 131,859,158 |
| GAD | Immune, Infection ( | .003 | 1 | 1 |
| 131,905,034 | 131,920,427 |
| GAD | Immune ( | 1 | 1 | 1 |
| 132,021,763 | 132,024,700 |
| GAD | Immune, Infection, Other, Pharmacogenomic, Renal (multiple phenotypes) | 1 | 1 | 1 |
| 132,037,271 | 132,046,267 |
| GAD; SZGENE | Cardiovascular, Hematological, Immune, Infection, Other, Renal, Reproduction (multiple phenotypes) | 1 | .091 | 1 |
| 132,224,776 | 132,228,376 |
| GAD | Reproduction ( | .041 | 1 | 1 |
| 132,415,560 | 132,468,608 |
| GAD | Reproduction ( | .053 | 1 | .018 |
| 133,478,300 | 133,511,819 |
| GAD | Immune ( | .049 | 1 | 1 |
| 134,807,802 | 134,810,937 |
| GAD | Immune ( | 1 | .119 | 1 |
| 134,897,870 | 134,899,538 |
| GAD; SZGENE | Psychiatric (schizophrenia) | .014 | 1 | 1 |
| 135,255,833 | 135,259,415 |
| GAD; SZGENE | Immune ( | 1 | 1 | 1 |
| 135,300,162 | 135,305,266 |
| SZGENE | 1 | .047 | 1 | |
| 135,392,596 | 135,427,406 |
| GAD | Other ( | 1 | .007 | 1 |
Note: The table shows genes that have previously reported to be associated with disease based on the UCSC Genome Bioinformatics site (NCBI36/hg18) (http://genome.ucsc.edu/) and genes previously found to be associated with schizophrenia based on the Schizophrenia Research Forum (www.schizophreniaresearchforum.org). The final column represents the phenotype and disease associations according to the UCSC Genome Bioinformatics site.
Set-based tests were performed in Plink to assess the association between SNPs within a particular gene and case-control status. This test uses permutation to determine the significance. The default values were used (r-squared = .5; p-value = .05; maximum number of SNPs within a gene = 5); more details can be found at http://pngu.mgh.harvard.edu/~purcell/plink.
Results of the sensitivity analyses: a comparison of different nominal p-values.
| Nominal p-value | p (Netherlands) | p (GAIN) | p (Germany) | Fisher’s combined probability test |
| .05 | 0.001 | 0.023 | 0.001 | 3.98 E-6 |
| 0.01 | 0.002 | 0.04 | 0.06 | 4.2 E-4 |
| 0.1 | 0.001 | 0.001 | 0.003 | 6.4 E-7 |
| EIGENSTRAT | 0.004 | 0.001 | 0.04 | 2.23 E-5 |
Note. Results for variations of the method for the 32 Mbp width region on chromosome 5, 128–160 bp. Nom 0.01: the analysis was performed using p = 0.01 as the cutoff for nominal SNP-wise significance. Nom 0.11: the analysis was performed using p = 0.1 as the cutoff for nominal SNP-wise significance. EIGENSTRAT: the analysis was performed on data corrected for population stratification using the EIGENSTRAT procedure.