| Literature DB >> 28615668 |
Huiying Zhao1, Dale R Nyholt2, Yuanhao Yang2, Jihua Wang3, Yuedong Yang4.
Abstract
Genome-wide association studies (GWAS) have successfully identified single variants associated with diseases. To increase the power of GWAS, gene-based and pathway-based tests are commonly employed to detect more risk factors. However, the gene- and pathway-based association tests may be biased towards genes or pathways containing a large number of single-nucleotide polymorphisms (SNPs) with small P-values caused by high linkage disequilibrium (LD) correlations. To address such bias, numerous pathway-based methods have been developed. Here we propose a novel method, DGAT-path, to divide all SNPs assigned to genes in each pathway into LD blocks, and to sum the chi-square statistics of LD blocks for assessing the significance of the pathway by permutation tests. The method was proven robust with the type I error rate >1.6 times lower than other methods. Meanwhile, the method displays a higher power and is not biased by the pathway size. The applications to the GWAS summary statistics for schizophrenia and breast cancer indicate that the detected top pathways contain more genes close to associated SNPs than other methods. As a result, the method identified 17 and 12 significant pathways containing 20 and 21 novel associated genes, respectively for two diseases. The method is available online by http://sparks-lab.org/server/DGAT-path .Entities:
Mesh:
Year: 2017 PMID: 28615668 PMCID: PMC5471232 DOI: 10.1038/s41598-017-03826-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The workflow of DGAT-path to assess pathway associations.
Figure 2Comparisons of methods by (a) the type I error rate and (b) the power at different nominal P-values.
Figure 3Comparisons of methods under different pathway sizes and percentages of significant SNPs by (a) the type I error rates and (b) the power at the nominal P-value of 0.05.
Figure 4The percentages of genes overlapping with the associated genes from recent GWAS catalogs as a function of the total number of genes in different numbers of top pathways detected by methods on the datasets of (a) SCZ (b) breast cancer.
Pathways identified by DGAT-path as significantly associated with SCZ and Breast cancer.
| Pathway ID | Pathway Name | P-value |
|---|---|---|
| SCZ | ||
| GO:0007156 | Homophilic cell adhesion | <10−6 |
| GO:0007628 | Adult walking behavior | <10−6 |
| GO:0014069 | Postsynaptic density | <10−6 |
| GO:0045211 | Postsynaptic membrane | <10−6 |
| GO:0046415 | Urate metabolic process | <10−6 |
| path:hsa04713 | Circadian entrainment | <10−6 |
| path:hsa04724 | Glutamatergic synapse | <10−6 |
| path:hsa04730 | Long-term depression | <10−6 |
| GO:0005245 | Voltage-gated calcium channel activity | 1.9 × 10−6 |
| P00019 | Endothelin signaling pathway | 1.9 × 10−6 |
| REACT_18325 | Regulation of insulin secretion | 1.9 × 10−6 |
| GO:0044325 | Ion channel binding | 3.0 × 10−6 |
| path:hsa04270 | Vascular smooth muscle contraction | 6.9 × 10−6 |
| GO:0007169 | Transmembrane receptor protein tyrosine kinase signaling pathway | 6.9 × 10−6 |
| path:hsa04720 | Long-term potentiation | 6.9 × 10−6 |
| P00057 | Wnt signaling pathway | 7.9 × 10−6 |
| path:hsa04261 | Adrenergic signaling in cardiomyocytes | 8.9 × 10−6 |
| Breast Cancer | ||
| GO:0002456 | T cell mediated immunity | <9.9. × 10−7 |
| GO:0003964 | RNA-directed DNA polymerase activity | <9.9 × 10−7 |
| GO:0005751 | Mitochondrial respiratory chain complex IV | <9.9 × 10−7 |
| GO:0007506 | Gonadal mesoderm development | <9.9 × 10−7 |
| GO:0019031 | Yiral envelope | <9.9 × 10−7 |
| GO:0042287 | MHC protein binding | <9.9 × 10−7 |
| GO:0042989 | Sequestering of actin monomers | <9.9 × 10−7 |
| GO:0050658 | RNA transport | <9.9 × 10−7 |
| GO:0070544 | Histone H3-K36 demethylation | <9.9 × 10−7 |
| P00012 | Cadherin signaling pathway | <9.9 × 10−7 |
| GO:0045211 | Postsynaptic membrane | <9.9 × 10−7 |
| GO:0045296 | Cadherin binding | 4.0 × 10−6 |