| Literature DB >> 31063461 |
Qi Yan1, Nianjun Liu2, Erick Forno1, Glorisa Canino3, Juan C Celedón1, Wei Chen1,4,5.
Abstract
The development of high-throughput biotechnologies allows the collection of omics data to study the biological mechanisms underlying complex diseases at different levels, such as genomics, epigenomics, and transcriptomics. However, each technology is designed to collect a specific type of omics data. Thus, the association between a disease and one type of omics data is usually tested individually, but this strategy is suboptimal. To better articulate biological processes and increase the consistency of variant identification, omics data from various platforms need to be integrated. In this report, we introduce an approach that uses a modified Fisher's method (denoted as Omnibus-Fisher) to combine separate p-values of association testing for a trait and SNPs, DNA methylation markers, and RNA sequencing, calculated by kernel machine regression into an overall gene-level p-value to account for correlation between omics data. To consider all possible disease models, we extend Omnibus-Fisher to an optimal test by using perturbations. In our simulations, a usual Fisher's method has inflated type I error rates when directly applied to correlated omics data. In contrast, Omnibus-Fisher preserves the expected type I error rates. Moreover, Omnibus-Fisher has increased power compared to its optimal version when the true disease model involves all types of omics data. On the other hand, the optimal Omnibus-Fisher is more powerful than its regular version when only one type of data is causal. Finally, we illustrate our proposed method by analyzing whole-genome genotyping, DNA methylation data, and RNA sequencing data from a study of childhood asthma in Puerto Ricans.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31063461 PMCID: PMC6524814 DOI: 10.1371/journal.pgen.1008142
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Simulated Type I error rates based on 10,000 datasets.
| Significance level | Optimal Omnibus-Fisher | Omnibus-Fisher | Usual Fisher |
|---|---|---|---|
| 0.05 | 0.0521 | 0.0516 | 0.0522 |
| 0.01 | 0.0100 | 0.0109 | 0.0108 |
| 0.001 | 0.0015 | 0.0015 | 0.0015 |
| 0.05 | 0.0483 | 0.0472 | 0.0478 |
| 0.01 | 0.0088 | 0.0095 | 0.0095 |
| 0.001 | 0.0007 | 0.0009 | 0.0009 |
| 0.05 | 0.0488 | 0.0526 | 0.0675 |
| 0.01 | 0.0089 | 0.0107 | 0.0161 |
| 0.001 | 0.0010 | 0.0015 | 0.0029 |
| 0.05 | 0.0525 | 0.0509 | 0.0655 |
| 0.01 | 0.0093 | 0.0096 | 0.0166 |
| 0.001 | 0.0005 | 0.0011 | 0.0029 |
Genes with P < 1×10−4 from the optimal or regular Omnibus-Fisher tests for the asthma status analysis in WBCs.
| Optimal Omnibus-Fisher | Regular Omnibus-Fisher | Gene-level SNP | Gene-level | Gene-level RNA expression | ||
|---|---|---|---|---|---|---|
| 17 | 1.40×10−5 | 3.39×10−5 | 2.89×10−6 | 0.1071 | 0.6728 | |
| 17 | 2.00×10−5 | 1.67×10−4 | 2.36×10−6 | 0.8816 | 0.6153 | |
| 17 | 7.00×10−4 | 8.06×10−5 | 0.0102 | 0.0047 | 0.0127 | |
| 6 | 1.90×10−4 | 5.65×10−5 | 0.2343 | 0.0109 | 2.00×10−4 | |
| 11 | 4.00×10−4 | 9.25×10−5 | 0.0086 | 4.76×10−4 | 0.1779 | |
| 14 | 7.00×10−5 | 5.13×10−5 | 0.1012 | 2.30×10−5 | 0.1757 | |
| 16 | 0.0020 | 7.25×10−5 | 3.57×10−4 | 0.0688 | 0.0197 | |