| Literature DB >> 21347143 |
Younghee Lee1, Jianrong Li, Eric Gamazon, James L Chen, Anna Tikhomirov, Nancy J Cox, Yves A Lussier.
Abstract
A key challenge for genome-wide association studies (GWAS) is to understand how single nucleotide polymorphisms (SNPs) mechanistically underpin complex diseases. While this challenge has been addressed partially by Gene Ontology (GO) enrichment of large list of host genes of SNPs prioritized in GWAS, these enrichment have not been formally evaluated. Here, we develop a novel computational approach anchored in information theoretic similarity, by systematically mining lists of host genes of SNPs prioritized in three adult-onset diabetes mellitus GWAS. The "gold-standard" is based on GO associated with 20 published diabetes SNPs' host genes and on our own evaluation. We computationally identify 69 similarity-predicted GO independently validated in all three GWAS (FDR<5%), enriched with those of the gold-standard (odds ratio=5.89, P=4.81e-05), and these terms can be organized by similarity criteria into 11 groupings termed "biomolecular systems". Six biomolecular systems were corroborated by the gold-standard and the remaining five were previously uncharacterized. http://lussierlab.org/publications/ITS-GWAS.Entities:
Year: 2010 PMID: 21347143 PMCID: PMC3041547
Source DB: PubMed Journal: Summit Transl Bioinform ISSN: 2153-6430
Reproducibility of Biomolecular Systems predicted in GWAS through . This table presents the count of GO terms enriched in each GWAS according to the unadjusted P-value (top row) and that have been replicated in two (FUSION and DGI, lower) or three studies (upper). Further, different rows present results for increasing numbers of prioritized host genes (PHG) derived from GWAS intragenic SNPs. Joint P-values for two and three GWAS were corrected with combinatorial Bonferroni correction ((P-value^m)*n, m=number of GWAS, n=count of GO studied∼2800). Odds ratios were calculated from a gold standard of biomolecular results (see Methods). Legend: empty sets (0) are presented in pale grey (see Methods, bootstrap); struck, FDR>5% unshaded, meets Bonferroni corrections; bold and red, odds ratio>confidence interval 95% according to our gold standard.
Reproducibility of Biomolecular Systems Predicted through This table present the count of GO terms enriched in each GWAS using information theoretic similarity (ITS) between GO terms (ITS>0.7, ). The lower table shows the number of GO terms predicted in two studies (FUSION and DGI) by ITS. The upper table is then associated with information theory to those of the WTCCC. Since a similarity of ITS=1 is the same thing as a join sets of GO (intersection of sets) produced in : see ; *odds ratio significant in every combination of GWAS at that PHG and P-value.