| Literature DB >> 19477981 |
Yang Huang1, Stefan Wuchty, Michael T Ferdig, Teresa M Przytycka.
Abstract
MOTIVATION: Analysis of expression quantitative trait loci (eQTL) significantly contributes to the determination of gene regulation programs. However, the discovery and analysis of associations of gene expression levels and their underlying sequence polymorphisms continue to pose many challenges. Methods are limited in their ability to illuminate the full structure of the eQTL data. Most rely on an exhaustive, genome scale search that considers all possible locus-gene pairs and tests the linkage between each locus and gene. RESULT: To analyze eQTLs in a more comprehensive and efficient way, we developed the Graph based eQTL Decomposition method (GeD) that allows us to model genotype and expression data using an eQTL association graph. Through graph-based heuristics, GeD identifies dense subgraphs in the eQTL association graph. By identifying eQTL association cliques that expose the hidden structure of genotype and expression data, GeD effectively filters out most locus-gene pairs that are unlikely to have significant linkage. We apply GeD on eQTL data from Plasmodium falciparum, the human malaria parasite, and show that GeD reveals the structure of the relationship between all loci and all genes on a whole genome level. Furthermore, GeD allows us to uncover additional eQTLs with lower FDR, providing an important complement to traditional eQTL analysis methods.Entities:
Mesh:
Year: 2009 PMID: 19477981 PMCID: PMC2687943 DOI: 10.1093/bioinformatics/btp189
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.(a) Each genomic locus l is assigned the genotype of the strain it was inherited from (0/1). In all strains s, we discretize expression levels of genes g as being either ‘up’, ‘unchanged’ or ‘down’. Considering the genotypes of l, we observe different gene expression patterns, indicating different expression mechanisms in different strains. For example, the expression of g is up-regulated in strain s, s, s and s, and down-regulated in strain s, s, s and s. In (b) we show the corresponding eQTL association graph. Specifically, we find an association clique including genes g and g, strains s, s, s, s and loci l and l, shown in light grey. The edges of the association clique are drawn with wider lines.
Fig. 2.Number of association cliques from real and random data with support 6. Numbers we obtained from randomized data were averaged over 100 runs. In (a) we perform the analysis varying |G|, the number of probes, while we show the analogous results with changing number of loci |L| in (b).
Fig. 3.Histogram of nominal P-values for all possible locus–gene pairs and pairs we selected from eQTL association cliques.
Fig. 4.In (a), we show genome-wide eQTL distributions in P.falciparum by testing all possible combination of loci and genes (Gonzales et al., 2008). In (b), we find similar, yet enriched patterns of eQTLs we detected with GeD.