| Literature DB >> 18319721 |
Silpa Suthram1, Andreas Beyer, Richard M Karp, Yonina Eldar, Trey Ideker.
Abstract
Analysis of expression quantitative trait loci (eQTLs) is an emerging technique in which individuals are genotyped across a panel of genetic markers and, simultaneously, phenotyped using DNA microarrays. Because of the spacing of markers and linkage disequilibrium, each marker may be near many genes making it difficult to finely map which of these genes are the causal factors responsible for the observed changes in the downstream expression. To address this challenge, we present an efficient method for prioritizing candidate genes at a locus. This approach, called 'eQTL electrical diagrams' (eQED), integrates eQTLs with protein interaction networks by modeling the two data sets as a wiring diagram of current sources and resistors. eQED achieved a 79% accuracy in recovering a reference set of regulator-target pairs in yeast, which is significantly higher than the performance of three competing methods. eQED also annotates 368 protein-protein interactions with their directionality of information flow with an accuracy of approximately 75%.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18319721 PMCID: PMC2290938 DOI: 10.1038/msb.2008.4
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 11.429
Figure 1Examples of the electrical circuit approach and the eQED model. (A) Sample network. (B) The ‘greedy' random walk approach by Tu . (C) The single-locus model of eQED. Gene T in the blue octagon is the target gene. The locus marked by the red box, containing candidate genes L1, L2 and L3, associates significantly with the target T. The numbers next to the locus genes correspond to the number of times they were visited in the random walk approaches or the amount of current through them in the electric circuit approach. (D) The random walk derived from (C). (E) The sample network with two significant loci. (F) The multiple-loci model of eQED.
Figure 2Flowchart of the eQED method.
Causal gene prediction accuracy
| Methods | Number of correct predictions |
|---|---|
| Random | 118 |
| Tu | 262 |
| Shortest path | 351 |
| eQED (single locus) | 392 |
| eQED (multiple loci) | 438 |
aAll predictions were tested against a gold-standard data set of 548 causal gene–target pairs compiled from yeast gene expression knockout studies by Hughes and Hu and a gene overexpression study by Chua .
bA naïve method in which the causal gene is selected to be the gene at the locus that is connected by the shortest path to the target.
Figure 3Inferred pathways and directionality prediction. (A) The accuracy of the direction prediction methods. The ‘gold' standard protein interactions were ranked according to the different metrics (x-axis), and the cumulative percent accuracy represented as y-axis. (B–D) The regulatory networks for three example target genes. The nodes colored in shades of red correspond to predicted causal genes. The intensity of color corresponds to their P-value of association with the target.