| Literature DB >> 20964847 |
Andrew K Rider1, Geoffrey Siwo, Nitesh V Chawla, Michael Ferdig, Scott J Emrich.
Abstract
BACKGROUND: Complexity and noise in expression quantitative trait loci (eQTL) studies make it difficult to distinguish potential regulatory relationships among the many interactions. The predominant method of identifying eQTLs finds associations that are significant at a genome-wide level. The vast number of statistical tests carried out on these data make false negatives very likely. Corrections for multiple testing error render genome-wide eQTL techniques unable to detect modest regulatory effects. We propose an alternative method to identify eQTLs that builds on traditional approaches. In contrast to genome-wide techniques, our method determines the significance of an association between an expression trait and a locus with respect to the set of all associations to the expression trait. The use of this specific information facilitates identification of expression traits that have an expression profile that is characterized by a single exceptional association to a locus. Our approach identifies expression traits that have exceptional associations regardless of the genome-wide significance of those associations. This property facilitates the identification of possible false negatives for genome-wide significance. Further, our approach has the property of prioritizing expression traits that are affected by few strong associations. Expression traits identified by this method may warrant additional study because their expression level may be affected by targeting genes near a single locus.Entities:
Mesh:
Year: 2010 PMID: 20964847 PMCID: PMC2974753 DOI: 10.1186/1471-2105-11-526
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Lowest corrected p-value versus Hellinger distance. The Hellinger distance and p-value have a weak correlation, indicating that genome-wide significance is not a major consideration in the calculation of Hellinger distance. The r2 value for the linear regression model on this data is 0.248. The pluses represent expression traits for which the strongest association to a locus is on the same chromosome as the trait.
Figure 2Distribution of hellinger distance for all expression traits. A histogram displaying the distribution of Hellinger distances across all 7665 expression traits. The larger Hellinger distances represent expression traits that may be regulated equally by multiple loci while the smaller Hellinger distances correspond to expression traits which have single or few exceptionally strong associations.
Hotspots on each chromosome found using Hellinger distance (HD) and the genome-wide approach (GW) and the proportion of cis-acting eQTLs in hotspots on the chromosome.
| chromosome | HD | cis HD eQTLs | GW | cis GW eQTLs |
|---|---|---|---|---|
| 3 | 5 | 0/27 | 1 | 0/29 |
| 4 | 2 | 5/9 | 0 | 0 |
| 5 | 6 | 6/191 | 8 | 12/439 |
| 7 | 2 | 0/11 | 0 | 0 |
| 8 | 2 | 0/14 | 0 | 0 |
| 9 | 1 | 0/7 | 1 | 1/12 |
| 10 | 1 | 0/4 | 0 | 0 |
| 12 | 2 | 0/12 | 1 | 0/18 |
| 14 | 1 | 0/4 | 0 | 0 |
Cis-acting eQTLs were defined as those which are most strongly associated to markers on the chromosome they appear in.
Figure 3Comparison of eQTL hotspots using the genome-wide and Hellinger distance approaches. Marker positions on the genome versus the frequency of significant associations or eQTLs. The dashed line represents the cutoff for significance. The first bar graph shows the frequency of eQTLs by the genome-wide method while the second shows the frequency of expression traits with small Hellinger distance. Traits with significant Hellinger distance are assigned to the marker they are most strongly associated with.
Columns small tail and large tail indicate the number of total GO terms found for expression traits in the denoted tail of the Hellinger distance distribution that are not enriched in expression traits with eQTLs.
| GO category | small tail | large tail |
|---|---|---|
| process | 43 | 8 |
| function | 5 | 7 |
| component | 4 | 27 |
The small tail contained 28 cis-acting eQTLs and the large tail contained 12. Each tail containted 318 eQTLs.