| Literature DB >> 18466586 |
Abstract
With the availability of high-throughput microarray technologies, investigators can simultaneously measure the expression levels of many thousands of genes in a short period. Although there are rich statistical methods for analyzing microarray data in the literature, limited work has been done in mapping expression quantitative trait loci (eQTL) that influence the variation in levels of gene expression. Most existing eQTL mapping methods assume that the expression phenotypes follow a normal distribution and violation of the normality assumption may lead to inflated type I error and reduced power. QTL analysis of expression data involves the mapping of many expression phenotypes at thousands or hundreds of thousands of marker loci across the whole genome. An appropriate procedure to adjust for multiple testing is essential for guarding against an abundance of false positive results. In this study, we applied a semiparametric quantitative trait loci (SQTL) mapping method to human gene expression data. The SQTL mapping method is rank-based and therefore robust to non-normality and outliers. Furthermore, we apply an efficient Monte Carlo procedure to account for multiple testing and assess the genome-wide significance level. Particularly, we apply the SQTL mapping method and the Monte-Carlo approach to the gene expression data provided by Genetic Analysis Workshop 15.Entities:
Year: 2007 PMID: 18466586 PMCID: PMC2367566 DOI: 10.1186/1753-6561-1-s1-s83
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Number of mapped phenotypes at the genome-wide significance levels of 0.01, 0.05, 1.0 × 10-3, 1.0 × 10-4, and 1.0 × 10-5
| Count of mapped phenotypesa | ||||||
| Chromosome | Region | *α = 0.05 | α = 0.01 | α = 1.0 × 10-3 | α = 1.0 × 10-4 | α = 1.0 × 10-5 |
| 10 | 26.35817Mb-29.46754Mb | 244 | 117 | 61 | 56 | 51 |
| 19 | 60.12027Mb-63.63787Mb | 52 | 25 | 17 | 17 | 17 |
| 21 | 46.14523Mb-46.84680Mb | 91 | 34 | 18 | 15 | 15 |
| 22 | 21.40594Mb-24.67240Mb | 79 | 34 | 25 | 23 | 23 |
| 22 | 42.36342Mb-43.48228Mb | 54 | 19 | 15 | 15 | 15 |
aα, is the genome-wide significance level
Summary statistics of four non-normally distributed expression phenotypes
| Gene | Mininum | Mean | SD | Maximum | Skewness | Kurtosis | |
| 211518_s_at | 2.10 | 3.50 | 1.06 | 9.13 | 2.59 | 8.47 | 3.3 × 10-17 |
| 204695_at | 4.21 | 8.44 | 0.94 | 9.79 | -1.82 | 3.95 | 1.7 × 10-13 |
| 202982_s_at | 3.20 | 7.98 | 0.68 | 9.19 | -3.25 | 18.89 | 2.7 × 10-16 |
| 202950_at | 5.99 | 10.62 | 0.59 | 11.80 | -2.67 | 19.16 | 3.3 × 10-13 |
Figure 1Histograms showing distributions of four non-normally distributed expression phenotypes.
Figure 2LOD score plots for four non-normally distributed expression phenotypes: SQTL (red solid) and standard VC approach (black solid).