| Literature DB >> 22969789 |
Ovidiu D Iancu1, Priscila Darakjian, Sunita Kawane, Daniel Bottomly, Robert Hitzemann, Shannon McWeeney.
Abstract
Complex Mus musculus crosses, e.g., heterogeneous stock (HS), provide increased resolution for quantitative trait loci detection. However, increased genetic complexity challenges detection methods, with discordant results due to low data quality or complex genetic architecture. We quantified the impact of theses factors across three mouse crosses and two different detection methods, identifying procedures that greatly improve detection quality. Importantly, HS populations have complex genetic architectures not fully captured by the whole genome kinship matrix, calling for incorporating chromosome specific relatedness information. We analyze three increasingly complex crosses, using gene expression levels as quantitative traits. The three crosses were an F(2) intercross, a HS formed by crossing four inbred strains (HS4), and a HS (HS-CC) derived from the eight lines found in the collaborative cross. Brain (striatum) gene expression and genotype data were obtained using the Illumina platform. We found large disparities between methods, with concordance varying as genetic complexity increased; this problem was more acute for probes with distant regulatory elements (trans). A suite of data filtering steps resulted in substantial increases in reproducibility. Genetic relatedness between samples generated overabundance of detected eQTLs; an adjustment procedure that includes the kinship matrix attenuates this problem. However, we find that relatedness between individuals is not evenly distributed across the genome; information from distinct chromosomes results in relatedness structure different from the whole genome kinship matrix. Shared polymorphisms from distinct chromosomes collectively affect expression levels, confounding eQTL detection. We suggest that considering chromosome specific relatedness can result in improved eQTL detection.Entities:
Keywords: collaborative cross; eQTL detection; gene expression; mouse genetics; population substructure
Year: 2012 PMID: 22969789 PMCID: PMC3427913 DOI: 10.3389/fgene.2012.00157
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1HAPPY intervals are compared with EMMA individual markers. HAPPY eQTLs associated with interval 1 correspond to EMMA eQTLs associated with either marker A or marker B. EMMA eQTLs associated with marker C correspond with HAPPY eQTLs associated with either interval 2 or 3.
Figure 2Initial comparison of eQTL detection across mouse crosses and methods. (A) Level of overlap across the three mouse crosses; HS4 results are superior to both F2 and HS-CC. (B–D) Cis results are more reproducible in all three datasets.
Figure 3Results of data filtering on the concordance between HAPPY and EMMA. (A) Concordance comparison across the three data sets. HS4 concordance is best, with HS-CC and F2 slightly behind. (B–D) Concordance before and after data filtering. In all cases data filtering improves concordance between the methods.
Figure 4Concordance of results and number of eQTLs for the probes retained after data filtering. (A) The HAPPY and EMMA results for the retained probes are compared using ROC analysis. (B,C) Number of eQTLs detected by HAPPY and EMMA, respectively, before and after data filtering.
Figure 5Results of the JM procedure. (A) JM compared with HAPPY in the F2 data. There is no improvement in the ability to reproduce the original EMMA results. (B) HS4 results show better ability of JM to reproduce EMMA results. (C) HS-CC results, JM has best improvement of JM of HAPPY. (D) Overlap of eQTLs (p < 10–5) across the three methods with JM detecting a large portion of intersection of HAPPY and EMMA results.