| Literature DB >> 23979571 |
Riyan Cheng1, Justin Borevitz, R W Doerge.
Abstract
A major consideration in multitrait analysis is which traits should be jointly analyzed. As a common strategy, multitrait analysis is performed either on pairs of traits or on all of traits. To fully exploit the power of multitrait analysis, we propose variable selection to choose a subset of informative traits for multitrait quantitative trait locus (QTL) mapping. The proposed method is very useful for achieving optimal statistical power for QTL identification and for disclosing the most relevant traits. It is also a practical strategy to effectively take advantage of multitrait analysis when the number of traits under consideration is too large, making the usual multivariate analysis of all traits challenging. We study the impact of selection bias and the usage of permutation tests in the context of variable selection and develop a powerful implementation procedure of variable selection for genome scanning. We demonstrate the proposed method and selection procedure in a backcross population, using both simulated and real data. The extension to other experimental mapping populations is straightforward.Entities:
Keywords: multitrait mapping; quantitative trait locus (QTL); statistical power; variable selection
Mesh:
Substances:
Year: 2013 PMID: 23979571 PMCID: PMC3813856 DOI: 10.1534/genetics.113.155937
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
Figure 1(A–C) Mapping profiles for single-trait analysis of the 16 traits (A) and multitrait analysis of selected traits vs. that of all 16 traits (B) and selected traits (C). The horizontal lines are 0.05 significance thresholds adjusted for all the markers (and all the traits in the case of single-trait analysis). Each vertical section displays one chromosome. C shows if a trait is selected for multitrait analysis (green), if the single-trait mapping curve goes above the threshold line at the locus (yellow), and if both occur (red).
Estimated type I error rates and standard errors
| Significance level | |||
|---|---|---|---|
| Method | 0.1 | 0.05 | 0.01 |
| A | 0.552 (0.0157) | 0.443 (0.0157) | 0.231 (0.0133) |
| B | 0.207 (0.0128) | 0.136 (0.0108) | 0.041 (0.0063) |
| C | 0.098 (0.0094) | 0.051 (0.0070) | 0.011 (0.0033) |
| D | 0.116 (0.0101) | 0.061 (0.0076) | 0.018 (0.0042) |
Four methods were implemented to select an optimal subset of traits for multitrait analysis of the permuted data: (A) use the same traits as selected in the analysis of the original data, (B) select the same number of traits as in the analysis of the original data, (C) use the same procedure as selecting traits in the analysis of the original data, and (D) select a predefined number (five) of traits. Standard errors of the estimated type I error rates are given in parentheses.
Figure 2(A–D) Statistical power estimated from 1000 replicate simulations at the significance level 0.05
Figure 3(A–C) Number of selected traits (A), estimated statistical power using random 50% subsamples (B), and difference in the estimated power between the proposed VSFOP procedure and any of other methods (C). Five methods were considered: (1) single-trait analysis (ST), (2) multitrait analysis of selected traits (SL), (3) multitrait analysis of all traits (AT), (4) multitrait analysis of clustered traits (CL), and (5) single-trait analysis of the first eight principal components (PC). Each vertical section displays one chromosome.