| Literature DB >> 21918629 |
Kajsa Ljungberg1, Kateryna Mishchenko, Sverker Holmgren.
Abstract
We present a two-phase strategy for optimizing a multidimensional, nonconvex function arising during genetic mapping of quantitative traits. Such traits are believed to be affected by multiple so called quantitative trait loci (QTL), and searching for d QTL results in a d-dimensional optimization problem with a large number of local optima. We combine the global algorithm DIRECT with a number of local optimization methods that accelerate the final convergence, and adapt the algorithms to problem-specific features. We also improve the evaluation of the QTL mapping objective function to enable exploitation of the smoothness properties of the optimization landscape. Our best two-phase method is demonstrated to be accurate in at least six dimensions and up to ten times faster than currently used QTL mapping algorithms.Entities:
Keywords: DIRECT; QTL mapping; global optimization
Year: 2010 PMID: 21918629 PMCID: PMC3170002 DOI: 10.2147/AABC.S9240
Source DB: PubMed Journal: Adv Appl Bioinform Chem ISSN: 1178-6949
Figure 2The search space for the outer problem is divided into chromosome combination boxes, cc-boxes. Each cc-box is further divided into marker boxes, m-boxes.
Figure 3A part of a typical objective function f(x) for the outer problem. The discontinuities at cc-box boundaries can be seen as straight lines in the contour plot below the surface.
Stopping rule parameters
| Algorithm | D | D-D | D-SD | D-QN |
|---|---|---|---|---|
| 41 | 32 | 25 | 22 |
The maximal number of function evaluations for different values of d, all back-cross data sets
| 2 | 3 | 4 | 5 | 6 | |
|---|---|---|---|---|---|
| Exhaustive grid search | 13778625 | 2 · 1010 | 3 · 1013 | 3 · 1016 | 3 · 1019 |
| D | 2409 | 13501 | 126022 | 995586 | >6650000 |
| D-D | 787 | 8571 | 59944 | 326606 | 1618725 |
| D-SD | 601 | 4296 | 25891 | 110433 | 418476 |
| D-QN | 530 | 3637 | 19980 | 71113 | 236636 |
The maximal number of function evaluations for different values of d, all intercross data sets
| 2 | 3 | 4 | 5 | 6 | |
|---|---|---|---|---|---|
| Exhaustive grid search | 13778625 | 2 · 1010 | 3 · 1013 | 3 · 1016 | 3 · 1019 |
| D | 1355 | 15010 | 124989 | 1341120 | >6676400 |
| D-D | 1341 | 8985 | 54572 | 486633 | 1618411 |
| D-SD | 958 | 5445 | 24204 | 217926 | 415408 |
| D-QN | 778 | 4035 | 17310 | 149691 | 246884 |
Figure 4The maximal number of function evaluations for different values of d, all backcross data sets.
Figure 5The maximal number of function evaluations for different values of d, all D-QN 0.49 0.54 0.41 0.48 0.52 intercross data sets.
The fraction of function evaluations in local algorithm, backcross data
| 2 | 3 | 4 | 5 | 6 | |
|---|---|---|---|---|---|
| D-D | 0.49 | 0.75 | 0.80 | 0.87 | 0.89 |
| D-SD | 0.39 | 0.40 | 0.52 | 0.58 | 0.60 |
| D-QN | 0.31 | 0.40 | 0.48 | 0.52 | 0.52 |
Fraction of function evaluations in local algorithm, intercross data
| 2 | 3 | 4 | 5 | 6 | |
|---|---|---|---|---|---|
| D-D | 0.71 | 0.79 | 0.79 | 0.82 | 0.89 |
| D-SD | 0.56 | 0.66 | 0.51 | 0.61 | 0.60 |
| D-QN | 0.49 | 0.54 | 0.41 | 0.48 | 0.52 |
Results from forward selection, ratio of actual error to accepted error
| 2 | 3 | 4 | 5 | 6 | |
|---|---|---|---|---|---|
| Backcross data | 51, wrong cc-box | 16 | 10, wrong cc-box | 69, wrong cc-box | 8 |
| Intercross data | 0.5 | 5 | 14 | 2, wrong cc-box | 23 |
Heritabilities
| d | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|
| 0.05 | 0.17 | 0.20 | 0.31 | 0.34 | |
| 0.08 | 0.15 | 0.20 | 0.27 | 0.34 |