| Literature DB >> 18405361 |
Sang Hong Lee1, Julius H J Van der Werf, Brian P Kinghorn.
Abstract
BACKGROUND: Haplotype reconstruction is important in linkage mapping and association mapping of quantitative trait loci (QTL). One widely used statistical approach for haplotype reconstruction is simulated annealing (SA), implemented in SimWalk2. However, the algorithm needs a very large number of sequential iterations, and it does not clearly show if convergence of the likelihood is obtained.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18405361 PMCID: PMC2375132 DOI: 10.1186/1471-2105-9-189
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Log likelihood of the best haplotypes found by SimWalk2, EA1 and EA2 with simulated data
| Complete genotypic data | ||||
| no. evaluations‡: | 20000 | 100000 | 500000 | 16000000* |
| SimWalk2 | -1960.81 | -1827.53 | -1804.93 | -1801.7 (32 minutes)¶ |
| EA1 | -2197.17 | -1852.43 | -1802.16 | -1800.55 (9 minutes) |
| EA2 | -2299.78 | -1848.51 | -1802.39 | -1800.55 (10 minutes) |
| Incomplete genotypic data | ||||
| no. evaluations‡: | 40000 | 400000 | 4000000 | 16000000 |
| SimWalk2 | -943.83 | -791.41 | -769.97 | -767.43 (12 minutes) |
| EA1 | -1108.24 | -791.18 | -766.74 | -764.66 (6 minutes) |
| EA2 | -1112.86 | -782.19 | -765.59 | -764.43 (7 minutes) |
‡number of evaluations of the likelihood function, *16000000 is from the default annealing schedule of SimWalk2
¶Computing time for SimWalk2, EA1 and EA2 (EA-population size N = 10 with 4 CPUs)
Log likelihood of the best haplotypes found by SimWalk2, EA1 and EA2 with real data
| No. evaluations: | 260000 | 520000 | 1050000 | 2100000 | 21000000 |
| SimWalk2 | -24439.02 | -23551.84 | -23312.14 | -23193.18 | -23159.19 (434 minutes)¶ |
| EA1 | -25992.16 | -24258.61 | -23368.69 | -23164.58 | -23158.60 (106 minutes) |
| EA2 | -25900.78 | -23871.49 | -23194.87 | -23158.39 | -23158.39 (126 minutes) |
¶Computing time for SimWalk2, EA1 and EA2 (EA-population size N = 10 with 4 CPUs)
Figure 1Convergence and computational efficiency. Log likelihood pattern of SimWalk2, EA1 and EA2 along with the actual computing time when using 4 processors simultaneously
Figure 2Simple example pedigree. Five founders with their numbered gametes (f1~ f10) and 5 descendents with their numbered meiosis (m1~m10). Unordered genotypes for the first and second marker are in the brackets underneath the animal ID5, ID9 and ID10 (other animals are not genotyped).
A legal configuration of segregation indicators for the meiosis, and allele assignment to the founder gametes given the segregation indicators at the first and second marker
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| marker 1 | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 |
| marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | a | - | - | a | - | c | - | - | a | - |
| marker 2 | - | y | x | - | - | - | - | y | x | - |
Log likelihood = -46.143
Updating segregation indicators for a meiosis at a marker subsequently, and the change of allele assignment to the founder gamete given the updated segregation indicators
| I. Log likelihood = -43.837 | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| Marker 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | |
| Marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| Marker 1 | - | - | a | - | c | - | - | a | - | |
| Marker 2 | - | y | x | - | - | - | - | y | x | - |
| II. Log likelihood = -41.531 | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| marker 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | |
| marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | - | a | - | - | c | - | - | a | - | |
| marker 2 | - | y | x | - | - | - | - | y | x | - |
| III. Log likelihood = -39.225 | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| marker 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | |
| marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | - | a | a | - | - | c | - | - | a | - |
| marker 2 | - | y | x | - | - | - | - | y | x | - |
| IV. Log likelihood = -36.919 | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| marker 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | |
| marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | - | a | a | - | - | - | - | a | - | |
| marker 2 | - | y | x | - | - | - | - | y | x | - |
| V. Log likelihood = -34.614 | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| marker 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | |
| marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | - | a | a | - | c | - | - | - | a | - |
| marker 2 | - | y | x | - | - | - | - | y | x | - |
| VI. Log likelihood = -32.308 | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| marker 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | |
| marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | - | a | a | - | c | - | - | - | a | - |
| marker 2 | - | y | x | - | - | - | - | y | x | - |
| VII. State is illegal | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| marker 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | |
| marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | - | a | - | - | - | - | - | - | ||
| marker 2 | - | y | x | - | - | - | - | y | x | - |
Segregation state of founder gametes for genotyped animals and the changes due to switching m7 at the first marker
| Genotyped animal | paternal | maternal |
| ID5 | f3 | |
| ID9 | F5 -> f2 | f9 |
| ID10 | F5 -> f2 |
Joint updates
| With joint update for m1 and m7 (log likelihood = -32.308) | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| Marker 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | ||
| Marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | a | - | - | - | - | - | a | - | ||
| marker 2 | - | y | x | - | - | - | - | y | x | - |
| With joint update for m7 and m9 (log likelihood = -27.185) | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| marker 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | ||
| marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | - | a | a | - | - | - | - | - | ||
| marker 2 | - | y | x | - | - | - | - | y | x | - |
| With joint update for m7 and m10 (log likelihood = -31.797) | ||||||||||
| m1 | m2 | m3 | m4 | m5 | m6 | m7 | m8 | m9 | m10 | |
| marker 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | ||
| marker 2 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| f1 | f2 | f3 | f4 | f5 | f6 | f7 | f8 | f9 | f10 | |
| marker 1 | - | a | a | - | - | - | - | - | - | |
| marker 2 | - | y | x | - | - | - | - | y | x | - |