| Literature DB >> 28984187 |
Yuan-De Tan1, Xiang H F Zhang2,3,4,5, Qianxing Mo6,7.
Abstract
BACKGROUND: Dominant markers in an F2 population or a hybrid population have much less linkage information in repulsion phase than in coupling phase. Linkage analysis produces two separate complementary marker linkage maps that have little use in disease association analysis and breeding. There is a need to develop efficient statistical methods and computational algorithms to construct or merge a complete linkage dominant marker maps. The key for doing so is to efficiently estimate recombination fractions between dominant markers in repulsion phases. RESULT: We proposed an expectation least square (ELS) algorithm and binomial analysis of three-point gametes (BAT) for estimating gamete frequencies from F2 dominant and codominant marker data, respectively. The results obtained from simulated and real genotype datasets showed that the ELS algorithm was able to accurately estimate frequencies of gametes and outperformed the EM algorithm in estimating recombination fractions between dominant loci and recovering true linkage maps of 6 dominant loci in coupling and unknown linkage phases. Our BAT method also had smaller variances in estimation of two-point recombination fractions than the EM algorithm.Entities:
Keywords: Codominant marker; Dominant marker; ELS algorithm; EM algorithm; Gamete frequency
Mesh:
Substances:
Year: 2017 PMID: 28984187 PMCID: PMC5629630 DOI: 10.1186/s12859-017-1804-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The ELS estimated frequencies of four nonsister gametes in 20 triplets of 6 dominant loci in 333 F2 micea
| locus | frequency of non-sister gamete | Chi-square test | ||||||
|---|---|---|---|---|---|---|---|---|
| a | b | c |
|
|
|
|
| ratio |
| 1 | 2 | 3 | 0.208668 | 0.086162 | 0.094698 | 0.110472 | 0.000339 | |
| 1 | 2 | 4 | 0.14751 | 0.087958 | 0.160866 | 0.103665 | 0.028 | |
| 1 | 2 | 5 | 0.200976 | 0.080676 | 0.108494 | 0.109854 | 0.00092 | |
| 1 | 2 | 6 | 0.192229 | 0.084408 | 0.126033 | 0.09733 | 0.0023 | |
| 1 | 3 | 4 | 0.140566 | 0.079252 | 0.181795 | 0.098387 | 0.0038 | |
| 1 | 3 | 5 | 0.209093 | 0.065783 | 0.12237 | 0.102753 | 0.00012 | |
| 1 | 3 | 6 | 0.16895 | 0.063323 | 0.165648 | 0.102079 | 0.0011 | |
| 1 | 4 | 5 | 0.173539 | 0.100396 | 0.079665 | 0.1464 | 0.0069 | |
| 1 | 4 | 6 | 0.16482 | 0.102771 | 0.113819 | 0.118591 | 0.0837 | 1:1:1:1 |
| 1 | 5 | 6 | 0.173447 | 0.07932 | 0.148645 | 0.098588 | 0.0059 | |
| 2 | 3 | 4 | 0.141958 | 0.088919 | 0.172784 | 0.096339 | 0.012 | |
| 2 | 3 | 5 | 0.202566 | 0.085775 | 0.112098 | 0.099561 | 0.00084 | |
| 2 | 3 | 6 | 0.139672 | 0.086032 | 0.16843 | 0.105866 | 0.0212 | |
| 2 | 4 | 5 | 0.173634 | 0.117152 | 0.085607 | 0.123607 | 0.0212 | |
| 2 | 4 | 6 | 0.10113 | 0.133668 | 0.133668 | 0.131535 | 0.3134 | 1:1:1:1 |
| 2 | 5 | 6 | 0.156859 | 0.096648 | 0.142758 | 0.103735 | 0.0634 | 1:1:1:1 |
| 3 | 4 | 5 | 0.143582 | 0.120278 | 0.098137 | 0.138002 | 0.1954 | 1:1:1:1 |
| 3 | 4 | 6 | 0.105025 | 0.139707 | 0.129181 | 0.126086 | 0.3581 | 1:1:1:1 |
| 3 | 5 | 6 | 0.140841 | 0.109125 | 0.152351 | 0.097682 | 0.1028 | 1:1:1:1 |
| 4 | 5 | 6 | 0.146847 | 0.096392 | 0.156905 | 0.099856 | 0.0476 | |
a: The data came from MAPMAKER/EXP(3.0b) [27]
The BAT estimated frequencies of nonsister gametes in 20 triplets of 6 codominant loci in 333 F2 micea
|
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| 1 | 2 | 3 | 0.242929 | 0.066568 | 0.080146 | 0.110358 | 3.348e-07 | |
| 1 | 2 | 4 | 0.145838 | 0.08977 | 0.162134 | 0.102258 | 0.0291 | |
| 1 | 2 | 5 | 0.196094 | 0.091387 | 0.121051 | 0.091467 | 0.0017 | |
| 1 | 2 | 6 | 0.17224 | 0.104184 | 0.143308 | 0.080268 | 0.0098 | |
| 1 | 3 | 4 | 0.165099 | 0.068697 | 0.191983 | 0.074221 | 7.1334e-05 | |
| 1 | 3 | 5 | 0.222931 | 0.079297 | 0.147828 | 0.049944 | 1.0943e-06 | |
| 1 | 3 | 6 | 0.177615 | 0.065713 | 0.187929 | 0.068743 | 2.3780e-05 | |
| 1 | 4 | 5 | 0.158699 | 0.103969 | 0.114759 | 0.122573 | 0.1336 | 1:1:1:1 |
| 1 | 4 | 6 | 0.165089 | 0.113628 | 0.139128 | 0.082155 | 0.0249 | |
| 1 | 5 | 6 | 0.155874 | 0.091722 | 0.16263 | 0.089774 | 0.0121 | |
| 2 | 3 | 4 | 0.142565 | 0.093943 | 0.179432 | 0.08406 | 0.0050 | |
| 2 | 3 | 5 | 0.216411 | 0.069533 | 0.134853 | 0.079203 | 2.0404e-05 | |
| 2 | 3 | 6 | 0.160337 | 0.092614 | 0.172787 | 0.074262 | 0.0020 | |
| 2 | 4 | 5 | 0.172459 | 0.100044 | 0.105018 | 0.12248 | 0.0365 | |
| 2 | 4 | 6 | 0.154284 | 0.140079 | 0.121173 | 0.084464 | 0.0559 | 1:1:1:1 |
| 2 | 5 | 6 | 0.167782 | 0.072156 | 0.153072 | 0.10699 | 0.0057 | |
| 3 | 4 | 5 | 0.154314 | 0.118649 | 0.10895 | 0.118086 | 0.2051 | 1:1:1:1 |
| 3 | 4 | 6 | 0.144358 | 0.108635 | 0.131647 | 0.115359 | 0.3080 | 1:1:1:1 |
| 3 | 5 | 6 | 0.16738 | 0.053399 | 0.176627 | 0.102594 | 0.0002 | |
| 4 | 5 | 6 | 0.153613 | 0.092081 | 0.124467 | 0.129838 | 0.1092 | 1:1:1:1 |
a: The data came from MAPMAKER/EXP(3.0b) [27]
The ELS estimated frequencies of nonsister gametes in triplets of dominant loci 1, 2, 3 and 5 in 333 F2 mice
| locus | frequency of gamete | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| 1 | 2 | 3 | 0.208668 | 0.086162 | 0.094698 | 0.110472 |
| 1 | 2 | 5 | 0.200976 | 0.080676 | 0.108494 | 0.109854 |
| 1 | 3 | 5 | 0.209093 | 0.065783 | 0.12237 | 0.102753 |
| 2 | 3 | 5 | 0.202566 | 0.085775 | 0.112098 | 0.099561 |
The estimated recombination fractions between dominant loci in four triples
| triple | Recombination fraction between loci | ||||
|---|---|---|---|---|---|
|
|
|
|
|
|
|
| 1 | 2 | 3 | 0.39326 | 0.36172 | 0.41034 |
| 1 | 2 | 5 | 0.38106 | 0.37834 | 0.43669 |
| 1 | 3 | 5 | 0.33707 | 0.37631 | 0.45024 |
| 2 | 3 | 5 | 0.37067 | 0.39575 | 0.42332 |
Comparison between two estimators of recombination fractions between markers
| two loci | the ELS estimate in dominant genotype data | the BAT estimate in codominant genotype data | |
|---|---|---|---|
| 1 | 2 | 0.387164 | 0.271317 |
| 1 | 3 | 0.349396 | 0.275955 |
| 1 | 5 | 0.377323 | 0.439563 |
| 2 | 3 | 0.390506 | 0.339240 |
| 2 | 5 | 0.416221 | 0.426574 |
| 3 | 5 | 0.436782 | 0.402158 |
Fig. 1Variances of estimated recombination fractions between adjacent dominant loci in unknown linkage phase deviated from their respective true values Variance of estimated recombination fraction between adjacent dominant loci is given by simulating 100 estimates around true recombination fraction between adjacent loci. The variance here is equivalent to mean square error (MSE)
Efficiencies of estimators of recombination fractions in recovering the true linkage maps of 6 dominant loci in the case of random distance
| Estimator | Linkage phase | Sample size | |||
|---|---|---|---|---|---|
| 100 | 200 | 300 | |||
| Two-point EM | CP | 92.3 | 97.8 | 100.0 | |
| UP | 15.7 | 22.9 | 23.4 | ||
| ELS | CP | 96.7 | 100.0 | 100.0 | |
| UP | 50.5 | 77.0 | 85.1 | ||
| BAT | CP | 82.1 | 95.9 | 97.9 | |
| UP | 26.0 | 40.9 | 42.3 | ||
CP: Coupling phase and UP: unknown phase