| Literature DB >> 19671130 |
Albart Coster1, Henri C M Heuven, Rohan L Fernando, Jack C M Dekkers.
Abstract
BACKGROUND: Current methods for haplotype inference without pedigree information assume random mating populations. In animal and plant breeding, however, mating is often not random. A particular form of nonrandom mating occurs when parental individuals of opposite sex originate from distinct populations. In animal breeding this is called crossbreeding and hybridization in plant breeding. In these situations, association between marker and putative gene alleles might differ between the founding populations and origin of alleles should be accounted for in studies which estimate breeding values with marker data. The sequence of alleles from one parent constitutes one haplotype of an individual. Haplotypes thus reveal allele origin in data of crossbred individuals.Entities:
Mesh:
Year: 2009 PMID: 19671130 PMCID: PMC2741458 DOI: 10.1186/1297-9686-41-40
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Figure 1Graphical representation of the crossbreeding model. Arepresents the list of haplotype classes of population M and Arepresents the list of haplotype classes of population F. Grepresents the genotypes in population M, Grepresents the genotypes in population F, and Grepresents the genotypes in the crossbred population Cross. Haplotypes for Gare associated to classes in Aand A.
Average (standard deviation) of number of distinct haplotypes in (nHap), the average fraction of heterozygous loci within individuals (% het) and fraction observed recombinant haplotypes for the Cross population (% rec).
| MAF | Populations M, F | Cross populations | |||
| nHap | % het | nHap | % het | % rec | |
| 0.01 | 2 (1) | 0.02 (0.02) | 3 (1) | 0.98 (0.02) | 0.00 (0.00) |
| 0.25 | 19 (9) | 0.20 (0.07) | 32 (6) | 0.66 (0.08) | 0.01 (0.01) |
| 0.40 | 30 (9) | 0.29 (0.06) | 50 (12) | 0.54 (0.07) | 0.02 (0.01) |
| 0.49 | 32 (8) | 0.30 (0.06) | 48 (8) | 0.49 (0.07) | 0.01 (0.01) |
nHap and %het in populations M and F represent averages of these two populations. Minor Allele Frequency (MAF) in the base populations was simulated between 0.01 and 0.49. Ten replicates were simulated for each MAF.
Effect of Concentration Parameter (α) of the Dirichlet Process on Allele Origin Accuracy (AOAc), Similarity Index (If), and the average number of haplotype classes () for 1 replicate of populations M and Cross.
| Population M | Cross population | |||||||||
| 0.98 | 0.98 | 0.98 | 0.98 | 0.97 | ||||||
| 0.91 | 0.91 | 0.93 | 0.93 | 0.92 | 0.94 | 0.94 | 0.94 | 0.94 | 0.91 | |
| 18 | 18 | 18 | 19 | 27 | 47 | 47 | 47 | 49 | 67 | |
Analyses were run assuming crossbreeding and populations M, F and Cross were used in the analyses. Base populations for M and F were simulated with Minor Allele Frequency equal to 0.40.
Average (standard deviation) Allele Origin Accuracy (AOAc) and Similarity Index (If) of haplotypes inferred for genotypes of simulated populations M and Cross.
| Population | |||
| Random-Mating | |||
| M | 0.84 (0.05) | ||
| Cross | 0.30 (0.28) | ||
| Crossbreeding | |||
| M | 0.88 (0.03) | ||
| Cross | 0.95 (0.02) | 0.87 (0.05) | |
Data were analysed assuming Random-Mating or Crossbreeding. Genotypes of simulated population F were included in the analyses when Crossbreeding was assumed. Analyses were run with α equal to 1. Ten replicates where simulated for each scenario. Base populations for M and F were simulated with Minor Allele Frequency equal to 0.40.
Average Allele Origin Accuracy (AOAc) and Similarity Index (If) of haplotypes inferred for genotypes of simulated Cross population.
| 100% Pop. M, F | 0.95 (0.02) | 0.87 (0.05) |
| 100% Pop. M, 0% Pop. F | 0.94 (0.01) | 0.84 (0.03) |
| 0% Pop. M, F | 0.44 (0.19) | 0.36 (0.21) |
Analyses were run assuming Crossbreeding and purebred populations M and F were either included or not in the analyses. Analyses were run with α equal to 1. Populations were simulated with Minor Allele Frequency in the base populations equal to 0.40. Ten replicates were simulated for each scenario.
Average (standard deviation) of Similarity Indices If for haplotypes inferred with PHASE and with the DP algorithm from genotypes of simulated populations M and Cross.
| MAF | PHASE | DP | ||
| Pop. M | Cross pop. | Pop. M | Cross pop. | |
| 0.01 | 1.00 (0.01) | 0.00 (0.00) | 1.00 (0.01) | 1.00 (0.01) |
| 0.25 | 0.93 (0.04) | 0.12 (0.28) | 0.93 (0.04) | 0.92 (0.04) |
| 0.40 | 0.86 (0.05) | 0.42 (0.30) | 0.88 (0.03) | 0.87 (0.05) |
| 0.49 | 0.90 (0.03) | 0.55 (0.25) | 0.90 (0.03) | 0.89 (0.03) |
Minor Allele Frequency in the base populations (MAF) was simulated between 0.01 and 0.49, 10 replicates were simulated for each MAF. Genotypes of simulated population F were included in the analyses with the DP algorithm. Parameter α was set equal to 1 in the analyses with DP.
Allele OriginAccuracy (AOAc) and Similarity Index (If) for haplotypes inferred with the DP algorithm assuming crossbreeding (DP), with the DP algorithm assuming random mating (DP RM) and with PHASE.
| DP CB | DP RM | PHASE | ||
| AOAc | If | If | If | |
| 19 Dams | ||||
| Cross | 0.97 | 0.93 | 0.09 | 0.93 |
| Dams | 0.92 | 0.90 | 0.86 | |
| Sires | 0.75 | 0.78 | 0.80 | |
| 63 Dams | ||||
| Cross | 0.94 | 0.87 | 0.69 | 0.86 |
| Dams | 0.84 | 0.80 | 0.83 | |
| Sires | 0.76 | 0.77 | 0.77 | |
| 109 Dams | ||||
| Cross | 0.95 | 0.91 | 0.10 | 0.91 |
| Dams | 0.84 | 0.82 | 0.81 | |
| Sires | 0.76 | 0.77 | 0.77 | |
Parameter α of the DP algorithm was set equal to 1. Data from the Commercial × Meishan crossbreeding data. Indivuals in the Dam group were from the commercial breed and individuals in the Sire group were from the Meishan breed. Parameter α was set equal to 1 in the analyses with DP.