| Literature DB >> 35886031 |
Sofia Antão-Sousa1,2,3,4, Eduardo Conde-Sousa1,5, Leonor Gusmão4, António Amorim1,2,3, Nádia Pinto1,2,6.
Abstract
Microsatellites (or short-tandem repeats (STRs)) are widely used in anthropology and evolutionary studies. Their extensive polymorphism and rapid evolution make them the ideal genetic marker for dating events, such as the age of a gene or a population. This usage requires the estimation of mutation rates, which are usually estimated by counting the observed Mendelian incompatibilities in one-generation familial configurations (typically parent(s)-child duos or trios). Underestimations are inevitable when using this approach, due to the occurrence of mutational events that do not lead to incompatibilities with the parental genotypes ('hidden' or 'covert' mutations). It is known that the likelihood that one mutation event leads to a Mendelian incompatibility depends on the mode of genetic transmission considered, the type of familial configuration (duos or trios) considered, and the genotype(s) of the progenitor(s). In this work, we show how the magnitude of the underestimation of autosomal microsatellite mutation rates varies with the populations' allele frequency distribution spectrum. The Mendelian incompatibilities approach (MIA) was applied to simulated parent(s)/offspring duos and trios in different populational scenarios. The results showed that the magnitude and type of biases depend on the population allele frequency distribution, whatever the type of familial data considered, and are greater when duos, instead of trios, are used to obtain the estimates. The implications for molecular anthropology are discussed and a simple framework is presented to correct the naïf estimates, along with an informatics tool for the correction of incompatibility rates obtained through the MIA.Entities:
Keywords: STRs; autosomes; dating; evolution; hidden mutations; microsatellites; mutation rate estimates biases
Mesh:
Substances:
Year: 2022 PMID: 35886031 PMCID: PMC9323320 DOI: 10.3390/genes13071248
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1Examples of hidden mutations occurred in a parent–child duo and a parents–child trio, at an autosomal microsatellite. The arrows and circles indicate the allele transmission involving a mutation.
Figure 2Mock allele frequencies, considering predefined distributions. The full lines correspond to the situation where 10 alleles are considered and the dotted lines to the cases with 20 alleles (narrow and wide distributions, respectively).
Rates of hidden mutations per marker (real and mock allelic distributions) and familial configuration considered (either duos or trios). One single-step mutation was simulated in one, randomly selected, parental meiosis of each of the 1,000,000 parent–child duos and parents–child trios, considering the allelic distributions of 10 autosomal STRs for the populations of Norway, Somalia, and Spain [34] and the allelic distributions of the 6 artificially generated markers. In the latter, N refers to the number of alleles in the marker.
| Markers | Duos | Trios | ||||||
|---|---|---|---|---|---|---|---|---|
| Norway | Somalia | Spain | Norway | Somalia | Spain | |||
|
|
|
|
|
|
|
|
|
|
|
| 0.249 | 0.312 | 0.325 | 0.087 | 0.128 | 0.135 | ||
|
| 0.372 | 0.348 | 0.35 | 0.163 | 0.15 | 0.147 | ||
|
| 0.497 | 0.419 | 0.393 | 0.141 | 0.125 | 0.108 | ||
|
| 0.463 | 0.526 | 0.542 | 0.224 | 0.259 | 0.271 | ||
|
| 0.334 | 0.329 | 0.328 | 0.15 | 0.145 | 0.146 | ||
|
| 0.143 | 0.178 | 0.171 | 0.054 | 0.069 | 0.065 | ||
|
| 0.452 | 0.508 | 0.521 | 0.102 | 0.227 | 0.233 | ||
|
| 0.622 | 0.508 | 0.525 | 0.119 | 0.192 | 0.198 | ||
|
| 0.445 | 0.425 | 0.435 | 0.213 | 0.205 | 0.205 | ||
|
|
| 0.696 | 0.157 | |||||
|
| 0.614 | 0.09 | ||||||
|
| 0.663 | 0.132 | ||||||
|
| 0.586 | 0.066 | ||||||
|
| 0.63 | 0.105 | ||||||
|
| 0.57 | 0.052 | ||||||
Figure 3Graphical representations of the proportion of hidden mutations per marker (upper for real population distributions and lower for mock ones, considering the indicated distribution) and familial configuration. Full lines connect the dots corresponding to the proportion of hidden mutations for each marker in duos; dotted lines connect the dots corresponding to the proportion of hidden mutations for each marker in trios. For the mock distributions, N refers to the number of alleles considered in the marker. For example, “Normal (N = 10)” refers to the mock marker designed with a normal and narrow distribution, with 10 alleles, whereas “Normal (N = 20)” refers to the mock marker designed with a normal and wider distribution, with 20 alleles.
Figure 4Graphical representations of the correlation between the frequency of hidden mutations observed per marker (upper for real population distributions and lower for mock ones, considering the indicated distribution) and markers’ heterozygosity. Orange corresponds to the correlation of hidden mutations and the heterozygosity of each marker in duos, and blue in trios. For the mock distributions, N refers to the number of alleles considered in the marker. For example, “Normal (N = 10)” refers to the mock marker designed with a normal and narrow distribution, with 10 alleles, whereas “Normal (N = 20)” refers to the mock marker designed with a normal and wider distribution, with 20 alleles. Heterozygosity was calculated as .