| Literature DB >> 22808085 |
Teresinha Palha1, Leonor Gusmão, Elzemar Ribeiro-Rodrigues, João Farias Guerreiro, Andrea Ribeiro-Dos-Santos, Sidney Santos.
Abstract
In a large variety of genetic studies, probabilistic inferences are made based on information available in population databases. The accuracy of the estimates based on population samples are highly dependent on the number of chromosomes being analyzed as well as the correct representation of the reference population. For frequency calculations the size of a database is especially critical for haploid markers, and for countries with complex admixture histories it is important to assess possible substructure effects that can influence the coverage of the database. Aiming to establish a representative Brazilian population database for haplotypes based on 23 Y chromosome STRs, more than 2,500 Y chromosomes belonging to Brazilian, European and African populations were analyzed. No matter the differences in the colonization history of the five geopolitical regions that currently exist in Brazil, for the Y chromosome haplotypes of the 23 studied Y-STRs, a lack of genetic heterogeneity was found, together with a predominance of European male lineages in all regions of the country. Therefore, if we do not consider the diverse Native American or Afro-descendent isolates, which are spread through the country, a single Y chromosome haplotype frequency database will adequately represent the urban populations in Brazil. In comparison to the most commonly studied group of 17 Y-STRs, the 23 markers included in this work allowed a high discrimination capacity between haplotypes from non-related individuals within a population and also increased the capacity to discriminate between paternal relatives. Nevertheless, the expected haplotype mutation rate is still not enough to distinguish the Y chromosome profiles of paternally related individuals. Indeed, even for rapidly mutating Y-STRs, a very large number of markers will be necessary to differentiate male lineages from paternal relatives.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22808085 PMCID: PMC3393733 DOI: 10.1371/journal.pone.0040007
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Map of Brazil indicating the geographical location of the 17 different sampling places located in 5 geopolitical regions.
Figure 2MDS plot based on the pairwise RST genetic distance matrix (Stress = 0.134).
Single locus and haplotype genetic diversities calculated for the 5 different regions as well as in the total sample from Brazil.
| Brazil | |||||||||
| Loci | North | Northeast | Central West | Southeast | South | Brazil | Africa | Native America | Portugal |
| DYS392 | 0,628 | 0,620 | 0,608 | 0,593 | 0,603 | 0,616 | 0,201 | 0,332 | 0,585 |
| DYS391 | 0,574 | 0,541 | 0,572 | 0,564 | 0,576 | 0,568 | 0,348 | 0,381 | 0,594 |
| DYS390 | 0,657 | 0,724 | 0,660 | 0,666 | 0,681 | 0,673 | 0,444 | 0,679 | 0,689 |
| DYS393 | 0,507 | 0,502 | 0,532 | 0,438 | 0,486 | 0,491 | 0,632 | 0,364 | 0,507 |
| DYS389 I | 0,625 | 0,580 | 0,544 | 0,489 | 0,522 | 0,576 | 0,460 | 0,424 | 0,536 |
| DYS389 II | 0,687 | 0,722 | 0,727 | 0,695 | 0,673 | 0,694 | 0,703 | 0,702 | 0,665 |
| DYS19 | 0,649 | 0,645 | 0,630 | 0,643 | 0,658 | 0,647 | 0,643 | 0,322 | 0,679 |
| DYS385 | 0,914 | 0,919 | 0,902 | 0,886 | 0,860 | 0,901 | 0,947 | 0,869 | 0,898 |
| DYS458 | 0,791 | 0,791 | 0,799 | 0,791 | 0,797 | 0,792 | 0,727 | 0,644 | 0,749 |
| DYS439 | 0,681 | 0,677 | 0,705 | 0,660 | 0,678 | 0,678 | 0,656 | 0,537 | 0,548 |
| Y-GATA H4 | 0,578 | 0,632 | 0,645 | 0,596 | 0,633 | 0,606 | 0,579 | 0,637 | 0,564 |
| DYS456 | 0,689 | 0,711 | 0,666 | 0,682 | 0,664 | 0,685 | 0,563 | 0,666 | 0,708 |
| DYS437 | 0,585 | 0,625 | 0,597 | 0,588 | 0,623 | 0,599 | 0,202 | 0,406 | 0,607 |
| DYS635 | 0,761 | 0,727 | 0,788 | 0,778 | 0,733 | 0,763 | 0,611 | 0,468 | 0,718 |
| DYS448 | 0,673 | 0,699 | 0,693 | 0,699 | 0,691 | 0,687 | 0,578 | 0,577 | 0,700 |
| DYS438 | 0,696 | 0,703 | 0,666 | 0,700 | 0,673 | 0,693 | 0,448 | 0,255 | 0,678 |
| DYS576 | 0,821 | 0,795 | 0,793 | 0,778 | 0,774 | 0,800 | 0,802 | 0,704 | 0,812 |
| DYS447 | 0,724 | 0,743 | 0,700 | 0,692 | 0,728 | 0,719 | 0,729 | 0,635 | 0,814 |
| DYS460 | 0,552 | 0,600 | 0,562 | 0,577 | 0,593 | 0,573 | 0,529 | 0,333 | 0,551 |
| Y-GATA A10 | 0,652 | 0,682 | 0,709 | 0,660 | 0,669 | 0,666 | 0,688 | 0,645 | 0,584 |
| DYS449 | 0,833 | 0,848 | 0,832 | 0,834 | 0,815 | 0,833 | 0,859 | 0,826 | 0,818 |
| DYS570 | 0,776 | 0,791 | 0,733 | 0,777 | 0,795 | 0,779 | 0,821 | 0,713 | 0,776 |
|
| |||||||||
| 09/set | 0.655 | 0.656 | 0.647 | 0.622 | 0.632 | 0.646 | 0.547 | 0.509 | 0.644 |
| 14/set | 0.701 | 0.716 | 0.706 | 0.701 | 0.705 | 0.705 | 0.628 | 0.575 | 0.688 |
| 17/set | 0.668 | 0.676 | 0.671 | 0.654 | 0.659 | 0.667 | 0.546 | 0.516 | 0.652 |
| 23/set | 0.684 | 0.694 | 0.685 | 0.672 | 0.678 | 0.684 | 0.599 | 0.551 | 0.672 |
|
| 874 | 279 | 143 | 408 | 320 | 2024 | 148 | 310 | 50 |
| 09/set | |||||||||
| No. of haplot.: | 663 | 225 | 122 | 287 | 250 | 1270 | – | – | – |
| No. unique haplot.: | 569 | 198 | 109 | 240 | 230 | 1010 | – | – | – |
| Haplot. diversity: | 0.9969 | 0.9937 | 0.9881 | 0.9908 | 0.9910 | 0.9959 | – | – | – |
| 14/set | |||||||||
| No. of haplot.: | 871 | 277 | 143 | 408 | 320 | 2002 | – | – | – |
| No. unique haplot.: | 869 | 275 | 143 | 408 | 320 | 1977 | – | – | – |
| Haplot. diversity: | 0.9988 | 0.9964 | 0.9930 | 0.9975 | 0.9969 | 0.9995 | – | – | – |
| 17/set | |||||||||
| No. of haplot.: | 862 | 274 | 143 | 401 | 318 | 1956 | – | – | – |
| No. unique haplot.: | 851 | 269 | 143 | 397 | 316 | 1899 | – | – | – |
| Haplot. diversity: | 0.9987 | 0.9963 | 0.9930 | 0.9974 | 0.9968 | 0.9994 | – | – | – |
| 23/set | |||||||||
| No. of haplot.: | 874 | 279 | 143 | 408 | 320 | 2017 | – | – | – |
| No. unique haplot.: | 874 | 279 | 143 | 408 | 320 | 2010 | – | – | – |
| Haplot. diversity: | 0.9988 | 0.9965 | 0.9930 | 0.9975 | 0.9969 | 0.9995 | – | – | – |
Probability of finding at least one mutation between two Y-STR haplotypes one to four generations apart.
| Y-STR | Mutation rate | One Generation Prob.of at least one mutation | Two Generations Prob.of at least one mutation | Three Generations Prob.of at least one mutation | Four Generations Prob.of at least one mutation | ||||
| % | 1 in: | % | 1 in: | % | 1 in: | % | 1 in: | ||
| DYS19 | 0,002299 | 0.23 | 435 | 0.46 | 218 | 0.69 | 145 | 0.92 | 109 |
| DYS389I | 0,002523 | 0.25 | 396 | 0.50 | 198 | 0.75 | 132 | 1.01 | 99 |
| DYS389II | 0,003644 | 0.36 | 274 | 0.73 | 137 | 1.09 | 92 | 1.45 | 69 |
| DYS390 | 0,002102 | 0.21 | 476 | 0.42 | 238 | 0.63 | 159 | 0.84 | 119 |
| DYS391 | 0,002599 | 0.26 | 385 | 0.52 | 193 | 0.78 | 129 | 1.04 | 97 |
| DYS392 | 0,000412 | 0.04 | 2425 | 0.08 | 1213 | 0.12 | 809 | 0.16 | 607 |
| DYS393 | 0,001045 | 0.10 | 957 | 0.21 | 479 | 0.31 | 319 | 0.42 | 240 |
| DYS385a | 0,002080 | 0.21 | 481 | 0.42 | 241 | 0.62 | 161 | 0.83 | 121 |
| DYS385b | 0,004140 | 0.41 | 242 | 0.83 | 121 | 1.24 | 81 | 1.65 | 61 |
| DYS458 | 0,006444 | 0.64 | 155 | 1.28 | 78 | 1.92 | 52 | 2.55 | 39 |
| DYS439 | 0,005214 | 0.52 | 192 | 1.04 | 96 | 1.56 | 64 | 2.07 | 48 |
| GATA H4 | 0,002434 | 0.24 | 411 | 0.49 | 206 | 0.73 | 137 | 0.97 | 103 |
| DYS456 | 0,004243 | 0.42 | 236 | 0.85 | 118 | 1.27 | 79 | 1.69 | 59 |
| DYS437 | 0,001226 | 0.12 | 816 | 0.25 | 408 | 0.37 | 272 | 0.49 | 204 |
| GATA C4/DYS635 | 0,003467 | 0.35 | 288 | 0.69 | 144 | 1.04 | 96 | 1.38 | 72 |
| DYS448 | 0,001571 | 0.16 | 637 | 0.31 | 319 | 0.47 | 213 | 0.63 | 160 |
| DYS438 | 0,000306 | 0.03 | 3269 | 0.06 | 1635 | 0.09 | 1090 | 0.12 | 818 |
| DYS576 | 0,014300 | 1.43 | 70 | 2.84 | 35 | 4.23 | 24 | 5.60 | 18 |
| DYS447 | 0,002120 | 0.21 | 472 | 0.42 | 236 | 0.63 | 158 | 0.85 | 118 |
| DYS460 | 0,006220 | 0.62 | 161 | 1.24 | 81 | 1.85 | 54 | 2.46 | 41 |
| GATA A10 | 0,003320 | 0.33 | 301 | 0.66 | 151 | 0.99 | 101 | 1.32 | 76 |
| DYS449 | 0,012200 | 1.22 | 82 | 2.43 | 41 | 3.62 | 28 | 4.79 | 21 |
| DYS570 | 0,012400 | 1.24 | 81 | 2.46 | 41 | 3.67 | 27 | 4.87 | 21 |
| 09-Set | 0.0023 | 2.07 | 48 | 4.09 | 24 | 6.07 | 16 | 8.01 | 12 |
| 14-Set | 0.0054 | 7.29 | 14 | 14.04 | 7 | 20.31 | 5 | 26.12 | 4 |
| 17-Set | 0.0027 | 4.48 | 22 | 8.75 | 11 | 12.84 | 8 | 16.74 | 6 |
| 23/set | 0.0042 | 9.20 | 11 | 17.55 | 6 | 25.14 | 4 | 32.03 | 3 |
| 69-Set | 0.01 | 50.016 | |||||||
| 34-Set | 0.02 | 49.686 | |||||||
| 916-Set | 0.01 | 99.990 | |||||||
| 458-Set | 0.02 | 99.990 | |||||||
Single locus mutation rates are according to YHRD (www.yhrd.org) and Ballantyne et al. [35]; haplotype mutation rates for the different sets have been calculated as the average mutation of the corresponding.