| Literature DB >> 23226224 |
Ulises Toscanini1, Manuel Garcia-Magariños, Gabriela Berardi, Thore Egeland, Eduardo Raimondi, Antonio Salas.
Abstract
The statistical interpretation of the forensic genetic evidence requires the use of allelic frequency estimates in the reference population for the studied markers. Differences in the genetic make up of the populations can be reflected in statistically different allelic frequency distributions. One can easily figure out that collecting such information for any given population is not always possible. Therefore, alternative approaches are needed in these cases in order to compensate for the lack of information. A number of statistics have been proposed to control for population stratification in paternity testing and forensic casework, Fst correction being the only one recommended by the forensic community. In this study we aimed to evaluate the performance of Fst to correct for population stratification in forensics. By way of simulations, we first tested the dependence of Fst on the relative sizes of the sub-populations, and second, we measured the effect of the Fst corrections on the Paternity Index (PI) values compared to the ones obtained when using the local reference database. The results provide clear-cut evidence that (i) Fst values are strongly dependent on the sampling scheme, and therefore, for most situations it would be almost impossible to estimate real values of Fst; and (ii) Fst corrections might unfairly correct PI values for stratification, suggesting the use of local databases whenever possible to estimate the frequencies of genetic profiles and PI values.Entities:
Mesh:
Year: 2012 PMID: 23226224 PMCID: PMC3511507 DOI: 10.1371/journal.pone.0049832
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Census of the Argentinean population in the provinces and population groups used in the present study.
| Population | Type | Sample size |
| % | Ref. |
| Buenos Aires | Urban | 879 | 15,653,341 | 0.01 |
|
| Neuquen | Urban | 355 | 474,155 | 0.07 |
|
| La Pampa | Urban | 232 | 299,294 | 0.08 |
|
| Santa Cruz | Urban | 132 | 196,958 | 0.07 |
|
| Tucumán | Urban | 75 | 1,338,523 | 0.01 |
|
| San Luis | Urban | 61 | 367,933 | 0.02 |
|
| Colla | Native American | 43 | 53,106 | 0.08 |
|
| Toba | Native American | 129 | 47,591 | 0.27 |
|
Buenos Aires includes Buenos Aires city and Buenos Aires province. In the most right column, the percentage of sample size relative to the census population size is shown.
Figure 1Values of Fst under different sampling schemes and considering original STR profiles obtained from different Argentinean population samples (Native American and Urban) as described in the text.
Differences between population groups.
| U | U+N | U( | U+N( | |
| Local | 72.2/59.5 | 73.8/60.7 | 68.6/54.7 | 76.7/64.5 |
| Local | 22.8 | 23.1 | 20.6 | 22.1 |
Values in the first row indicate the percentages of individuals that show significant differences in pairwise comparisons under the test of Tukey for trios (the first term is for α = 0.01, while the second term is for the Bonferroni's correction assuming 1,906 comparisons). Values in the second row show the percentages of cases where WMD values were above 0.8. ‘Local’ = indicates the local (reference) database; U = urban; U+N = urban plus Native American, U(Fst) = urban with Fst corrections, U+N(Fst) = urban plus Native American with Fst corrections.
Figure 2WMD and Tukey test P-values distributions for the 1,906 profiles obtained for the comparison between the local reference database and the four remaining scenarios considered.
(a) Each histogram represents the impact on WMD for a given pair of frequency datasets over the 1,906. (b) the curve represents the minus log10(P-values) (Tukey test) obtained for the difference between the PI values for each case. The horizontal lines represent from bottom to top the log10 values for α = 0.05, α = 0.01, and the respective values for Bonferroni corrections. Cases on the right hand side of the vertical yellow line correspond to the cases where the differences where statistically significant for α = 0.01 after Bonferroni correction.
Numbers of pairwise comparisons exhibiting a difference of a given order of magnitude using Fst corrections versus the local database.
| [1] | [2] | [3] | [4] | [5] | [6] | |
| Urban | 2 (0.1) | 103 (5.9) | 0 (0.0) | 1 (0.1) | 0 (0.0) | 0 (0.0) |
| Native American | 70 (40.7) | 5 (2.9) | 26 (15.1) | 2 (1.2) | 3 (1.7) | 0 (0.0) |
|
|
|
|
|
|
|
|
For the computation, the Urban+Native American frequency database with the corresponding Fst corrections was employed. The values indicate the number of Urban or Native trios that exhibited a PI value higher than 1, 2 and 3 orders of magnitude (Columns 2, 4 & 6) and lower than 1, 2 and 3 orders of magnitude (Columns 1, 3 & 5) using the Urban+Native American database with the corresponding Fst corrections relative to the PI values obtained with the reference database. [1]: PI(Local)(×100)>PI(Fst)>PI(Local)(×10); [2]: PI(Local)(×0.01)