| Literature DB >> 35052480 |
Abstract
The inference of ancestry has become a part of the services many forensic genetic laboratories provide. Interest in ancestry may be to provide investigative leads or identify the region of origin in cases of unidentified missing persons. There exist many biostatistical methods developed for the study of population structure in the area of population genetics. However, the challenges and questions are slightly different in the context of forensic genetics, where the origin of a specific sample is of interest compared to the understanding of population histories and genealogies. In this paper, the methodologies for modelling population admixture and inferring ancestral populations are reviewed with a focus on their strengths and weaknesses in relation to ancestry inference in the forensic context.Entities:
Keywords: ancestry; biostatistics; classification; clustering; distance based; hypothesis tests; likehood
Mesh:
Substances:
Year: 2022 PMID: 35052480 PMCID: PMC8774801 DOI: 10.3390/genes13010141
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Plot of the first three PCs from the PCA of the reference samples.
Figure 2Barplots of the STRUCTURE admixture components () for the reference samples from six metapopulations. There is a clear visual difference between the admixture components for the six metapopulations.
Figure 3Boxplots of the Brier scores, , with in the STRUCTURE analysis and j running through the metapopulations of [58]. Reference: Samples constituting the metapopulations (and used to calculate ). Concordant: A sample is accepted in the metapopulation coinciding with its sampling region (with a likelihood value significantly larger than any other metapopulation). Discordant: A sample is accepted in a metapopulation different than its sampling region (with a likelihood value significantly larger than any other metapopulation). Ambiguous: A sample’s likelihood value is not significantly different in two or more of the accepted metapopulations. Rejected: A sample is rejected in all metapopulations. The concordant test samples (green) have similar Brier scores as those of the reference samples (grey). Discordant samples (red) tend to have the largest Brier scores, followed by the rejected samples (blue) with the ambiguous (yellow) in between (see text for definitions). The horizontal dashed and dotted lines in the top of the plot indicates and , respectively ([26], inspired by Supplementary Figure S8).