| Literature DB >> 34680954 |
Peter Gill1,2, Corina Benschop3, John Buckleton4,5, Øyvind Bleka1, Duncan Taylor6,7.
Abstract
Probabilistic genotyping has become widespread. EuroForMix and DNAStatistX are both based upon maximum likelihood estimation using a γ model, whereas STRmix™ is a Bayesian approach that specifies prior distributions on the unknown model parameters. A general overview is provided of the historical development of probabilistic genotyping. Some general principles of interpretation are described, including: the application to investigative vs. evaluative reporting; detection of contamination events; inter and intra laboratory studies; numbers of contributors; proposition setting and validation of software and its performance. This is followed by details of the evolution, utility, practice and adoption of the software discussed.Entities:
Keywords: DNAStatistX; EuroForMix; STRmixTM; probabilistic genotyping
Mesh:
Year: 2021 PMID: 34680954 PMCID: PMC8535381 DOI: 10.3390/genes12101559
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Receiver operating characteristic (ROC) plot where the rate of false positive support (FP) (horizontal axis) and true positives support (TP) (vertical axis) are plotted as a function of LR thresholds. The plot shows the results for the maximum likelihood estimation method (MLE) and the conservative method (CONS) for both LRmix and EuroForMix. The points on the curves show the FP and TP rates for different LR thresholds. Note that with this dataset, approximately 5% of samples were very low-level mixtures so that the POI was undetectable This caused the number of contributors to be underestimated, leading to very low (exclusionary) LRs. Therefore, the true positive rate does not reach 1.0 with the MLE method. Reprinted from [50], Copyright (2016), with permission from Elsevier.
Figure 2A diagram showing the evolution of probabilistic genotyping software developed by the NFI and Oslo University Hospital. Blue and orange boxes indicate qualitative and quantitative (γ) models, respectively. Green boxes are binary methods and purple boxes indicate software that include multiple types of methods.
Figure 3γ distributions for a simple case, where shape parameters = 3.312 and 8.381, respectively and the scale parameter is 86.2. The peak height expectation (µ) and Mx are shown for each contributor. The probability density function for the individual peak height contributions are derived from these curves. Reprinted from [62], chapter 7, Copyright (2020) with permission from Elsevier.
Figure 4Graphical network summarizing the connections between case samples through LR calculations. The references are the green nodes. Single contributor evidences are in cyan; two contributors are in orange and three or more contributors are in red. If the ’plotly’ function in R is used then the mouse can be hovered over a node and this displays a list of the matches, as shown for sample 30.01. The thickness of the edges between the nodes is inversely proportional to the size of the LR on a log10 scale. Reprinted from [62], chapter 11, Copyright (2020) with permission from Elsevier.
Figure 5Representation of the components involved in the analysis of a DNA profile.
Publications of conceptual components of STRmix™ modelling.
| Algorithms, Scientific Principles and Methods | Version Introduced | Reference |
|---|---|---|
| Allele and stutter peak height variability as separate constants within the MCMC | V2.0 | [ |
| Peak height variability as random variables within the MCMC | V2.3 | [ |
| Model for calibrating laboratory peak height variability | V2.0 | [ |
| Application of a Gaussian random walk to the MCMC process | V2.3 | [ |
| Modelling of back stutter by regressing stutter ratio against allelic designation | V2.0 | [ |
| Modelling of back stutter by regressing stutter ratio against LUS | V2.3 | [ |
| Modelling of forward stutter | V2.4 | [ |
| Modelling of allelic drop-in using a simple exponential or uniform distribution | V2.0 | [ |
| Modelling of allelic drop-in using a γ distribution | V2.3 | [ |
| Modelling of degradation and dropout | V2.0 | [ |
| Modelling of the uncertainties in the allele frequencies using the HPD | V2.0 | [ |
| Modelling of the uncertainties in the MCMC | V2.3 | [ |
| Database searching of mixed DNA profiles | V2.0 | [ |
| Familial searching of mixed DNA profiles | V2.3 | [ |
| Relatives as alternate contributors under the defence proposition | V2.3 | [ |
| Modelling expected stutter peak heights in saturated data | V2.3 | [ |
| Taking into account the ‘factor of two’ in | V2.3 | [ |
| Model for incorporating prior beliefs in mixture proportions | V2.3 | [ |
| Combining DNA profiles produced under different conditions into a single analysis | V2.5 | [ |
| Assigning a range for the number of contributors to a DNA profile | V2.6 | [ |
| Mixture-to-mixture comparison to identify common DNA donors | V2.7 | [ |
| A top-down DNA search approach | V2.8 | [ |
| The diagnostic outputs of | V2.3 | [ |
Publications of validation of STRmix™ models.
| Focus of Validation | Reference |
|---|---|
| Ability of | [ |
| Ability of | [ |
| Behaviour of | [ |
| Sensitivity of | [ |
| Tests to be performed when validating probabilistic genotyping, using | [ |
| Ability of individuals from different laboratories to standardise evaluations when using | [ |
| Ability of | [ |
| Ability of | [ |
| Sensitivity of | [ |
| Ability of | [ |
| Effect of mixture complexity, allele sharing and contributor proportions on the ability | [ |
| The ability of | [ |
| The sensitivity of | [ |
| Ability to use | [ |
| Testing the assumption of additivity of peak heights in | [ |
| Performance of the degradation model within | [ |
| The effect of relatedness of contributors to the | [ |
| Testing the calibration of | [ |
| Validation overviews of | [ |
| Comparison of | [ |
Figure 6Growth of STRmix™ use over eight years.