| Literature DB >> 25750707 |
Kenneth K Kidd1, William C Speed1.
Abstract
BACKGROUND: DNA sequencing is likely to become a standard typing method in forensics in the near future. We define a microhaplotype to be a locus with two or more single nucleotide polymorphisms (SNPs) that occur within a short segment of DNA (e.g., 200 bp) that can be covered by a single sequence run and collectively define a multiallelic locus. Microhaplotypes can be highly informative for many forensic questions, including detection of mixtures of two or more sources in a DNA sample, a common problem in forensic practice.Entities:
Keywords: DNA mixtures; Forensic identification; Microhaplotype; Population genetics
Year: 2015 PMID: 25750707 PMCID: PMC4351693 DOI: 10.1186/s13323-014-0018-3
Source DB: PubMed Journal: Investig Genet ISSN: 2041-2223
Figure 1Ternary plot of the probability of a qualitatively detectable mixture. The probability of having more than two haplotypes present, for a three locus system with allele frequencies of p, q, and r, is calculated for a set of genotypes from a random pair of individuals. The values range from zero along the margins with only two alleles present to the maximum at the “center” where all alleles are equally frequent.
Maximum probabilities of detecting a mixture of two random unrelated individuals for an N-allele microhap
|
|
|
|
|
|---|---|---|---|
| Three | 0.4444 | - | 0.4444 |
| Four | 0.5625 | 0.09375 | 0.65625 |
| Five | 0.5760 | 0.1920 | 0.7680 |
Maximum probabilities of detecting a mixture of two random unrelated individuals for three-, four-, and five-allele microhaps. These are the values when all alleles are equally frequent. As shown in Figure 1, the values are lower when the frequencies are not equal.
Cumulative probability of a mixture having three or more alleles at two or more loci
|
| ||||
|---|---|---|---|---|
|
|
|
|
|
|
| 3 | 0.69131 | 0.82849 | 0.90471 | 0.94706 |
| 4 | 0.88184 | 0.95938 | 0.98604 | 0.9952 |
| 5 | 0.94618 | 0.98751 | 0.99710 | 0.99933 |
Cumulative probability of a mixture having three or more alleles at two or more loci, for integral values of Ae. See text.
Figure 2Histogram of A e for the original 31 microhaps published in [ 18 ] .
Examples of 2-SNP, 3-SNP, and 4-SNP microhaplotypes with largest A e values
|
|
|
|
|
|---|---|---|---|
| Microhap048 (mh24:C14ORF43 [ | rs12717560 | 159 | 2.708 |
| rs12878166 | |||
| Microhap046 (mh22:SUDS3 [ | rs1503767 | 72 | 2.842 |
| rs11068953 | |||
| Microhap049 | rs9937467 | 59 | 2.888 |
| rs17670098 | |||
| rs17670111 | |||
| MicroHap061 | rs763040 | 146 | 3.192 |
| rs5764924 | |||
| rs763041 | |||
| MicroTetrad180 | rs12802112 | 193 | 4.008 |
| rs28631755 | |||
| rs7112918 | |||
| rs4752777 | |||
| MicroTetrad315 | rs8126597 | 145 | 4.763 |
| rs6517970 | |||
| rs8131148 | |||
| rs6517971 |
Examples of 2-SNP, 3-SNP, and 4-SNP microhaplotypes with largest Ae values characterized on our laboratory’s populations to date. The 2-SNP microhaps were published in [18] under the locus name appended to the “Provisional Locus Name” field; The microhap number indicates the number of that locus in [18] and in Figure 2.
Figure 3Haplotype frequency plots for best 2-SNP microhaps characterized to date.
Figure 4Haplotype frequency plots for best 3-SNP microhaps characterized to date.
Figure 5Haplotype frequency plots for best 4-SNP microhaps characterized to date.
Additional documented variation in Microhap048 and MicroTetrad315
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Microhap048 | |||||
| rs149195448 | 14 | 74250553 | 0.006 | ||
| rs12717560 | SNP 1 | 14 | 74250557 | 0.331 | |
| rs76446474 | 14 | 74250562 | 0.005 | ||
| rs374425620 | 14 | 74250591 | n/a | ||
| rs191001036 | 14 | 74250647 | 0.001 | ||
| rs113480934 | 14 | 74250694 | n/a | ||
| rs12878166 | SNP 2 | 14 | 74250715 | 0.377 | |
| rs12879393 | 14 | 74250730 | 0.286 | ||
| MicroTetrad315 | |||||
| rs8126597 | SNP 1 | 21 | 21880086 | 0.298 | |
| rs192464415 | 21 | 21880096 | 0.001 | ||
| rs76016088 | 21 | 21880100 | 0.027 | ||
| rs184686078 | 21 | 21880130 | 0.001 | ||
| rs138895664 | 21 | 21880157 | 0.073 | ||
| rs6517970 | SNP 2 | 21 | 21880158 | 0.444 | |
| rs202132081 | 21 | 21880159 | 0.064 | ||
| rs8131148 | SNP 3 | 21 | 21880191 | 0.320 | |
| rs6517971 | SNP 4 | 21 | 21880231 | 0.420 | |
| rs111754000 | 21 | 21880269 | n/a |
Additional documented variation in Microhap048 and MicroTetrad315. The “Clustered Allele Frequency” is the average in the 1000 Genomes data for the less frequent to vary rare allele at the SNP. The number of populations with data varies, and some have no frequency data available (n/a).