| Literature DB >> 36226179 |
Dan Wen1, Hao Xing1, Ying Liu2, Jienan Li1, Weifeng Qu1, Wei He1, Chudong Wang1, Ruyi Xu1, Yi Liu1, Hongtao Jia1, Lagabaiyila Zha1.
Abstract
Paternity testing and sibling testing become more complex and difficult when samples degrade. But the commonly used genetic markers (STR and SNP) cannot completely solve this problem due to some disadvantages. The novel genetic marker microhaplotype proposed by Kidd's research group combines the advantages of STR and SNP and is expected to become a promising genetic marker for kinship testing in degraded samples. Therefore, in this study, we intended to select an appropriate number of highly polymorphic SNP-based microhaplotype loci, detect them by the next-generation sequencing technology, analyze their ability to detect degraded samples, calculate their forensic parameters based on the collected 96 unrelated individuals, and evaluate their effectiveness in paternity testing and sibling testing by simulating kinship relationship pairs, which were also compared to 15 STR loci. Finally, a short and highly polymorphic microhaplotype panel was developed, containing 36 highly polymorphic SNP-based microhaplotype loci with lengths smaller than 100 bp and A e greater than 3.00, of which 29 microhaplotype loci could not reject the Hardy-Weinberg equilibrium and linkage equilibrium after the Bonferroni correction. The CPD and CPE of these 29 microhaplotype loci were 1-2.96E-26 and 1-5.45E-09, respectively. No allele dropout was observed in degraded samples incubated with 100°C hot water for 40min and 60min. According to the simulated kinship analysis, the effectiveness at the threshold of 4/-4 reached 98.39% for relationship parent-child vs. unrelated individuals, and the effectiveness at the threshold of 2/-2 for relationship full-sibling vs. unrelated individuals was 93.01%, which was greater than that of 15 STR loci (86.75% for relationship parent-child vs. unrelated individuals and 81.73% for relationship full-sibling vs. unrelated individuals). After combining our 29 microhaplotype loci with other 50 short and highly polymorphic microhaplotype loci, the effectiveness values at the threshold of 2/-2 were 82.42% and 90.89% for relationship half-sibling vs. unrelated individuals and full-sibling vs. half-sibling. The short and highly polymorphic microhaplotype panel we developed may be very useful for paternity testing and full sibling testing in degraded samples, and in combination with short and highly polymorphic microhaplotype loci reported by other researchers, may be helpful to analyze more distant kinship relationships.Entities:
Keywords: degraded samples; forensic; microhaplotype; paternity testing; sibling testing
Year: 2022 PMID: 36226179 PMCID: PMC9549137 DOI: 10.3389/fgene.2022.983811
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
The details of the selected microhaplotype loci.
| Locus | Chr | Position (GRCh37) | SNPs ID | Extent in bp |
|---|---|---|---|---|
| mh01zha018 | 1 | 4,573,068/4,573,134 |
| 67 |
| mh01zha034 | 1 | 228,494,357/228,494,382 | rs3795795/ | 26 |
| mh02zha025 | 2 | 68,237,549/68,237,578/68,237,608 | rs11689307/rs11678194/ | 60 |
| mh02zha033 | 2 | 140,986,567/140,986,573 |
| 7 |
| mh03zha016 | 3 | 32,037,852/32,037,880 |
| 29 |
| mh04zha007 | 4 | 115,480,309/115,480,344/115,480,387 |
| 79 |
| mh04zha012 | 4 | 18,798,844/18,798,877 |
| 34 |
| mh04zha020 | 4 | 59,942,563/59,942,599 |
| 37 |
| mh04zha027a | 4 | 137,729,311/137,729,331/137,729,375 | rs11936713/rs1551708/ | 65 |
| mh04zha031 | 4 | 166,705,962/166,705,990 |
| 29 |
| mh04zha032a | 4 | 188,290,891/188,290,908/188,290,915/188,290,948 | rs3860700/ | 58 |
| mh05zha004a | 5 | 174,968,649/174,968,732 | rs2644662/ | 84 |
| mh06zha012a | 6 | 170,554,249/170,554,250/170,554,285 | rs6456186/rs6456187/ | 37 |
| mh06zha025 | 6 | 67,847,591/67,847,632 |
| 42 |
| mh06zha026 | 6 | 77,669,385/77,669,395 | rs323232/ | 11 |
| mh07zha014 | 7 | 5,156,472/5,156,492 |
| 21 |
| mh07zha018 | 7 | 11,777,712/11,777,747/11,777,769 | rs9691520/ | 58 |
| mh07zha026 | 7 | 103,539,419/103,539,452/103,539,478 | rs17157141/rs73183735/ | 60 |
| mh07zha027 | 7 | 122,708,807/122,708,851 |
| 45 |
| mh08zha007a | 8 | 4,045,817/4,045,914 |
| 98 |
| mh09zha012 | 9 | 28,320,624/28,320,672 | rs72709379/ | 49 |
| mh09zha017 | 9 | 112,618,165/112,618,187 | rs10816899/ | 23 |
| mh09zha021 | 9 | 134,212,403/134,212,428 |
| 26 |
| mh10zha010 | 10 | 15,035,571/15,035,631 | rs9732205/ | 61 |
| mh10zha020 | 10 | 126,297,208/126,297,257/126,297,261 | rs11245314/rs7079225/ | 54 |
| mh11zha006b | 11 | 124,823,941/124,823,950/124,823,981 | rs3809057/ | 41 |
| mh11zha010 | 11 | 3,479,453/3,479,464/3,479,478 | rs10834159/ | 26 |
| mh12zha012 | 12 | 713,677/713,712/713,741 |
| 65 |
| mh12zha014 | 12 | 20,553,325/20,553,347 |
| 23 |
| mh13zha003 | 13 | 59,822,210/59,822,229/59,822,240 | rs2874768/ | 31 |
| mh13zha008 | 13 | 98,640,318/98,640,386 |
| 69 |
| mh14zha003 | 14 | 72,252,135/72,252,139/72,252,142 | rs4902946/rs8012670/ | 8 |
| mh14zha008 | 14 | 22,095,528/22,095,573 | rs185847116/ | 46 |
| mh14zha010 | 14 | 77,688,982/77,689,033/77,689,054 | rs10400728/rs72728998/ | 73 |
| mh16zha013 | 16 | 58,900,991/58,901,029/58,901,035 | rs9935162/rs76244992/ | 45 |
| mh18zha010a | 18 | 844,106/844,110/844,130/844,136/844,145 |
| 40 |
The multi-allelic SNPs, are marked in bold.
Allelic frequencies and forensic parameters of the 29 microhaplotype loci based on the dataset of 96 unrelated Shandong Han Chinese from our study.
|
|
|
|
|
|
|
|
|
|
| ||||||||||
| TC | 0.24 | TC | 0.14 | AT | 0.17 | AAT | 0.40 | AT | 0.36 | TC | 0.24 | AGT | 0.11 | TA | 0.16 | TAAA | 0.05 | TC | 0.22 |
| CC | 0.34 | CT | 0.33 | TC | 0.01 | AAC | 0.18 | AC | 0.23 | CT | 0.41 | CAT | 0.21 | TG | 0.33 | TAAG | 0.01 | GA | 0.35 |
| GT | 0.11 | CC | 0.35 | CT | 0.32 | CAC | 0.17 | TC | 0.19 | CC | 0.18 | CGT | 0.19 | CG | 0.34 | TTCG | 0.01 | GC | 0.08 |
| GC | 0.30 | CG | 0.18 | CC | 0.21 | GGC | 0.26 | CC | 0.22 | GT | 0.01 | CGC | 0.39 | GG | 0.17 | TGAA | 0.02 | GG | 0.35 |
| GT | 0.29 | GC | 0.17 | CGG | 0.10 | TGAG | 0.14 | ||||||||||||
| TGCA | 0.02 | ||||||||||||||||||
| TGCG | 0.06 | ||||||||||||||||||
| CAAA | 0.47 | ||||||||||||||||||
| CAAG | 0.09 | ||||||||||||||||||
| CACG | 0.02 | ||||||||||||||||||
| CTCG | 0.03 | ||||||||||||||||||
| CGAG | 0.06 | ||||||||||||||||||
| CGCG | 0.05 | ||||||||||||||||||
| PD | 0.86 | PD | 0.87 | PD | 0.88 | PD | 0.87 | PD | 0.86 | PD | 0.85 | PD | 0.89 | PD | 0.84 | PD | 0.91 | PD | 0.83 |
| PE | 0.53 | PE | 0.46 | PE | 0.44 | PE | 0.43 | PE | 0.56 | PE | 0.55 | PE | 0.58 | PE | 0.56 | PE | 0.46 | PE | 0.46 |
| Ho | 0.76 | Ho | 0.72 | Ho | 0.71 | Ho | 0.70 | Ho | 0.78 | Ho | 0.77 | Ho | 0.79 | Ho | 0.78 | Ho | 0.72 | Ho | 0.72 |
| | 3.57 |
| 3.54 |
| 3.81 |
| 3.54 |
| 3.74 |
| 3.53 |
| 3.94 |
| 3.58 |
| 3.84 |
| 3.32 |
|
|
|
|
|
|
|
|
|
|
| ||||||||||
| ATA | 0.35 | AC | 0.37 | AC | 0.17 | AT | 0.32 | TTC | 0.22 | TC | 0.33 | AA | 0.34 | AT | 0.23 | TT | 0.25 | AA | 0.23 |
| ATC | 0.30 | TC | 0.23 | GA | 0.33 | TT | 0.23 | TGA | 0.23 | CT | 0.32 | TA | 0.29 | GT | 0.22 | TC | 0.17 | AG | 0.33 |
| ATG | 0.02 | GT | 0.10 | GC | 0.36 | CT | 0.31 | TGT | 0.38 | CC | 0.21 | TC | 0.24 | GC | 0.22 | TG | 0.39 | TG | 0.17 |
| GAG | 0.15 | GC | 0.30 | GG | 0.14 | CC | 0.14 | TGC | 0.10 | GC | 0.14 | CA | 0.13 | GG | 0.33 | CG | 0.19 | GG | 0.28 |
| GTG | 0.18 | CGA | 0.07 | ||||||||||||||||
| PD | 0.89 | PD | 0.83 | PD | 0.86 | PD | 0.87 | PD | 0.87 | PD | 0.87 | PD | 0.88 | PD | 0.89 | PD | 0.87 | PD | 0.88 |
| PE | 0.32 | PE | 0.58 | PE | 0.49 | PE | 0.47 | PE | 0.55 | PE | 0.46 | PE | 0.43 | PE | 0.38 | PE | 0.47 | PE | 0.56 |
| Ho | 0.63 | Ho | 0.79 | Ho | 0.74 | Ho | 0.73 | Ho | 0.77 | Ho | 0.72 | Ho | 0.70 | Ho | 0.67 | Ho | 0.73 | Ho | 0.78 |
| | 3.73 |
| 3.47 |
| 3.47 |
| 3.70 |
| 3.82 |
| 3.65 |
| 3.62 |
| 3.87 |
| 3.57 |
| 3.79 |
|
|
|
|
|
|
|
|
|
| |||||||||||
| AA | 0.30 | AAT | 0.26 | TTT | 0.04 | TA | 0.14 | CAT | 0.10 | TA | 0.05 | TGC | 0.48 | TTT | 0.39 | ATGGA | 0.28 | ||
| AC | 0.43 | ATC | 0.33 | TTC | 0.01 | TG | 0.19 | CCA | 0.36 | TG | 0.10 | GTG | 0.12 | TTC | 0.15 | CCAGG | 0.09 | ||
| AG | 0.18 | AGC | 0.20 | TCT | 0.11 | CA | 0.36 | CCT | 0.05 | GA | 0.26 | GGA | 0.10 | TTG | 0.07 | CCGGG | 0.39 | ||
| GA | 0.09 | GGC | 0.21 | TCC | 0.02 | GG | 0.31 | GCA | 0.21 | GC | 0.11 | GGC | 0.05 | CTC | 0.24 | GTAAG | 0.13 | ||
| TGT | 0.18 | GCG | 0.29 | GG | 0.48 | GGG | 0.24 | CCC | 0.15 | GTAGG | 0.01 | ||||||||
| GTT | 0.05 | GTGGA | 0.11 | ||||||||||||||||
| GCT | 0.14 | ||||||||||||||||||
| GGT | 0.46 | ||||||||||||||||||
| PD | 0.85 | PD | 0.88 | PD | 0.88 | PD | 0.84 | PD | 0.89 | PD | 0.83 | PD | 0.85 | PD | 0.88 | PD | 0.88 | ||
| PE | 0.39 | PE | 0.49 | PE | 0.46 | PE | 0.49 | PE | 0.38 | PE | 0.49 | PE | 0.27 | PE | 0.56 | PE | 0.51 | ||
| Ho | 0.68 | Ho | 0.74 | Ho | 0.72 | Ho | 0.74 | Ho | 0.67 | Ho | 0.74 | Ho | 0.58 | Ho | 0.78 | Ho | 0.75 | ||
| | 3.19 |
| 3.83 |
| 3.53 |
| 3.52 |
| 3.75 |
| 3.08 |
| 3.11 |
| 3.86 |
| 3.74 | ||
FIGURE 1The heatmap of A distribution of 29 microhaplotype loci in 26 populations based on the data of the 1000 Genomes Project.
FIGURE 2The distributions of Log 10 of PI, FSI, HSI, and FHSI based on the data of 15 STR loci. (A) PI; (B) FSI; (C) HSI; (D) FHSI.
The UCR values of four kinds of relationship pairs using the 15 STR loci, 29 microhaplotype loci and 79 microhaplotype loci.
| Dataset | Relationship | Max (Hd|Log10LR) | UCR(Hp|Log10LR > Max (Hd|Log10LR)) | Min (Hp|Log10LR) | UCR(Hd|Log10LR < Min (Hp|Log10LR)) |
|---|---|---|---|---|---|
| 15 STR | Parent-child vs. Unrelated | 3.51 | 86.70% | −4.79 | 98.51% |
| Full-sibling vs .Unrelated | 3.84 | 52.70% | −4.89 | 12.79% | |
| Half-sibling vs. Unrelated | 3.24 | 3.01% | −2.87 | 1.16% | |
| Full-sibling vs. Half-sibling | 3.46 | 3.43% | −2.47 | 2.15% | |
| 29 MH | Parent-child vs. Unrelated | 2.81 | 99.75% | −3.74 | 99.99% |
| Full-sibling vs. Unrelated | 4.60 | 62.04% | −4.72 | 53.38% | |
| Half-sibling vs. Unrelated | 3.77 | 1.10% | −3.49 | 1.96% | |
| Full-sibling vs. Half-sibling | 3.64 | 8.58% | −3.25 | 5.52% | |
| 79 MH | Parent-child vs. Unrelated | −57.62 | 100.00% | 5.26 | 100.00% |
| Full-sibling vs. Unrelated | 1.13 | 99.99% | −0.37 | 100.00% | |
| Half-sibling vs. Unrelated | 3.75 | 48.32% | −4.14 | 38.97% | |
| Full-sibling vs. Half-sibling | 3.74 | 71.52% | −4.34 | 51.34% |
FIGURE 5The system power based on the data of 15 STR loci, 29 microhaplotype loci and 79 microhaplotype loci for relationship parent-child vs. unrelated individuals at different threshold values. (A) Threshold t1 = 1/t2 = −1; (B) Threshold t1 = 2/t2 = −2; (C) Threshold t1 = 3/t2 = −3; (D) Threshold t1 = 4/t2 = −4.
FIGURE 6The system power based on the data of 15 STR loci, 29 microhaplotype loci and 79 microhaplotype loci for relationship full-sibling vs. unrelated individuals at different threshold values. (A) Threshold t1 = 1/t2 = −1; (B) Threshold t1 = 2/t2 = −2; (C) Threshold t1 = 3/t2 = −3; (D) Threshold t1 = 4/t2 = −4.
FIGURE 7The system power based on the data of 15 STR loci, 29 microhaplotype loci and 79 microhaplotype loci for relationship half-sibling vs. unrelated individuals at different threshold values. (A) Threshold t1 = 1/t2 = −1; (B) Threshold t1 = 2/t2 = −2; (C) Threshold t1 = 3/t2 = −3; (D) Threshold t1 = 4/t2 = −4.
FIGURE 8The system power based on the data of 15 STR loci, 29 microhaplotype loci and 79 microhaplotype loci for relationship full-sibling vs. half-sibling at different threshold values. (A) Threshold t1 = 1/t2 = −1; (B) Threshold t1 = 2/t2 = −2; (C) Threshold t1 = 3/t2 = −3; (D) Threshold t1 = 4/t2 = −4.
FIGURE 3The distributions of Log 10 of PI, FSI, HSI and FHSI based on the data of 29 microhaplotype loci. (A) PI; (B) FSI; (C) HSI; (D) FHSI.
FIGURE 4The distributions of Log 10 of PI, FSI, HSI and FHSI based on the data of 79 microhaplotype loci. (A) PI; (B) FSI; (C) HSI; (D) FHSI.