| Literature DB >> 31727926 |
Hussain M Alsafiah1,2, Ali A Aljanabi3, Sibte Hadi3, Saleh S Alturayeif4, William Goodwin5.
Abstract
Short tandem repeat (STR) profiling has been routinely used in kinship testing since the introduction of commercial kits in the mid-1990s. While 15 to 23 STR loci normally give definitive results in simple kinship testing, additional loci are sometimes required to resolve complex cases. The SureID 23comp Human Identification Kit, recently released by Health Gene Technologies (China), multiplexes amelogenin and 22 autosomal STRs, 17 of which are non-CODIS STRs. This enables the profiling of 38-40 loci when used in conjunction with widely used commercial kits. In this study, the kit was evaluated for kinship applications as a supplementary STR kit following the minimum criteria for validation recommended by the European Network of Forensic Science Institutes (ENFSI) and the Scientific Working Group on DNA Analysis Methods (SWGDAM) using 500 samples. Performance was comparable with other commercial kits demonstrating: repeatability and reproducibility; precision (maximum s.d. 0.1048 nt); accuracy, all alleles were within ±0.41 nt compared to the actual sizes; heterozygous peak balances at all loci >68%; stutter ratios ranged from 3.8% to 16.15%; full profiles were generated with 125 pg DNA (95.12% of alleles at 62 pg),; and we found 100% concordance over 5 common STRs with the GlobalFiler kit.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31727926 PMCID: PMC6856129 DOI: 10.1038/s41598-019-52838-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
STR Markers included in the SureID 23comp Kit.
| Chr. | SureID 23comp | Investigator HDplex STRs(a,b) | PowerPlex CS7 | ||
|---|---|---|---|---|---|
| STRs(a) | Location (GRCh38) | Repeat structure* | |||
| 1 | D1S1656a | 230769616–230769683 | CCTA [TCTA]n | F13B | |
| 2 | D2S441a | 68011947–68011994 | [TCTA]n | D2S1360 | |
| 3 | D3S1744 | 147374752–147374828 | [ATAG]n | D3S1744 | |
| 4 | D4S2366 | 6483114–6483172 | [GATA]n [GATT]n [GATA]n | D4S2366 | |
| 5 | D5S2800 | 59403132–59403199 | [GGTA]n [GACA]n [GATA]n [GATT]n | D5S2500 | |
| 6 | D6S474 | 112557951–112558018 | [AGAT]n [GATA]n | D6S474, SE33b | F13A01 |
| 7 | D7S3048 | 21227099–21227174 | [TATC]n [TACC]n [CACC]n | D7S1517 | |
| 8 | D8S1132 | 106316692–106316774 | [TCTA]n | D8S1132 | LPL |
| 9 | D9S1122 | 77073826–77073873 | [TAGA]n | Penta C | |
| 10 | D10S1248a | 129294244–129294295 | [GGAA]n | D10S2325 | |
| 11 | D11S2368 | 19259601–19259684 | [TATC]n [TGTC]n [TATC]n | ||
| 12 | D12S391a | 12297020–12297095 | [AGAT]n [AGAC]n AGAT | D12S391a | |
| 13 | D13S325 | 42599304–42599382 | [TCTA]n | ||
| 14 | D14S1434 | 94842054–94842105 | [CTGT]n [CTAT]n | ||
| 15 | D15S659 | 46081911–46081966 | [TATC]n | FESFPS, Penta E | |
| 16 | D16S539a | 86352702–86352745 | [GATA]n | ||
| 17 | D17S1301 | 74684855–74684902 | [AGAT]n | ||
| 18 | D18S1364 | 65732998–65733056 | [TAGA]n TACA [TAGA]n | D18S51a | |
| 19 | D19S253 | 15617484–15617531 | [ATCT]n | ||
| 20 | D20S482 | 4525692–4525747 | [AGAT]n | ||
| 21 | D21S2055 | 39819508–39819649 | [CTAT]n CTAA [CTAT]n | D21S2055 | Penta D |
| 22 | D22GATA198B05 | 17169811–17169882 | CTCT [ATCT]n [ACCT]n | ||
This table shows the locations (GRCh38) and repeat structures of the 22 STRs included in the SureID 23comp Kit. Five loci are common with the CODIS and the ESS. Twelve loci are not included in other available supplementary kits (Investigator HDplex and PowerPlex CS7). All information was adopted from[17,18,39]. As proposed by Phillips[19], the D5S2500 locus name is corrected to D5S2800 (Please see Note S1).
aCODIS and ESS locus.
bGermany core locus.
*Lowercase and italic nucleotides are not counted in allele nomenclature system[39].
Figure 1Sensitivity and stochastic tests for the SureID 23comp Kit. Serial dilutions (500, 250, 125, 62, and 31) pg were prepared from the 2800 M control DNA (Promega Corporation). Each test was done on five replicates and the sample with the highest number of detected alleles are shown. Each cell represents an allele and merged cells represent homozygote loci in the 2800 M. Green cells identify detected alleles with ≥60% peak balance ratios. Yellow cells identify detected alleles with <60% peak balance ratios. Red cells represent undetected alleles with threshold of 50 RFU/150 RFU for heterozygotes/homozygotes.
Figure 2Peak balance ratios study for the D21S2055 locus. This figure shows a study of the correlation between the size differences between heterozygous alleles and the peak balance ratio for the D21S2055 locus using data of 500 samples. The peak ratios of all genotypes that have the same size difference (nt) (e.g. the genotypes 13, 17; 14, 18; and 15, 19 have the same size difference of 4 nt) were averaged and are represented by the black dots. The blue line shows the smoothed mean of the peak ratios. Heterozygote alleles with >50 nucleotides differences showed peak ratios <45%.
Figure 3Precision study of the SureID 23comp Kit. The figure shows standard deviation (s.d) values of the fragment sizes of 22,981 alleles generated from 500 samples tested by the SureID 23comp. The highest s.d. was observed in allele 21 at D7S3048 (0.1048 nt).
Figure 4Accuracy study of the SureID 23comp Kit. The average of the size values of each allele in the data of the 500 samples and in 21 allelic ladders were compared to the actual sizes of the corresponding allele (actual sizes provided by the manufacturer). The size differences per nucleotides were calculated and are represented by the coloured dots. All alleles fell within the range of ±0.41 nt of the allelic widow; the largest differences were seen at D6S474 allele 17 (0.4096 nt) and D7S3048 allele 26 (−0.4084 nt).
Alleles not represented by the allelic ladder of SureID 23comp Kit detected in the population of Saudi Arabia.
| STRs | Allele | frequency | STRs | Allele | frequency | ||
|---|---|---|---|---|---|---|---|
| Saudi | Ningbo | Saudi | Ningbo | ||||
| D18S1364 | 11 | 0.001 | 0.002 | D13S325 | 26.3 | 0.002 | 0 |
| D1S1656 | 7 | 0.001 | 0 | 27.3 | 0.001 | 0 | |
| 8 | 0.001 | 0 | D8S1132 | 13.1 | 0.001 | 0 | |
| 10 | 0.004 | 0.001 | 15 | 0.003 | 0 | ||
| 14.3 | 0.002 | 0 | D7S3048 | 30 | 0.001 | 0 | |
| 15.3 | 0.040 | 0 | D2S441 | 8.3 | 0.001 | 0 | |
| 16.3 | 0.061 | 0.007 | 9 | 0.005 | 0 | ||
| 18 | 0.003 | 0.011 | 11.3 | 0.066 | 0 | ||
| 19.3 | 0.006 | 0.003 | 13.3 | 0.001 | 0 | ||
| 20.3 | 0.001 | 0.002 | D19S253 | 6 | 0.004 | 0 | |
| D9S1122 | 7 | 0.001 | 0 | 16 | 0.001 | 0 | |
| D4S2366 | 16 | 0.002 | 0 | D22GATA198B05 | 11.2 | 0.001 | 0 |
| D3S1744 | 12 | 0.001 | 0 | 12 | 0.004 | 0 | |
| D12S391 | 18.3 | 0.005 | 0 | D6S474 | 10 | 0.001 | 0 |
| 19.1 | 0.001 | 0 | D14S1434 | 16 | 0.004 | 0.004 | |
| 19.3 | 0.004 | 0 | D15S659 | 6 | 0.001 | 0 | |
| 27 | 0.003 | 0.003 | 7 | 0.003 | 0 | ||
Thirty-four alleles were detected at 15 STRs that are not represented by the allelic ladder of the kit. It shows also the frequency of these alleles in Ningbo population (data provided by the Health Gene Technologies) and in the population of Saudi Arabia. The frequencies of detected alleles in the population of Saudi Arabia ranged from 0.001 (one observation) to 0.066 (66 observations).
Figure 5Alleles outside the allelic ladder windows of the SureID 23comp Kit. This figure shows ten alleles observed in the population of Saudi Arabia that are not represented in the allelic ladder and were situated outside the designated widow of their loci: (a) Alleles 7 and 8 at D1S1656; (b) Alleles 26.3 and 27.3 at D13S325; (c) Allele 30 at D7S3048; (d) Allele 16 at D4S2366; (e) Allele 12 at D3S1744; (f) Allele 10 at D6S474; (g) Alleles 6 and 7 at D15S659. Allele 7 at D1S1656 (a) was situated under the designated area of D18S1364 locus.
LD test results for 18 pairs of syntenic loci formed when combining loci included in the SureID 23comp and GlobalFiler kits.
| Chr. | Syntenic Pair | LD test | HapMap proxy SNP position (GRCh37)b | Cumulative genetic map distance in cMb | Genetic map distance in cM | RFs from Kosambi mapping functiona–c | RFs estimated based on family studiesd–f |
|---|---|---|---|---|---|---|---|
| Chr.2 | TPOX | 0.97764 | 1493487 | 1.6661 | 88.8129 | 0.4721a | 0.53d |
| D2S441 | 68239020 | 90.47903 | |||||
| Chr.2(p-q) | D2S1338 | 0.99164 | 218879435 | 223.4832 | 133.0042 | 0.4951c | 0.58d |
| D2S441 | 68239020 | 90.47903 | |||||
| Chr.2(p-q) | TPOX | 0.79338 | 1493487 | 1.6661 | 221.8171 | 0.4999c | 0.51d |
| D2S1338 | 218879435 | 223.4832 | |||||
| Chr.3(p-q) | D3S1358 | 0.99193 | 45582627 | 67.1789 | 90.0624 | 0.4735a | 0.64d |
| D3S1744 | 147092143 | 157.24131 | |||||
| Chr.4(p-q) | FGA | 0.99771 | 155508100 | 156.81293 | 143.8662 | 0.4968c | 0.51d |
| D4S2366 | 6484806 | 12.9467 | |||||
| Chr.5 | D5S2800 | 0.15198 | 58698677 | 70.3208 | 56.3520 | 0.4050b | N/A |
| D5S818 | 123111652 | 126.67284 | |||||
| Chr.5 | D5S2800 | 0.85646 | 58698677 | 70.3208 | 84.1132 | 0.4666c | N/A |
| CSF1PO | 149455757 | 154.43395 | |||||
| Chr.5 | D5S818 | 0.69008 | 123111652 | 126.67284 | 27.7611 | 0.2522a | 0.18d |
| CSF1PO | 149455757 | 154.43395 | |||||
| Chr.6 | SE33 | 0.99963 | 88986609 | 95.44921 | 23.2133 | 0.2168a | 0.19d |
| D6S474 | 112879893 | 118.66248 | |||||
| Chr.7(p-q) | D7S3048 | 0.98684 | 21266723 | 36.14071 | 64.0605 | 0.4284b | 0.4997e |
| D7S820 | 83789257 | 100.2012 | |||||
| Chr.8 | D8S1132 | 0.23577 | 107330479 | 119.96228 | 16.4809 | 0.1591a | 0.1443e |
| D8S1179 | 125907927 | 136.44313 | |||||
| Chr.11 | TH01 | 0.20421 | 2192166 | 4.48933 | 28.3996 | 0.2569b | 0.2152e |
| D11S2368 | 19281171 | 32.88891 | |||||
| Chr.12 | vWA | 0.89307 | 6093924 | 15.63031 | 11.9410 | 0.1172a | 0.1259e |
| D12S391 | 12450501 | 27.57129 | |||||
| Chr.13 | D13S325 | 0.97422 | 43173444 | 44.90825 | 34.9225 | 0.3016b | 0.2533e |
| D13S317 | 82721723 | 79.83074 | |||||
| Chr.18 | D18S51 | 0.04312 | 60949983 | 88.92051 | 2.2970 | 0.0229b | 0.0327e |
| D18S1364 | 63400151 | 91.21746 | |||||
| Chr.19(p-q) | D19S433 | 0.78742 | 30417603 | 51.72618 | 12.4538 | 0.1228b | 0.1101f |
| D19S253 | 15728103 | 39.27234 | |||||
| Chr.21 | D21S11 | 1 | 20554558 | 14.64555 | 34.8192 | 0.3010a | 0.32d |
| D21S2055 | 41191871 | 49.46478 | |||||
| Chr.22 | D22S1045 | 0.98893 | 37535663 | 46.21362 | 38.8178 | 0.3253b | N/A |
| D22GATA198B05 | 17651831 | 7.39585 | |||||
| Chr.15* | Penta E | — | 97377441 | 124.05054 | 74.5331 | 0.4517b | N/A |
| D15S659 | 46371620 | 49.51748 | |||||
| Chr.21* | Penta D | — | 45056178 | 59.37591 | 9.9111 | 0.0978b | N/A |
| D21S2055 | 41191871 | 49.46478 |
This test was performed for data generated from the 500 samples from Saudi population for the 38 loci. No significant deviation from linkage equilibrium (LE) was detected (P-value = 0.003). RFs that previously derived from HapMap data and from family studies were reviewed from literature[4,12,21,22,24]. RFs derived from HapMap data for D2S1338-D2S441, TPOX-D2S1338, FGA-D4S2366 and D5S2800-CSF1PO pairs, were calculated based on cumulative genetic map distance provided in Phillips[12] and using Excel tool developed by Phillips et al.[4].
aData reviewed from Phillips et al.[4].
bData reviewed from Phillips[12].
cRFs were calculated based on cumulative genetic map distance provided in Phillips[12] and using an excel tool developed by Phillips et al.[4].
dData reviewed from Westen et al.[21].
eData reviewed from Liu et al.[22].
fData reviewed from Wu et al.[24].
*Potential syntenic pairs when combining the SureID 23comp with kits that include the Penta E and Penta D loci (LD was not tested).
N/A: No data available from family studies.