Literature DB >> 34302451

Variants in linkage status at D5S818 detected by multiple STR kits comparison and Sanger sequencing.

Chengchen Shao1, Yining Yao1, Xinwei Pan2, Mengde Wu2, Beilei Zhang3, Hongmei Xu1, Jianhui Xie1, Kuan Sun4,1.   

Abstract

BACKGROUND: D5S818 discrepancies have been reported in forensic parental testing due to null alleles. However, more cases may be ignored since proportional null alleles were missed without detection of heredity discrepancy between parents and offspring.
RESULTS: In this study, null allele 12 at D5S818 was detected by the PowerPlex® 21 System with a higher occurrence rate on the basis of review on 2824 samples from the 1282 routine cases in Chinese Han population. Sequencing results revealed novel variant of guanine (G) into adenine (A) in the 7th [AGAT] repeats in the core repeat region accompanied by rs1187948322 in the samples with null allele 12.
CONCLUSIONS: Forensic STR typing may benefit from this discovery: (1) primer design of CE profiling system could be improved for sensitive population and (2) polymorphic information could be enriched for the accuracy and precision of NGS genotyping system. Peak area of D5S818 was also analyzed through different commercial STR kits. It is suggested that more attention should be paid on observed homozygosity with reduced peak area, especially for the samples from Chinese Han population.
© 2021 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals LLC.

Entities:  

Keywords:  D5S818; STR typing; linkage status; null alleles; variant

Mesh:

Substances:

Year:  2021        PMID: 34302451      PMCID: PMC8457698          DOI: 10.1002/mgg3.1765

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


INTRODUCTION

Short tandem repeats (STRs) are widely used in forensic applications of individual identification and paternity testing for its high polymorphism (Butler, 2011). Discrepancies have been reported in concordance studies when the same sample was genotyped with different STR kits (W. Chen et al., 2014; Li et al., 2014; Mizuno et al., 2008; Ricci et al., 2007; Tsuji et al., 2010). Null alleles, which also called silent alleles, may lead to mismatch between parents and children since defective amplification resulted from variations at primer binding sites. Point mutations or insertion or deletion (InDels) at the flanking regions of an STR locus may potentially affect annealing and/or elongation of primers in the PCR procedure, resulting in the dropout of one or both of the alleles. However, null alleles may be ignored in case the sister chromatid with the normal allele instead of that with the null allele was inherited to the offspring. Therefore, the probability of occurrence for null alleles are often underestimated. On the other hand, current studies are mainly aimed at solving the problem of Mendelian discrepancy across generations. Therefore, most researches focused on consistency study among different detection systems and sequence validation in the flanking regions. However, sequence characteristics and the relationship between the flanking variants and the core repeat region of the STR locus needs further exploration. Linkage study is considered as an excellent tool for the validation of GWAS results because of the additional genetic information reflected by the linkage status (Wen et al., 2014). While in the forensic science, the linkage information of the adopted makers may provide potential clues for forensic investigation, such as the inference of biogeographic ancestry (Gattepaille & Jakobsson, 2012). Furthermore, specifically to the sequence polymorphism of forensic makers detected by NGS platform, the explicit linkage relationship for a target population could enrich the database for forensic practice. Null alleles at D5S818 were more often observed, especially in Chinese Han population (Chen et al., ,2014, 2015). To date, five sequence variations at primer binding region of D5S818 locus including rs576058164 (Jiang et al., 2011), rs182073376 (Alves et al., 2003; Delamoye et al., 2004), rs25768 (Edwards & Allen, 2004; Fujii et al., 2016), rs951218455 (Chen et al., 2014), and rs1187948322 (Chen et al., 2015; Yao et al., 2018) have been reported to cause discrepancies in paternity testing since null alleles. In this study, null allele 12 was detected in D5S818 with a higher occurrence rate from 1282 routine forensic cases by the PowerPlex® 21 System. A novel variant from Chinese Han population named as “D5S818[CE12]‐Chr5‐GRCh38 123775556–123775599 [ATCT]12123775578‐A; 123775689‐T” was identified based on Sanger sequencing validation. Besides, Expressmarker® 22 PCR amplification kit, Identifiler® Plus PCR Amplification Kit. and PowerPlex® Fusion System were employed, respectively, for comparison study. Exploration on the sequence characteristics of D5S818 at the core repeat sequence and the flanking region was conducted to further enrich the polymorphic information of STRs for forensic genetics.

MATERIALS AND METHODS

DNA samples

A total of 1282 routine cases (including 26 individual identifications, 286 father–mother–child trios, 734 father–child duos, and 236 mother–child duos) from Chinese Han population were included in this study. Written informed consent from all individuals was obtained before sample collection. Ethics Approval was obtained from the ethics committee of School of Basic Medical Sciences, Fudan University. Genomic DNA was extracted from FTA card using a QIAamp® DNA Investigator Kit (Qiagen). DNA quantification was performed using a Qubit 3 Fluorometer together with Qubit dsDNA HS Assay Kit (Thermo Fisher).

STR genotyping

STR loci such as D3S1358, D1S1656, D6S1043, D13S317, Penta E, D16S539, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433 and FGA in the PowerPlex® 21 System (PP21, Promega), D3S1358, D13S317, D7S820, D16S539, Penta E, D2S441, TPOX, TH01, D2S1338, CSF1PO, Penta D, D10S1248, D19S433, vWA, D21S11, D18S51, D6S1043, D8S1179, D5S818, D12S391 and FGA in the Expressmarker® 22 PCR amplification kit (EX22, AGCU), D8S1179, D21S11, D7S820, CSF1PO, D3S1358, TH01, D13S317, D16S539, D2S1338, D19S433, vWA, TPOX and D18S51 in the Identifiler® Plus PCR Amplification Kit (ID+, ThermoFisher Scientific), and D3S1358, D1S1656, D2S441, D10S1248, D13S317, Penta E, D16S539, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D7S820, D5S818, TPOX, DTS391, D8S1179, D12S391, D19S433, FGA and D22S1045 in the PowerPlex® Fusion System (Fusion, Promega) were genotyped strictly as recommended by respective manufacturers in this study. The polymerase chain reaction (PCR) was performed by Mastercycler® nexus GSX1 (Eppendorf) according to the manufacturers’ recommendations. PCR products were separated by capillary electrophoresis in an Applied Biosystems 3130xL Gene Analyzer (ThermoFisher Scientific). Allele designation was determined according to allelic ladders by using the GeneMapper® ID v3.2 or ID‐X v1.4 (ThermoFisher Scientific).

PCR amplification and DNA sequencing

PCR primers were designed on the basis of the GenBank D5S818 sequence (Accession No. AC008512.8): forward (FP): 5′‐ACTTTGAGCTATTAGGCATGGGAGAG‐3′ (26mer); and reverse (RP): 5′‐GCCTGTATAGTCATGTCCCTCTGTGTAG‐3′ (28mer). The thermal cycling parameters were enzyme activation at 95°C for 30s, followed by 30 cycles of denaturation at 95°C for 30 s, annealing at 60°C for 30 s and extension at 72°C for 1min, with a final extension at 72°C for 10 min. The PCR products were separated on nondenaturing polyacrylamide gel (T = 6%, C = 3.3%) and purified by the QIAquick1 Gel Extraction Kit (Qiagen). The separated allele fragments were cloned as per the standard procedure. Positive clones were selected and sequenced separately by sanger sequencing. Alignments of the sequences were performed with the reference sequence for verification.

Data analysis

Peak areas were analyzed for the suspected samples with the null alleles at locus D5S818. Peak areas of all 601 samples detected by PP21 were exported as combined table by GeneMapper® software and the ratios of peak areas between D5S818 and D7S820 (the locus next to D5S818) were calculated as follows:

Quality control

The main experiments were conducted at the Forensic Genetics Laboratory of Fudan University, P.R. China, in accordance with quality control measures. All methods were carried out in accordance with the approved guidelines of Fudan University, P.R. China. The laboratory has been accredited by the China National Accreditation Service for Conformity Assessment (CNAS) which is also approved by the International Laboratory Accreditation Cooperation (ILAC).

RESULTS

Concordance study of D5S818 among various profiling systems

Genotyping profiles of 2824 samples from the 1282 routine cases with PP21 System were reviewed. Amplification kits EX22, ID+, and Fusion were also employed on 601 samples with observed homozygosity on D5S818 for comparison study and validation. When amplified and analyzed with PP21 only, seven samples from three cases show heredity discrepancy between parent and offspring with the observed occurrence rate of 1.165%. However, discordant alleles were detected in 11 samples at D5S818 (occurrence rate 1.830%) with EX22. Four more samples were detected with null alleles at D5S818 except for the result from PP21. To confirm this results, ID+ and Fusion were also adopted for further comparison. Genotypes from ID+ were consistent with that from EX22 at locus D5S818, and the genotypes from Fusion supported the results from PP21. No discordant was detected in other overlapped STR loci. The genotyping results of D5S818 for the 11 suspected samples were listed in Table 1, and the electropherograms of one sample named C5C2 were displayed in Figure 1 as a representation. In the electropherograms analyzed with PP21 and Fusion, homozygous allele 11 was detected with deficient peak height at locus D5S818 when comparing with adjacent locus (as shown in Figure 1a,d). Profiling results of the same sample with EX22 and ID+ (as shown in Figure 1b,c) revealed that, genotype of D5S818 were heterozygous alleles 11 and 12 with balanced peak height and peak area. Besides, a tiny peak was observed in the position of allele 12 in the electropherogram from the PowerPlex® Fusion System. It cannot be recognized by GeneMapper since neither the peak height nor the peak area could meet the detection threshold.
TABLE 1

Genotypes of D5S818 from 11 suspected samples using different kits

SampleGenderCategoryPP21EX22Id+Fusion
C1AFMAlleged Father11,‐11,1211,1211,‐
C1CFChild10,‐10,1210,1210,‐
C2AFMAlleged Father10,‐10,1210,1210,‐
C3AFMAlleged Father9,‐9,129,129,‐
C4AFMAlleged Father11,‐11,1211,1211,‐
C5AFMAlleged Father9,‐9,129,129,‐
C5C1FChild11,‐11,1211,1211,‐
C5C2FChild11,‐11,1211,1211,‐
C6CFChild13,‐12,1312,1313,‐
C7AMFAlleged Mother10,‐10,1210,1210,‐
C7CFChild11,‐11,1211,1211,‐
FIGURE 1

Electropherograms of D5S818 from sample C5C2 using four different multiplex kit. The fluorescence channels with D5S818 were shown for comparison. (a) Electropherograms of C5C2 by using PP21; (b) Electropherograms of C5C2 by using EX22; (c) Electropherograms of C5C2 by using ID+; (d) Electropherograms of C5C2 by using Fusion

Genotypes of D5S818 from 11 suspected samples using different kits Electropherograms of D5S818 from sample C5C2 using four different multiplex kit. The fluorescence channels with D5S818 were shown for comparison. (a) Electropherograms of C5C2 by using PP21; (b) Electropherograms of C5C2 by using EX22; (c) Electropherograms of C5C2 by using ID+; (d) Electropherograms of C5C2 by using Fusion

Validation through clone sequencing with plasmid vector

Sanger sequencing was adopted to validate the variant sequence at locus D5S818. Amplicon with target DNA fragment from PCR products was loaded into plasmid vector and sequenced with clone sequencing. As a result, the genotypes of D5S818 from the 11 suspected samples were all sequenced as heterozygosity with a shared allele 12, which was not detected by PP21, in accordance with the profiling result using EX22 and ID+. Furthermore, a variant in the core repeat region was detected in all of the clones with null allele 12. According to STRBase (Ruitberg et al., 2001), D5S818 is defined as a [AGAT]n simple repeats locus based on GenBank top strand. However, the core region of the null allele 12 were all sequenced as [AGAT]6AAAT[AGAT]5 rather than [AGAT]12, as shown in Figure 2. Additionally, the null allele 12 from the 11 suspected samples present a C>A transversion at 90 bp upstream the 5′ end of the repeat region when comparing with the sequence of common allele 12 (as shown in Figure 2). This point mutation has been termed as rs1187948322 in dbSNP. The two variants were also reported in the Genome Aggregation Database (gnomAD), but the linkage situation needs further verification (Lek et al., 2016). In our study, allele 12 with variant at rs1187948322 always show [AGAT]6AAAT[AGAT]5 instead of [AGAT]12 in the core repeat region of D5S818 locus. Latent linkage was supposed between these two variants, which may lead to erroneous decision when genotyping STR loci with CE.
FIGURE 2

The chromatogram of D5S818 based on Sanger Sequencing. (a) The chromatogram of normal allele 12 from a common sample; (7) The chromatogram of null allele 12 from sample C5C2

The chromatogram of D5S818 based on Sanger Sequencing. (a) The chromatogram of normal allele 12 from a common sample; (7) The chromatogram of null allele 12 from sample C5C2

Peak area analysis in D5S818

Further investigation on the peak area was performed on D5S818. The ratios of peak areas between D5S818 and D7S820 (the locus next to D5S818) were calculated. And the relationship of the null alleles occurrence and the Ra values was analyzed, as shown by the violin plot in Figure 3. Ra of the common samples ranged from 0.799 to 1.556 while Ra of the 11 suspected samples with null alleles ranged from 0.504 to 0.684. A significant difference was revealed between them (p < 0.0001). Null alleles occur with a relatively high probability in samples where Ra values are appreciably lower, as the result of the reduced peak area of the observed homozygous peak in null alleles.
FIGURE 3

The violin plot for Ra analysis. The value of Ra was impacted by the presence of null alleles

The violin plot for Ra analysis. The value of Ra was impacted by the presence of null alleles

DISCUSSION

In the concordance study of D5S818 among various profiling systems, four additional samples were detected with null alleles at locus D5S818 by re‐genotyping, which may be easily ignored without extra genotyping since no discrepancy was detected between parent and offspring. The frequency of the null allele 12 at D5S818 among unrelated individuals was estimated as 0.1592% (4/2512) considering all reviewed samples in this study. Nevertheless, the frequency for the same null alleles in another Chinese Han population in Chen et al.’s study was about 0.0634% (3/4734) (Chen et al., 2015). Generally, null alleles are easily underestimated unless discrepant calls between alleged father/mother and child were observed. Another possible explanation may be the population difference of the polymorphism at the flanking region, which affects the primer binding in the process of DNA amplification for forensic detection. Null alleles may happen when the target DNA fragments are amplified inappropriately. For the same reason, the primer sequences in many common used amplification kits were mainly developed based on the sequence data from western populations. Mismatches occurred with a higher possibility in the primer binding process when the tested individual is from the Chinese Han population. Therefore, it is important to investigate and recognize the sequence characteristics in the flanking region of the STR loci for the target populations. More importantly, despite the caution of repeated experiments with different profiling systems, the discovery of variants in linkage status on locus D5S818 may enrich the genetic data for interpretation of STR typing with NGS. By combing the sequence polymorphism with length polymorphism, STR analysis based on NGS show promising prospect in forensic genetics. The study on the polymorphic feature in the flanking region and its potential relationship with the core repeat sequence of the STR locus would benefit the forensic STR research and therefore, improve the resolution of forensic markers. Additional information such as biogeographic origin may also be reflected from the linkage status of variants if more reference populations were included. In 2016, International Society for Forensic Genetics (ISFG) recommended a new system of STR allele nomenclature based on sequence information (Parson et al., 2016). The allele described in this study could be named as “D5S818[CE12]‐Chr5‐GRCh38 123775556–123775599 [ATCT]12123775574‐T; 123775689‐T” according to the recommendations. Furthermore, the balance within a locus always got enough focus since discordant proportions between two alleles of heterozygotes may imply mixed samples. However, the balance between different loci were less concerned in consideration of specific features for different loci. Peak area of D5S818 analyzed through 2824 samples (as listed in Figure 3) suggested that the balance between loci in a multiplex system should be paid more attention for the occurrence of null alleles.

CONCLUSIONS

In this study, a novel variant was discovered in the core repeat region of D5S818 in Chinese Han population, and it is observed to be in linkage with a reported variation named rs1187948322. Polymorphic information of forensic STRs could be enriched for forensic application. Besides, additional attention was suggested on observed homozygosity with reduced peak area in D5S818, since null alleles occur more frequently in this situation than previously estimated.

CONFLICT OF INTEREST

The authors have no conflicts of interest to declare that are relevant to the content of this article.

AUTHORS’ CONTRIBUTIONS

C.S. and K.S. conceived the idea of the study; X.P., M.W. and B.Z. analyzed the data; Y.Y., J.X., and H.X. interpreted the results; C.S. and K.S. wrote the paper; all authors discussed the results and revised the manuscript.

COMPLIANCE WITH ETHICAL STANDARDS

Approval was obtained from the ethics committee of School of Basic Medical Sciences, Fudan University. The procedures used in this study adhere to the tenets of the Declaration of Helsinki.
  17 in total

1.  STRBase: a short tandem repeat DNA database for the human identity testing community.

Authors:  C M Ruitberg; D J Reeder; J M Butler
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  False homozygosities at various loci revealed by discrepancies between commercial kits: implications for genetic databases.

Authors:  Magali Delamoye; Charlotte Duverneuil; Katia Riva; Michel Leterreux; Stéphane Taieb; Philippe De Mazancourt
Journal:  Forensic Sci Int       Date:  2004-06-30       Impact factor: 2.395

3.  A single mutation in the FGA locus responsible for false homozygosities and discrepancies between commercial kits in an unusual paternity test case.

Authors:  Ugo Ricci; German Melean; Carlo Robino; Maurizio Genuardi
Journal:  J Forensic Sci       Date:  2007-03       Impact factor: 1.832

4.  Combining markers into haplotypes can improve population structure inference.

Authors:  Lucie M Gattepaille; Mattias Jakobsson
Journal:  Genetics       Date:  2011-08-25       Impact factor: 4.562

5.  Null alleles and sequence variations at primer binding sites of STR loci within multiplex typing systems.

Authors:  Yining Yao; Qinrui Yang; Chengchen Shao; Baonian Liu; Yuxiang Zhou; Hongmei Xu; Yueqin Zhou; Qiqun Tang; Jianhui Xie
Journal:  Leg Med (Tokyo)       Date:  2017-10-28       Impact factor: 1.376

6.  A silent allele in the locus D19S433 contained within the AmpFlSTR Identifiler PCR Amplification Kit.

Authors:  Akiko Tsuji; Atsushi Ishiko; Takahiro Umehara; Yosuke Usumoto; Wakako Hikiji; Keiko Kudo; Noriaki Ikeda
Journal:  Leg Med (Tokyo)       Date:  2010-01-27       Impact factor: 1.376

7.  A D19S433 primer binding site mutation and the frequency in Japanese of the silent allele it causes.

Authors:  Natsuko Mizuno; Tetsushi Kitayama; Koji Fujii; Hiroaki Nakahara; Kanako Yoshida; Kazumasa Sekiguchi; Naoto Yonezawa; Minoru Nakano; Kentaro Kasai
Journal:  J Forensic Sci       Date:  2008-07-11       Impact factor: 1.832

8.  Metabolome-based genome-wide association study of maize kernel leads to novel biochemical insights.

Authors:  Weiwei Wen; Dong Li; Xiang Li; Yanqiang Gao; Wenqiang Li; Huihui Li; Jie Liu; Haijun Liu; Wei Chen; Jie Luo; Jianbing Yan
Journal:  Nat Commun       Date:  2014-03-17       Impact factor: 14.919

9.  Analysis of protein-coding genetic variation in 60,706 humans.

Authors:  Monkol Lek; Konrad J Karczewski; Eric V Minikel; Kaitlin E Samocha; Eric Banks; Timothy Fennell; Anne H O'Donnell-Luria; James S Ware; Andrew J Hill; Beryl B Cummings; Taru Tukiainen; Daniel P Birnbaum; Jack A Kosmicki; Laramie E Duncan; Karol Estrada; Fengmei Zhao; James Zou; Emma Pierce-Hoffman; Joanne Berghout; David N Cooper; Nicole Deflaux; Mark DePristo; Ron Do; Jason Flannick; Menachem Fromer; Laura Gauthier; Jackie Goldstein; Namrata Gupta; Daniel Howrigan; Adam Kiezun; Mitja I Kurki; Ami Levy Moonshine; Pradeep Natarajan; Lorena Orozco; Gina M Peloso; Ryan Poplin; Manuel A Rivas; Valentin Ruano-Rubio; Samuel A Rose; Douglas M Ruderfer; Khalid Shakir; Peter D Stenson; Christine Stevens; Brett P Thomas; Grace Tiao; Maria T Tusie-Luna; Ben Weisburd; Hong-Hee Won; Dongmei Yu; David M Altshuler; Diego Ardissino; Michael Boehnke; John Danesh; Stacey Donnelly; Roberto Elosua; Jose C Florez; Stacey B Gabriel; Gad Getz; Stephen J Glatt; Christina M Hultman; Sekar Kathiresan; Markku Laakso; Steven McCarroll; Mark I McCarthy; Dermot McGovern; Ruth McPherson; Benjamin M Neale; Aarno Palotie; Shaun M Purcell; Danish Saleheen; Jeremiah M Scharf; Pamela Sklar; Patrick F Sullivan; Jaakko Tuomilehto; Ming T Tsuang; Hugh C Watkins; James G Wilson; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2016-08-18       Impact factor: 49.962

10.  Variants in linkage status at D5S818 detected by multiple STR kits comparison and Sanger sequencing.

Authors:  Chengchen Shao; Yining Yao; Xinwei Pan; Mengde Wu; Beilei Zhang; Hongmei Xu; Jianhui Xie; Kuan Sun
Journal:  Mol Genet Genomic Med       Date:  2021-07-24       Impact factor: 2.183

View more
  1 in total

1.  Variants in linkage status at D5S818 detected by multiple STR kits comparison and Sanger sequencing.

Authors:  Chengchen Shao; Yining Yao; Xinwei Pan; Mengde Wu; Beilei Zhang; Hongmei Xu; Jianhui Xie; Kuan Sun
Journal:  Mol Genet Genomic Med       Date:  2021-07-24       Impact factor: 2.183

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.