| Literature DB >> 34491421 |
Christian Faccinetto1, Daniele Sabbatini2, Patrizia Serventi3, Martina Rigato4, Cecilia Salvoro4, Gianluca Casamassima3, Gianluca Margiotta3, Sara De Fanti5,6, Stefania Sarno5, Nicola Staiti3, Donata Luiselli7, Alberto Marino3, Giovanni Vazza8.
Abstract
With the recent advances in next-generation sequencing (NGS), mitochondrial whole-genome sequencing has begun to be applied to the field of the forensic biology as an alternative to the traditional Sanger-type sequencing (STS). However, experimental workflows, commercial solutions, and output data analysis must be strictly validated before being implemented into the forensic laboratory. In this study, we performed an internal validation for an NGS-based typing of the entire mitochondrial genome using the Precision ID mtDNA Whole Genome Panel (Thermo Fisher Scientific) on the Ion S5 sequencer (Thermo Fisher Scientific). Concordance, repeatability, reproducibility, sensitivity, and heteroplasmy detection analyses were assessed using the 2800 M and 9947A standard control DNA as well as typical casework specimens, and results were compared with conventional Sanger sequencing and another NGS sequencer in a different laboratory. We discuss the strengths and limitations of this approach, highlighting some issues regarding noise thresholds and heteroplasmy detection, and suggesting solutions to mitigate these effects and improve overall data interpretation. Results confirmed that the Precision ID Whole mtDNA Genome Panel is highly reproducible and sensitive, yielding useful full mitochondrial DNA sequences also from challenging DNA specimens, thus providing further support for its use in forensic practice.Entities:
Keywords: Forensic genetics; Internal validation; Ion Torrent; Massive parallel sequencing; NGS; Whole mitochondrial genome
Mesh:
Substances:
Year: 2021 PMID: 34491421 PMCID: PMC8523450 DOI: 10.1007/s00414-021-02686-w
Source DB: PubMed Journal: Int J Legal Med ISSN: 0937-9827 Impact factor: 2.686
Short reads in amplification negatives
| Sample | Total mapped reads | Reads < 80 bp | % of reads < 80 bp |
|---|---|---|---|
| Ctrl-1.1 | 2972 | 1762 | 59.2 |
| Ctrl-2.1 | 7244 | 3958 | 54.6 |
| Ctrl-3.1 | 3049 | 2094 | 69.0 |
| Ctrl-4.1 | 2829 | 1807 | 63.8 |
| Ctrl-4.2 | 6301 | 5494 | 87.1 |
| Ctrl-4.3 | 2963 | 1852 | 62.0 |
| Ctrl-4.4 | 6386 | 4982 | 78.0 |
| Ctrl-5.1 | 8764 | 6334 | 72.2 |
| Ctrl-5.2 | 6949 | 5247 | 75.5 |
Fig. 1Mapped reads in amplification negatives. The scatterplots show the distribution of read length vs MAPQ in each amplification negative. The red line indicates the length cut-off value of 80 bp
Fig. 2The scatterplot shows the mean VarFreq of each positions across all the replicates of 9947A (x axis) and 2800 M (y axis). The red line indicates a VarFreq of 90% (see text). Positions with a VarFreq below 90% in both controls are labelled
Positions with VarFreq < 90% across multiple replicates of 9947A and 2800 M of samples
| Position | rCRS base | Sample | VarFreq mean | VarFreq standard deviation | Second most common variant |
|---|---|---|---|---|---|
| 309* | C | 9947A | 37.76 | 23.81 | DEL |
| 2800 M | 36.18 | 5.69 | |||
| 315 | C | 9947A | 77.40 | 10.21 | INS |
| 2800 M | 81.84 | 4.08 | |||
| 438 | C | 9947A | 86.36 | 6.83 | DEL |
| 2800 M | 87.33 | 4.34 | |||
| 498 | C | 9947A | 84.75 | 2.13 | DEL |
| 2800 M | 82.32 | 2.74 | |||
| 937 | T | 9947A | 88.09 | 6.60 | DEL |
| 2800 M | 86.77 | 3.67 | |||
| 1296* | A | 9947A | 85.84 | 6.01 | DEL |
| 2800 M | 81.91 | 5.73 | |||
| 2135 | A | 9947A | 83.75 | 4.54 | DEL |
| 2800 M | 83.21 | 5.95 | |||
| 5287* | A | 9947A | 85.76 | 6.11 | DEL |
| 2800 M | 86.34 | 1.48 | |||
| 5752* | A | 9947A | 75.32 | 14.18 | DEL |
| 2800 M | 73.43 | 9.89 | |||
| 7402 | C | 9947A | 74.03 | 6.31 | DEL |
| 2800 M | 77.93 | 9.17 | |||
| 7513 | T | 9947A | 72.39 | 13.67 | DEL |
| 2800 M | 72.28 | 9.02 | |||
| 8249 | G | 9947A | 56.47 | 14.13 | INS |
| 2800 M | 61.26 | 4.32 | |||
| 8252 | C | 9947A | 84.66 | 4.62 | G |
| 2800 M | 84.34 | 5.54 | |||
| 8254 | C | 9947A | 60.31 | 5.32 | DEL |
| 2800 M | 57.51 | 5.16 | |||
| 8255* | G | 9947A | 75.14 | 11.57 | DEL |
| 2800 M | 79.94 | 4.71 | |||
| 8256 | T | 9947A | 62.78 | 10.49 | DEL |
| 2800 M | 58.67 | 5.08 | |||
| 8962 | A | 9947A | 83.35 | 5.73 | DEL |
| 2800 M | 83.22 | 5.42 | |||
| 9100 | A | 9947A | 78.83 | 4.36 | G |
| 2800 M | 81.09 | 3.85 | |||
| 10,151 | A | 9947A | 87.12 | 5.91 | DEL |
| 2800 M | 85.59 | 4.16 | |||
| 11,038 | A | 9947A | 87.17 | 5.73 | DEL |
| 2800 M | 87.41 | 3.10 | |||
| 12,425 | A | 9947A | 66.95 | 12.42 | DEL |
| 2800 M | 76.17 | 4.26 | |||
| 13,237* | A | 9947A | 88.15 | 4.95 | DEL |
| 2800 M | 89.89 | 2.88 | |||
| 13,758 | C | 9947A | 54.32 | 6.28 | DEL |
| 2800 M | 58.76 | 4.84 | |||
| 14,510 | A | 9947A | 83.97 | 7.57 | DEL |
| 2800 M | 84.23 | 5.51 | |||
| 14,754* | C | 9947A | 81.30 | 12.77 | DEL |
| 2800 M | 79.64 | 5.44 | |||
| 14,774* | C | 9947A | 87.38 | 10.76 | DEL |
| 2800 M | 86.89 | 3.94 |
*NUMT positions reported in Li et al. [32]
Proportion of mtDNA sequenced for dilution series
| Sample | X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | X10 | X11 | X12 | X13 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100%# | 100% | 90.8%* | 96.6%* | 80.7%*§ | 72.4%* | |
| 100% | 100% | 61.5%* | 100% | 100% | 100% | 100% | 100% | 100%# | 91.7%* | 63.4%* | 91.6%* | 47.3%*§ | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 93.6%* | 96.7%* | 62.4%*§ | 49.9%*§ |
*Incomplete mtDNA profile.
Undetected 7861Y heteroplasmy.
§Presence of false positive variant calls.
Fig. 3Cumulative VarFreq distribution computed in the range of 0–99% of each dilution series (considering all replicates). For a better interpretation, the 92.5–99% VarFreq range is shown
Fig. 4The plot shows the values of point heteroplasmies in the 9947A dilution series. A PHP 7861Y (T/C); B PHP 1393R (G/A); C PHP 3242R (G/A). The dashed red line indicates the expected heteroplasmy value