| Literature DB >> 33158032 |
Daniela Cuenca1, Jessica Battaglia1, Michelle Halsing1, Sandra Sheehan1.
Abstract
The advent of massively parallel sequencing (MPS) in the past decade has opened the doors to mitochondrial whole-genome sequencing. Mitochondrial (mt) DNA is used in forensics due to its high copy number per cell and maternal mode of inheritance. Consequently, we have implemented the Thermo Fisher Precision ID mtDNA Whole Genome panel coupled with the Ion Chef™ and Ion S5™ for MPS analysis in the California Department of Justice, Missing Persons DNA Program. Thirty-one mostly challenging samples (degraded, inhibited, low template, or mixed) were evaluated for this study. The majority of these samples generated single source full or partial genome sequences with MPS, providing information in cases where previously there was none. The quantitative and sensitive nature of MPS analysis was beneficial, but also led to detection of low-level contaminants. In addition, we found Precision ID to be more susceptible to inhibition than our legacy Sanger assay. Overall, the success rate (full single source hypervariable regions I and II (HVI/HVII) for Sanger and control region for MPS result) for these challenging samples increased from 32.3% with Sanger sequencing to 74.2% with the Precision ID assay. Considering the increase in success rate, the simple workflow and the higher discriminating potential of whole genome data, the Precision ID platform is a significant improvement for the CA Department of Justice Missing Persons DNA Program.Entities:
Keywords: forensics; human identification; massively parallel sequencing; mitochondrial DNA; next-generation sequencing
Mesh:
Substances:
Year: 2020 PMID: 33158032 PMCID: PMC7692767 DOI: 10.3390/genes11111303
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Samples Description Table. The table shows sample name, sample type, DNA concentration, legacy results, and Precision ID (PID) amplification volume used. STR and Sanger sequencing was attempted for all samples, “No STR” or “No HVI/HVII results” indicate unsuccessful results. Not every sample was quantified with both autosomal and mitochondrial targets; cells that show N/A indicate that sample was not quantified using that target. In the Volume Used column dilution is abbreviated to Dil.
| qPCR Sample Quantitation | ||||||
|---|---|---|---|---|---|---|
| Sample Name | Sample Type | Large Target (pg/µL) | Small Target (pg/µL) | mt Target (mtCopies/µL) | Legacy Results (Nuclear and Mitochondrial) | Precision ID Volume Used (µL) |
| Hair Case 1—Item 1 | Hair—2 cm proximal end | 0 | Not Applicable (N/A) | 16,000 | No short tandem repeat (STR); hypervariable regions I/II (HVI/HVII) Mixture | 3.4 1 |
| Hair Case 1—Item 2 | Hair Shaft—2 cm after item 1 sampling | 0 | N/A | 2700 | No STR; HVI/HVII Mixture | 2.8 1 |
| Hair Case 1—Item 3 | Hair Shaft—2 cm after item 2 sampling | 0 | N/A | 160 | No STR; HVI/HVII Mixture | 3.3 1 |
| Hair Case 2—Item 1 | Hair Root + Shaft | 0 | 0 | N/A | No STR; Partial HVI/HVII Results and a Mixture | 15 1 |
| Hair Case 2—Item 2 | Hair Shaft—2 cm after root | 0 | 0 | N/A | No STR; Partial HVI/HVII Results and a Mixture | 15 1 |
| Hair Case 3—Item 1 | Hair Shaft | N/A | N/A | N/A | No STR; No HVI/HVII Results | 15 1 |
| Hair Case 3—Item 2 | 2nd Hair Shaft | N/A | N/A | N/A | No STR; No HVI/HVII Results | 15 1 |
| Hair Case 4 | Hair Shaft | N/A | N/A | 3525 | HVI and HVII Results | 15 1 |
| Bone 1 | Tooth | 22.1 | 49.3 | 13,650 | Full STR Identifiler + MiniFiler; HVI/HVII Results Outsourced | 10 2 |
| Bone 2 | Foot Bone | 0.1 | 0.5 | 1935 | Full STR Identifiler + MiniFiler; HVI/HVII Results Outsourced | 6.5 1 |
| Bone 3 | Femur | 0.5 | 1.1 | 6550 | No STR; HVI/HVII Results Outsourced | 6.5 1 |
| Bone 4 | Skull | 0.1 | 0.2 | 670 | No STR; Partial HVI/HVII Results | 8 1 |
| Bone 5 | Femur | 0 | 0 | N/A | No STR; No HVI/HVII Results | 7.2 1 |
| Bone 6 | Femur | 0 | N/A | 1890 | 3 STRs Identifiler + MiniFiler; No HVI/HVII Results | 13.2 1 |
| Bone 7 | Pelvic Bone | 0 | 7200 | N/A | No STR; No HVI/HVII Results | 10 2 |
| Bone 8 | Tooth | 0 | 0 | 3.34 | No STR; No HVI/HVII Results | 10 2 |
| Bone 9 | Tooth | 0 | 48 | N/A | No STR; HVI/HVII Results Outsourced | 10 2 |
| Bone 10 | Femur | 2.5 | 5.3 | N/A | Partial STR; HVI/HVII Results | 15 1 |
| Bone 11 | Femur | 71 | 30,000 | N/A | Partial STR; No HVI/HVII Results | 5 4 |
| Bone 12 | Tooth | 4.6 | 71.4 | N/A | Partial STR; No HVI/HVII Results | 5 4 |
| Bone 13 | Tooth | 0 | 170 | N/A | Partial STR; No HVI/HVII Results | 3 4 |
| Bone 14 | Femur | 88.1 | 132 | 28,000 | Partial STR; No HVI/HVII Results | 1 3 |
| Tissue 1 | Tissue | 0 | 231 | 895,000 | No STR; Partial mtDNA Results | 10 2 |
| Tissue 2 | Tissue | 0.1 | 658 | N/A | Partial MiniFiler; No HVI/HVII Results | 1 4 |
| Tissue 3 | Tissue | N/A | N/A | N/A | Full STR with MiniFiler; Partial HVI/HVII Results | 15 1 |
| Nail 1 | Nail | 370 | N/A | 17,000,000 | Full STR; HVI/HVII Results | 1 of 1/3 Dil 3 |
| Nail 2 | Nail | 8900 | 53,000 | N/A | Full STR; HVI/HVII Results | 1 of 1/90 Dil 3 |
| Nail 3 | Nail | 0 | 41 | N/A | No STR; No HVI/HVII Results | 8.5 1 |
| Buccal 1 | Buccal | 180 | 260 | N/A | Full STR; HVI/HVII Results | 1 of 1/2 Dil 3 |
| Buccal 2 | Buccal | 460 | 560 | N/A | Full STR; HVI/HVII Results | 1 of 1/4 Dil 3 |
| Buccal 3 | Buccal | 320 | 360 | N/A | Full STR; HVI/HVII Results | 1 of 1/3 Dil 3 |
1 Maximum available DNA extract volume used. 2 Volume used equals what was used for Sanger sequencing. 3 A target of 80–100 picograms was used. 4 Volume used based on quantitation and degradation index (concentration of small DNA target/concentration of large DNA target) results.
Converge™ v2.1 analysis settings for sequencing runs of 8 and 16 samples. For the minimum coverage percent, the threshold is relative to the median number of reads for the amplicon of interest. The confirmed, point heteroplasmy, insertion and deletion percentage thresholds are related to the total number of reads for the base position of interest.
| Converge v2.1 Analysis Setting | 8 Sample Multiplex |
|
|---|---|---|
| Minimum Total Read Coverage per Base Position | 500 reads | 250 reads |
| Minimum Variants Coverage to Call | 50 reads | 25 reads |
| Coverage Threshold to Mark Region | 500 reads | 250 reads |
| Minimum Coverage Percent | 5% | 5% |
| Confirmed Call Threshold | 90% | 90% |
| Point Heteroplasmy Call or Mixed Base Threshold Call Threshold | 10% | 10% |
| Insertion Call Threshold | 20% | 20% |
| Deletion Call Threshold | 30% | 30% |
Sequencing-run metrics. Chip loading refers to the percent of wells in the chip that have a DNA containing bead. Polyclonal percentage is the percent of wells that have a mixed signal or had two or more DNA fragments during clonal amplification. Percent of low-quality reads refers to the percentage of reads filtered out by quality filters. The last metric is the total number of usable reads after all the filters are applied.
| Run Number | Chip Loading (%) | % Polyclonal | % Low Quality Reads | Usable Reads |
|---|---|---|---|---|
| Run 1 | 77 | 38 | 26 | 12,174,997 |
| Run 2 | 87 | 42 | 19 | 14,961,333 |
| Run 3 | 84 | 37 | 52 | 8,050,849 |
| Run 4 | 90 | 43 | 57 | 8,135,888 |
| Run 5 | 84 | 45 | 47 | 9,107,278 |
| Run 6 | 83 | 47 | 28 | 11,581,237 |
| Run 7 | 87 | 54 | 33 | 9,079,330 |
| Run 8 | 83 | 55 | 33 | 7,992,750 |
| Run 9 | 88 | 51 | 27 | 11,247,818 |
| Run 10 | 89 | 46 | 38 | 10,473,906 |
| Run 11 | 89 | 49 | 27 | 11,248,640 |
| Run 12 | 91 | 55 | 29 | 10,002,927 |
Figure 1Number of reads per sample and its relation to the analytical threshold (AT). The AT is dynamic in this chart because the samples were not all sequenced in the same run. The AT was determined by multiplying the total usable reads for each run by 0.025 or 0.0125, depending on the number of samples multiplexed in the same run. The value was then plotted on the chart as the orange threshold line. Blue and purple bar distinctions represent samples sequenced in an 8 (blue) or 16 (purple) multiplex.
Precision ID mtDNA assay success rate. The table shows the sample name, Precision ID result coverage percentage for the full genome, California Department of Justice (CA DOJ) Sanger results, CA DOJ and outsourcing combined Sanger results, the Precision ID assay results and the success rate for each. Success rate is determined based on a full single-source HVI and HVII Sanger result or a full control region massively parallel sequencing (MPS) result. The absence of a box indicates no typing attempt was made, the presence of an uncrossed box (☐) indicates typing was attempted but unsuccessful, and a crossed box (☒) indicates typing was attempted and successful.
| Sample Name | Full mtDNA Coverage | Sanger CA DOJ | Sanger CA DOJ and Outsourcing Combined Results | Precision ID Assay |
|---|---|---|---|---|
| Hair 1–1 | 100% | ☐ | ☐ | ☒ |
| Hair 1–2 | 93% | ☐ | ☐ | ☐ |
| Hair 1–3 | 75% | ☐ | ☐ | ☐ |
| Hair 2–1 | 95% | ☐ | ☐ | ☒ |
| Hair 2–2 | 93% | ☐ | ☐ | ☐ |
| Hair 3–1 | 96% | ☐ | ☐ | ☐ |
| Hair 3–2 | 93% | ☐ | ☐ | ☒ |
| Hair 4 | 99.4% | ☒ | ☒ | ☐ |
| Bone 1 | 99.5% | ☒ | ☒ | |
| Bone 2 | 99.4% | ☒ | ☒ | |
| Bone 3 | 100% | ☐ | ☒ | ☒ |
| Bone 4 | 98.8% | ☐ | ☐ | ☒ |
| Bone 5 | 52% * | ☐ | ☐ | ☐ * |
| Bone 6 | 96.7% | ☐ | ☐ | ☒ |
| Bone 7 | 84% | ☐ | ☐ | ☒ |
| Bone 8 | 0% | ☐ | ☐ | ☐ |
| Bone 9 | 0% | ☐ | ☐ | |
| Bone 10 | 85% | ☒ | ☒ | ☒ |
| Bone 11 | 100% | ☐ | ☐ | ☒ |
| Bone 12 | 100% | ☐ | ☐ | ☒ |
| Bone 13 | 99.5% | ☐ | ☐ | ☒ |
| Bone 14 | 100% | ☐ | ☐ | ☒ |
| Tissue 1 | 91.3% | ☐ | ☐ | ☒ |
| Tissue 2 | 100% | ☐ | ☐ | ☒ |
| Tissue 3 | 100% | ☐ | ☐ | ☒ |
| Nail 1 | 100% | ☒ | ☒ | ☒ |
| Nail 2 | 100% | ☒ | ☒ | ☒ |
| Nail 3 | 100% | ☐ | ☐ | ☒ |
| Buccal 1 | 99.2% | ☒ | ☒ | ☒ |
| Buccal 2 | 100% | ☒ | ☒ | ☒ |
| Buccal 3 | 85% | ☒ | ☒ | ☒ |
| Success | Rate | 7/28 = 25% | 10/31 = 32.3% | 23/31 = 74.2% |
* Bone 5—sample affected by the robotic error. It is marked unsuccessful for MPS results because it did not have a full control region. However, as discussed in Section 3.2, this is likely due to an automation error and not the sample.
Figure 2Linear coverage graphs. The x-axis shows the mitochondrial genome from position 1–16,569. The y-axis represents the number of usable reads sequenced. The pink shading shows the coverage threshold of 500 reads. (a) Bone 3 amplified well with full coverage for all amplicons. (b) Bone 5 is missing coverage for every other amplicon in the panel seen as a disconnected coverage map.
Figure 3Read histogram from sequencing report. This histogram shows the size of the reads within a sample. (a) Histogram for a positive control sample. (b) Histogram for tissue 1 showing degradation. (c) Histogram for bone sample 10.
Figure 4Read coverage for tissue 1. The x-axis shows the mitochondrial genome from position 1–16,569. The y-axis represents the number or reads sequenced. The smallest amplicons in the panel are represented in the coverage map by the two spikes just below 6 and 10 kb.
Figure 5Precision ID homopolymer detection causing an alignment miscall at positions 16,188 and 16189. The insertion at position 16,193 is represented as a purple i.