| Literature DB >> 21674826 |
Mitchell M Holland1, Megan R McQuillan, Katherine A O'Hanlon.
Abstract
AIM: To use parallel array pyrosequencing to deconvolute mixtures of mitochondrial DNA (mtDNA) sequence and provide high resolution analysis of mtDNA heteroplasmy.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21674826 PMCID: PMC3118725 DOI: 10.3325/cmj.2011.52.299
Source DB: PubMed Journal: Croat Med J ISSN: 0353-9504 Impact factor: 1.351
Figure 1First round polymerase chain reaction (frPCR) primer design. The squares represent the primer binding sequence, the circles represent the multiplex identifier (MID) sequence, and the thick line represents the fusion primer sequence for 454 applications. The length of each primer is 49 nucleotides. The total amplicon length is 503 bps; 49 nucleotides of forward and reverse primers +405 bps of hypervariable segment 1 (HV1) representing target sequence between nucleotide positions 15 997 and 16 401.
First round polymerase chain reaction (frPCR) primer sequences for hypervariable segment 1. Starting at the 5′-end of each primer, fusion primer sequence for forward and reverse emulsion PCR and pyrosequencing is in standard font, followed by multiplex identifier sequence in italics and primer binding sequence in bold
| Primer | Sequence |
|---|---|
| MID1 forward | 5′ – GCCTCCCTCGCGCCATCAG |
| MID1 reverse | 3′ – |
| MID9 forward | 5′ - GCCTCCCTCGCGCCATCAG |
| MID9 reverse | 3′ – |
| MID10 forward | 5′ - GCCTCCCTCGCGCCATCAG |
| MID10 reverse | 3′ – |
| MID11 forward | 5′ – GCCTCCCTCGCGCCATCAG |
| MID11 reverse | 3′ – |
| MID12 forward | 5′ – GCCTCCCTCGCGCCATCAG |
| MID12 reverse | 3′ – |
First round polymerase chain reaction (frPCR) primer sequences for primer set two (PS2) of hypervariable segment 1, the D18S51 short tandem repeat locus, and the DYS389 I/II loci. Starting at the 5′-end of each primer, fusion primer sequence for emulsion PCR and pyrosequencing is in standard font, followed by multiplex identifier sequences in italics and primer binding sequence in bold
| Primer | Sequence |
|---|---|
| PS2
MID1 forward | 5′ – GCCTCCCTCGCGCCATCAG |
| PS2
MID1 reverse | 3′ – |
| PS2
MID11 forward | 5′ – GCCTCCCTCGCGCCATCAG |
| PS2
MID11 reverse | 3′ – |
| D18S51
MID1 forward | 5′ – GCCTCCCTCGCGCCATCAG |
| D18S51
MID1 reverse | 3′ – |
| D18S51
MID11 forward | 5′ – GCCTCCCTCGCGCCATCAG |
| D18S51
MID11 reverse | 3′ – |
| DYS389 I/II
MID1 forward | 5′ – GCCTCCCTCGCGCCATCAG |
| DYS389 I/II
MID1 reverse | 3′ – |
| DYS389 I/II
MID11 forward | 5′ – GCCTCCCTCGCGCCATCAG |
| DYS389 I/II MID11 reverse | 3′ – |
The hypervariable segment 1 mitochondrial DNA (mtDNA) profiles for 25 of the 27 lineages; polymorphic nucleotide sites are in relation to the Cambridge Reference Sequence. The percentage of minor component heteroplasmy at specific sites is provided for each sample, when detected
| Sample | Sanger mtDNA profile | Percent of minor heteroplasmy and site | 454 GS Junior mtDNA profile | Percent of minor heteroplasmy and site |
|---|---|---|---|---|
| F2 | 16069T, 16093C, 16126C, 16261T, 16274A, 16355T | 16311 – 18.4% C | 16069T, 16093C, 16126C, 16261T, 16274A, 16355T | 16093 – 3.71% T 16261 – 1.29% C 16311 – 20.14% C |
| F3 | 16069T, 16126C, 16145A, 16172C, 16261T | Not detected | 16069T, 16126C, 16145A, 16172C, 16261T | Not detected |
| F4 | No polymorphisms | Not detected | No polymorphisms | Not detected |
| F5 | 16129A, 16172C, 16223T, 16311C | Not detected | 16129A, 16172C, 16223T, 16311C | 16129 – 0.51% G 16311 – 0.33% T |
| F7, F12-13, M13-14 | 16192T, 16256T, 16270T | Not detected | 16192T, 16256T, 16270T | 16192 – 2.64%-4.50% C |
| F8 | 16223T, 16362C | Not detected | 16223T, 16362C | 16223 – 1.86% C |
| F9 | 16356C | Not detected | 16356C | Not detected |
| F10 | 16298C | Not detected | 16298C | 16298 – 0.45% T |
| F16 | 16126C, 16239T, 16294T, 16296T, 16304C | Not detected | 16126C, 16239T, 16294T, 16296T, 16304C | Not detected |
| F25 | 16343G | Not detected | 16343G | Not detected |
| F26 | 16093C | Not detected | 16093C | Not detected |
| F27 | 16172C, 16278T | Not detected | 16172C, 16278T | Not detected |
| M3 | 16355T | Not detected | 16355T | Not detected |
| M4 | 16111T | Not detected | 16111T | 16111 – 0.52% C |
| M5 | 16114A, 16129A, 16192T, 16213A, 16223T, 16278T, 16355T, 16362C | Not detected | 16114A, 16129A, 16192T, 16213A, 16223T, 16278T, 16355T, 16362C | 16192 – 3.18% C |
| M7 | 16129A, 16223T, 16264T | Not detected | 16129A, 16223T, 16264T | Not detected |
| M8 | 16224C, 16311C | Not detected | 16224C, 16311C | Not detected |
| M9 | 16301T, 16343G, 16356C | Not detected | 16301T, 16343G, 16356C | Not detected |
| M10 | 16304C | Not detected | 16304C | 16209 – 2.62% C 16222 – 2.30% T 16304 – 2.99% T |
| M11 | 16129A, 16223T | Not detected | 16129A, 16223T | Not detected |
| M12 | 16069T, 16126C | Not detected | 16069T, 16126C | 16126 – 1.14% T |
| M15 | 16093C, 16224C, 16311C | Not detected | 16093C, 16224C, 16311C | 16093 – 3.04% T |
| M17 | 16126C, 16294T, 16296T | Not detected | 16126C, 16294T, 16296T | Not detected |
| M18 | 16278T, 16304C, 16311C | Not detected | 16278T, 16304C, 16311C | 16128 – 0.52% T 16278 – 0.77% C 16293 – 0.77% G 16304 – 1.00% T |
| M19, F22 | 16069T, 16126C, 16222T | Not detected | 16069T, 16126C, 16222T | Not detected |
Figure 2Heteroplasmy at positions 16 093, 16 261, and 16 311 of sample F2, as observed when using the Sanger method of DNA sequencing. The only position where heteroplasmy would have been called by the majority of forensic scientists is 16 311; laboratories using instruments with low levels of signal to noise may have called position 16 093. The corresponding percentages of heteroplasmy observed in the second generation sequencing data were, 3.71%, 1.29%, and 20.14%, respectively.
A summary of the data for the 3 sets of 5 mixture experiments at the estimated ratios of 1:5, 1:100, 1:250, 1:500, and 1:1000. Total coverage is the total number of sequencing reads generated by the instrument; minor component percentage is the percentage of the total reads that correspond to the minor nucleotide; minor component coverage is the total number of sequencing reads generated by the instrument for the minor component; forward:reverse sequence reads is the total number of sequencing reads generated by the instrument in the forward and reverse directions; and minor vs total is a reflection that the number on the top ratios are minor (eg, 662:363) and the bottom ratios are total (eg, 5425:2946)
| Experiment 1 (F12:M15) | Experiment 2 (M9:M11) | Experiment 3 (M15:F22) | ||||
|---|---|---|---|---|---|---|
| 1:5 | 8371 | 12.24 | 1950 | 7.18 | 11310 | 16.85 |
| 1:100 | 11126 | 0.56 | 2751 | 0.11 | 2436 | 0.86 |
| 1:250 | 7241 | 0.36 | 2830 | 0.11 | 3959 | 0.38 |
| 1:500 | 9941 | 0.24 | 7357 | 0.07 | 25705 | 0.051 |
| 1:1000 | 5848 | 0.07 | 1234 | 0.16 | 2991 | 0.100 |
| minor component coverage | forward: reverse sequence reads
minor vs total | minor component coverage | forward: reverse sequence reads
minor vs total | minor component coverage | forward: reverse sequence reads
minor vs total | |
| 1:5 | 1025 | 662:363
5425:2946 | 140 | 122:18
1651:299 | 1906 | 1181:725
7492:3818 |
| 1:100 | 62 | 46:16
7910:3216 | 3 | 3:0
2410:341 | 21 | 12:9
1783:653 |
| 1:250 | 26 | 19:7
4789:2452 | 3 | 2:1
2329:501 | 15 | 9:6
2490:1469 |
| 1:500 | 24 | 14:10
6945:2996 | 5 | 5:0
7012:345 | 13 | 8:5
18411:7294 |
| 1:1000 | 4 | 3:1
3954:1894 | 2 | 1:1
972:262 | 3 | 2:1
1971:1020 |
| actual mixture ratio | actual mixture ratio | actual mixture ratio | ||||
| 1:5 | 1:8 | 1:14 | 1:6 | |||
| 1:100 | 1:178 | 1:917 | 1:116 | |||
| 1:250 | 1:278 | 1:917 | 1:264 | |||
| 1:500 | 1:417 | 1:1428 | 1:1977 | |||
| 1:1000 | 1:1428 | 1:625 | 1:997 | |||
Figure 3Short tandem repeat results for the D18S51 locus of two samples mixed in a pre-polymerase chain reaction ratio of 1:1. The D18 profile for individual one was 13,14 and 14,17 for individual two. The 3 gray-shaded areas labeled in the figure are representative of the sequence density for alleles 13, 14, and 17; ~ 1000-2500 sequences for each allele. The cross above the 14 allele indicates which sequence is being displayed in the window below; sequences 1-28 for allele 14. The sequence density for allele 14 is illustrative of 2 doses of that allele. The area enclosed by the box in the window below is the 14 copies of the STR repeat, AGAA, for the 14 allele; for sequences 6-10. The numbered sequence above the window is a reference used to partition the sequences associated with the 14 allele away from other alleles.
A summary of the complete and chimeric sequences observed in the 1:5 and 1:100 mixtures for samples M15 (major) and F22 (minor). The front and back designations reflect the first and second halves of hypervariable segment 1, respectively. The mtDNA sequence for the major and minor components can be found in Table 3 and Figure 4
| Samples M15 and F22 | 1:5 ratio | 1:100 ratio |
|---|---|---|
| Total sequences analyzed | 1028 (of 11 310) | 2436 |
| Complete minor sequences | 130 | 6 |
| Major front, minor back sequences | 21 | 5 |
| Minor front, major back sequences | 22 | 9 |
| 069T + major sequences | 6 | 2 |
| 069C+ minor sequences | 23 | 9 |
| Major +311T sequences | 16 | 2 |
| Minor +311C sequences | 4 | 2 |
Figure 4A chimeric mitochondrial DNA sequence for a sample from a 1:100 mixture of M15 and F22. The second sequence from the top has the 16069 T, 16126 C, and 16222 T polymorphisms of the minor component; highlighted by the NextGENe software. The second sequence from the bottom is a chimeric product that contains the 16 069 T and 16 126 C from the minor component, and the 16 224 C and 16 311 C polymorphisms from the major component. This is referred to as a Minor Front, Major Back chimeric sequence. The missing C’s at position 16 224 for the 4th, 5th, and 7th sequences listed below are indicative of the difficulties of the 454 approach to resolve homopolymeric stretches of greater than 4 C’s in a row.
Figure 5The distance between mitochondrial DNA polymorphisms in the major and minor components of a 1:5 or 1:100 mixture of M15 and F22. The major component profile from M15 has 16093 C/T heteroplasmy, 16224 C, and 16311 C, and the Minor component profile from F22 has 16069 T, 16126 C, and 16222 T.