| Literature DB >> 23049901 |
Jessica Rothe1, Norman E Watkins, Marion Nagy.
Abstract
Allele-specific extension reactions (ASERs) use 3' terminus-specific primers for the selective extension of completely annealed matches by polymerase. The ability of the polymerase to extend non-specific 3' terminal mismatches leads to a failure of the reaction, a process that is only partly understood and predictable, and often requires time-consuming assay design. In our studies we investigated haplotype-specific extraction (HSE) for the separation of male DNA mixtures. HSE is an ASER and provides the ability to distinguish between diploid chromosomes from one or more individuals. Here, we show that the success of HSE and allele-specific extension depend strongly on the concentration difference between complete match and 3' terminal mismatch. Using the oligonucleotide-modeling platform Visual Omp, we demonstrated the dependency of the discrimination power of the polymerase on match- and mismatch-target hybridization between different probe lengths. Therefore, the probe specificity in HSE could be predicted by performing a relative comparison of different probe designs with their simulated differences between the duplex concentration of target-probe match and mismatches. We tested this new model for probe design in more than 300 HSE reactions with 137 different probes and obtained an accordance of 88%.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23049901 PMCID: PMC3457965 DOI: 10.1371/journal.pone.0045955
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic illustration of HSE for the separation of male DNA mixtures.
A male DNA mixture is shown by the presents of two different Y-chromosomes from individual 1 (light-blue) and individual 2 (dark-blue) which differ in a theoretical SNP A/G. The principal of HSE is than shown by the separation of a Y-chromosomal fragment of individual 2 by the use of a probe specific to the SNP allele G. (1) HSE-probe shows 3′terminal mismatch for individual 1, therefore no extension occurs. (2) Probe matches completely at Y-chromosome of individual 2 and can be extended. (3) During extension reaction the polymerase incorporates biotinylated dNTPs. Streptavidin coated magnetic beads bind to biotin labeled DNA. (4) The DNA-biotin-streptavidin complex can be captured by the use of a magnetic field.
Probes tested in HSE.
| probe | mismatch | sequence | tested different probe-length in nucleotides |
| P30FG | G-T |
| 17, 19–22 |
| P30FG -1 | G-T |
| 21 |
| P30FA | A-C |
| 17, 19, 22 |
| P30FA -1 | A-C |
| 21, 22 |
| P30RC | C-A |
| 19, 21, 24, 25 |
| P30RC -1 | C-A |
| 23, 26 |
| P38FA | A-G |
| 10–13, 15, 16, 19–21 |
| P38FA -1 | A-G |
| 12, 16, 19 |
| P38FC | C-T |
| 10, 12, 13, 15, 18–20 |
| P38FC -1 | C-T |
| 12, 18 |
| P224FC | C-A |
| 9–21, 23 |
| P224FT | T-G |
| 18–21, 23 |
| P224RG | G-T |
| 17, 20 |
| P224RA | A-C |
| 17, 20 |
| P240FC | C-A |
| 17, 19–22 |
| P240FC -1 | C-A |
| 18–21 |
| P240FT | C-A |
| 22 |
| P240FT -1 | C-A |
| 20, 21 |
| P240RG | G-T |
| 16–20 |
| P240RG -1 | G-T |
| 19, 20 |
| P240RA | A-C |
| 17–21 |
| P244FA | A-C |
| 14, 16, 23 |
| P244FA -1 | A-C |
| 15, 16 |
| P244FG | G-T |
| 11–16, 23 |
| P244FG -1 | G-T |
| 13–15 |
| P244RC | C-A |
| 13–15, 23 |
| P244RC -1 | C-A |
| 13, 14 |
| P244RT | T-G |
| 11, 13, 16, 22 |
| P244RT -1 | T-G |
| 14 |
| TatFT | T-G |
| 20 |
| TatFC | C-A |
| 18, 20 |
| rs13304202FA | A-C |
| 18–21 |
| rs13304202FA -1 | A-C |
| 20, 22 |
| rs13304202FG | G-T |
| 21 |
| rs13304202FG -1 | G-T |
| 19 |
| rs13304202RC | A-C |
| 18, 19, 21 |
| rs13304202RC -1 | A-C |
| 19–21, 24 |
| Seq.E2071 RG | G-T |
| 20, 23 |
| Seq.E2071 RA | A-C |
| 20, 23 |
| Seq.F4204 FG | G-G |
| 17, 20 |
| Seq.F4204 FC | C-C |
| 17, 20 |
Probes were named after the corresponding SNP, orientation (F = forward, R = reverse), and discriminating 3′ terminal base, which is also underlined in the sequence. Probe names have been denoted with −1 when the discriminating base is placed on the second base-pair of the 3′ terminus.
Numbers refer to the different lengths of the probes that were tested. For example, probe rs13304202FA was tested using lengths of 18, 19, 20, and 21 nucleotides, whereas the probes have been shortened step-wise on their 5′ termini.
Figure 2Direct comparison of the thermodynamic stability of match and mismatch using Visual OMP.
The program allows the simulation of the concentration of hybridized matches and mismatches by adding the starting concentration (A) of the target and probe, as well as assay parameter (B) under the solution conditions. (C) Simulation of concentration versus temperature. Schematic representation of the comparison between match and mismatch by concentrations. Ta, optimal annealing temperature for PCR; Tm, melting temperature. Horizontal arrows indicate a switch in the hybridization curve for decreasing (↓) or increasing (↑) probe length (nt) and G/C content (GC%).
Figure 3Specific extraction at the P224 locus by HSE and evaluation by AmpFℓSTR® Yfiler PCR.
(A) Map of the 11 kb distant DYS390 and P224 loci on the human Y chromosome, which were selected for HSE analysis. Arrows indicate the positions of the probes used (RG, FC, FT, or RA), which were orientated in either the forward (F) or reverse (R) direction. Given chromosome positions are from a reference sequence (Hg18). (B) Electropherograms of AmpFℓSTR® Yfiler analysis of haplo-separated samples obtained from a DNA mixture (Contributor 1 and Contributor 2) using the probes P224FC (23 and 19 nucleotides long) and P224FT (23 and 20 nucleotides long), or without probes. Green bars with allele 23 correspond to Contributor 1, and brown bars with allele 24 correspond to Contributor 2. Percentages indicate the enrichment of one Contributor.
Figure 4Separation by HSE with different probes.
Enrichment of one contributor from a male DNA mixture is shown in relation to probe length for two different probe sets P224FC/FT (see also File S2). The bars indicate standard deviation of the mean enrichment. No HSE indicates analysis of the male DNA mixture without separation by HSE, and no probe indicates separation of a male DNA mixture by HSE without probe. n = number of extractions.
Summary of all analyzed probes in this study.
| A | B | C | D | |||||||
| probe (distance to STR marker) | mean GC% | distance of hits in kb with matching 3′end | HSE results | ΔΔG (M-MM) | simulation with Viusal Omp | Match (Target-Probe) | Mismatch (Target-Probe) | |||
| success in % | best tested length | best length | HSE fits with simulation | |||||||
|
|
| 36 | no |
| 21* | −1,15 | 21, 22 |
| C-G | T-G |
|
| 35 | no |
| 21 | −1,80 | 21 |
| C-G | T-G | |
|
| 33 | no |
| 20* | −0,40 | 21 |
| T-A | C-A | |
|
| 30 | no |
| 22 | −1,01 | 22 |
| T-A | C-A | |
|
| 27 | no |
| 24* | −0,49 | 24 |
| G-C | A-C | |
|
| 30 | 2, 24 |
| 23 | −2,18 | 23, 27 |
| G-C | A-C | |
|
|
| 77 | no |
| 21 | −0,60 | 11 |
| T-A | A-G |
|
| 71 | no |
| 20 | −1,04 | 12 |
| T-A | A-G | |
|
| 83 | 2, 9, 10, 13+ |
| 15 | −1,19 | 10 |
| G-C | T-C | |
|
| 79 | 7, 10+ |
| n.s. | −1,83 | 12 |
| G-C | T-C | |
|
|
| 40 | no |
| 19* | −0,37 | 18 |
| G-C | A-C |
|
| 34 | no |
| 19*, 20* | −0,70 | 19 |
| A-T | G-T | |
|
| 60 | no |
| 17 | −1,31 | 14 |
| C-G | T-G | |
|
| 53 | no |
| 17 | −0,48 | 15, 16 |
| T-A | C-A | |
|
|
| 37 | no |
| 22* | −0,48 | 22 |
| G-C | A-C |
|
| 40 | no |
| 20* | −2,43 | 20, 21 |
| G-C | A-C | |
|
| 32 | no |
| 22 | −0,60 | 22 |
| A-T | T-C | |
|
| 35 | no |
| 20 | −1,52 | 20, 21 |
| A-T | T-C | |
|
| 42 | no |
| 18* | −1,14 | 19 |
| C-G | T-G | |
|
| 40 | no |
| 20 | −1,21 | 20 |
| C-G | T-G | |
|
| 39 | no |
| 18 | −0,87 | 18, 20 |
| T-A | C-A | |
|
|
| 66 | 49 |
| 13* | −0,95 | 15 |
| C-G | T-G |
|
| 65 | no |
| 15 | −2,11 | 14 |
| C-G | T-G | |
|
| 57 | 116+ |
| n.s. | 0,05 | no |
| T-A | C-A | |
|
| 59 | 0,6+ |
| 15 | −1,04 | 15 |
| T-A | C-A | |
|
| 68 | 20, 29+ |
| 14* | −0,66 | 13 |
| G-C | A-C | |
|
| 71 | 20+ |
| 14 | −2,16 | 14 |
| G-C | A-C | |
|
| 60 | 2, 9, 13+ |
| 13 | −0,69 | 13 |
| A-T | G-T | |
|
| 64 | 9, 13+ |
| 14 | −1,01 | 14 |
| A-T | G-T | |
|
|
| 33 | no |
| 20 | −0,57 | 20 |
| T-A | G-T |
|
| 38 | no |
| 18 | −0,52 | 20 |
| G-C | A-C | |
|
| 40 | no |
| 19 | −1,39 | 20 |
| C-G | T-G | |
|
| 43 | no |
| 18 | −2,77 | 19, 20 |
| C-G | T-G | |
|
|
| 34 | no |
| 19 | 0,42 | no |
| T-A | C-A |
|
| 38 | no |
| 22 | −1,55 | 20, 22 |
| T-A | C-A | |
|
| 39 | no |
| 21 | −1,17 | 22 |
| G-C | A-C | |
|
| 43 | no |
| 21* | −3,00 | 21, 24 |
| G-C | A-C | |
|
|
| 35 | no |
| 20 | −1,19 | 19 |
| C-G | T-G |
|
| 30 | no |
| 20 | −0,13 | 20 |
| T-A | C-A | |
|
|
| 53 | 0.3 kb with matched 3′ end |
| 17 | −0,71 | 18 |
| C-G | G-G |
|
| 53 | 0.3 kb but no matched 3′ end |
| 17 | −0,83 | 14 |
| C-G | C-C | |
(A) and (B) Comparison of HSE results for all probe sets with distance to the STR marker, G/C content, hit frequency, and free energy differences of match and mismatch in kcal/mol.
The number is the distance in kb of strong hits to the extraction locus, + indicates the presence of further hits but greater distance (>30 kb).
Best simulated probes P38FA (11 nucleotides long) and P38FA-1 (12 nucleotides long) show many hits (4, 12, 16 kb+ and 4, 10 16 kb+).
HSE results show the best experimentally evaluated probe length with enrichment of one contributor. Low separation in HSE could be due to a higher hit frequency close to the extraction locus and great distance from the STR marker. Asterix indicate the best probe length. (C) The best probe length was evaluated after simulation and chosen after highest Δcon, and comparison of the prediction for best probe length after Δcon and best probe in HSE: (r) right prediction, (w) wrong prediction. (D) Listing of all analyzed match and mismatch constellations in this study. List of complete data is available in Data S3.
Probe design for P224 FC and P30RC.
| A | B | |||||||||
| probe (STR marker) | l | probe sequence | GC% | TmM | ΔG°M | HSE result in % | n | conM*10−17M | conMM*10−17M | ΔconM-MM*10−17M |
|
| 25 |
| 36 | 63 | −10,97 | – | 297 | 290 |
| |
| 24 |
| 38 | 63 | −11,09 | – | 297 | 292 |
| ||
| 23 |
| 39 | 63 | −10,73 |
| 4 | 295 | 285 |
| |
| 22 |
| 41 | 63 | −10,80 | – | 296 | 287 |
| ||
| 21 |
| 38 | 61 | −9,58 |
| 3 | 274 | 233 |
| |
| 20 |
| 40 | 59 | −8,69 |
| 6 | 219 | 141 |
| |
| 19 |
| 37 | 59 | −8,32 |
| 10 | 182 | 100 |
| |
| 18 |
| 39 | 58 | −7,95 |
| 7 | 141 | 68 |
| |
| 17 |
| 41 | 57 | −7,58 |
| 8 | 100 | 43 |
| |
| 16 |
| 44 | 54 | −6,40 |
| 7 | 23 | 9 |
| |
| 15 |
| 47 | 54 | −6,56 |
| 6 | 29 | 12 |
| |
| 14 |
| 43 | 52 | −6,02 |
| 6 | 13 | 5 |
| |
| 13 |
| 46 | 49 | −5,15 |
| 5 | 4 | 1 |
| |
| 12 |
| 42 | 45 | −4,26 |
| 3 | 1 | 0 |
| |
| 11 |
| 45 | 44 | −4,01 |
| 3 | 1 | 0 |
| |
| 10 |
| 40 | 36 | −2,85 |
| 3 | 0 | 0 |
| |
| 9 | GACATCTTC | 44 | 35 | −2,60 |
| 3 | 0 | 0 |
| |
| 8 | ACATCTTC | 38 | 26 | −2,06 | – | 0 | 0 |
| ||
| 6 | ATCTTC | 43 | 21 | −0,93 | – | 0 | 0 |
| ||
|
| 26 |
| 27 |
| −7,43 | 86 | 48 |
| ||
| 25 |
| 28 |
| −7,55 |
| 2 | 97 | 56 |
| |
| 24 |
| 29 |
| −7,63 |
| 2 | 106 | 61 |
| |
| 23 |
| 26 |
| −6,02 | – | 13 | 7 |
| ||
| 22 |
| 27 |
| −6,62 |
| 2 | 31 | 16 |
| |
| 21 |
| 28 |
| −5,47 | – | 6 | 3 |
| ||
| 20 |
| 30 |
| −6,04 | – | 14 | 7 |
| ||
| 19 |
| 26 |
| −4,14 |
| 2 | 1 | 0 |
| |
| 18 |
| 22 |
| −3,56 | 0 | 0 |
| |||
Probe P224FC and P30RC were designed for SNP P224 (C) and SNP P30 (G) as the extraction loci in forward (F) and reverse (R) orientation. (A) For different probe length (l), the G/C content, melting temperature (Tm), and ΔG value were calculated using Visual Omp. The success of separation by HSE with different probe lengths is given as the percent enrichment of one contributor with standard deviation and number of experiments (n). (−) indicates no data (B) Concentrations of hetero-duplex target probes (con) for match (M) and mismatch (MM) have been simulated for different probe lengths using Visual OMP with the assay parameters. Δ con presents the difference between match and mismatch concentrations.
Figure 5Influence of the G/C contents on Δcon and HSE success.
(A) Comparison of Δcon from six core probes with different G/C content. For all probe sets, G/C content has been calculated as a mean value for all simulated probe lengths. (B) Diagram showing the linear dependence of probe length and G/C content for optimum specific probes after simulation and HSE extraction. Data points are based on Table 2. Diamonds represent the length and G/C content of probes with the highest simulated Δcon (n = 48), crosses indicate length and G/C content of the probe with the best HSE success (n = 11, grey boxes in Table 2A).