| Literature DB >> 22014212 |
Izaskun Mallona1, Julia Weiss, Marcos Egea-Cortines.
Abstract
BACKGROUND: Relative calculation of differential gene expression in quantitative PCR reactions requires comparison between amplification experiments that include reference genes and genes under study. Ignoring the differences between their efficiencies may lead to miscalculation of gene expression even with the same starting amount of template. Although there are several tools performing PCR primer design, there is no tool available that predicts PCR efficiency for a given amplicon and primer pair.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22014212 PMCID: PMC3234296 DOI: 10.1186/1471-2105-12-404
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Algorithms for primer self-complementarity (primerSelfcom) and cross hybridization (primerDimers) computing. (a), (b) and (c) show three stages of the sliding window triplet extraction step. All the DNA string is reduced into overlapping triplets. (d) reflects the general overview of the algorithm. As a first step, triplets are extracted for each primer. Then, primers are reverse complemented and thereafter splitted into overlapping triplets. Comparison between triplets allows the generation of an estimate of similarity, which is employed as a hybridization predictor.
Statistical results for univariate analysis
| Univariate analysis | ||||||
|---|---|---|---|---|---|---|
| Primers length | Z = -7.4398 | - | -0.118 | -0.239 | -0.433 | 1.008e-13* |
| Sequence length | Z = -5.423 | - | -0.086 | -0.173 | -0.314 | 5.86e-08* |
| Sequence G+C content | Z = -10.2664 | - | -0.163 | -0.331 | -0.601 | <2.2e-16* |
| A repeats | Z = 2.1004 | - | 0.033 | 0.067 | 0.121 | 0.03569* |
| T repeats | Z = 3.9818 | - | 0.063 | 0.127 | 0.230 | 6.84e-05* |
| C repeats | Z = -5.294 | - | -0.084 | -0.169 | -0.307 | 1.196e-07* |
| G repeats | Z = -7.1808 | - | -0.114 | -0.230 | -0.418 | 6.929e-13* |
| Primers | Z = 1.4653 | - | 0.023 | 0.047 | 0.085 | 0.1428 |
| Primers self complementarity | Z = 11.9002 | - | 0.190 | 0.386 | 0.700 | <2.2e-16* |
| Primer dimers | Z = 4.4161 | - | 0.070 | 0.141 | 0.256 | 1.005e-05* |
| Primer GC imbalance | Z = 11.1367 | - | 0.177 | 0.360 | 0.654 | <2.2e-16* |
| Primers GC content | Z = 4.5921 | - | 0.073 | 0.147 | 0.266 | 4.388e-06* |
| Sequence palindromes | Z = -3.4951 | - | -0.056 | -0.111 | -0.202 | 0.0004738* |
| Species | 9 | - | - | - | <2.2e-16* | |
| Template | 12 | - | - | - | <2.2e-16* | |
| Variety or line | 18 | - | - | - | <2.2e-16* | |
| Template source | 24 | - | - | - | <2.2e-16* | |
| Operator | 8 | - | - | - | <2.2e-16* | |
| Primer's 3' last two nucleotides | 15 | - | - | - | <2.2e-16* | |
In the case of asymptotic Spearman tests, Z value are shown; χ2 value is written for asymptotic Kruskal-Wallis tests. An asterisk over the p-value reflects a significant influence (p-value ≤ 0.05). Sperman's correlation analysis shows the ρ statistic as well the Cohen's d and Log Odds estimates of effect size. For post-hoc asymptotic Wilcoxon Mann-Whitney rank sum tests and its effect size estimators see Additional File 2.
GAM overview
| GAM analysis | ||||
|---|---|---|---|---|
| (Intercept) | 1.73825 | 0.00153 | 1136 | <2e-16 |
| Approximate significance of smooth terms | ||||
| edf | Ref.df | F | p-value | |
| s(lengthSequence, gcSequence) | 17.58 | 18.08 | 2.332 | 0.00114 |
| s(primersLength, gcPrimers) | 27.96 | 28.46 | 22.950 | < 2e-16 |
| s(gcImbalance, primerDimers) | 28.83 | 29.33 | 16.717 | < 2e-16 |
| R-sq.(adj) = 0.41 Deviance explained = 42.1% | ||||
GAM summary as estimated by the gam function of the mgcv R package. Model formula corresponds to: efficiency s(lengthSequence, gcSequence) + s(primersLength, gcPrimers) + s(gcImbalance, primerDimers).
Figure 2Perspective plot views of the GAM. Results of the best-fitting smooths for the variables included in the model. The interaction between the two variables is presented as a surface; the z-axis shows the response and the relative importance of each variable is presented in the x- and y- axis.
Figure 3ROC and PR curves. ROC and PR curves are plotted for various experimental efficiency thresholds, which define the decision criteria of succesful PCR performance. A good behaviour in ROC space is to be in the upper-left-hand corner, whereas in PR space the goal is to locate at the upper-right-hand corner.