| Literature DB >> 19264805 |
Jorge Duitama1, Dipu Mohan Kumar, Edward Hemphill, Mazhar Khan, Ion I Mandoiu, Craig E Nelson.
Abstract
Rapid and reliable virus subtype identification is critical for accurate diagnosis of human infections, effective response to epidemic outbreaks and global-scale surveillance of highly pathogenic viral subtypes such as avian influenza H5N1. The polymerase chain reaction (PCR) has become the method of choice for virus subtype identification. However, designing subtype-specific PCR primer pairs is a very challenging task: on one hand, selected primer pairs must result in robust amplification in the presence of a significant degree of sequence heterogeneity within subtypes, on the other, they must discriminate between the subtype of interest and closely related subtypes. In this article, we present a new tool, called PrimerHunter, that can be used to select highly sensitive and specific primers for virus subtyping. Our tool takes as input sets of both target and nontarget sequences. Primers are selected such that they efficiently amplify any one of the target sequences, and none of the nontarget sequences. PrimerHunter ensures the desired amplification properties by using accurate estimates of melting temperature with mismatches, computed based on the nearest neighbor model via an efficient fractional programming algorithm. Validation experiments with three avian influenza HA subtypes confirm that primers selected by PrimerHunter have high sensitivity and specificity for target sequences.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19264805 PMCID: PMC2677861 DOI: 10.1093/nar/gkp073
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Features comparison between primer and probe selection tools most similar to PrimerHunter
| Design tool | Multiple targets | Nontargets | TM Model | Salt correction | Output |
|---|---|---|---|---|---|
| Primer3 (4) | No | Yes (DB) | NN | Yes | Multiple primer pairs |
| Insignia (9) | Yes (DB) | Yes (DB) | None | No | Multiple signatures |
| QPrimer (15) | No (DB) | No | NN | No | Multiple primers |
| DePict (14) | Yes (MSA) | No | None | No | Best primer |
| PROBEMer (8) | Yes | Yes | NN | No | Multiple probes |
| Greene SCPrimer (13) | Yes (MSA) | No | NN | Yes | Multiple primer pairs |
| OligoSpawn (16) | Yes | Yes | NN | No | Multiple probes |
| SLICSel (17) | Yes | Yes | NN | Yes | Multiple probes |
| Primaclade (18) | Yes (MSA) | No | NN | No | Multiple primers |
| OligoArray (19) | Yes | Yes (DB) | NN | No | Multiple probes |
| PrimerHunter (this article) | Yes | Yes | NN w/mismatches | Yes | Multiple primer pairs |
DB: user can select targets from a preconstructed database; MSA: input must be provided as a multiple sequence alignment; NN: nearest neighbor model.
Figure 1.Average and standard deviation of the difference (in degrees Celsius) between experimental melting temperatures and predictions obtained by fractional programming without salt correction (21) and with salt corrections performed using the SantaLucia model equation (1) for 812 duplexes of perfectly complementary oligonucleotides with lengths between 9 and 30 bp and GC content between 8% and 80%.
MSE for residuals calculated as the difference (in degrees Celsius) between experimental melting temperatures and predictions obtained by fractional programming without salt correction (21) and with salt corrections performed using the SantaLucia model equation (1)
| Salt Conc. | Primer length 9–30 | Primer length 20–25 | ||||
|---|---|---|---|---|---|---|
| (M) | GC content 8%–80% | GC content 25%–75% | ||||
| Number of duplexes | MSE w/o salt correction | MSE with salt correction | Number of duplexes | MSE w/o salt correction | MSE with salt correction | |
| 0.069–0.15 | 351 | 150.03 | 2.30 | 158 | 148.91 | 2.25 |
| 0.22 | 152 | 47.44 | 2.71 | 72 | 43.14 | 3.18 |
| 0.62–0.621 | 152 | 8.98 | 2.52 | 72 | 6.90 | 1.38 |
| 1–1.02 | 157 | 4.75 | 4.97 | 74 | 2.61 | 2.76 |
Average and standard deviation for the difference (in degrees Celsius) between experimental melting temperature and predictions made by the SantaLucia model equation (1) on duplexes with one and two mismatches
| Number of mismatches | Length range | GC content range | Number of duplexes | Average difference | Standard deviation |
|---|---|---|---|---|---|
| 1 | 9–16 | 21%–78% | 110 | 0.56 | 2.06 |
| 2 | 12–14 | 50%–75% | 28 | −1.25 | 2.70 |
Figure 2.Phylogenetic tree of avian influenza HA sequences of North American origin from the NCBI flu database (five complete sequences selected at random for each subtype).
Primers found for each subtype of avian influenza HA and comparison with number of probes generated by related tools
| Subtype | Number of targets | Number of Nontargets | Avg. percentage of diss. | Number of FP | Number of RP | Number of PP | Number of probes SlicSel | Number of probes OligoSpawn |
|---|---|---|---|---|---|---|---|---|
| H1 | 48 | 526 | 8.4 | 51 | 52 | 70 | 20 | 2 |
| H2 | 41 | 533 | 9.1 | 42 | 43 | 187 | 14 | 2 |
| H3 | 72 | 502 | 11.1 | 41 | 61 | 135 | 7 | 1 |
| H4 | 67 | 507 | 7.4 | 265 | 225 | 3724 | 18 | 2 |
| H5 | 69 | 505 | 9.1 | 68 | 66 | 160 | 17 | 1 |
| H6 | 100 | 474 | 15.4 | 36 | 27 | 3 | 4 | 3 |
| H7 | 55 | 519 | 8.9 | 77 | 81 | 260 | 2 | 1 |
| H8 | 9 | 565 | 6.3 | 489 | 482 | 14415 | 100 | 1 |
| H9 | 23 | 551 | 8.7 | 140 | 152 | 1222 | 58 | 1 |
| H10 | 16 | 558 | 6.8 | 243 | 302 | 3712 | 35 | 1 |
| H11 | 45 | 529 | 5.9 | 267 | 262 | 4117 | 32 | 1 |
| H12 | 15 | 559 | 7.1 | 472 | 494 | 12895 | 52 | 1 |
| H13 | 10 | 564 | 14.4 | 41 | 33 | 98 | 1 | 2 |
| H16 | 4 | 570 | 9.5 | 367 | 352 | 7629 | 68 | 1 |
The dissimilarity within a subtype is calculated as the average pairwise Hamming distance in the multiple sequence alignment expressed as percentage of the average sequence length. (FP: forward primers; RP: reverse primers; PP: primer pairs)
Figure 3.Amplification curves using an H5-specific primer pair and H3, H5, H7 plasmids or no template (three replicates each).
Figure 4.Average ΔCt for on-target (T), off-target (OT1 and OT2) and NTC Q-PCR amplification with nine primer pairs (three subtype-specific pairs for each of H3, H5 and H7; error bars indicate minimum/maximum values).
Figure 5.ΔCt for triplicate Q-PCR reactions performed with H3-, H5- and H7-specific primer pairs at 10 different dilutions of on- and off-target templates. Lines connect triplicate means at each dilution. The legend in each graph indicates the color for the primer (numerator) and target (denominator) combination.