| Literature DB >> 18405373 |
Kelvin Li1, Anushka Brownley, Timothy B Stockwell, Karen Beeson, Tina C McIntosh, Dana Busam, Steve Ferriera, Sean Murphy, Samuel Levy.
Abstract
BACKGROUND: Polymerase chain reaction (PCR) is used in directed sequencing for the discovery of novel polymorphisms. As the first step in PCR directed sequencing, effective PCR primer design is crucial for obtaining high-quality sequence data for target regions. Since current computational primer design tools are not fully tuned with stable underlying laboratory protocols, researchers may still be forced to iteratively optimize protocols for failed amplifications after the primers have been ordered. Furthermore, potentially identifiable factors which contribute to PCR failures have yet to be elucidated. This inefficient approach to primer design is further intensified in a high-throughput laboratory, where hundreds of genes may be targeted in one experiment.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18405373 PMCID: PMC2396641 DOI: 10.1186/1471-2105-9-191
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparison of JCVI Primer Design Tool versus other recently published primer design packages.
| 2003 | 2005 | 2006 | 2006 | 2007 | 2008 | ||
| Ensembl ID | - | - | - | - | ∗ | ∗ | |
| DbSNP/AFFY Probe ID | - | - | - | - | dbSNP/AFFY Probe ID | dbSNP ID | |
| Genomic coordinates | - | - | UCSC annotation | - | - | Ensembl coordinates | |
| RefSeq ID/Genbank ID | - | Genbank ID | - | Refseq mRNA ID | - | - | |
| Sequence | ∗ | ∗ | ∗ | - | - | ∗ | |
| - | - | - | ∗ | - | ∗ | ||
| - | - | - | - | - | ∗ | ||
| ∗ | ∗ | ∗ | ∗ | ∗ | - | ||
| - | ‡ | ‡ | ‡ | ‡ | ∗ | ||
| - | ∗ | ∗ | ∗ | ∗ | ∗ | ||
| ∗ | - | - | - | - | ∗ | ||
| - | stutter only | - | - | - | ∗ | ||
| - | - | - | - | - | ∗ | ||
| - | - | - | ∗† | - | ∗ | ||
| - | ∗ | - | - | - | ∗ | ||
∗ Feature supported
- Feature not supported
‡ Tiling pattern is fixed
† Searches for the full length of the primer, instead of just the critical region on the 3' end
Size and coverage rates of overall and exonic target regions by project
| Projects | # Genes | Total Target Region(bp) | % Target Region Covered | Exonic Target (bp) | % Exon Target Covered |
| JCVI #1 | 109 | 1,609,447 | 94.71% | 380,045 | 96.39% |
| JCVI #2 | 12 | 203,592 | 98.39% | 65,445 | 99.55% |
| JCVI #3 | 1 | 14,691 | 90.27% | 847 | 100.00% |
| JCVI #4 | 2 | 4,724 | 100.00% | 2,723 | 100.00% |
| JCVI #5 | 26 | 161,631 | 98.81% | 161,631 | 98.81% |
| Total | 150 | 1,994,085 | 95.40% | 610,691 | 97.14% |
Sequencing success rates for projects by amplification protocol and region type
| Standard | High GC | Combined Amplicon Statistics | ||||||
| Projects | # Amp. | % Amp. Success | # Amp. | % Amp. Success | # Total Amp. | % Amp. Success | # Exonic Amp. | % Exonic Amp. Success |
| JCVI #1 | 3,529 | 94.87% | 1,786 | 82.42% | 5,315 | 90.69% | 1,680 | 94.52% |
| JCVI #2 | 465 | 93.98% | 282 | 98.94% | 747 | 95.85% | 230 | 96.96% |
| JCVI #3 | 12 | 83.33% | 28 | 100.00% | 40 | 95.00% | 10 | 100.00% |
| JCVI #4 | 10 | 100.00% | 10 | 100.00% | 20 | 100.00% | 15 | 100.00% |
| JCVI #5 | 422 | 95.73% | 427 | 90.16% | 849 | 92.93% | 777 | 96.14% |
| Total | 4,438 | 94.84% | 2,533 | 85.83% | 6,971 | 91.57% | 2,712 | 95.24% |
Figure 1An overview of the JCVI High-throughput Directed Sequencing Pipeline.
Figure 2Sample output from a primer design run displaying amplicons in relation to genomic features.
Figure 3Trace phase shifting events occur when a PCR reaction creates a mixed population of DNA molecules that are then simultaneously sequenced.
Figure 4A trace file with PCR stutter artifact, before (top panel) and after (middle panel) PCR stutter filtering. The opposite strand trace (bottom panel) is shown in reverse complement, and confirms the discovery of the mixed base.
Figure 5The trace file in the top panel contains a 5 bp heterozygous indel. The middle and bottom panels show the result of using the correlation-based approach to split the original trace. The "long" allele trace appears in the middle panel and the "short" allele trace appears in the bottom panel.
Figure 6Identification of a palindrome element pair that could potentially produce stem loop interference. The stem consists of (3 × 3) + (5 × 2) = 19 hydrogen bonds and the loop has a circumference of 11 bases.
Figure 7Primer design run time distribution. Note that most jobs complete within the first hour.