| Literature DB >> 19465390 |
Bruno Contreras-Moreira1, Bernardo Sachman-Ruiz, Iraís Figueroa-Palacios, Pablo Vinuesa.
Abstract
Primers4clades is an easy-to-use web server that implements a fully automatic PCR primer design pipeline for cross-species amplification of novel sequences from metagenomic DNA, or from uncharacterized organisms, belonging to user-specified phylogenetic clades or taxa. The server takes a set of non-aligned protein coding genes, with or without introns, aligns them and computes a neighbor-joining tree, which is displayed on screen for easy selection of species or sequence clusters to design lineage-specific PCR primers. Primers4clades implements an extended CODEHOP primer design strategy based on both DNA and protein multiple sequence alignments. It evaluates several thermodynamic properties of the oligonucleotide pairs, and computes the phylogenetic information content of the predicted amplicon sets from Shimodaira-Hasegawa-like branch support values of maximum likelihood phylogenies. A non-redundant set of primer formulations is returned, ranked according to their thermodynamic properties. An amplicon distribution map provides a convenient overview of the coverage of the target locus. Altogether these features greatly help the user in making an informed choice between alternative primer pair formulations. Primers4clades is available at two mirror sites: http://maya.ccg.unam.mx/primers4clades/and http://floresta.eead.csic.es/primers4clades/. Three demo data sets and a comprehensive documentation/tutorial page are provided for easy testing of the server's capabilities and interface.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19465390 PMCID: PMC2703966 DOI: 10.1093/nar/gkp377
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.First output generated by primers4clades run in the interactive ‘cluster sequences’ mode using the provided alpha-proteobacterial atpD demo data set. (A) Provides a summary of the user-specified run parameters and alignment statistics for the input data set. (B) Shows the labeled NJ tree computed from the alignment of the input dataset. (C) Shows the cluster selection panel based on the labels shown on the NJ tree. Hitting the re-cluster button parses the alignment to use only the selected sequences. Hitting the get primers button starts the primer design and evaluation steps.
Figure 2.Output summary of the primer design and evaluation steps. (A) Shows the positions of the different amplicon sets mapped on the first sequence of the input alignment at the protein level. The nonredundant codon usage tables used for primer design are shown. All primer pair formulations and their thermodynamic properties can be downloaded using the TAB link. (B) The CODEHOP (bold) and the three codon-based primer formulations returned by the system (only the forward formulation is shown), aligned with the underlying codons. An ‘!’ sign denotes positions corrected by the system based on the codon alignment. (C) Output summary for the (forward) primer thermodynamic and phylogenetic quality evaluations.
Figure 3.Genome-scale benchmark analysis to test the influence of diverse alignment parameters on the numbers of predicted primer pairs obtained per locus. The analyses were performed on a set of 983 orthologous gene family alignments (at the protein level) for 19 Rhizobiales genomes. (A) Protein alignment length. (B) Percentage of gaps in the alignment. (C) Maximum WAG + G ML distance between pairs of sequences in a gene family multiple sequence alignment. (D) Among site rate variation in the protein alignment, measured as a function of the alpha (shape) parameter of the gamma distribution, estimated under ML using the WAG + G model with eight discrete rate categories. (E) Number of codon tables used per alignment.