| Literature DB >> 22385976 |
Bhakti Dwivedi1, Robert Schmieder, Dawn B Goldsmith, Robert A Edwards, Mya Breitbart.
Abstract
BACKGROUND: Phages (viruses that infect bacteria) have gained significant attention because of their abundance, diversity and important ecological roles. However, the lack of a universal gene shared by all phages presents a challenge for phage identification and characterization, especially in environmental samples where it is difficult to culture phage-host systems. Homologous conserved genes (or "signature genes") present in groups of closely-related phages can be used to explore phage diversity and define evolutionary relationships amongst these phages. Bioinformatic approaches are needed to identify candidate signature genes and design PCR primers to amplify those genes from environmental samples; however, there is currently no existing computational tool that biologists can use for this purpose.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22385976 PMCID: PMC3314551 DOI: 10.1186/1471-2105-13-37
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1An outline of the PhiSiGns workflow. PhiSiGns consists of two critical, interlinked processes: 1) the identification of signature genes conserved amongst a group of phages, and 2) the design of PCR primers for the amplification of these signature genes. Red stars indicate results that can be downloaded to a local machine
Figure 2Web interface for the identification of signature genes. The interface shows 1) the options to limit the display of available phage genomes, 2) a list of phage genomes meeting the selection criteria, 3) a list of the user-selected phage genomes from step (2) for signature gene identification and primer design, 4) the BLAST E-value cut-off for similarity searches, and 5) the BLAST alignment coverage cut-off for similarity searches. Users can click on the question marks for additional information on a given parameter
Figure 3Web interface for designing primers on a selected signature gene. The interface shows 1) an input box for minimum and maximum values for the primer parameters, 2) a list of genes within the selected signature gene group (users have the option to select/deselect genes from the table for alignment and primer design), 3) an option to download the sequence FASTA file, 4) an option to view the program-generated CLUSTALW alignment, and 5) an option to upload a user-generated alignment for the selected signature gene, to be used for primer design
Overview of signature genes (SiGs) identified amongst eight core T7-like phage genomes in the PhiSiGns case study
| # of phages | # of SiGs | Functional roles |
|---|---|---|
| 8 | 24 | DNA polymerase, RNA polymerase, primase/helicase, ssDNA binding protein, ligase, packaging protein A, exonuclease, endonuclease, terminase, RNA polymerase inhibitor, phage capsid and scaffold, portal connector protein, tail fiber, internal core proteins, holins, lysins, unknown phage proteins |
| 7 | 3 | endopeptidases, unknown phage proteins |
| 6 | 6 | kinase, ssDNA binding protein, dGTPase, unknown phage proteins |
| 5 | 6 | nuclease, lipoprotein, unknown phage proteins |
| 4 | 3 | primase/helicase, unknown phage proteins |
| 3 | 4 | adenosylmethionine hydrolase, unknown phage proteins |
| 2 | 12 | endonuclease, unknown phage proteins |
Figure 4Phylogenetic tree of T7-like primase/helicase sequences amplified from sewage samples with degenerate primers designed with PhiSiGns. The eight core T7-like phages and three cyanophage P60-like phages are shown in red and green, respectively. The sewage sequences amplified in this study are shown in blue. The SEWAGE clade represents the compressed view of 49 closely related sequences recovered from sewage in this study. Internal nodes with bootstrap support ≥ 70% are shown with the corresponding bootstrap value indicated. The scale bar represents the number of nucleotide substitutions per site.