| Literature DB >> 26713222 |
Anne G Rosenwald1, Bradley Murray1, Theodore Toth1, Ramana Madupu2, Alexandra Kyrillos1, Gaurav Arora1.
Abstract
Chlamydia-infecting bacteriophages, members of the Microviridae family, specifically the Gokushovirinae subfamily, are small (4.5-5 kb) single-stranded circles with 8-10 open-reading frames similar to E. coli phage ϕX174. Using sequence information found in GenBank, we examined related genes in Chlamydophila pneumoniae and Chlamydia-infecting bacteriophages. The 5 completely sequenced C. pneumoniae strains contain a gene orthologous to a phage gene annotated as the putative replication initiation protein (PRIP, also called VP4), which is not found in any other members of the Chlamydiaceae family sequenced to date. The C. pneumoniae strain infecting koalas, LPCoLN, in addition contains another region orthologous to phage sequences derived from the minor capsid protein gene, VP3. Phylogenetically, the phage PRIP sequences are more diverse than the bacterial PRIP sequences; nevertheless, the bacterial sequences and the phage sequences each cluster together in their own clade. Finally, we found evidence for another Microviridae phage-related gene, the major capsid protein gene, VP1 in a number of other bacterial species and 2 eukaryotes, the woodland strawberry and a nematode. Thus, we find considerable evidence for DNA sequences related to genes found in bacteriophages of the Microviridae family not only in a variety of prokaryotic but also eukaryotic species.Entities:
Keywords: gokushovirinae; horizontal gene transfer; microviridae; putative replication initiation protein (PRIP)
Year: 2014 PMID: 26713222 PMCID: PMC4589997 DOI: 10.4161/21597073.2014.965076
Source DB: PubMed Journal: Bacteriophage ISSN: 2159-7073
Chlamydia pneumoniae sequences used in this study
| Strain | Host | DNA Sequence | Accession Number |
|---|---|---|---|
| AR39 | Human | NC_002179 | ref. 28 |
| CWL029 | Human | NC_000922 | ref. 44 |
| J138 | Human | NC_002491 | ref. 12 |
| TW-183 | Human | NC_005043 | (unpublished) |
| LPCoLN | Koala | CP001713 | ref. 45 |
Host ranges of the six Chlamydia phages
| Chp1 | Chp2 | Chp3 | Chp4 | CPAR39 | CPG1 | |
|---|---|---|---|---|---|---|
| no | no | |||||
| no | no | |||||
| no | no | |||||
| yes | yes | |||||
| no | yes | yes | ||||
| yes | yes | no | ||||
| yes | yes | |||||
| no | no | |||||
| yes | no | no |
Information gleaned from the literature about host ranges of the 6 Chlamydia bacteriophages. YES indicates the species from which the phage was isolated; yes indicates that the phage can infect that species; no indicates that the phage does not infect that species; no entry means was no information available. Information about the following phages was obtained from: ϕChp1; ϕChp2; ϕChp3; ϕChp4; ϕCPAR39; and ϕCPG1.
Figure 1.Comparison among Chlamydia bacteriophages. (A) Matrix derived from MUSCLE analysis of the phage DNA sequences. The matrix shows the extent of identity between 2 phage sequences in pairwise alignment. (Chp1 shows ∼60% identity to the other phages, whereas the others show >90% identity to one another. (B) Phylogenetic analysis of the 6 phage genomes. The evolutionary history was inferred by using the maximum likelihood method based on the Hasegawa-Kishino-Yano model using 1000 bootstraps. The tree with the highest log likelihood (−11567.9705) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically as follows. When the number of common sites was < 100 or less than one fourth of the total number of sites, the maximum parsimony method was used; otherwise BIO neighbor joining method with maximum composite likelihood distance matrix was used. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 5.9605)). The analysis involved 6 nucleotide sequences. All positions containing gaps and missing data were eliminated. There were a total of 3729 positions in the final dataset. Evolutionary analyses were conducted in MEGA5.
Figure 2.Phylogenetic Analysis of the 11 PRIP Proteins. The evolutionary history was inferred by using the maximum likelihood method based on the Whelan and Goldman model. The tree with the highest log likelihood (−713.8725) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying neighbor-join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then selecting the topology with superior log likelihood value. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 11 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 88 positions in the final data set. Evolutionary analyses were conducted in MEGA5.
Figure 3.Comparison of the 5 C. pneumoniae strains in the region containing the PRIP gene. Ten kb regions centered around the PRIP gene for each of the 5 sequenced C. pneumoniae strains were compared to each other using the WebACT (Artemis Comparison Tool). Light blue arrows represent the open-reading frames on each sequence; PRIP is outlined with a heavier black line. The red chords between sequences signify homology in those regions; blue chords show homology to the opposite strand. In LPCoLN and AR39, PRIP is found on the bottom strand, while in CWL029, J138, and TW183, PRIP is on the top strand. The sequence in LPCoLN that does not match anything in the other strains corresponds to the additional phage sequences similar to VP3 not found in the human-infecting strains.
VP1 homologs found in non-Chlamydia bacterial and plant species The protein sequence corresponding to VP1 from Chp4 (YP_338238.1) was submitted for BLASTP analysis. The top hits with e-values ∼<1e-50 are reported here. Hits to “uncultured bacteria” have been eliminated. Most bacteria are members of the Bacteroidetes family (Parabacteroides and Elizabethkingia) or the Firmicutes family (Clostridium) except for Richelia (a cyanobacterium). Surprisingly, 2 eukaryotes also had reasonably high e-values with >50% query coverage: Fragaria vesca (woodland strawberry) and Necator americanus (a nematode)
| Name | Max Score | Total Score | Query Cover | e-value | Max identity | Accession Number |
|---|---|---|---|---|---|---|
| capsid family protein [ | 336 | 336 | 97% | 2e-104 | 37% | KDS61824.1 |
| PREDICTED: capsid protein VP1-like [ | 321 | 321 | 64% | 3e-100 | 50% | XP_004309312.1 |
| putative major coat protein [ | 324 | 324 | 96% | 5e-100 | 36% | WP_022246927.1 |
| hypothetical protein [ | 313 | 313 | 97% | 1e-95 | 38% | WP_005867318.1 |
| capsid protein VP1 [ | 309 | 309 | 97% | 4e-94 | 36% | WP_022322420.1 |
| conserved hypothetical protein [ | 293 | 293 | 96% | 5e-88 | 34% | CDN73650.1 |
| hypothetical protein [ | 292 | 292 | 96% | 9e-88 | 35% | WP_024568106.1 |
| protein [ | 271 | 271 | 40% | 1e-83 | 59% | WP_008233569.1 |
| capsid family protein [ | 259 | 259 | 74% | 8e-77 | 38% | KDS75238.1 |
| capsid family protein [ | 206 | 206 | 46% | 2e-58 | 42% | KDS63784.1 |
| putative capsid protein [ | 188 | 188 | 88% | 2e-49 | 33% | ETN80368.1 |
Other Chlamydia spp. sequences used in this study
| Strain | DNA Sequence | Accession Number |
|---|---|---|
| NC_002620 | ref. 46 | |
| NC_000117 | ref. 47 | |
| NC_010287 | ref. 48 | |
| NC_004552 | ref. 49 | |
| NC_003361 | ref. 50 | |
| NC_003361 | ref. 51 | |
| NC_015408 | ref. 52 | |
| NC_015470 | ref. 53 |
Note – no complete sequence of Chlamydia suis is currently available at GenBank.
Chlamydia phage sequences used in this study
| Name | Isolated From | DNA Sequence | Accession Number |
|---|---|---|---|
| ϕChp1 | NC_001741 | ref. 16 | |
| ϕChp2 | NC_002194 | ref. 14 | |
| ϕChp3 | NC_008355 | ref. 21 | |
| ϕChp4 | NC_007461 | ref. 22 | |
| ϕCPAR39 | NC_002180 | ref. 15 | |
| ϕCPG1 | NC_001998 | (unpublished) |