| Literature DB >> 27302758 |
Caroline Chénard1, Jennifer F Wirth1, Curtis A Suttle2.
Abstract
UNLABELLED: Here we present the first genomic characterization of viruses infecting Nostoc, a genus of ecologically important cyanobacteria that are widespread in freshwater. Cyanophages A-1 and N-1 were isolated in the 1970s and infect Nostoc sp. strain PCC 7210 but remained genomically uncharacterized. Their 68,304- and 64,960-bp genomes are strikingly different from those of other sequenced cyanophages. Many putative genes that code for proteins with known functions are similar to those found in filamentous cyanobacteria, showing a long evolutionary history in their host. Cyanophage N-1 encodes a CRISPR array that is transcribed during infection and is similar to the DR5 family of CRISPRs commonly found in cyanobacteria. The presence of a host-related CRISPR array in a cyanophage suggests that the phage can transfer the CRISPR among related cyanobacteria and thereby provide resistance to infection with competing phages. Both viruses also encode a distinct DNA polymerase B that is closely related to those found in plasmids of Cyanothece sp. strain PCC 7424, Nostoc sp. strain PCC 7120, and Anabaena variabilis ATCC 29413. These polymerases form a distinct evolutionary group that is more closely related to DNA polymerases of proteobacteria than to those of other viruses. This suggests that the polymerase was acquired from a proteobacterium by an ancestral virus and transferred to the cyanobacterial plasmid. Many other open reading frames are similar to a prophage-like element in the genome of Nostoc sp. strain PCC 7524. The Nostoc cyanophages reveal a history of gene transfers between filamentous cyanobacteria and their viruses that have helped to forge the evolutionary trajectory of this previously unrecognized group of phages. IMPORTANCE: Filamentous cyanobacteria belonging to the genus Nostoc are widespread and ecologically important in freshwater, yet little is known about the genomic content of their viruses. Here we report the first genomic analysis of cyanophages infecting filamentous freshwater cyanobacteria, revealing that their gene content is unlike that of other cyanophages. In addition to sharing many gene homologues with freshwater cyanobacteria, cyanophage N-1 encodes a CRISPR array and expresses it upon infection. Also, both viruses contain a DNA polymerase B-encoding gene with high similarity to genes found in proteobacterial plasmids of filamentous cyanobacteria. The observation that phages can acquire CRISPRs from their hosts suggests that phages can also move them among hosts, thereby conferring resistance to competing phages. The presence in these cyanophages of CRISPR and DNA polymerase B sequences, as well as a suite of other host-related genes, illustrates the long and complex evolutionary history of these viruses and their hosts.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27302758 PMCID: PMC4916379 DOI: 10.1128/mBio.00667-16
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1 Comparative genomics of the two Nostoc cyanophages and the prophage-like element in the genome of Nostoc sp. strain PCC 7524. The genomes are presented as linear molecules for better comparison. A, cyanophage A-1; B, cyanophage N-1; C, prophage-like element in the genome of Nostoc sp. strain PCC 7524 (accession no. NC019684.1). Gene abbreviations and functions are as follows: ant, antirepressor; BP, baseplate; bnr, bnr/Asp-box repeat protein; dctp, dCTP deaminase; dhns, dihydroxynaphthoate synthase; exo, exonuclease; hel, helicase; incl, viral A inclusion factor; L, lysozyme; lipA, lipoprotein A; luX, LuxR transcription factor, lyz, lysozyme; MCP, major capsid protein; meth, methylase; ori, origin of replication; pep, peptidase; psp, phage shock protein; pol, DNA Pol B; prim, primase; T, tail; terL, terminase large subunit; thk, thymidylate kinase; ths, thymidylate synthase; trans, transposase.
Predicted ORFs in cyanophages A-1 and N-1 with similarity to T4-like genes
| T4 core gene product | CyanophageA-1 | Cyanophage N-1 | ||
|---|---|---|---|---|
| E value | Phage hit | E value | Phage hit | |
| gp5 baseplate hub + tail lysozyme | 4.00e−5 | Phage RB49 | 4.00e−5 | Cyanophage S-PM2 |
| gp17 terminase DNA-packaging enzyme large subunit | 4.00e−2 | Cyanophage S-PM2 | 4.00e−2 | Cyanophage S-PM2 |
| Gp41 DNA primase-helicase | 2.00e−13 | Cyanophage S-PM2 | 1.00e−11 | Cyanophage S-RSM4 |
| gp43 DNA Pol B | 4.00e−5 | Phage Aeh1 | 9.00e−6 | Phage Aeh1 |
| Thymidylate synthase | 1.00e−35 | Cyanophage S-CRM01 | 9.00e−32 | Cyanophage P-SSM4 |
FIG 2 Sequence logos of the promoter motifs predicted from alignments of the 5′ upstream regions (top) and putative promoter for cyanophage A-1 showing a sequence logo created from an alignment of 43 sequences (bottom). The putative promoter for cyanophage N1 shows a sequence logo created from an alignment of 21 sequences. The heights of the letters are proportional to the levels of sequence conservation of the nucleotides at the respective positions.
FIG 3 Unrooted ML phylogenetic tree of DNA Pol B protein sequences found in viruses, bacteria, and archaea. Bootstrap values of 90 to 100% (black circles) and 75 to 89 % (gray circles) are shown at the nodes. The sequence names are colored as follows: black, Bacteria and Archaea; light blue, marine cyanomyoviruses; dark blue, T4-like myoviruses; light green, Nostoc phages; dark green, cyanobacterial plasmid. The scale bar shows the number of amino acid substitutions per site.
FIG 4 Phylogenetic relationship of terminase large subunit protein sequences from phages. A ML tree is shown with bootstrap values of 90 to 100% (black circles) and 75 to 89 % (gray circles) shown at the nodes. The sequence names are colored as follows: light green, Nostoc cyanophages; dark green, cyanobacteria; light blue, other cyanophages; black, other viruses.
Predicted ORFs in cyanophage A-1 with high similarity to cyanobacterial genes
| ORF | Length (bp) | Strand | Significant hit | Organism (accession no.) | E value | % Identity (no. of shared aa) |
|---|---|---|---|---|---|---|
| 1 | 1,359 | F | Terminase large subunit | 2.28e−54 | 30 (129) | |
| 2 | 1,605 | F | Hypothetical protein | 7.16e−17 | 21 (113) | |
| 9 | 1,521 | F | Tail sheath protein | 1.14e−69 | 38 (137) | |
| 16 | 1,416 | R | Lysozyme-like domain, rare lipoprotein A | 2.3e−19 | 57.7 (60) | |
| 19 | 738 | R | Hypothetical protein | 9.39e−11 | 29 (67) | |
| 23 | 891 | R | Exonuclease RNase T and DNA polymerase | 1.63e−08 | 27 (48) | |
| 26 | 813 | F | Phage-related baseplate assembly protein | 1.46e−32 | 34 (91) | |
| 32 | 738 | F | Baseplate J phage tail | 3.76e−33 | 41 (94) | |
| 34 | 576 | F | Phage tail protein (tail_P2_I) | 5.63e−40 | 59 (77) | |
| 35 | 1,389 | F | Hypothetical protein | 4.83e−57 | 37 (157) | |
| 36 | 1,140 | F | Tail collar protein | 2.1e−52 | 50 (149) | |
| 39 | 2,016 | R | DNA Pol B | 7.00e−115 | 40.4 (237) | |
| 54 | 939 | R | Putative ant AntA/AntB antirepressor | 1.90e−24 | 45 (51) | |
| 59 | 618 | R | DNA | 1.00e−06 | 27.3 (44) | |
| 72 | 1,209 | F | Transposase | 0 | 388 (100) | |
| 75 | 417 | F | Hypothetical protein | 2.59e−48 | 58 (98) | |
| 86 | 600 | F | dCTP deaminase/dUTPase superfamily | 2.00e−76 | 67.3 (134) | |
| 89 | 888 | F | DNA methylase N-4/N-6 domain protein | 1.00e−65 | 49.4 (133) | |
| 90 | 432 | F | Endo-DNase RusA | 8.00e−04 | 28.4 (29) | |
| 92 | 780 | R | Thymidylate synthase complementing protein | 4.38e−39 | 38.4 (86) |
F, forward; R, reverse.
aa, amino acids.
Predicted ORFs in cyanophage N-1 with high similarity to cyanobacterial genes
| ORF | Length (bp) | Strand | Significant hit | Organism (accession no.) | E value | % Identity (no. |
|---|---|---|---|---|---|---|
| 1 | 1,359 | F | Phage terminase, large subunit | 2.32e−55 | 31 (134) | |
| 2 | 1,602 | F | Hypothetical protein | 3.55e−18 | 22 (106) | |
| 6 | 330 | F | Hypothetical protein | 9.62e−06 | 30 (30) | |
| 9 | 1,521 | F | Tail sheath protein | 3.74e−74 | 39 (138) | |
| 15 | 2,694 | R | Lysozyme-like domain, rare lipoprotein A | 1.48e−37 | 28 (137) | |
| 18 | 849 | R | Hypothetical protein | 9.82e−10 | 27.9 (51) | |
| 19 | 918 | F | Exonuclease RNase T and DNA Pol III | 4.14e−04 | 28.4 (38) | |
| 29 | 330 | F | Lysozyme | 2.51e−03 | 24 (32) | |
| 30 | 1,167 | F | Baseplate J phage tail protein | 4.18e−68 | 43 (144) | |
| 31 | 576 | F | Phage tail protein | 2.32E+00 | 26.3 (45) | |
| 32 | 1,395 | F | Phage tail fiber protein | 2.97e−64 | 37 (172) | |
| 33 | 1,155 | F | Tail collar protein | 1.62e−47 | 33 (137) | |
| 34 | 1,896 | R | DNA Pol B | 7.0e−118 | 41.5 (243) | |
| 48 | 600 | R | Hypothetical protein | 1.10e−06 | 21.4 (40) | |
| 52 | 633 | R | C-5 cytosine-specific DNA methylase | 1.81e−05 | 36.2 (34) | |
| 56 | 240 | R | ASCH domain protein | 4.06e−12 | 42.3 (33) | |
| 62 | 627 | F | Thymidylate kinase | 2.45e−23 | 36 (74) | |
| 68 | 1,047 | F | DNA-cytosine methyltransferase | 1.02e−56 | 33.2(127) | |
| 75 | 579 | R | Hypothetical protein | 8.67e−03 | 32.7 (33) | |
| 76 | 507 | R | Hypothetical protein | 1.73e−47 | 56 (94) | |
| 77 | 726 | R | Hypothetical protein | 4.94e−06 | 30.4 (78) | |
| 79 | 585 | F | dCTP deaminase | 1.45e−69 | 65.5 (131) |
F, forward; R, reverse.
aa, amino acids.
FIG 5 Characterization of the CRISPR in cyanophage N-1. (A) The CRISPR in cyanophage N-1 consists of five DRs (light gray boxes), four spacers (white boxes), and a leader sequence (dark gray box). The CRISPR array is surrounded by ORFs that putatively encode an exonuclease (blue arrows) or are hypothetical (black arrows). (B) A logo illustrating nucleotide differences in an alignment of the consensus DR from cyanophage N-1 and the consensus DR from CRISPR arrays 13 and 14 in Nostoc sp. strain PCC 7210. Differences are shown by placement of the letters for the nucleotides of Nostoc on the top and those for cyanophage N-1 on the bottom of the logo. (C) An unrooted neighbor-joining tree shows that the consensus DR in cyanophage N-1 (orange) is not most closely related to those found in its known hosts, Nostoc sp. strain PCC 7210 (green) and A. variabilis ATCC 29413 (light blue). Consensus DRs from other cyanobacteria, including those that are most closely related in Calothrix sp. strain PCC 6303, are shown in black. The scale bar represents 0.03 nucleotide change. (D) Schematic of CRISPR 8 in Nostoc sp. strain PCCC 7210. The CRISPR array consists of DRs (light gray boxes), seven spacers (white boxes), and a leader sequence (dark gray box). The CRISPR array is surrounded by CAS genes (purple arrows).
BLAST results for the DRs from the CRISPR array present in the genome of cyanophage N-1
| Cyanobacterial stain (CRISPR ID) | E value | Start | End | DR consensus sequence |
|---|---|---|---|---|
| 3.0e−08 | 2087079 | 2087333 | GTCTGAATTCCATATAATCCCTATCAGGGATTGAAAC | |
| 1.0e−07 | 3131505 | 3132053 | GTTTAAATTCCACTTAATCCCTATCAGGGATTGAAAC | |
| 1.0e−07 | 3859458 | 3860565 | GTTTCAATCCCTGATAGGGATTAAGTGGAATTTAAAC | |
| 1.0e−07 | 2916270 | 2917173 | GTTTAAATTCCACTTAATCCCTATCAGGGATTGAAAC | |
| 1e−07 | 5817107 | 5818597 | GTTGCAATTTCTATTAATCCCTATCAGGGATTGAAAC | |
| 8.0e−07 | 1234556 | 1237182 | GTTTTAATTAACAAAAATCCCTATCAGGGATTGAAAC | |
| 8.0e−07 | 3516819 | 3517367 | GTTTCAATCCCTGATAGGGATTTTTGTTAGTTAAAAC | |
| 8.0e−07 | 3517542 | 3518084 | GTTTCAATCCCTGATAGGGATTTTTGTTAGTTAAAAC | |
| 4e−07 | 5001036 | 5003639 | GTTTCAATCCCTAATAGGGATTATTTGAAATTTCAAC | |
| 4e−07 | 1101502 | 1103134 | GTTACAATTCACCCAAATCCCTATCAGGGATTGAAAC | |
| 4e−07 | 2021864 | 2022765 | GTTCCTATAAACTAAAATCCCTATCAGGGATTGAAAC | |
| 4e−07 | 2038085 | 2039287 | GTTCCTATAAACTAAAATCCCTATCAGGGATTGAAAC | |
| 4e−07 | 5067421 | 5070798 | GTTTCAATCCCTGATAGGGATTTAAGTTAATTGGAAC | |
| 4e−07 | 3375025 | 3376306 | GTTTCAATCCCTGATAGGGATTTAAGTTAATTGGAAC | |
| 3.0e−07 | 3338172 | 3341197 | GTTTCAATCCCTGATAGGGATTTTGATGAATTGCAAT | |
| 3.0e−07 | 1836813 | 1837723 | GTTTCTATTAACACAAATCCCTATCAGGGATTGAAAC | |
| 3.0e−07 | 807452 | 807558 | ATTGCAATTAACTAAAATCCCTATCAGGGATTGAAAC | |
| 3.0e−07 | 5764010 | 5766428 | ATTGCAATTAACTAAAATCCCTATCAGGGATTGAAAC | |
| 3.0e−07 | 5654133 | 5654384 | GTTAAAACCCTCTAAAATCCCTATCAGGGATTGAAAC | |
| 3.0e−07 | 2395670 | 2398703 | GTTTCAATCCCTGATAGGGATTTTAGAGGGTTTTAAC | |
| 1.0e−06 | 5227213 | 5229282 | GTTTCTATTAACACAAATCCCTATCAGGGATTGAAAG | |
| 4.0e−06 | 3575124 | 3575742 | GTTACAATTAAAATGAATCCCTATTAGGGATTGAAAC | |
| 4.0e−06 | 4821250 | 4823752 | GTTGCAACACCACATAATCCCTATTAGGGATTGAAAC |