| Literature DB >> 29951046 |
Richard Allen White Iii1,2,3,4,5, Hon L Wong4,6, Rendy Ruvindy4,6, Brett A Neilan4,6, Brendan P Burns4,6.
Abstract
Single stranded DNA viruses have been previously shown to populate the oceans on a global scale, and are endemic in microbialites of both marine and freshwater systems. We undertook for the first time direct viral metagenomic shotgun sequencing to explore the diversity of viruses in the modern stromatolites of Shark Bay Australia. The data indicate that Shark Bay marine stromatolites have similar diversity of ssDNA viruses to that of Highbourne Cay, Bahamas. ssDNA viruses in cluster uniquely in Shark Bay and Highbourne Cay, potentially due to enrichment by phi29-mediated amplification bias. Further, pyrosequencing data was assembled from the Shark Bay systems into two putative viral genomes that are related to Genomoviridae family of ssDNA viruses. In addition, the cellular fraction was shown to be enriched for antiviral defense genes including CRISPR-Cas, BREX (bacteriophage exclusion), and DISARM (defense island system associated with restriction-modification), a potentially novel finding for these systems. This is the first evidence for viruses in the Shark Bay stromatolites, and these viruses may play key roles in modulating microbial diversity as well as potentially impacting ecosystem function through infection and the recycling of key nutrients.Entities:
Keywords: BREX; CRISPR-Cas; Shark Bay; ssDNA viruses; stromatolites; viral defense; viral metagenomics
Year: 2018 PMID: 29951046 PMCID: PMC6008428 DOI: 10.3389/fmicb.2018.01223
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Metagenomic statistics including read analysis, assembly stats, and annotations for MetaVir2 and MG-RAST.
| Reads | Viral fraction | Cellular fraction | Assembly | Viral fraction |
|---|---|---|---|---|
| Number | 92298 | 73371 | Number | 504 |
| Total length (bp) | 39623558 | 31023655 | Total length (bp) | 149063 |
| Average (bp) | 429 | 423 | Average (bp) | 295 |
| GC% | 48% | 44% | N50 (bp) | 353 |
| Median (bp) | 172 | |||
| Largest (bp) | 4099 | |||
| Number | 62294 | 59805 | ||
| Artificial duplicate reads | 23699 | 8842 | Number | 49 |
| Total length (bp) | 28413896 | 26636206 | Total length (bp) | 57537 |
| Average (bp) | 456 | 445 | Average (bp) | 1174 |
| GC% | 47% | 44% | N50 (bp) | 1473 |
| Median (bp) | 924 | |||
| Largest (bp) | 4099 | |||
| Predicted protein features | 39127 | 50281 | ||
| Predicted rRNA features | 3321 | 3746 | ||
| Identified protein features | 2452 | 23704 | ||
| Identified rRNA features | 0 | 64 | Number | N/A |
| Identified functional categories | 2033 | 21025 | Total length (bp) | N/A |
| Failed QC (duplicates/length) | 30,004 (32.51%) | 13,566 (18.49%) | Average (bp) | N/A |
| Unknown | 1,000 (1.08%) | 197 (0.27%) | N50 (bp) | N/A |
| Predicted feature | 61,294 (66.41%) | 59,608 (81.24%) | Median (bp) | N/A |
| Unknown protein | 30,514 (49.78%) | 20,191 (33.87%) | Largest (bp) | N/A |
| Annotated protein | 30,780 (50.22%) | 38,473 (64.54%) | ||
| Ribosomal RNA | 0 (0.00%) | 944 (1.58%) | Number | N/A |
| Total length (bp) | N/A | |||
| Average (bp) | N/A | |||
| 50 on score | 6.55% | 10.62% | N50 (bp) | N/A |
| 9.04% | 15.50% | Median (bp) | N/A | |
| 7.42% | 12.57% | Largest (bp) | N/A | |
| 6.53% | 10.54% | |||
KEGG ontology (KO) EC numbers for the Shark Bay virome annotations reads.
| EC description | EC number | Abundance |
|---|---|---|
| DNA (cytosine-5-)-methyltransferase | 2.1.1.37 | 36 |
| Ribonucleoside-diphosphate reductase | 1.17.4.1 | 11 |
| DNA helicase | 3.6.4.12 | 8 |
| GDP-mannose 4,6-dehydratase | 4.2.1.47 | 5 |
| Riboflavin kinase | 2.7.1.26 2.7.7.2 | 5 |
| Methyltransferases | 2.1.1.- | 4 |
| Carbon-oxygen lyases/Hydro-lyases | 4.2.1.- | 3 |
| Spermidine dehydrogenase | 1.5.99.6 | 3 |
| DNA-directed DNA polymerase | 2.7.7.7 | 2 |
| Nicotinamidase | 3.5.1.19 3.5.1.- | 2 |
| Ribonucleoside-triphosphate reductase | 1.17.4.2 | 2 |
| Nucleotidyltransferases | 2.7.7.- | 2 |
| Thymidylate synthase | 2.1.1.148 | 2 |
| Sarcosine oxidase | 1.5.3.1 | 2 |
| Leucyl aminopeptidase | 3.4.11.1 | 2 |
| Xanthine dehydrogenase | 1.17.1.4 | 1 |
| Aspartate carbamoyltransferase | 2.1.3.2 | 1 |
| Deoxyuridine-triphosphatase | 3.6.1.23 | 1 |
| Histidine permease | 3.6.3.21 | 1 |
| Amidophosphoribosyltransferase | 2.4.2.14 | 1 |
| Indolepyruvate decarboxylase | 4.1.1.74 | 1 |
| NAD+ synthase (glutamine-hydrolysing) | 6.3.5.1 | 1 |
| Allantoicase | 3.5.3.4 | 1 |
| Peptidyl-dipeptidase A | 3.4.15.1 | 1 |
| Dihydrofolate reductase | 1.5.1.3 2.1.1.45 | 1 |
| GDP- | 1.1.1.271 | 1 |
| DNA-directed RNA polymerase | 2.7.7.6 | 1 |
| UDP- | 2.5.1.7 | 1 |
| Tryptophan synthase | 4.2.1.20 | 1 |
| 3-oxoacyl-[acyl-carrier-protein] reductase | 1.1.1.100 | 1 |
| Oxidoreductases with NAD+ or NADP+ as acceptors | 1.1.1.- | 1 |
| Protoporphyrinogen oxidase | 1.3.3.4 | 1 |
BLAST results against NCBI against long contigs and putative viral genomes in the Shark Bay viral fraction.
| Gene | Max score | Total score | Query cover % | Ident % | Accession | ||
|---|---|---|---|---|---|---|---|
| Uncultured virus | Viral replication-associated protein (Rep) | 172 | 172 | 56.00 | 1.00E-44 | 34.00 | AUM61732.1 |
| Uncultured virus | Viral replication-associated protein (Rep) | 171 | 171 | 57.00 | 4.00E-43 | 33.00 | AUM62051.1 |
| Sewage-associated circular DNA virus-30 | Viral replication-associated protein (Rep) | 166 | 166 | 52.00 | 2.00E-41 | 32.00 | YP_009117070.1 |
| Sewage-associated circular DNA virus-18 | Viral replication-associated protein (Rep) | 170 | 170 | 50.00 | 2.00E-43 | 31.00 | YP_009116898.1 |
| Uncultured virus | Viral replication-associated protein (Rep) | 169 | 169 | 47.00 | 1.00E-42 | 33.00 | AUM61781.1 |
| Uncultured virus | Viral replication-associated protein (Rep) | 164 | 164 | 49.00 | 2.00E-41 | 34.00 | AUM61982.1 |
| Ralstonia picketti | Hypothetical protein | 636 | 636 | 55.00 | 0 | 99.00 | WP_024972784.1 |
| Cellulophaga phage phi47:1 | Hypothetical protein CDPG_00080 | 629 | 629 | 54.00 | 0 | 99.00 | AGF91683.1 |
| Cellulophaga phage phi47:1 | Hypothetical protein CDPG_00081 | 276 | 276 | 27.00 | 3.00E-87 | 100.00 | AGF91684.1 |
| contig-1000007_1202 | |||||||
| Pseudanabaena sp. ’Roaring Creek | Hypothetical protein | 45.8 | 45.8 | 13.00 | 0.019 | 45.00 | WP_055077263.1 |
| Synechococcus sp. PCC 7502 | Hypothetical protein | 42 | 42 | 11.00 | 0.093 | 50.00 | WP_015169903.1 |
| Oscillatoriales cyanobacterium USR001 | Hypothetical protein BCD67_24715 | 39.3 | 39.3 | 15.00 | 2.1 | 31.00 | OCQ97517.1 |
| Nocardia transvalensis | Patatin | 41.2 | 41.2 | 22.00 | 3.7 | 34.00 | WP_040746262.1 |
| Uncultured prokaryote | Hypothetical protein | 48.1 | 48.1 | 11.00 | 0.037 | 27.00 | CRY97485.1 |
| Uncultured prokaryote | Hypothetical protein | 41.6 | 41.6 | 3.00 | 5.3 | 46.00 | CRY96835.1 |
| Recombinase family protein | 42.7 | 42.7 | 4.00 | 6.7 | 39.00 | WP_084599775.1 | |
| contig-2000010_3827 | |||||||
| Uncultured prokaryote | Hypothetical protein | 75.9 | 75.9 | 11.00 | 6.00E-11 | 34.00 | CRY96346.1 |
| Hypothetical protein | 72.4 | 72.4 | 4.00 | 2.00E-09 | 57.00 | OLA80278.1 | |
| Hypothetical protein | 69.7 | 69.7 | 4.00 | 7.00E-09 | 52.00 | WP_082813420.1 | |
| DNA ligase-associated DEXH box helicase | 42.4 | 42.4 | 18.00 | 4.5 | 29.00 | WP_076628122.1 | |
| MULTISPECIES: serine/threonine protein kinase | 42 | 42 | 19.00 | 7.1 | 28.00 | WP_103536509.1 | |
| Contig-9_1603 | |||||||
| No hits | |||||||
| Contig-8_1554 | |||||||
| No hits | |||||||
| Contig-7_1473 | |||||||
| No hits | |||||||
BLAST results against NCBI for phoH sequences in the Shark Bay viral fraction.
| phoH sequences | Gene name | Species | Max score | Total score | Query cover % | Ident % | Accession | |
|---|---|---|---|---|---|---|---|---|
| Sequence VIR56683 9-293 | PhoH family protein | 110 | 110 | 96.00 | 1.00E-26 | 56.00 | WP_102408772.1 | |
| PhoH family protein | 110 | 110 | 96.00 | 2.00E-26 | 58.00 | WP_006744458.1 | ||
| PhoH family protein | 110 | 110 | 97.00 | 2.00E-26 | 50.00 | WP_045087935.1 | ||
| Sequence VIR437-141 | hypothetical protein D478_16524 | 112 | 112 | 98.00 | 3.00E-30 | 56.00 | ELK40902.1 | |
| PhoH family protein | 119 | 119 | 98.00 | 1.00E-29 | 60.00 | WP_026896053.1 | ||
| ATPase | 107 | 107 | 98.00 | 3.00E-28 | 54.00 | GAE45877.1 | ||
| Sequence VIR04336 12-416 | PhoH family protein | 148 | 148 | 98.00 | 3.00E-40 | 52.00 | WP_075517747.1 | |
| PhoH family protein | 147 | 147 | 100.00 | 1.00E-39 | 52.00 | WP_026896053.1 | ||
| PhoH family protein | 145 | 145 | 99.00 | 3.00E-39 | 50.00 | WP_091657652.1 | ||