| Literature DB >> 30678352 |
Qingwei Yang1, Chen Gao2, Yong Jiang3,4,5, Min Wang6,7,8, Xinhao Zhou9, Hongbing Shao10, Zheng Gong11, Andrew McMinn12,13.
Abstract
Viruses are the most abundant biological entities in aquatic ecosystems and harbor an enormous amount of genetic diversity. Whereas their influence on marine ecosystems is widely acknowledged, current information about their diversity remains limited. We conducted a viral metagenomic analysis of water samples collected during the austral summer of 2016 from the South Scotia Ridge (SSR), near the Antarctic Peninsula. The taxonomic composition and diversity of the viral communities were investigated, and a functional assessment of the sequences was performed. Phylotypic analysis showed that most viruses belonged to the order Caudovirales, especially the family Podoviridae (41.92⁻48.7%), which is similar to the situation in the Pacific Ocean. Functional analysis revealed a relatively high frequency of phage-associated and metabolism genes. Phylogenetic analyses of phage TerL and Capsid_NCLDV (nucleocytoplasmic large DNA viruses) marker genes indicated that many sequences associated with Caudovirales and NCLDV were novel and distinct from known phage genomes. High Phaeocystis globosa virus virophage (Pgvv) signatures were found and complete and partial Pgvv-like were obtained, which influence host⁻virus interactions. Our study expands existing knowledge of viral communities and their diversities from the Antarctic region and provides basic data for further exploring polar microbiomes.Entities:
Keywords: Pgvv-like; South Scotia Ridge; diversity; viral community; virus
Mesh:
Year: 2019 PMID: 30678352 PMCID: PMC6410227 DOI: 10.3390/v11020095
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
The remaining number of quality-controlled reads and non-redundancy contigs. Q20: quality score of 20; Q30: quality score of 30.
| Process | SSR Viromes | |||
|---|---|---|---|---|
| D39s | DA4s | DA4b | ||
|
| Raw reads | 4,2547,324 | 43,791,908 | 43,371,374 |
| Cut adaptor | 40,670,620 (95.59%) | 41,883,752 (95.64%) | 41,353,540 (95.35%) | |
| Q20 >20% | 35,357,306 (83.10%) | 37,567,224 (85.79%) | 38,397,082 (88.53%) | |
| Q30 >30% | 35,334,370 (83.05%) | 36,385,270 (83.09%) | 38,203,624 (88.08%) | |
|
| All assembled contigs (>500 bp) | 2,418,081 | 3,699,559 | 1,693,019 |
| Non-redundancy contigs | 145,023 | 135,910 | 234,648 | |
|
| Mapped reads | 15,307,526 (43.32%) | 12,760,519 (35.07%) | 21,197,289 (55.49%) |
Figure 1Taxonomic assignment of metagenomic reads (a) Percentage of the sequence reads classified by the taxonomic grouping based on BLASTX similarity search with NCBI nr database (E-value < 10−3). Sequences with no hits with E-value > 10−3 were regarded as unidentified reads (“no annotation” category in the table and red in the pie graphs). “LUCA” (last universal common ancestor) (green) denotes reads that could not unambiguously be assigned to a domain of life. (b) Taxonomic assignment of metagenomic reads based on BLASTX similarity search with viral RefSeq database (E-value < 10−3). (c) Comparison of the annotation results of NCBI nr and viral RefSeq.
Classification of reads from viromes hitting viral sequences. ssDNA: single-stranded DNA; dsDNA: double-stranded DNA.
| Group | Order | Family | Relative Abundance (%) | ||
|---|---|---|---|---|---|
| D39s | DA4s | DA4b | |||
| dsDNA |
|
| 41.92 | 48.70 | 42.15 |
|
|
| 28.34 | 22.92 | 29.46 | |
|
|
| 11.92 | 13.38 | 14.08 | |
|
| unclassified | 3.56 | 2.99 | 2.90 | |
|
|
| 3.57 | 2.22 | 1.32 | |
| -- |
| 0.16 | 0.22 | 0.10 | |
| -- |
| 0.14 | 0.22 | 1.13 | |
| -- |
| 0.37 | 0.44 | 0.30 | |
| unclassified | 3.13 | 2.84 | 3.17 | ||
| ssDNA | -- |
| 0.10 | 0.13 | 0.12 |
| -- |
| 0.00 | 0.10 | 0.04 | |
| -- |
| 0.01 | 0.11 | 0.01 | |
| -- | unclassified | 0.09 | 0.05 | 0.05 | |
| virophage | -- | 2.00 | 0.58 | 0.19 | |
| others |
|
| 0.26 | 0.29 | 0.21 |
|
|
| 0.07 | 0.76 | 0.04 | |
| Unclassified phage/viruses | -- | 3.71 | 3.01 | 4.07 | |
Taxonomic characterization of South Scotia Ridge (SSR) viromes and twenty previously published viromes as determined by the indicated BLAST comparison to viral Refseq (E-value < 10−3).
| Virome | MetaVir | Total Sequenced | Viral | Relative Abundance (%) | |||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| dsDNA Viruses, No RNA Stage | RNA | ssDNA | Satellite | Virophage | |||||||||||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| Others |
|
|
|
|
| Others | |||||||
| Antar Lake Spring | 10 | 41,322 | 20.51 | 0 | 0 | 0.05 | 0.06 | 0.22 | 0 | 0 | 0 | 0 | 0.01 | 0 | 0 | 0.03 | 0.14 | 22.00 | 0.27 | 0.03 | 3.45 | 0.22 | 23.39 | 6.82 | 0 |
| Antar Lake Summer | 11 | 38,475 | 22.07 | 0 | 0.01 | 0.32 | 0.30 | 0.72 | 0.03 | 0.01 | 0.38 | 0 | 1.57 | 0.02 | 0.09 | 0.20 | 0.40 | 61.96 | 0.49 | 0.11 | 1.31 | 0.93 | 15.52 | 3.20 | 0.19 |
| Lake Pavin | 6 | 649,290 | 25.04 | 0 | 0 | 1.60 | 1.29 | 6.73 | 0.01 | 0.01 | 0.01 | 0 | 0.06 | 0.01 | 0.02 | 0.85 | 28.04 | 6.02 | 0.07 | 0.12 | 1.79 | 0.06 | 17.07 | 0.93 | 0.25 |
| Lake Bourget | 7 | 593,084 | 37.46 | 0.01 | 0 | 2.30 | 2.98 | 11.56 | 0.02 | 0.01 | 0.01 | 0 | 0.14 | 0.02 | 0.06 | 1.86 | 0.28 | 1.77 | 0.03 | 0.01 | 60.35 | 0.32 | 1.99 | 0.36 | 0.20 |
| Tilapia Channel | 33 | 264,844 | 9.18 | 0.03 | 0.02 | 5.40 | 11.42 | 11.84 | 0.04 | 0.03 | 0.04 | 0 | 0.09 | 0.06 | 0 | 2.77 | 0.02 | 19.89 | 13.46 | 0.42 | 8.26 | 0.16 | 12.31 | 3.80 | 0.03 |
| Lough Neagh | 4925 | 2,295,055 | 24.65 | 0.14 | 0.08 | 10.96 | 32.59 | 36.71 | 0.11 | 0.08 | 0.02 | 0.06 | 0.38 | 0.05 | 0.31 | 16.11 | 0.32 | 0.01 | 0.03 | 0.16 | 0.04 | 0 | 0.39 | 0 | 0.13 |
| Jiulong River Estuary | 6305 | 498,957 | 31.40 | 0.05 | 0.06 | 14.67 | 32.36 | 34.80 | 0.12 | 0.04 | 0.11 | 0.02 | 0.79 | 0.16 | 0.49 | 13.58 | 0.27 | 0 | 0.01 | 0.11 | 0.01 | 0.01 | 0.22 | 0.31 | 0.14 |
| GS117 | 1479 | 480,375 | 38.80 | 0.04 | 0.03 | 16.50 | 35.59 | 27.05 | 0.06 | 0.04 | 0.06 | 0.01 | 0.53 | 0.06 | 0.44 | 17.59 | 0.17 | 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | 0.07 | 0.32 | 0.28 |
| M1CS | 1440 | 303,519 | 32.93 | 0.05 | 0.03 | 10.92 | 28.08 | 48.75 | 0.09 | 0.06 | 0.06 | 0.02 | 0.47 | 0.07 | 0.15 | 9.85 | 0.11 | 0.03 | 0.01 | 0.01 | 0 | 0.04 | 0.11 | 0.29 | 0.11 |
| Dunk Island | 1357 | 1,165,256 | 3.61 | 0.01 | 0.02 | 12.61 | 40.74 | 21.80 | 0.19 | 0.04 | 0.04 | 0.06 | 0.36 | 0.04 | 0.15 | 22.43 | 0.09 | 0 | 0.05 | 0.02 | 0 | 0 | 0.07 | 0.53 | 0.20 |
| Fitzroy Island | 1358 | 82,739 | 32.66 | 0.02 | 0.05 | 13.33 | 48.00 | 17.30 | 0.04 | 0.05 | 0.05 | 0.06 | 0.44 | 0.12 | 0.01 | 19.36 | 0.20 | 0 | 0.04 | 0.04 | 0 | 0 | 0.09 | 0 | 0.29 |
| LA26S | 1396 | 165,256 | 28.45 | 0.04 | 0.17 | 15.51 | 44.05 | 24.32 | 0.08 | 0.10 | 0.11 | 0.02 | 1.17 | 0.27 | 0.27 | 11.76 | 0.10 | 0 | 0.02 | 0.07 | 0 | 0 | 0.21 | 0.76 | 0.19 |
| D39s | -- | 17,667,185 | 11.97 | 0.09 | 0.06 | 28.33 | 41.90 | 11.92 | 0.03 | 0.33 | 0.16 | 0 | 3.57 | 0.05 | 0.14 | 7.04 | 0.04 | 0.01 | 0 | 0.07 | 0 | 0 | 0.10 | 0 | 2.00 |
| DA4s | -- | 19,101,812 | 5.09 | 0.06 | 0.22 | 22.91 | 48.68 | 13.37 | 0.05 | 0.41 | 0.22 | 0 | 2.21 | 0.06 | 0.22 | 6.33 | 0.10 | 0.11 | 0 | 0.10 | 0.10 | 0 | 0.07 | 0 | 0.58 |
| DA4b | -- | 18,192,635 | 13.87 | 0.10 | 0.05 | 29.46 | 42.14 | 14.08 | 0.02 | 0.26 | 0.10 | 0 | 1.32 | 0.01 | 1.13 | 6.44 | 0.05 | 0.01 | 0 | 0.09 | 0.04 | 0 | 0.07 | 0 | 0.19 |
| Sargasso Sea | 12 | 397,939 | 11.80 | 0.01 | 0 | 2.20 | 6.32 | 2.48 | 0 | 0.02 | 0.02 | 0 | 0.08 | 0.04 | 0 | 2.88 | 0.01 | 14.21 | 0 | 0 | 43.69 | 0.03 | 24.83 | 0.60 | 0.01 |
| Arctic Vir | 15 | 686,209 | 1.30 | 0.04 | 0.04 | 9.91 | 9.63 | 23.14 | 0.54 | 0.08 | 0.25 | 0.05 | 1.24 | 1.37 | 0.76 | 3.19 | 0.99 | 0.64 | 0.04 | 5.18 | 0 | 0 | 1.88 | 16.07 | 0.06 |
| Arctic Ocean | 1158 | 79,646 | 14.15 | 0 | 0 | 0.12 | 0.12 | 0.39 | 0 | 0 | 0.01 | 0 | 0.03 | 0.01 | 0.02 | 0.09 | 0.32 | 63.14 | 0.09 | 0.16 | 2.17 | 0.19 | 13.93 | 4.25 | 0 |
| Black Sea | 1155 | 78436 | 16.47 | 0 | 0 | 0.05 | 0.03 | 0.08 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0.01 | 0.02 | 0.12 | 56.33 | 0.60 | 0.01 | 0.88 | 0.17 | 14.24 | 5.79 | 0 |
| Med Sea | 1161 | 65,340 | 16.13 | 0 | 0.01 | 2.29 | 0.82 | 2.49 | 0.04 | 0.05 | 0.02 | 0.01 | 0.16 | 0.04 | 1.58 | 0.35 | 0.29 | 21.05 | 0 | 0 | 59.09 | 0 | 2.96 | 7.51 | 0 |
| OMZst3viral10m | 897 | 128,441 | 15.56 | 0.01 | 0.01 | 2.90 | 4.00 | 2.12 | 0.01 | 0.02 | 0.04 | 0 | 0.40 | 0.03 | 0.06 | 1.07 | 0.08 | 60.79 | 0 | 0.03 | 6.49 | 0.13 | 14.89 | 1.88 | 0.16 |
| Antar hypolith | 2726 | 1,057,535 | 12.74 | 0.01 | 0.01 | 1.31 | 1.47 | 7.15 | 0.13 | 0.01 | 0 | 0.05 | 0.02 | 0.03 | 0.02 | 0.36 | 0.13 | 0.03 | 0 | 0.21 | 88.42 | 0 | 0.25 | 0.09 | 0 |
| Antar open soil | 2727 | 870,687 | 16.99 | 0.08 | 0.05 | 9.25 | 4.96 | 16.74 | 0.29 | 0.05 | 0.03 | 0.06 | 0.24 | 0.05 | 0.09 | 3.10 | 0.43 | 0.01 | 0 | 0.03 | 63.28 | 0 | 0.40 | 0.07 | 0 |
Figure 2Comparison viromes between the SSR area and other environmental viromes depending on BLAST-based comparison. Twenty environmental viromes were available on MetaVir2, obtained from different habitats including freshwater, seawater, deep-sea surface sediments, soil, and hypolith. Bray–Curtis dissimilarity matrices of BLAST hits were calculated from virome data and used to represent the relative distances between individual viromes (stress value 10.7%).
Figure 3The relative abundance of viral sequences (normalized with genome length) mostly at the family level in each different habitat virome. Point size indicates the value of relative abundance percentage.
The percentage of reads in other published viromes obtained from MetaVir with a significant similarity (BLASTN, E-value < 10−3) to the SSR viromes.
| Biome | Virome | MetaVir | Number of Reads | South Scotia Sea | ||
|---|---|---|---|---|---|---|
| D39s | DA4s | DA4b | ||||
| Antarctic seawater | D39s | - | 35,334,370 | 100% | 50.35% | 49.60% |
| Antarctic seawater | DA4s | - | 36,385,270 | 48.90% | 100% | 28.37% |
| Antarctic seawater | DA4b | - | 38,203,624 | 45.88% | 27.02% | 100% |
| Seawater | OMZst3viral10m | 897 | 128,441 | 6.50% | 6.86% | 10.69% |
| Seawater | GS117 | 1479 | 480,375 | 8.61% | 9.09% | 14.57% |
| Arctic seawater | Arctic Vir | 15 | 686,209 | 1.45% | 1.33% | 2.10% |
| Seawater | Sargasso Sea | 12 | 397,939 | 4.68% | 4.97% | 7.69% |
| POV seawater | Dunk Island | 1357 | 1,165,256 | 0.55% | 0.71% | 1.14% |
| POV seawater | Fitzroy Island | 1358 | 82,739 | 7.44% | 9.11% | 14.04% |
| POV seawater | LA26S | 1396 | 165,256 | 16.75% | 14.72% | 21.02% |
| POV seawater | M1CS | 1440 | 303,519 | 14.77% | 15.59% | 23.12% |
| Deep Ocean |
| 1158 | 79,646 | 1.64% | 2.17% | 4.22% |
| Deep Ocean |
| 1155 | 78,436 | 0.57% | 0.62% | 0.69% |
| Deep Ocean |
| 1161 | 65,340 | 0.86% | 0.89% | 2.04% |
| Freshwater | Lake Bourget | 7 | 593,084 | 0.94% | 0.87% | 1.66% |
| Freshwater | Lake Pavin | 6 | 649,290 | 0.25% | 0.23% | 0.44% |
| Antarctic freshwater |
| 10 | 41,322 | 0.07% | 0.06% | 0.16% |
| Antarctic freshwater |
| 11 | 38,475 | 0.41% | 0.43% | 1.87% |
| Freshwater | Lough Neagh | 4925 | 2,295,055 | 0.31% | 0.30% | 0.52% |
| Freshwater | Jiulong River Estuary | 6305 | 498,957 | 5.74% | 6.06% | 9.61% |
| Freshwater |
| 33 | 264,844 | 0.14% | 0.15% | 0.31% |
| Antarctic soil |
| 2727 | 870,687 | 0.43% | 0.42% | 0.59% |
| Antarctic hypolith |
| 2726 | 1,057,535 | 0.26% | 0.42% | 0.36% |
Figure 4Composition of predicted functional genes of the SSR contigs. The Coding Sequences (CDSs) were compared with the SEED database using subsystems in MG-RAST. The metabolic categorization is based on the sequences Best BLAST Hits in the SEED database curated subsystems (E-value < 10−5).
Figure 5Terminase phylogeny. A maximum-likelihood phylogenetic tree of Caudovirales terminase large-subunit domains (PF03237) is shown (1000 iterations, JTT + G model). Only bootstrap values >50% are indicated at the nodes of the tree, and bootstrap scores greater than 90% are indicated with a black dot. Average branch length distance of leaves less than 0.4 were collapsed and are shown as triangles. Reference sequences are marked (see color legend at the top). Abbreviations are as follows. HMO-2011: Puniceispirillum phage HMO-2011; HTVC010P: Pelagibacter phage HTVC010P; RM378: Rhodothermus phage RM378; P-RSM6: Cyanophage P-RSM6. The black dotted line, solid line, and bright blue line indicate the sequences obtained from this study.
Figure 6Capsid_NCLDV phylogeny. A maximum-likelihood phylogenetic tree drawn from the capsid_NCLDV (PF04451) and six virophage putative major capsid (MCP) protein multiple alignment is shown (1000 iterations, JTT + G model). Bootstrap scores greater than 90% are marked with black dots. Each MCP is associated with an abundance profile (right) that displays the relative abundance of the contig across the three SSR viromes (based on normalized coverage).
Figure 7Overview of genomic synteny and similarities between Pgvv-like group. The complete Pgvv reference is covered by a grey shadow. A color scale for percent identity (nucleic) is shown at the bottom right. The name, percent GC content (GC%), and length for each genome are indicated. Genes are colored according to their functional affiliation. Tlr 6F: Toll-like receptor 6 family; GIY: GIY-YIG family nuclease; MCP: major capsid protein; mCP: minor capsid protein; and Yrec (OLV11): OLV11-like tyrosine recombinase.