| Literature DB >> 33051444 |
Kira Moon1, Suhyun Kim1, Ilnam Kang2, Jang-Cheon Cho3.
Abstract
A high number of viral metagenomes have revealed countless genomes of putative bacteriophages that have not yet been identified due to limitations in bacteriophage cultures. However, most virome studies have been focused on marine or gut environments, thereby leaving the viral community structure of freshwater lakes unclear. Because the lakes located around the globe have independent ecosystems with unique characteristics, viral community structures are also distinctive but comparable. Here, we present data on viral metagenomes that were seasonally collected at a depth of 1 m from Lake Soyang, the largest freshwater reservoir in South Korea. Through shotgun metagenome sequencing using the Illumina MiSeq platform, 3.08 to 5.54-Gbps of reads per virome were obtained. To predict the viral genome sequences within Lake Soyang, contigs were constructed and 648 to 1,004 putative viral contigs were obtained per sample. We expect that both viral metagenome reads and viral contigs would contribute in comparing and understanding of viral communities among different freshwater lakes depending on seasonal changes.Entities:
Mesh:
Year: 2020 PMID: 33051444 PMCID: PMC7553992 DOI: 10.1038/s41597-020-00695-9
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1A map depicting the sampling site in Lake Soyang and an overview of the metagenome preparation. The red dot represents the sampling site.
Sequencing information of viral metagenomes from Lake Soyang.
| Sample | Accession no. | Base pair (Gbp) | % of SSU rRNAa | % of LSU rRNAa | % of Bacterial markersa | Scorea |
|---|---|---|---|---|---|---|
| ′14 Oct. | ERR2814725 | 3.20 | 0.0013 | 0.0192 | 0.0195 | 6.88079 |
| ′15 Jan. | ERR2814726 | 3.08 | 0.0004 | 0.0161 | 0.0074 | 8.23119 |
| ′15 Sept. | ERR2814753 | 5.54 | 0.0017 | 0.0330 | 0.0079 | 3.99835 |
| ′15 Nov. | ERR2814752 | 3.69 | 0.0040 | 0.0212 | 0.0440 | 6.23944 |
| ′16 Feb. | ERR2814750 | 3.25 | 0.0031 | 0.0159 | 0.0146 | 8.29454 |
| ′16 May | ERR2814751 | 3.18 | 0.0091 | 0.0047 | 0.0957 | 2.95373 |
aThe degree of bacterial gene contamination, as determined by the ratio of bacterial marker genes, was calculated using the ViromeQC program[28].
Fig. 2Taxonomic annotation of virome reads collected from Lake Soyang. The taxonomic prediction of virome reads is shown in the domain level (a). Only the virome reads that were able to be taxonomically classified by MG-RAST using the NCBI RefSeq database are shown here. The “others” shown here means reads that had a significant hit in RefSeq database but could not be assigned to a specific taxon. The reads that were annotated as viruses in (a) were further shown in family levels in (b).
Number of virome contigs assembled from Lake Soyang virome reads.
| Sample | IMG Accession no. | Assembled contigs | N50 (bp) | Assembled total bases | Length of longest contigs | Contigs ≥ 10-kb | Viral contigsa |
|---|---|---|---|---|---|---|---|
| ′14 Oct. | 3300007735 | 78,169 | 1,950 | 23,735,463 | 213,274 | 1,027 | 867 |
| ′15 Jan. | 3300007734 | 89,763 | 1,577 | 22,031,041 | 176,311 | 983 | 844 |
| ′15 Sept. | 3300011113 | 121,633 | 1,324 | 19,395,483 | 334,901 | 835 | 648 |
| ′15 Nov. | 3300011116 | 214,755 | 1,084 | 30,660,637 | 334,837 | 1,352 | 1,004 |
| ′16 Feb. | 3300011114 | 164,680 | 1,071 | 22,677,118 | 125,970 | 1,112 | 935 |
| ′16 May | 3300011115 | 140,964 | 1,266 | 24,544,035 | 215,674 | 1,171 | 867 |
aThe number of viral and prophage contigs were determined using the VirSorter program[22].
Physicochemical features of Lake Soyang water samples.
| Sample | Temp.(°C)a | Salinity (%)a | DO (mg/L)a | pHa | PO43− (mg/L)b | SiO2 (mg/L)b | NH3-N (mg/L)b | NO2–N (mg/L)b | NO3–N (mg/L)b |
|---|---|---|---|---|---|---|---|---|---|
| ′14 Oct.c | 19.49 | 0.00 | 8.49 | 6.18 | 0.0100 | 2.372 | 0.0100 | 0.0070 | 1.900 |
| ′15 Jan.c | 5.56 | 0.04 | 6.07 | 6.89 | 0.0100 | 2.614 | 0.0000 | 0.0060 | 1.800 |
| ′15 Sept.d | 25.64 | 0.05 | 8.29 | 8.43 | NDf | 1.5241 | 0.0337 | 0.0195 | 1.5331 |
| ′15 Nov.d | 16.55 | 0.04 | 6.92 | 7.88 | NDf | 0.8486 | 0.0267 | 0.0024 | 1.6485 |
| ′16 Feb.d | 4.97 | 0.15 | 7.54 | 7.42 | 0.0009 | 1.0927 | 0.0088 | 0.0014 | 1.5802 |
| ′16 Mayd | 14.01 | 0.06 | NAe | 7.02 | NDf | 2.2380 | 0.0167 | 0.0091 | 1.4776 |
aThe physical measurements of the water samples were measured and recorded on site using the YSI 556 MPS instrument.
bThe physicochemical values were measured in laboratory setting using either HACH spectrophotometer instrument or QuAAtro microflow analyzer.
cThe physicochemical values for these samples were measured using the HACH spectrophotometer instruments.
dThe physicochemical values for these samples were measured using the QuAAtro microflow analyzer.
eNot available.
fNot detected.
The Q scores of raw virome read collected from Lake Soyang.
| Sample | Base pair (Gbp) | Q20 (Gbp) | Q20 (%) | Q30 (Gbp) | Q30 (%) | GC content (%) |
|---|---|---|---|---|---|---|
| ′14 Oct. | 3.20 | 2.67 | 83.45 | 2.28 | 71.00 | 48.38 |
| ′15 Jan. | 3.08 | 2.52 | 81.64 | 2.12 | 68.64 | 49.24 |
| ′15 Sept. | 5.54 | 4.53 | 81.82 | 3.82 | 68.99 | 44.71 |
| ′15 Nov. | 3.69 | 3.10 | 83.94 | 2.72 | 73.78 | 46.85 |
| ′16 Feb. | 3.25 | 2.68 | 82.60 | 2.34 | 72.18 | 49.04 |
| ′16 May | 3.18 | 2.63 | 82.88 | 2.31 | 72.53 | 47.96 |
| Measurement(s) | Metagenome • DNA viral genome |
| Technology Type(s) | whole genome sequencing |
| Factor Type(s) | season |
| Sample Characteristic - Organism | unclassified bacterial viruses |
| Sample Characteristic - Environment | oligotrophic lake • freshwater lake biome |
| Sample Characteristic - Location | South Korea |