| Literature DB >> 26834726 |
Chaowen Gong1, Weijia Zhang1, Xuewen Zhou1, Hongming Wang1, Guowei Sun1, Jinzhou Xiao1, Yingjie Pan1, Shuling Yan2, Yongjie Wang1.
Abstract
Virophages are small double-stranded DNA viruses that are parasites of giant DNA viruses that infect unicellular eukaryotes. Here we identify a novel group of virophages, named Dishui Lake virophages (DSLVs) that were discovered in Dishui Lake (DSL): an artificial freshwater lake in Shanghai, China. Based on PCR and metagenomic analysis, the complete genome of DSLV1 was found to be circular and 28,788 base pairs in length, with a G+C content 43.2%, and 28 predicted open reading frames (ORFs). Fifteen of the DSLV1 ORFs have sequence similarity to known virophages. Two DSLV1 ORFs exhibited sequence similarity to that of prasinoviruses (Phycodnaviridae) and chloroviruses (Phycodnaviridae), respectively, suggesting horizontal gene transfer occurred between these large algal DNA viruses and DSLV1. 46 other virophages-related contigs were also obtained, including six homologous major capsid protein (MCP) gene. Phylogenetic analysis of these MCPs showed that DSLVs are closely related to OLV (Organic Lake virophage) and YSLVs (Yellowstone Lake virophages), especially to YSLV3, except for YSLV7. These results indicate that freshwater ecotopes are the hotbed for discovering novel virophages as well as understanding their diversity and properties.Entities:
Keywords: diversity; freshwater lake; genome; metagenomics; virophage
Year: 2016 PMID: 26834726 PMCID: PMC4722103 DOI: 10.3389/fmicb.2016.00005
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
The sampling sites and time.
| Dishui Lake | 121°55′27.00″ | 30°53′56.00″ | 2013/10 |
| Dazhi River | 121°46′12.78″ | 31°00′39.59″ | 2013/12 |
| Dianshan Lake | 120°54′08.76″ | 31°04′58.55″ | 2014/09 |
| Yangtze River | 121°46′5.16″ | 31°40′2.76″ | 2013/06 |
| Yangshan Harbor | 122°03′48.9″ | 30°37′6.42″ | 2012/11 |
| Gouqi Island | 122°45′57.78″ | 30°42′27.54″ | 2013/06 |
| Xi Lake | 120°09′22.30″ | 30°15′22.38″ | 2014/04 |
| Qiandao Lake | 118°55′36.84″ | 29°35′38.90″ | 2014/04 |
| Xuanwu Lake | 118°47′55.49″ | 32°04′37.53″ | 2013/12 |
| East Lake | 114°25′32.29″ | 30°33′06.00″ | 2014/06 |
Figure 1Map shows the sampling sites in East China. Sampling sites are indicated with names and dark dots. Dishui Lake is highlighted in blue. Numbers on horizontal and vertical axis represent longitude (Lng) and Latitude (Lat), respectively. The area of Shanghai is shaded in gray.
Eight pairs of virophage MCP genes specific primers designed in this study.
| Sputnik | 534 | CTACTACTGCTAGAATTACTGGTGT | ATGCTCCAGAAAGAATACCCTGT |
| Mavirus | 522 | ACACCCCCAGAACTCGATAC | ACAACCTAAACCGCGAGACA |
| ALM | 520 | TCCGAATGAACCGCCAATAGA | GTTTTGCGTTATGGTTCGGC |
| OLV | 479 | AAAGATGGTCCGGCTTCGAG | CTGATGCTAGAGTCGGCACG |
| YSLV1 | 426 | AGCCGTCGCAATAGTTCCAG | GAAGGTGGTTACGCTACCGA |
| YSLV2 | 417 | CACCTTTCGTATTTGGCGACC | AGCGGAAGTCGCTTATTCCT |
| YSLV3 | 627 | CGACCAAGACTTCCAGCCTC | CACAAGTCCCACTGAGTTGC |
| YSLV4 | 513 | CCATTTCTACCGACCCAGCA | GCACACGAGCGCAAATAAGA |
Thermal cycling programs of PCR for virophages.
| Sputnik | 94°C, 4 min | 94°C, 30 s | 54°C, 30 s | 72°C, 45 s | 72°C, 5min |
| Mavirus | |||||
| ALM YSLV1, 2, 3 | |||||
| OLV | 56°C, 30 s | ||||
| YSLV4 | 59°C, 30 s |
Primers for verification of DSLV1 genome.
| 1 | 417 | CTCCTTTTGCGAGGGGAACT | AAGAAACGATGCGAGGTGGT |
| 2 | 892 | CAAATCGCCTAAATGAATGTCCT | CATACCAGTCGCCAGTCCAA |
| 3 | 605 | GGGTAAAACCGCTGGGAGAG | CAGCGGTGTCAAACGCATTA |
| 4 | 948 | TTTCTGTTGCTTACGGGCGA | TGAGTGGAACTTGGAACGCA |
| 5 | 421 | GACCTATCGTCAGGGCAAGG | TAGGAGCGGAAGAAAAGGGG |
| 6 | 581 | ACTTTGGTGAATAGAGCGTTGA | GATAAGGCGTGAGGGTGCTT |
| 7 | 840 | TGCTTATGGCGGACAACCTT | CGGTTTGTGCGTCCAAATCA |
| 8 | 422 | TATACCTGCGTTGGTTGCGT | TAGGTGAGGTAGGTGAGGCA |
Information of the DSL metagenomic data sets.
| 1 | 18.46 | 35,371,138 | 6.50 | 23,505,862 | 6.43 | 23,151,200 |
| 2 | 10.82 | 20,766,464 | 7.59 | 17,580,296 | 7.58 | 17,507,722 |
| 3 | 3.97 | 7,974,858 | 2.53 | 6,239,280 | 2.52 | 6,202,911 |
| 4 | 8.56 | 17,264,916 | 5.96 | 14,037,596 | 5.95 | 13,964,065 |
| 5 | 6.97 | 15,136,922 | 5.61 | 12,272,954 | 5.60 | 12,216,490 |
| 6 | 0.90 | 1,833,418 | 0.59 | 1,407,000 | 0.58 | 1,387,258 |
| Total | 49.68 | 98,347,716 | 28.78 | 75,042,988 | 28.66 | 74,429,646 |
Figure 2Sequence alignment analysis of DSL metagenomic data sets. Red dots represent the recruited reads that shared significant sequence similarity with the virophage genomes. The numbers on the X-axis indicate the position and length (in base pairs) of the virophage genomes. The Y axis shows the percentage of sequence identity shared between the recruited reads and virophage genomic sequences.
Information of the assembled contigs.
| 1 | 1560 | ~28 | YSLV6 putative MCP | YSLV3 ORF19 |
| 2 | 1102 | ~3 | YSLV6 putative mCP | YSLV3 ORF18 |
| 3 | 476 | ~3 | YSLV6 putative MCP | YSLV3 ORF19 |
| 4 | 681 | ~11 | OLV OLV4 | YSLV3 ORF01 |
| 5 | 732 | ~7 | OLV OLV5 | YSLV3 ORF23 |
| 6 | 818 | ~6 | YSLV3 ORF22 | |
| 7 | 485 | ~8 | YSLV6 putative mCP | YSLV3 ORF18 |
| 8 | 680 | ~17 | OLV OLV12 | YSLV3 ORF06 |
| 9 | 563 | ~2.6 | OLV OLV12 | YSLV3 ORF06 |
| 10 | 528 | ~0.6 | YSLV3 ORF13 | |
| 11 | 460 | ~1.6 | YSLV6 putative primase-helicase | YSLV3 ORF11 |
| 12 | 532 | ~3.0 | YSLV6 putative MCP | YSLV3 ORF19 |
| 13 | 473 | ~1.3 | Clostridium perfringens DNA adenine methylase | YSLV3 ORF06 |
| 14 | 435 | ~2.6 | YSLV3 ORF19 | |
| 15 | 420 | ~3.5 | YSLV6 putative MCP | YSLV3 ORF19 |
| 16 | 391 | ~1.7 | OLV OLV4 | YSLV3 ORF01 |
| 17 | 723 | ~2.2 | YSLV3 ORF22 | |
| 18 | 639 | ~13.3 | YSLV5 ORF15 | YSLV3 ORF09 |
| 19 | 461 | ~1 | YSLV5 ORF11 | YSLV3 ORF03 |
| 20 | 445 | ~3.2 | YSLV5 putative cysteine protease | YSLV3 ORF05 |
| 21 | 440 | ~3.4 | YSLV6 putative mCP | YSLV3 ORF18 |
| 22 | 416 | ~3.6 | YSLV3 ORF18 | |
| 23 | 375 | ~3.5 | YSLV6 putative MCP | YSLV3 ORF19 |
| 24 | 316 | ~2.9 | YSLV5 putative MCP | YSLV3 ORF19 |
| 25 | 520 | ~1.2 | APMV helicase III/VV D5-type ATPase C-terminus | YSLV3 ORF11 |
| 26 | 390 | 0.39 | YSLV5 ORF15 | YSLV3 ORF09 |
| 27 | 302 | ~1.1 | OLV OLV4 | YSLV3 ORF01 |
| 28 | 236 | 0.7 | YSLV3 ORF12 | |
| 29 | 187 | ~2.4 | YSLV3 ORF13 | |
| 30 | 445 | 0.8 | YSLV6 putative packaging ATPase | YSLV3 ORF01 |
| 31 | 441 | 0.44 | YSLV6 putative MCP | YSLV3 ORF19 |
| 32 | 436 | ~2.7 | YSLV3 ORF16 | |
| 33 | 434 | 0.43 | OLV OLV5 | YSLV3 ORF23 |
| 34 | 434 | ~15 | OLV OLV4 | YSLV3 ORF01 |
| 35 | 433 | 0.43 | Hypothetical protein OLV OLV4 | YSLV3 ORF01 |
| 36 | 430 | 0.55 | OLV OLV4 | YSLV3 ORF01 |
| 37 | 429 | 0.85 | YSLV6 putative MCP | YSLV3 ORF19 |
| 38 | 355 | ~3.2 | YSLV3 ORF13 | |
| 39 | 332 | 0.6 | YSLV6 putative MCP | YSLV3 ORF19 |
| 40 | 295 | ~1.2 | Zamilon DNA packaging protein | YSLV3 ORF01 |
| 41 | 292 | ~5.2 | Sputnik V3 | YSLV3 ORF01 |
| 42 | 273 | 0.27 | YSLV6 ORF09 | YSLV3 ORF10 |
| 43 | 263 | 0.26 | YSLV3 ORF12 | |
| 44 | 250 | ~2.6 | YSLV3 ORF19 | |
| 45 | 230 | ~2.1 | YSLV3 ORF22 | |
| 46 | 210 | ~1.3 | YSLV5 putative primase-helicase | YSLV3 ORF11 |
| 47 | 151 | 0.78 | YSLV3 ORF12 |
Figure 3Circular map of the complete genome of DSLV1. Homologous genes shared between DSLV1 and YSLV3 are labeled in light blue; DSLV1 and giant algal viruses in red; DSLV1 and cellular organisms in orange; DSLV1 and PgVV (Phaeocystis globosa virus 16T virophage) in green. ORFans are marked in light gray. The interior blue line represents %G+C skew throughout the genome. Three gene clusters that have conserved synteny with other virophages are highlighted by the hashed rectangles. Red asterisks indicate the five conserved virophage genes. HEL, ATPase, PRO, MCP, and mCP.
Homologous genes present in virophages (modified from Zhou et al., .
| Putative FtsK-HerA family ATPase | 01(261) | 01(254) | 01(256) | 01(254) | 01(255) | 01(313) | 01(252) | 01(299) | 04(256) | 03(245) | 18(245) | 11(334) | 15(310) |
| Putative DNA helicase/primase/polymerase | 04(857) | 11(865) | 04(766) | 10(942) | 11(880) | 31(904) | 03(853) | 25(777) | 13(779) | 09(778) | 02(553) | 01(652) | |
| Putative GIY-YIG endonuclease | 03(81) | 12(167) | 09(225) | 16(113) | 24(129) | 14(114) | 08(81) | 06(165) | |||||
| Hypothetical protein | 06(311) | 09(310) | 10(308) | 04(344) | 14(326) | 15(315) | 11(296) | 08(318) | 11(298) | 10(168) | |||
| Putative cysteine protein | 07(654) | 05(172) | 23(190) | 12(195) | 16(191) | 19(70) | 10(187) | 04(175) | 07(190) | 09(175) | 12(188) | 10(175) | 16(189) |
| Putative major capsid protein | 15(575) | 19(578) | 25(623) | 15(584) | 22(617) | 21(614) | 04(583) | 20(585) | 09(576) | 20(595) | 06(609) | 08(553) | 18(606) |
| Putative minor capsid protein | 14(410) | 18(417) | 27(477), 26(866) | 14(400) | 21(394) | 20(479) | 06(377) | 26(383) | 08(389) | 18(167), 19(218) | 05(376) | 09(296) | 17(303) |
| Hypothetical protein | 03(110) | 28(104) | 20(101) | 26(227) | 11(853) | 18(98) | 02(123) | ||||||
| Hypothetical protein | 02(171) | 07(172) | |||||||||||
| Hypothetical protein | 09(184) | 07(143) | |||||||||||
| Hypothetical protein | 18(204) | 17(196) | 08(122) | ||||||||||
| Hypothetical protein | 28(276) | 23(275) | 21(404) | 34(325) | 06(361) | 29(312) | 05(290) | 21(438) | 07(442) | ||||
| Hypothetical protein | 06(278) | 25(421) | 07(116), 20(311) | 17(134) | 12(347) | ||||||||
| Hypothetical protein | 05(143) | 10(134) | 17(139) | 07(149) | 09(137) | ||||||||
| Hypothetical protein | 23(677) | 21(554) | 10(236) | 20(147) | 12(262) | 14(271) | |||||||
ORFs and their homologs predicted in DSLV1.
| 1 | 1 | 786 | 786 | 261 | Hypothetical protein YSLV3_ORF01 | YSLV3 | 8e−134 | 70 | 254 (1–254) | |
| 2 | 1649 | 783 | 867 | 288 | Hypothetical protein BpV2_168 | 2e−14 | 27 | 186 (37–222) | ||
| 3 | 1876 | 2121 | 246 | 81 | Hypothetical protein YSLV3_ORF12 | YSLV3 | 2e−12 | 39 | 74 (1–74) | |
| 4 | 2328 | 4901 | 2574 | 857 | Putative primase-helicase | YSLV3 | 3e−169 | 37 | 781 (49–829) | |
| Hypothetical protein MVEG_12362 | 1e−28 | 28 | 285 (475–759) | |||||||
| 5 | 4957 | 5388 | 432 | 143 | Hypothetical protein YSLV3_ORF10 | YSLV3 | 1e−19 | 36 | 129 (15–143) | |
| 6 | 5440 | 6375 | 936 | 311 | Hypothetical protein YSLV3_ORF09 | YSLV3 | 2e−123 | 58 | 307 (4–310) | |
| 7 | 6398 | 8362 | 1965 | 654 | Hypothetical protein YSLV3_ORF06 | YSLV3 | 4e−85 | 75 | 171 (265–435) | |
| Hypothetical protein YSLV3_ORF05 | YSLV3 | 3e−62 | 55 | 175 (478–652) | ||||||
| 8 | 9164 | 8493 | 672 | 233 | Hypothetical protein YSLV3_ORF13 | YSLV3 | 1e−69 | 50 | 210 (11–220) | |
| 9 | 9222 | 10679 | 1458 | 495 | Hypothetical protein YSLV3_ORF14 | YSLV3 | 7e−33 | 50 | 152 (1–152) | |
| Collagen triple helix repeat-containing protein | 4e−21 | 67 | 90 (265–354) | |||||||
| 10 | 10730 | 11299 | 570 | 189 | ||||||
| 11 | 11340 | 12761 | 1422 | 473 | ||||||
| 12 | 12807 | 14093 | 1287 | 428 | Hypothetical protein YSLV3_ORF16 | YSLV3 | 2e−25 | 37 | 204 (79–282) | |
| 2e−20 | 43 | 143 (1–143) | ||||||||
| 13 | 14150 | 14464 | 315 | 104 | Hypothetical protein YSLV3_ORF17 | YSLV3 | 3e−26 | 59 | 93 (12–104) | |
| 14 | 14858 | 16090 | 1233 | 410 | Putative minor capsid protein | YSLV3 | 1e−142 | 55 | 408 (2–409) | |
| 15 | 16147 | 17874 | 1728 | 575 | Putative major capsid protein | YSLV3 | 0 | 68 | 538 (1–538) | |
| 16 | 18575 | 18138 | 438 | 145 | Cysteine desulfurase | 1.4 | 26 | 104 (6–109) | ||
| 17 | 19079 | 19321 | 243 | 80 | ||||||
| 18 | 19886 | 19296 | 591 | 196 | ||||||
| 19 | 20100 | 20450 | 351 | 116 | ||||||
| 20 | 21490 | 20447 | 1044 | 347 | Serine protease | 0.010 | 29 | 131 (152–282) | ||
| 21 | 23639 | 21303 | 2337 | 778 | Hypothetical protein YSLV3_ORF22 | YSLV3 | 3e−40 | 37 | 246 (1–246) | |
| 1e−34 | 33 | 375 (402–776) | ||||||||
| 22 | 23704 | 23931 | 228 | 75 | ||||||
| 23 | 26100 | 24067 | 2034 | 677 | Hypothetical protein YSLV3_ORF21 | YSLV3 | 7e−57 | 33 | 650 (1–650) | |
| 9e−06 | 94 | 18 (657–674) | ||||||||
| 24 | 26396 | 26205 | 192 | 63 | ||||||
| 25 | 27017 | 26496 | 522 | 173 | Hypothetical protein NY2A_B677R | 2e−31 | 45 | 116 (58–173) | ||
| 26 | 27061 | 27339 | 279 | 92 | ||||||
| 27 | 27249 | 27860 | 612 | 203 | Hypothetical protein PGVV_00006 | 8e−05 | 33 | 85 (112–196) | ||
| Stress response protein NST1 | 5e−05 | 30 | 104 (88–191) | |||||||
| 28 | 27918 | 28748 | 831 | 276 | Hypothetical protein YSLV3_ORF23 | YSLV3 | 3e−123 | 63 | 275 (1–275) | |
Figure 4Whole genome alignment of DSLV1 and YSLV3. Five conserved genomic regions, shared between DSLV1 and YSLV3, are displayed with rectangles of different sizes and colors.
Figure 5Phylogenetic trees showing the relationship of DSLVs to other virophages. (A) A phylogenetic tree was reconstructed based on amino acid sequences of three conserved genes: MCP, Pro, and ATPase to compare DSLV1 to other known virophages. (B) Phylogenetic analysis of virophage MCP proteins, including all seven DSLV sequences identified in this study. Bootstrap values are indicated on each branch (100 iterations). DSLVs and YSLV3 are shaded in light blue. DSLVs and the closely related YSLVs and OLV are highlighted in blue. DSLVs are shown in bold. YSLV, Yellowstone Lake virophage; DSLV, Dishui Lake virophage; ALM, Ace Lake Mavirus; OLV, Organic Lake virophage. The accession numbers of the DSLV MCP gene sequences are as follows: KU245924 (MCP1), KU245925 (MCP2), KU245926 (MCP3), KU245927 (MCP4), KU245928 (MCP5), and KU245929 (MCP6).