Literature DB >> 35678577

Closed, Circular Genome Sequence of Aureococcus anophagefferens Virus, a Lytic Virus of a Brown Tide-Forming Alga.

Alexander R Truchon1, Eric R Gann1, Steven W Wilhelm1.   

Abstract

Here, we report the genomic sequence of Aureococcus anophagefferens virus, assembled into one circular contig from both Nanopore and Illumina reads. The genome is 381,717 bp long with a GC content of 29.1%, which includes an additional 5-kb region between the previously predicted polar ends of the reference genome.

Entities:  

Year:  2022        PMID: 35678577      PMCID: PMC9302175          DOI: 10.1128/mra.00282-22

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

We sequenced the genome of Aureococcus anophagefferens virus (AaV), a member of the family Mimiviridae, within the phylum Nucleocytoviricota. AaV has been propagated on its pelagophyte host, Aureococcus anophagefferens, since its isolation in the early 2000s and was first sequenced and assembled using Illumina reads in 2014 (1–3). Since then, the virus has been maintained via coculture with A. anophagefferens CCMP 1984. The original assembly was predicted to have terminal ends rich in leucine repeat-containing coding sequences (3, 4). As repetitive regions can lead to improper assemblies when exclusively employing short-read sequencing exclusively, we resequenced AaV using short- and long-read sequencing (5). Viral DNA was extracted from a lysed, xenic A. anophagefferens culture as described by Truchon et al. (6). Briefly, virions were concentrated using tangential flow filtration and ultracentrifugation to enrich for virus particles. Particles were digested in agarose CHEF plug molds (Bio-Rad, Hercules, CA, USA) with proteinase K before being run on a low-melting point agarose gel. High-molecular-weight DNA was excised from the gel and purified using a phenol-chloroform method (6). Long-read sequencing was performed using the Oxford Nanopore Technologies (ONT; Oxford, UK) platform. Genomic DNA libraries generated using the ligation sequencing kit (ONT) were sequenced on a MinION R9.4 flow cell (ONT), producing 284,000 reads that averaged 2,503 bp. Bases were called using the Guppy version 3.0.3 base caller using the config file dna_r9.4.1_450bps_fast.cfg (7). Adapter sequences were removed using Porechop version 0.2.4 (8), and the reads were trimmed with a quality score of 9 and a minimum length of 500 bp using NanoFilt version 2.7.1 (9). The reads were aligned to the AaV reference genome using BBMap version 38.90 (10) and used in the assembly, performed using Canu version 2.1 (11). DNA was also extracted for short-read sequencing by treating concentrated virions with proteinase K for 1 h at 37°C and extraction via standard phenol-chloroform methods (12). The DNA library was prepared and sequenced on a NextSeq 2000 instrument (Illumina, San Diego, CA, USA) by the Microbial Genome Sequencing Center, generating 12,378,846 reads in 150-bp paired-end format. The Illumina short reads were trimmed for quality using the default settings in CLC Genomics Workbench (Qiagen, Hilden, Germany) and mapped to the Canu-assembled contig using Bowtie 2 version 2.2.3 (13). The assembly was polished with the Illumina reads using Pilon version 1.23 (14), which generated a closed, circular contig of 381,717 bp. The quality and completeness were assessed using CheckV (15). Coding sequences and tRNAs were predicted using Prodigal version 2.6.3 (16) and tRNAscan-SE version 2.0 (17), respectively. A total of 384 coding sequences (CDS) were predicted. Functions were predicted from translated amino acid sequences using the eggNOG-mapper Web server (18). One novel CDS encodes for a putative DNA-dependent RNA polymerase subunit (Rpb2). Only 1 of the 11 RNA polymerase genes encoded by AaV is homologous to the novel subunit, though the two sequences have an amino acid identity below 30%. This supports the hypothesis that two copies of this gene arose from an ancestral duplication among mimiviruses of eukaryotic phototrophs (19). Among other changes to the genome are apparent duplications, gene elongations, and gene combinations missed during the initial assembly (Fig. 1).
FIG 1

Genomic maps (created in Circos [20]) comparing the original AaV assembly (i.e., 2013) to the updated assembly (i.e., 2022). The complete genome sequences of the two assemblies were aligned, and the coding densities were compared. The rings (from inner to outer) indicate the GC content (2013), minus-strand coding sequences (2013), genomic sequence (2013), plus-strand coding sequences (2013), minus-strand coding sequences (2022), genomic sequence (2022), plus-strand coding sequences, and GC content (2022). Coding sequences present in only one genome are highlighted in yellow, and tRNAs are represented on the genomic sequence (2022) by black bands. Novel and known Rpb2 coding sequences are marked with an asterisk.

Genomic maps (created in Circos [20]) comparing the original AaV assembly (i.e., 2013) to the updated assembly (i.e., 2022). The complete genome sequences of the two assemblies were aligned, and the coding densities were compared. The rings (from inner to outer) indicate the GC content (2013), minus-strand coding sequences (2013), genomic sequence (2013), plus-strand coding sequences (2013), minus-strand coding sequences (2022), genomic sequence (2022), plus-strand coding sequences, and GC content (2022). Coding sequences present in only one genome are highlighted in yellow, and tRNAs are represented on the genomic sequence (2022) by black bands. Novel and known Rpb2 coding sequences are marked with an asterisk.

Data availability.

The raw data and the assembled genome have been indexed at NCBI under the BioProject accession number PRJNA809211. The assembled genome has been assigned the GenBank accession number OM876856.1. The raw MinION and Illumina reads have been archived under the Sequence Read Archive accession numbers SRR16764708 to SRR16764709.
  16 in total

1.  Circos: an information aesthetic for comparative genomics.

Authors:  Martin Krzywinski; Jacqueline Schein; Inanç Birol; Joseph Connors; Randy Gascoyne; Doug Horsman; Steven J Jones; Marco A Marra
Journal:  Genome Res       Date:  2009-06-18       Impact factor: 9.043

2.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

3.  tRNAscan-SE: Searching for tRNA Genes in Genomic Sequences.

Authors:  Patricia P Chan; Todd M Lowe
Journal:  Methods Mol Biol       Date:  2019

4.  Genome of brown tide virus (AaV), the little giant of the Megaviridae, elucidates NCLDV genome expansion and host-virus coevolution.

Authors:  Mohammad Moniruzzaman; Gary R LeCleir; Christopher M Brown; Christopher J Gobler; Kay D Bidle; William H Wilson; Steven W Wilhelm
Journal:  Virology       Date:  2014-07-14       Impact factor: 3.616

5.  ISOLATION OF A NON-PHAGE-LIKE LYTIC VIRUS INFECTING AUREOCOCCUS ANOPHAGEFFERENS(1).

Authors:  Janet M Rowe; John R Dunlap; Christopher J Gobler; O Roger Anderson; Mary D Gastrich; Steven W Wilhelm
Journal:  J Phycol       Date:  2008-02       Impact factor: 2.923

6.  The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community.

Authors:  Miten Jain; Hugh E Olsen; Benedict Paten; Mark Akeson
Journal:  Genome Biol       Date:  2016-11-25       Impact factor: 13.583

7.  Completing bacterial genome assemblies with multiplex MinION sequencing.

Authors:  Ryan R Wick; Louise M Judd; Claire L Gorrie; Kathryn E Holt
Journal:  Microb Genom       Date:  2017-09-14

8.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Authors:  Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

9.  Performance of neural network basecalling tools for Oxford Nanopore sequencing.

Authors:  Ryan R Wick; Louise M Judd; Kathryn E Holt
Journal:  Genome Biol       Date:  2019-06-24       Impact factor: 13.583

10.  NanoPack: visualizing and processing long-read sequencing data.

Authors:  Wouter De Coster; Svenn D'Hert; Darrin T Schultz; Marc Cruts; Christine Van Broeckhoven
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.