Literature DB >> 30533722

Draft Genome Sequence of Escherichia coli Phage CMSTMSU, Isolated from Shrimp Farm Effluent Water.

Lelin Chinnadurai¹, Thirumalaikumar Eswaramoorthy¹, Abinaya Paramachandran¹, Sayan Paul², Rashmi Rathy², Arun Arumugaperumal², Sudhakar Sivasubramaniam², Citarasu Thavasimuthu¹.

Abstract

The Escherichia coli phage CMSTMSU was isolated from shrimp farm effluent water in Ramanathapuram, India. The phage exhibited lytic activity against both E. coli and the fish pathogen Pseudomonas aeruginosa. Here we report the draft genome sequence, assembly, and annotation of the isolated CMSTMSU phage. This genome resource can be used to utilize the phage as a crucial biocontrol agent in the fish aquaculture sector.

Entities: Chemical Disease Species

Year: 2018 PMID： 30533722 PMCID： PMC6256653 DOI： 10.1128/MRA.01034-18

Source DB: PubMed Journal: Microbiol Resour Announc ISSN： 2576-098X

ANNOUNCEMENT

Bacteriophages are viruses that infect bacteria. They are abundant in natural systems and are considered crucial factors in controlling bacterial populations (1). Phages also have the potential to regulate bacterial diseases of fish in aquatic environments by removing the fish pathogens (2). This study reports the genome sequence, assembly, and annotation of the Escherichia coli phage CMSTMSU. The phage was isolated from a wastewater sample obtained from a shrimp farm located in Ramanathapuram, India. It was detected with the soft agar overlay method using log-phase E. coli cells as the host. The isolated CMSTMSU phage also exhibited lytic activity against the fish pathogen Pseudomonas aeruginosa. The E. coli phage CMSTMSU was purified following the protocol reported by Mullan (see https://www.dairyscience.info/index.php/isolation-and-purification-of-bacteriophages.html). Then, the genomic DNA was extracted with the phenol-chloroform extraction method (3). The DNA library was prepared with the NEBNext Ultra II DNA library prep kit (New England Biolabs, USA). The whole-genome sequencing was performed with MinION Mk1b (Oxford Nanopore Technologies, UK) using the SpotON flow cell (FLO-MIN106) (4), and base calling was performed with Albacore version 2.1.3 at Genotypic Technology Pvt Ltd (Bangalore, India). We obtained 88,676 reads from the bar-coded library with the Nanopore sequencer with an average read length of 3.4 kb and an N50 length of 6,531 bp. The quality of the reads was analyzed with FastQC software version 0.11.5 (5). The base-called raw reads were used for de novo assembly with the Canu algorithm (6). The Canu assembly generated a single contig of 386.4 kb, which has a GC content of 35.6%. The contig underwent a BLAST search against the NCBI virus nonredundant (nr) database with the BLASTN algorithm with an E value threshold of 1E-5, and we found that it has an 83% sequence similarity with Escherichia phages PBECO 4, vB_Eco_slurp01, and 121Q. The draft genome of E. coli phage CMSTMSU was annotated with the RAST annotation server version 2.0 (http://rast.nmpdr.org) (7), GeneMarkS version 4.28 (http://exon.gatech.edu/GeneMark/genemarks.cgi) (8), and GLIMMER version 3.02 (https://ccb.jhu.edu/software/glimmer/) (9) gene prediction tools. The data obtained from the RAST annotation identified 767 protein-coding genes, and among them, 715 (91%) genes were identified from a BLAST search against the NCBI virus database with the BLASTP algorithm. The gene ontology (GO) and KEGG pathway annotations of the protein-coding genes were performed with the Blast2GO (https://www.blast2go.com/) functional annotation software (10). Of the 715 BLAST-annotated genes, 190 genes were assigned to 423 GO terms with ATP binding (45 genes) and nucleic acid phosphodiester bond hydrolysis (32 genes), and these were the most highly represented GO terms in the data set. We mapped 117 genes with 12 KEGG metabolic pathways, among which the pathways associated with purine metabolism (37 genes) and pyrimidine metabolism (26 genes) were the most dominant in the genome data set. Simultaneously, the annotations with the GeneMarkS and GLIMMER gene prediction tools predicted 891 and 938 protein-coding genes, respectively. Among these predicted genes, 599 genes were common to all three databases, whereas 115, 12, and 115 genes showed an overlap between RAST and GLIMMER, RAST and GeneMarkS, and GeneMarkS and GLIMMER, respectively. In addition, we identified 6 tRNA genes with a GC content range from 48.6% to 58.4% with the ARAGORN version 1.2.38 program (11). This genome draft sequence can be used as a potential resource to utilize the phage species as a biocontrol agent of antibiotics against fish pathogens.

Data availability.

The raw sequence reads have been submitted to the NCBI SRA under the accession number SRP158495, and the draft genome sequence of Escherichia coli phage CMSTMSU has been deposited in NCBI GenBank under the accession number MH494197.

8 in total

1. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors: J Besemer; A Lomsadze; M Borodovsky
Journal: Nucleic Acids Res Date: 2001-06-15 Impact factor: 16.971

2. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences.

Authors: Dean Laslett; Bjorn Canback
Journal: Nucleic Acids Res Date: 2004-01-02 Impact factor: 16.971

3. Improved microbial gene identification with GLIMMER.

Authors: A L Delcher; D Harmon; S Kasif; O White; S L Salzberg
Journal: Nucleic Acids Res Date: 1999-12-01 Impact factor: 16.971

4. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research.

Authors: Ana Conesa; Stefan Götz; Juan Miguel García-Gómez; Javier Terol; Manuel Talón; Montserrat Robles
Journal: Bioinformatics Date: 2005-08-04 Impact factor: 6.937

5. Bacteriophages with potential for inactivation of fish pathogenic bacteria: survival, host specificity and effect on bacterial community structure.

Authors: Carla Pereira; Yolanda J Silva; Ana L Santos; Ângela Cunha; Newton C M Gomes; Adelaide Almeida
Journal: Mar Drugs Date: 2011-11-07 Impact factor: 6.085

6. Real-time selective sequencing using nanopore technology.

Authors: Matthew Loose; Sunir Malla; Michael Stout
Journal: Nat Methods Date: 2016-07-25 Impact factor: 28.547

7. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Authors: Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy
Journal: Genome Res Date: 2017-03-15 Impact factor: 9.043

8. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST).

Authors: Ross Overbeek; Robert Olson; Gordon D Pusch; Gary J Olsen; James J Davis; Terry Disz; Robert A Edwards; Svetlana Gerdes; Bruce Parrello; Maulik Shukla; Veronika Vonstein; Alice R Wattam; Fangfang Xia; Rick Stevens
Journal: Nucleic Acids Res Date: 2013-11-29 Impact factor: 16.971

8 in total

2 in total

1. Genome Sequence and Characterization of Coliphage vB_Eco_SLUR29.

Authors: Ibrahim Besler; Pavelas Sazinas; Christian Harrison; Lucy Gannon; Tamsin Redgwell; Slawomir Michniewski; Steven P Hooton; Jon L Hobman; Andrew Millard
Journal: Phage (New Rochelle) Date: 2020-02-25

2. Whole-genome sequence analysis of environmental Escherichia coli from the faeces of straw-necked ibis (Threskiornis spinicollis) nesting on inland wetlands.

Authors: Ethan R Wyrsch; Piklu Roy Chowdhury; Louise Wallis; Max L Cummins; Tiziana Zingali; Kate J Brandis; Steven P Djordjevic
Journal: Microb Genom Date: 2020-06-10

2 in total