Literature DB >> 32409538

Genome Assembly of Azotobacter chroococcum Strain W5, a Free-Living Diazotroph Isolated from India.

Annapurna Kannepalli1, Kedharnath Reddy Pengani2, Jeshima Khan Yasin3, Sangeeta Paul2, Swati Tyagi2, Govindasamy Venkadasamy2, Meenakshi Sharma2, Swarnalakshmi Karivaradharajan2.   

Abstract

Azotobacter chroococcum strain W5 (MTCC 25045) is an effective diazotrophic bacterium with plant growth-promoting traits. Here, we report the draft genome assembly of this biologically and agronomically evaluated A. chroococcum strain. The genome assembly in 55 contigs is 4,617,864 bp long, with a G+C content of 66.83%.
Copyright © 2020 Kannepalli et al.

Entities:  

Year:  2020        PMID: 32409538      PMCID: PMC7225537          DOI: 10.1128/MRA.00259-20

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Azotobacter spp. are Gram-negative, free-living, aerobic, soil-dwelling, oval or spherical bacteria that form thick-walled cysts (as a means of asexual reproduction under favorable conditions). There are seven known species of the genus. A Gram-negative, aerobic, nitrogen-fixing bacterium was isolated from the rhizosphere of wheat (Triticum aestivum L.) grown at Indian Agricultural Research Institute (IARI) research farms and identified as Azotobacter chroococcum based on 16S rRNA gene sequencing (GenBank accession no. MT299751). The bacterium grew luxuriously on N2-free mineral medium and produced brown pigment and plant growth-promoting hormones (1). This isolate was named Azotobacter chroococcum strain W5 (MTCC 25045). Inoculation with A. chroococcum strain W5 led to an increase in crop yield (2, 3). These findings prompted further efforts to investigate this bacterium for potential applications in agricultural and pharmaceutical industries. To identify the specific genomic features of this special strain, we attempted whole-genome sequencing. Genomic DNA (gDNA) was extracted as reported earlier (4) from a 72-h culture grown in Luria-Bertani medium that was inoculated with a single isolated colony using a standard isolation protocol as per the manufacturer’s instructions (Qiagen DNA extraction kit). The integrity and quality of the DNA were checked through a fluorescence-based quantitation assay as per the developer’s instructions (Qubit 3.0; Thermo Fisher). An ultrasonicator (Covaris) was used to shear the genomic DNA and to generate 350-bp-long fragments. DNA libraries (NEBNext Ultra DNA library prep kit; Illumina, USA) were generated following the manufacturer’s recommendations. The fragments were end polished, A tailed, and ligated to full-length adapters for Illumina sequencing. In the final preparation step, PCR products were purified (AMPure XP system). Libraries were analyzed for size distribution using a Bioanalyzer (Agilent 2100) (5) and quantified using real-time PCR (CFX96; Bio-Rad). Illumina paired-end sequencing (HiSeq 1500) resulted in 14,040,301 raw reads with a read length of 150 bases, totaling 4.2 GB of data output. The quality of the reads generated was assessed using FastQC v0.11.5 (6) (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Of the total raw reads generated, 14.24% were below a Phred score of 40. Raw reads with N (unidentified base) and Phred scores less than 30 were filtered out using Trimmomatic v0.39 (7, 8) before further analyses with Illumina adapter sequences (ATCACG) (9). The high-quality clean data of 3.6 Gb with 13,178,049 reads were subjected to genome assembly using SPAdes v3.11.1 with default settings (10). Table 1 displays the assembly details of Azotobacter chroococcum strain W5.
TABLE 1

Azotobacter chroococcum strain W5 sequencing and assembly metrics

Genome featureValue(s)
Sequence length (bp)4,617,864
No. of contigs55
G+C content (%)66.83
Shortest contig length (bp)248
Longest contig length (bp)476,631
No. of repeat regions1
Total no. of genes4,335
Total no. of CDSsa 4,273
Total no. of coding genes4,168
No. of tRNAs53
No. of rRNAs5 (3 5S, 1 16S, and 1 23S)
No. of ncRNAs4
No. of CRISPRs1
N50 (bp)257,772
Total no. of pseudogenes105

CDSs, coding DNA sequences.

Azotobacter chroococcum strain W5 sequencing and assembly metrics CDSs, coding DNA sequences. The smallest and largest contig sizes were 248 and 476,631 bases, respectively. Finally, 55 contigs with an N50 value of 257,772 bases were assembled into the genome of 4,617,864 bases, coding 4,335 genes and 105 pseudogenes. Furthermore, 53 tRNAs, 5 rRNAs, and 4 noncoding RNAs (ncRNAs) were predicted by using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v4.11 (11). Overall, the G+C content of the assembly was 66.83%. Siderophore- and bacteriocin-coding genes were predicted by antiSMASH v5.0.0 using default settings (12).

Data availability.

The complete genome sequence of A. chroococcum strain W5 has been deposited in DDBJ/ENA/GenBank under the accession no. JAAPAP000000000 (BioProject no. PRJNA610299, BioSample no. SAMN14292004, and SRA accession no. SRR11237782). The version described in this paper is the first version, JAAPAP010000000.
  9 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Whole genome shotgun sequence of Bacillus paralicheniformis strain KMS 80, a rhizobacterial endophyte isolated from rice (Oryza sativa L.).

Authors:  Kannepalli Annapurna; Venkadasamy Govindasamy; Meenakshi Sharma; Arpita Ghosh; Surendra K Chikara
Journal:  3 Biotech       Date:  2018-04-20       Impact factor: 2.406

3.  Azotobacter Genomes: The Genome of Azotobacter chroococcum NCIMB 8003 (ATCC 4412).

Authors:  Robert L Robson; Robert Jones; R Moyra Robson; Ariel Schwartz; Toby H Richardson
Journal:  PLoS One       Date:  2015-06-10       Impact factor: 3.240

4.  Draft Genome Assembly of the Entomopathogenic Bacterium Photorhabdus luminescens subsp. sonorensis Caborca.

Authors:  Duy An Duong; Patricia Espinosa-Artiles; Rousel A Orozco; István Molnár; S Patricia Stock
Journal:  Microbiol Resour Announc       Date:  2019-09-05

5.  antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline.

Authors:  Kai Blin; Simon Shaw; Katharina Steinke; Rasmus Villebro; Nadine Ziemert; Sang Yup Lee; Marnix H Medema; Tilmann Weber
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

6.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

7.  Whole-Genome Sequence and Classification of 11 Endophytic Bacteria from Poison Ivy (Toxicodendron radicans).

Authors:  Phuong N Tran; Nicholas E H Tan; Yin Peng Lee; Han Ming Gan; Steven J Polter; Lucas K Dailey; André O Hudson; Michael A Savka
Journal:  Genome Announc       Date:  2015-11-19

8.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

9.  Draft Genome Sequence of Pseudomonas stutzeri Strain KMS 55, an Endophytic Diazotroph Isolated from Rice Roots.

Authors:  Kannepalli Annapurna; Venkadasamy Govindasamy; Meenakshi Sharma; Yuvika Rajrana; Karivaradharajan Swarnalakshmi; Sangeeta Paul; Arpita Ghosh; Surendra K Chikara
Journal:  Genome Announc       Date:  2017-10-05
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.