Literature DB >> 30229047

Draft genome sequence of Pantoea ananatis strain MHSD5 isolated from surface sterilized leaves of medicinal plant, Pellaea calomelanos obtained in South Africa.

Siphiwe Godfrey Mahlangu1, Mahloro Hope Serepa-Dlamini1.   

Abstract

Pantoea ananatis strain MHSD5 is a bacterial endophyte isolated from the surface sterilized leaves of Pellaea calomelanos, which is a medicinal plant obtained in Limpopo province of South Africa. We present here the draft genome sequence and annotation of P. ananatis strain MHSD5. The genome assembly was 4.6 Mb in size with an N50 of 550,557 bp. A total of 4,350 putative protein coding sequence genes were predicted with PGAAP. This is the first draft genome of a bacterial endophyte symbiotically associated with P. calomelanos. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession PUEK00000000. The version described in this paper is version PUEK01000000.

Entities:  

Year:  2018        PMID: 30229047      PMCID: PMC6141256          DOI: 10.1016/j.dib.2018.06.039

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the data

The first draft genome of bacterial endophyte symbiotically associated with Pellaea calomelanos. The whole genomic data provided information on genetic components of Pantoea ananatis strain MHSD5 involved in endophytic life style. Pantoea genus comprise species associated with various hosts, thus the genome information will advance genome comparison of various Pantoea species and further provide insights into the biology and evolution of the genus.

Data

The genus Pantoea has diverse species, which have been isolated from several environments such as aquatic and terrestrial environments [1]. Pantoea genus currently has twenty described species which have associations with humans, plants, insects, and animals [1]. Association of Pantoea species with various hosts can be parasitic, mutualistic or commensal [1], [2]. The genus consists of yellow-pigmented, gram-negative, rod-shaped bacteria in the Enterobacteriaceae family. Although most species have been reported to have pathogenic associations with humans, animals and plants [3], [4], few have been reported to be symbiotically associated with plants [5]. As plant endophytes, Pantoea bacteria have been reported to promote plant growth via a variety of mechanisms and produce bioactive compounds with antibiotic activities [6], [7], [8], [9]. Since Pantoea species have associations with different hosts in different environments, there is need for understanding genetic factors which allow this group of bacteria to successfully colonize various hosts. In addition, the availability of various genomes of Pantoea genus will promote whole genome comparison within this group and further our understanding of genetic factors that contribute to Pantoea species thriving in different environments and thus delineating their biology and evolution. We recently isolated Pantoea ananatis strain MHSD5 from surface sterilized leaves of Pellaea calomelanos, a medicinal plant utilized for treatment of chest colds, asthma, headaches, head colds as well as mouth and nasal ulcers [10]. Initial identification of the bacterium was confirmed by sequencing of its 16S rRNA gene, which was deposited in GenBank with accession number MF613651. This is the first study to isolate, identify and report on the draft whole genome sequence of bacterial endophyte associated with P. calomelanos. DNA sequencing was performed with Illumina MiSeq platform which generated 250 Mb data of reads. The genome assembly of P. ananatis strain MHSD5 produced 39 contigs, with N50 of 550,557 base pairs (bp), the largest contig with 1,441,770 bp. The genome of P. ananatis strain MHSD5 was 4,650,216 bp, with GC% content of 54.16%, which falls within the genome sizes (4.5–6.3 Mb) and GC% content (52–55%) ranges of some of the sequenced Pantoea species. Genome annotation was performed with Prokaryotic Genome Automatic Annotation Pipeline (PGAAP) and Rapid Annotation using Subsystem Technology server (RAST) [11], [12]. P. ananatis strain MHSD5 has 4,437 genes, among the identified genes 4,350 are protein coding sequence genes (CDS) and 119 are pseudogenes. The genome also has 3 rRNAs with five operons (5S, 16S and 23S) and 70 tRNAs genes. PGAAP and RAST annotation pipelines resulted in minor differences (Table 1). We registered the annotation results from PGAAP on GenBank. The RAST Subsystem Information (Fig. 1), shows the subsystem feature counts, with 92 genes involved in virulence, disease and defence; and carbohydrates having the most number of genes of 561 and 4 genes of secondary metabolism which are the lowest number of genes. Through the RAST sequence based comparison tool [13], we compared the assembled genomes of P. ananatis strain MHSD5 and Pantoea stewartii DC283 [14], using the latter as a reference genome as shown in Fig. 2(a) and (b). Although there were genome gaps on P. ananatis strain MHSD5 (Fig. 2(a)), it showed 70–99% similarity in protein sequences of both reverse and forward (bidirectional hits) to Pantoea stewartii DC283.
Table 1

Outcome comparison of Pantoea ananatis strain MHSD5 genome annotation using PGAAP and RAST.

Genomic featureaPGAAPRAST
Total number of genes44374397
Protein coding genes43504324
Number of RNAs8773
Contigs3939
N50550,557 bp550,557 bp
GC%54.16%54.2%

Only the PGAAP results were registered with GenBank.

Fig. 1

The subsystem distribution of Pantoea ananatis strain MHSD5 generated from RAST annotation server.

Fig. 2

(a) Pantoea ananatis strain MHSD5 genome compared to P. stewartii DC283, with the latter used as reference genome, (b) colour co-ordination similarity of the genome comparison in percentages. Bidirectional best hit refers to both forward and reverse hits.

Outcome comparison of Pantoea ananatis strain MHSD5 genome annotation using PGAAP and RAST. Only the PGAAP results were registered with GenBank. The subsystem distribution of Pantoea ananatis strain MHSD5 generated from RAST annotation server. (a) Pantoea ananatis strain MHSD5 genome compared to P. stewartii DC283, with the latter used as reference genome, (b) colour co-ordination similarity of the genome comparison in percentages. Bidirectional best hit refers to both forward and reverse hits. Genomics is crucial in identifying genes that are important for the bacterial endophyte to successfully penetrate, colonize and have symbiotic association with its plant host. In accordance to the life style of endophytes, we identified genes involved in nitrogen fixation, production of plant hormone indole acetic acid (IAA), production of antibiotics, toxins as well as toxin-antitoxin genes. Similar genes have been previously reported in other endophytic bacteria such as Azoarcus sp. strain BH72 as well as Gluconacetobacter diazotrophicus Pal5 [15], [16]. Some of the identified toxin and antitoxin genes are similar to Enterobacter sp. 638, a plant growth promoting endophyte which was fully sequenced and annotated [17].

Experimental design, materials and methods

Genome extraction and sequencing

P. ananatis strain MHSD5 was isolated from surface sterilized leaves of P. calomelanos obtained from Botlokwa, Limpopo Province, South Africa (23°29׳34.8"S 29°42׳11.2"E). Whole genomic DNA of Pantoea ananatis strain MHSD5 was extracted from nutrient agar pure colonies using Quick-DNA™ ZR fungal/bacterial DNA MiniPrep kit (Zymo Research, catalogue #D6005) according to manufacturers’ protocol. The extracted DNA was cleaned with ZR fungal/bacterial DNA clean and concentrator-5 (catalogue #D4003). The whole genome sequence was performed at a commercial service provider Inqaba Biotechnological Company PTY (LTD.), Pretoria, South Africa. Briefly, genomic DNA sample was fragmented using an ultrasonication approach (Covaris). The DNA library was prepared using the NEBNext Ultra™ II DNA Library Prep Kit for Illumina (New England BioLabs Inc.) according to the manufacturers’ protocol. The resulting fragments were size selected, end repaired and Illumina specific adapter sequences were ligated to each fragment. Following quantification, the samples were individually indexed and a second size selection step was performed (AMPure XP Bead-based), and sequenced on Illumina MiSeq platform, using a MiSeq v3 (600 cycle) kit. 250 Mb of data (2 × 300 bp long paired end reads) was produced.

Genome quality assessment, de novo assembly and annotation

Quality assessment of raw reads was performed by FastQC version 0.69. The reads were de novo assembled using Unicycler version 0.4.1.1 and the assembly was assessed by Quast version 4.6.3. The assembly generated 39 contigs with an N50 of 550,557 bp and 53× coverage. All the pre-annotation analysis were performed on Galaxy web platform (https://usegalaxy.org) [18]. The NCBI Prokaryotic Genome Automatic Annotation Pipeline (PGAAP) was used to perform initial annotation, which was followed by Rapid Annotation using Subsystem Technology (RAST).
Subject areaBiology
More specific subject areaPlant-microbe interaction, Bacteriology, Genomics, Bioinformatics
Type of dataTable, figure
How data was acquiredGenome sequencing: Illumina MiSeq at Inqaba Biotechnological Company, Pretoria, South Africa,
De novo sequence assembly: Web-based Galaxy Unicycler version 0.4.1.1, Bioinformatics approaches: NCBI Prokaryotic Genome Automatic Annotation Pipeline (PGAAP), Rapid Annotation using Subsystem Technology server (RAST).
Data formatAnalysed
Experimental factorsGenomic sequencing, assembly and annotation
Experimental featuresThe whole genome of Pantoea ananatis strain MHSD5 was sequenced on Illumina MiSeq sequencing platform. Read quality was assessed using Galaxy platform-FastQC version 0.69. De novo assembly of the reads were performed by Galaxy web platform-Unicycler version 0.4.1.1, and quality of assembly was assessed with Quast Genome assembly Quality version 4.6.3 (https://usegalaxy.org).
Data source locationP. ananatis strain MHSD5 was isolated from surface sterilized leaves of P. calomelanos obtained from Botlokwa, Limpopo Province, South Africa (23°29׳34.8"S 29°42׳11.2"E).
Data accessibilityGenome assembly,annotation and analysis of data are found in this article and the raw data together with NCBI PGAAP annotation were deposited at the NCBI repository:
https://www.ncbi.nlm.nih.gov/bioproject/PRJNA434382,
Bioproject ID: 434382, BioSample: SAMN08555277
This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession PUEK00000000. (http://www.ncbi.nlm.nih.gov/nuccore/PUEK00000000)
The genome annotation performed at RAST server are also given in this article.
  16 in total

1.  Rice endophyte Pantoea agglomerans YS19 promotes host plant growth and affects allocations of host photosynthates.

Authors:  Y Feng; D Shen; W Song
Journal:  J Appl Microbiol       Date:  2006-05       Impact factor: 3.772

2.  Complete genome of the mutualistic, N2-fixing grass endophyte Azoarcus sp. strain BH72.

Authors:  Andrea Krause; Adarsh Ramakumar; Daniela Bartels; Federico Battistoni; Thomas Bekel; Jens Boch; Melanie Böhm; Frauke Friedrich; Thomas Hurek; Lutz Krause; Burkhard Linke; Alice C McHardy; Abhijit Sarkar; Susanne Schneiker; Arshad Ali Syed; Rudolf Thauer; Frank-Jörg Vorhölter; Stefan Weidner; Alfred Pühler; Barbara Reinhold-Hurek; Olaf Kaiser; Alexander Goesmann
Journal:  Nat Biotechnol       Date:  2006-10-22       Impact factor: 54.908

Review 3.  Pantoea: insights into a highly versatile and diverse genus within the Enterobacteriaceae.

Authors:  Alyssa M Walterson; John Stavrinides
Journal:  FEMS Microbiol Rev       Date:  2015-06-24       Impact factor: 16.408

4.  Pantoea agglomerans strain EH318 produces two antibiotics that inhibit Erwinia amylovora in vitro.

Authors:  S A Wright; C H Zumoff; L Schneider; S V Beer
Journal:  Appl Environ Microbiol       Date:  2001-01       Impact factor: 4.792

5.  Isolation of Pantoea ananatis from sugarcane and characterization of its potential for plant growth promotion.

Authors:  J F da Silva; R R Barbosa; A N de Souza; O V da Motta; G N Teixeira; V S Carvalho; A L S R de Souza; G A de Souza Filho
Journal:  Genet Mol Res       Date:  2015-11-30

6.  Pantoea alhagi, a novel endophytic bacterium with ability to improve growth and drought tolerance in wheat.

Authors:  Chaoqiong Chen; Kaiyun Xin; Hao Liu; Juanli Cheng; Xihui Shen; Yao Wang; Lei Zhang
Journal:  Sci Rep       Date:  2017-01-27       Impact factor: 4.379

7.  Complete Genome Assembly of Pantoea stewartii subsp. stewartii DC283, a Corn Pathogen.

Authors:  Duy An Duong; Ann M Stevens; Roderick V Jensen
Journal:  Genome Announc       Date:  2017-06-01

8.  The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST).

Authors:  Ross Overbeek; Robert Olson; Gordon D Pusch; Gary J Olsen; James J Davis; Terry Disz; Robert A Edwards; Svetlana Gerdes; Bruce Parrello; Maulik Shukla; Veronika Vonstein; Alice R Wattam; Fangfang Xia; Rick Stevens
Journal:  Nucleic Acids Res       Date:  2013-11-29       Impact factor: 16.971

9.  Analysis of the Pantoea ananatis pan-genome reveals factors underlying its ability to colonize and interact with plant, insect and vertebrate hosts.

Authors:  Pieter De Maayer; Wai Yin Chan; Enrico Rubagotti; Stephanus N Venter; Ian K Toth; Paul R J Birch; Teresa A Coutinho
Journal:  BMC Genomics       Date:  2014-05-27       Impact factor: 3.969

10.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

View more
  1 in total

1.  Bacterial Microbiome in the Phyllo-Endosphere of Highly Specialized Rock Spleenwort.

Authors:  Valerie F Masocha; Hongmei Liu; Pingshan Zhan; Kaikai Wang; Ao Zeng; Sike Shen; Harald Schneider
Journal:  Front Plant Sci       Date:  2022-07-07       Impact factor: 6.627

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.