| Literature DB >> 30229047 |
Siphiwe Godfrey Mahlangu1, Mahloro Hope Serepa-Dlamini1.
Abstract
Pantoea ananatis strain MHSD5 is a bacterial endophyte isolated from the surface sterilized leaves of Pellaea calomelanos, which is a medicinal plant obtained in Limpopo province of South Africa. We present here the draft genome sequence and annotation of P. ananatis strain MHSD5. The genome assembly was 4.6 Mb in size with an N50 of 550,557 bp. A total of 4,350 putative protein coding sequence genes were predicted with PGAAP. This is the first draft genome of a bacterial endophyte symbiotically associated with P. calomelanos. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession PUEK00000000. The version described in this paper is version PUEK01000000.Entities:
Year: 2018 PMID: 30229047 PMCID: PMC6141256 DOI: 10.1016/j.dib.2018.06.039
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Outcome comparison of Pantoea ananatis strain MHSD5 genome annotation using PGAAP and RAST.
| Total number of genes | 4437 | 4397 |
| Protein coding genes | 4350 | 4324 |
| Number of RNAs | 87 | 73 |
| Contigs | 39 | 39 |
| N50 | 550,557 bp | 550,557 bp |
| GC% | 54.16% | 54.2% |
Only the PGAAP results were registered with GenBank.
Fig. 1The subsystem distribution of Pantoea ananatis strain MHSD5 generated from RAST annotation server.
Fig. 2(a) Pantoea ananatis strain MHSD5 genome compared to P. stewartii DC283, with the latter used as reference genome, (b) colour co-ordination similarity of the genome comparison in percentages. Bidirectional best hit refers to both forward and reverse hits.
| Subject area | Biology |
| More specific subject area | Plant-microbe interaction, Bacteriology, Genomics, Bioinformatics |
| Type of data | Table, figure |
| How data was acquired | Genome sequencing: Illumina MiSeq at Inqaba Biotechnological Company, Pretoria, South Africa, |
| De novo sequence assembly: Web-based Galaxy Unicycler version 0.4.1.1, Bioinformatics approaches: NCBI Prokaryotic Genome Automatic Annotation Pipeline (PGAAP), Rapid Annotation using Subsystem Technology server (RAST). | |
| Data format | Analysed |
| Experimental factors | Genomic sequencing, assembly and annotation |
| Experimental features | The whole genome of |
| Data source location | |
| Data accessibility | Genome assembly,annotation and analysis of data are found in this article and the raw data together with NCBI PGAAP annotation were deposited at the NCBI repository: |
| Bioproject ID: 434382, BioSample: SAMN08555277 | |
| This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession | |
| The genome annotation performed at RAST server are also given in this article. |