| Literature DB >> 32426431 |
Muhammad Ramziuddin Zakaria1, Ming Quan Lam1, Sye Jinn Chen1, Mohamad Hamizan Abdul Karim1, Lili Tokiman2, Adibah Yahya1, Mohd Shahir Shamsir3, Chun Shiong Chong1.
Abstract
Mangrovimonas sp. strain CR14 is a halophilic bacterium affiliated with family Flavobacteriaceae which was successfully isolated from mangrove soil samples obtained from Tanjung Piai National Park, Johor. The whole genome of strain CR14 was sequenced on an Illumina HiSeq 2500 platform (2 × 150 bp paired end). Herein, we report the genome sequence of Mangrovimonas sp. strain CR14 in which its assembled genome consisted 20 contigs with a total size of 3,590,195 bp, 3209 coding sequences, and an average 36.08% G + C content. Genome annotation and gene mining revealed that this bacterium demonstrated proteolytic activity which could be potentially applied in detergent industry. This whole-genome shotgun data of Mangrovimonas sp. strain CR14 has been deposited at DDBJ/ENA/GenBank under the accession JAAFZY000000000. The version described in this paper is version JAAFZY010000000.Entities:
Keywords: Genome sequence; Illumina; Mangrovimonas; Proteolytic activity
Year: 2020 PMID: 32426431 PMCID: PMC7225383 DOI: 10.1016/j.dib.2020.105658
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
General genome statistics of Mangrovimonas sp. strain CR14.
| Category | Strain CR14 | |
|---|---|---|
| Number | Total percentage (%) | |
| Number of contigs | 20 | – |
| Genome size (bp) | 3590,195 | 100.00 |
| G +C content | 1295,342 | 36.08% |
| Total genes predicted | 3209 | 100.00 |
| Protein coding genes | 3152 | 98.22 |
| Non-coding RNA genes | 46 | 1.43 |
| rRNA genes | ||
| 5S rRNA | 1 | 0.03 |
| 16S rRNA | 1 | 0.03 |
| 23S rRNA | 1 | 0.03 |
| tRNA | 39 | 1.22 |
| ncRNA | 4 | 0.12 |
| Pseudogenes | 11 | 0.34 |
Fig. 1Mangrovimonas sp. strain CR14 positive hydrolysis on skim milk containing agar showing ability of this bacterium to produce extracellular proteolytic enzymes.
| Subject | Biology |
| Specific subject area | Microbiology and genomics |
| Type of data | • Genome sequence data in FASTA format |
| How data were acquired | Whole-genome sequencing using Illumina HiSeq 2500 (2 × 150 bp paired end) platform |
| Data format | Raw and assembled genome sequences |
| Parameters for data collection | Genomic DNA was extracted from a pure culture of |
| Description of data collection | Whole-genome sequencing, assembly and annotation |
| Data source location | |
| Data accessibility | This whole-genome shotgun data of |