Literature DB >> 26566423

High-quality draft genome sequence of Gracilimonas tropica CL-CB462(T) (DSM 19535(T)), isolated from a Synechococcus culture.

Dong Han Choi1, Chisang Ahn2, Gwang Il Jang2, Alla Lapidus3, James Han4, T B K Reddy4, Marcel Huntemann4, Amrita Pati5, Natalia Ivanova4, Victor Markowitz6, Manfred Rohde7, Brian Tindall8, Markus Göker8, Tanja Woyke4, Hans-Peter Klenk9, Nikos C Kyrpides10, Byung Cheol Cho2.   

Abstract

Gracilimonas tropica Choi et al. 2009 is a member of order Sphingobacteriales, class Sphingobacteriia. Three species of the genus Gracilimonas have been isolated from marine seawater or a salt mine and showed extremely halotolerant and mesophilic features, although close relatives are extremely halophilic or thermophilic. The type strain of the type species of Gracilimonas, G. tropica DSM19535(T), was isolated from a Synechococcus culture which was established from the tropical sea-surface water of the Pacific Ocean. The genome of the strain DSM19535(T) was sequenced through the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project. Here, we describe the genomic features of the strain. The 3,831,242 bp long draft genome consists of 48 contigs with 3373 protein-coding and 53 RNA genes. The strain seems to adapt to phosphate limitation and requires amino acids from external environment. In addition, genomic analyses and pasteurization experiment suggested that G. tropica DSM19535(T) did not form spore.

Entities:  

Keywords:  GEBA; Genome; Gracilimonas tropica; Marine; Sphingobacteriia

Year:  2015        PMID: 26566423      PMCID: PMC4642740          DOI: 10.1186/s40793-015-0088-8

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The genus was first established in 2009 [1], and at the time of writing this paper there are three species that comprise this genus, [1], [2], and [3]. They are Gram-negative, catalase- and oxidase-positive, aerobic and facultatively anaerobic and have rod-shaped cells (Fig. 1) [1-3]. In addition, they form endospores except [3]. CL-CB462T (=KCCM 90063 = DSM 19535), the type strain of the type species of the genus , was isolated from a culture which was established from the tropical sea-surface water of the Pacific Ocean [1]. Interestingly, the genus formed a robust clade together with extremely halophilic or thermophilic bacteria ( and , respectively). On the contrary, species show only extremely halotolerant and mesophilic features. Considering the phenotypic diversity within the clade, their comparative genomic analyses could provide a good clue to understand bacterial adaptation to extreme environments based on genomic context. Here we present a summary of the genomic features of DSM 19535, which is the first genome-sequenced type strain from the genus .
Fig. 1

Scanning electron microscopy image of Gracilimonas tropica DSM19535T

Scanning electron microscopy image of Gracilimonas tropica DSM19535T

Organism information

Classification and features

Phylogenetic analysis based on 16S rRNA gene sequence comparison revealed DSM19535 is classified into the genus (Fig. 2). The type strains which were most closely related to strain DSM19535 were YIM J14T with 16S rRNA sequence similarity of 96.9 %, and CL-KR2T with a similarity of 96.1 %. Strain DSM19535 is tolerant of high salinity (up to 20 %) with a growth occurring over the range of salinity of 1–20 % (w/v) sea salts (optimum 3–6 %) (Table 1). Growth occurs under either aerobic or facultatively anaerobic conditions. The optimum pH is 7.0–8.0 with a growth range of pH 6–10 (Table 1). The strain was auxotroph for isoleucine and methionine (Table 1). Despite the phylum is known to be as a non-spore forming group [4], the strain was reported to form endospores, together with [3]. However, strain DSM19535 could not be asserted to form spore by the genomic analysis (see ‘Insights from the genome sequence’).
Fig. 2

Neighbour-joining tree, based on 16S rRNA gene sequences, showing the phylogenetic position of G. tropica DSM 19535T. Bootstrap percentages >50 % (based on 1000 resampling) are shown at branching points. Solid circles indicate that the corresponding nodes are also recovered in the maximum-likelihood and maximum-parsimony trees. Prolixibacter bellariivorans F2T was used as an outgroup. Bar, 0.02 nucleotide substitutions per site

Table 1

Classification and general features of G. tropica DSM 19535T [38, 39]

MIGS IDPropertyTermEvidence codea
Current classificationDomain Bacteria TAS [40]
Phylum Bacteriodetes TAS [4]
Class Sphingobacteriia TAS [41]
Order Sphingobacteriales TAS [41]
Genus Gracilimonas TAS [1]
Species Gracilimonas tropica TAS [1]
Type strain CL-CB462T TAS [1]
Gram stainNegativeTAS [1]
Cell shapeRod-shapedTAS [1]
MotilityNon-motileTAS [1]
SporulationNon-sporulationIDA
Temperature range20–40 °CTAS [1]
Optimum temperature35 °CTAS [1]
Energy sourceHeterotrophTAS [1]
Auxotroph forL-isoleucine, L-methionineIDA
Carbon sourceGlucose, fructose, aspartateTAS [1]
MIGS-6HabitatMarine, aquaticTAS [1]
MIGS-6.2pH6–10TAS [1]
MIGS-6.3Salinity1–20 % (optimum: 3–6 %)TAS [1]
MIGS-22Oxygen requirementFacultativeTAS [1]
MIGS-15Biotic relationshipFree livingTAS [1]
MIGS-14PathogenicityUnknownNAS
MIGS-4Geographic locationTropical NW PacificTAS [1]
MIGS-5Sample collection2009TAS [1]
MIGS-4.1LatitudeUnknownNAS
MIGS-4.2LongitudeUnknownNAS
MIGS-4.4Altitude0 mNAS

a Evidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [42]

Neighbour-joining tree, based on 16S rRNA gene sequences, showing the phylogenetic position of G. tropica DSM 19535T. Bootstrap percentages >50 % (based on 1000 resampling) are shown at branching points. Solid circles indicate that the corresponding nodes are also recovered in the maximum-likelihood and maximum-parsimony trees. Prolixibacter bellariivorans F2T was used as an outgroup. Bar, 0.02 nucleotide substitutions per site Classification and general features of G. tropica DSM 19535T [38, 39] a Evidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [42] By phylogenetic analyses (Fig. 2), the genus formed a sister clade with the genus which shows mesophilic features [5, 6]. At an outer branch, the clade with and formed a robust clade with the moderate halophilic (Fig. 2). Moreover, at a deeper branch, the clade formed a robust association with a clade that includes the thermophilic genus [7] and the genus of extremely halophilic [8], despite the relatively low (ca. 80 %) similarities between the two clades. Thus, the phylogentically robust clade contains both extremophiles and non-extremophiles. Auxotrophy for amino acids was examined using a minimal medium (glucose, 2 g; pyruvate, 0.3 g; K2HPO4, 3 g; NaH2PO4, 1 g; NH4Cl, 1 g; MgSO4•7H2O, 0.3 g; 1 ml of Holden’s trace elements [9]; 1 ml of Balch’s vitamin solution [10]; 1 L of artificial seawater [11]) supplemented with 0.3 mM or 3 mM of all amino acids except a focal amino acid. The strain could not grow in minimal medium without supplementation of L-isoleucine and L-methionine. But, the strain did not require other amino acids (L-alanine, L-arginine, L-asparagine, L-aspartate, L-cysteine, L-glutamate, L-glutamine, glycine, L-histidine, L-lysine, L-phenylalanine, L-proline, L-serine, L-threonine, L-tryptophan, L-tyrosine and selenocysteine) for growth.

Genome sequencing information

Genome project history

A culture of DSM 19535 (strain CL-CB462T) was selected for sequencing on the basis of its phylogenetic position [12, 13], and is part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project [14], a follow-up of the Genomic Encyclopedia of Bacteria and Archaea pilot project [15], which aims in increasing the sequencing coverage of key reference microbial genomes and to generate a large genomic basis for the discovery of genes encoding novel enzymes [16]. The one thousand microbial genomes-I is the first of the production phases of the Genomic Encyclopedia of Bacteria and Archaea: sequencing a myriad of type strains initiative [17] and a Genomic Standards Consortium project [18]. The genome project is deposited in the Genomes On Line Database [19] and the genome sequence is available from GenBank. Sequencing, finishing and annotation were performed by the DOE Joint Genome Institute (JGI) using state of the art sequencing technology [20]. A summary of the project information is presented in Table 2.
Table 2

Project information

MIGS IDPropertyTerm
MIGS-31Finishing qualityLevel 2: High Quality Draft
MIGS-28Libraries usedIllumina Std shotgun library
MIGS-29Sequencing platformsIllumina
MIGS-31.2Sequencing coverage421×
MIGS-30AssemblersVelvet v. 1.1.04, ALLPATHS v. R41043
MIGS-32Gene calling methodProdigal v2.5
NCBI project ID169,820
Genbank IDAQXG00000000
Genbank Date of ReleaseDecember 12, 2013
GOLD IDGp0013655
BIOPROJECTPRJNA169820
MIGS-13Source Material IdentifierDSM 19,535
Project relevanceGEBA-KMG, Tree of Life
Project information

Growth conditions and genomic DNA preparation

DSM 19535, was grown in DSMZ medium 514 (Bacto Marine Broth) [21] at 28 °C. Genomic DNA was isolated from about 0.5 g of cell paste using Jetflex Purification Kit (Epicentre MGP04100) following the standard protocol as recommended by the manufacturer with an additional protease K (50 μl; 21 mg/ml) digest for 60 min. at 58 °C followed by addition of 200 μl Protein Precipitation Buffer after protein precipitation and overnight incubation on ice [22]. DNA was quality controlled according to JGI guidelines and is available through the DNA Bank Network [23].

Genome sequencing and assembly

The draft genome was generated using Illumina technology [24]. An Illumina Std shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 14,058,618 reads totaling 2108.8 Mbp. All general aspects of library construction and sequencing performed at the JGI can be found at [25]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland A, Han J. DUK, unpublished, 2011). Artifact filtered sequence data was then screened and trimmed according to the k–mers present in the dataset (Mingkun L. kmernorm, unpublished, 2011). High–depth k–mers, presumably derived from MDA amplification bias, cause problems in the assembly, especially if the k–mer depth varies in orders of magnitude for different regions of the genome. Reads with high k–mer coverage (>30 × average k–mer depth) were normalized to an average depth of 30×. Reads with an average kmer depth of less than 2× were removed. Following steps were then performed for assembly: (1) normalized Illumina reads were assembled using Velvet version 1.1.04 [26], (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim [27], (3) normalized Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r41043) [28]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: −very clean yes –exportFiltered yes –min contig lgth 500 –scaffolding no –cov cutoff 10), 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0), 3) Allpaths–LG (PrepareAllpathsInputs: PHRED 64 = 1 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpathsLG: THREADS = 8 RUN = std shredpairs TARGETS = standard VAPI WARN ONLY = True OVERWRITE = True). The final draft assembly contained 48 contigs in 48 scaffolds. The total size of the genome is 3.8 Mbp and the final assembly is based on 457.7 Mbp of Illumina data. Based on a presumed genome size of 5Mbp, the average coverage of the genome was 421 × .

Genome annotation

Genes were identified using Prodigal [29] as part of the DOE-JGI Annotation pipeline [30, 31] followed by a round of manual curation using the JGI GenePRIMP pipeline [32]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information non-redundant database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and InterPro databases. Additional gene prediction analysis and functional annotation was performed within the Integrated Microbial Genomes [33].

Genome properties

The genome of the strain is 3,831,242 bp long and comprises 48 contigs ranging 1177 to 783,752 bp, with an overall GC content of 42.9 % (Table 3). Of the 3426 genes predicted, 3373 were protein coding genes, and 53 were RNA genes. A total of 2413 genes (70.4 %) were assigned a putative function while the remaining ones were annotated as hypothetical or unknown proteins. The distribution of genes into COG functional categories is presented in Table 4. The properties and the statistics of the genome are summarized in Tables 3 and 4.
Table 3

Genome statistics

AttributeNumber% of Totala
Genome size (bp)3,831,242100.00
DNA coding (bp)3,482,09390.89
DNA G + C (bp)1,645,31942.94
DNA scaffolds48100.00
Total genes3426100.00
Protein-coding genes337398.45
RNA genes531.55
Pseudo genes0
Genes in internal clusters104230.41
Genes with function prediction241370.43
Genes assigned to COGs193156.36
Genes with Pfam domains255774.64
Genes with signal peptides38511.24
Genes with transmembrane helices91226.62
CRISPR repeats0

aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome

Table 4

Number of genes associated with general COG functional categories

CodeValue% ageDescription
J1386.56Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K1125.32Transcription
L1075.09Replication, recombination and repair
B10.05Chromatin structure and dynamics
D241.14Cell cycle control, cell division, chromosome partitioning
V532.52Defense mechanisms
T964.56Signal transduction mechanisms
M1758.32Cell wall/membrane biogenesis
N170.81Cell motility
U442.09Intracellular trafficking and secretion
O934.42Posttranslational modification, protein turnover, chaperones
C1336.32Energy production and conversion
G954.52Carbohydrate transport and metabolism
E1788.46Amino acid transport and metabolism
F602.85Nucleotide transport and metabolism
H964.56Coenzyme transport and metabolism
I803.80Lipid transport and metabolism
P1034.90Inorganic ion transport and metabolism
Q532.52Secondary metabolites biosynthesis, transport and catabolism
R26312.50General function prediction only
S1838.79Function unknown
-149543.64Not in COGs

The total is based on total number of protein coding genes in the annotated genome

Genome statistics aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome Number of genes associated with general COG functional categories The total is based on total number of protein coding genes in the annotated genome

Insights from the genome sequence

Based on genomic analysis of the metabolic features, DSM19535 is predicted to be an auxotroph for L-lysine, L-phenylalanine, L-tyrosine, L-arginine, L-aspartic acid, L-isoleucine, L-proline, and L-methionine. In the auxotroph test, however, the strain was found to be auxotroph only for L-isoleucine and L-methionine (Table 1). This discrepancy might be due to missing annotations of essential genes by incomplete sequencing or presence of unknown genes related with transport and/or assimilation. In addition, despite selenocysteine was one of essential amino acids required for growth by the genomic analysis, the strain could grow in a medium without selenocysteine. Genome analysis also revealed that strain DSM19535 has a copper-containing nitrite reductase gene (nirK) homolog, suggesting that the strain may transform nitrite to nitric oxide (NO) under low oxygen or anoxic conditions. In addition, the strain contains DnrN (nitric oxide-dependent regulator) gene and this may protect cells from nitrosative stress [34]. However, the nitrate, nitric oxide and nitrous oxide reductases involved in denitrification were not found. The strain has an ATP-dependent glutamine synthetase and a NADPH-dependent glutamate-oxoglutarate amidotransferase, and thus can assimilate ammonia into glutamate and glutamine. In the strain, ammonium may be transported by an ammonium transport protein. Genes participating in phosphate metabolism were also identified in the genome of the strain DSM19535. Inorganic pyrophosphatase catalyzing the conversion of pyrophosphate to phosphate ion, and polyphosphate kinase catalyzing the formation of polyphosphate from ATP were found in the genome. The strain has several genes of Pho regulon (phoH, phoU, phoR and phoB) mediating an adaptive response to inorganic phosphate limitation but not high affinity phosphate binding protein and transporter (pstS and pstACB). In addition, the strain may hydrolyze phosphate groups from many types of organic molecules using alkaline phosphatase. In the previous study, DSM19535 was reported to be able to form spores [1]. The spore-formation is very unusual in the phylum [4]. Despite four and five proteins were annotated as stage II sporulation protein E (SpoIIE) and sporulation related domain, respectively, by search against the Pfam database, more than a hundred sporulation-related genes identified in 168T were absent from the genome of strain DSM19535. Further, the genes found in were also found in genomes of phylogenetically close but non-sporulating genera, DSM 17,893 and DSM2114. Therefore, further tests to examine spore-formation were conducted in this study. Consistent with the previous study, spore-like spherical cells were found after malachite green staining. However, after pasteurization at 60 °C for 10 and 20 min and 80, 90 and 100 °C for 10 min, re-growth of cells was never observed, suggesting that the coccoid cells may not be endospore. Actually, non-spore but spore-like spherical cells were also found in aging cultures of a variety of non-sporulating bacteria including belonging to the phylum , belonging to the class and belonging to the phylum [35-37]. The genomic analyses and pasteurization experiment convincingly suggested that the spore-like coccoid cells of DSM19535 are not endospores.

Conclusion

The genome of a member belonging to the genus in the phylum is reported here. In addition to detailed information of genome sequencing and annotation, genetic characteristics related with nitrogen and phosphorus utilization could be understood on the basis of genomic analyses. In addition, genomic analyses and pasteurization experiments suggested that DSM19535 does not form spores.
  34 in total

1.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

2.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

3.  Salinibacter ruber gen. nov., sp. nov., a novel, extremely halophilic member of the Bacteria from saltern crystallizer ponds.

Authors:  Josefa Antón; Aharon Oren; Susana Benlloch; Francisco Rodríguez-Valera; Rudolf Amann; Ramón Rosselló-Mora
Journal:  Int J Syst Evol Microbiol       Date:  2002-03       Impact factor: 2.747

4.  Anaerophaga thermohalophila gen. nov., sp. nov., a moderately thermohalophilic, strictly anaerobic fermentative bacterium.

Authors:  Karin Denger; Rolf Warthmann; Wolfgang Ludwig; Bernhard Schink
Journal:  Int J Syst Evol Microbiol       Date:  2002-01       Impact factor: 2.747

5.  New approach to the cultivation of methanogenic bacteria: 2-mercaptoethanesulfonic acid (HS-CoM)-dependent growth of Methanobacterium ruminantium in a pressureized atmosphere.

Authors:  W E Balch; R S Wolfe
Journal:  Appl Environ Microbiol       Date:  1976-12       Impact factor: 4.792

6.  The Genomic Standards Consortium.

Authors:  Dawn Field; Linda Amaral-Zettler; Guy Cochrane; James R Cole; Peter Dawyndt; George M Garrity; Jack Gilbert; Frank Oliver Glöckner; Lynette Hirschman; Ilene Karsch-Mizrachi; Hans-Peter Klenk; Rob Knight; Renzo Kottmann; Nikos Kyrpides; Folker Meyer; Inigo San Gil; Susanna-Assunta Sansone; Lynn M Schriml; Peter Sterk; Tatiana Tatusova; David W Ussery; Owen White; John Wooley
Journal:  PLoS Biol       Date:  2011-06-21       Impact factor: 8.029

7.  Complete genome sequence and description of Salinispira pacifica gen. nov., sp. nov., a novel spirochaete isolated form a hypersaline microbial mat.

Authors:  Wajdi Ben Hania; Manon Joseph; Peter Schumann; Boyke Bunk; Anne Fiebig; Cathrin Spröer; Hans-Peter Klenk; Marie-Laure Fardeau; Stefan Spring
Journal:  Stand Genomic Sci       Date:  2015-02-09

8.  Genomic Encyclopedia of Type Strains, Phase I: The one thousand microbial genomes (KMG-I) project.

Authors:  Nikos C Kyrpides; Tanja Woyke; Jonathan A Eisen; George Garrity; Timothy G Lilburn; Brian J Beck; William B Whitman; Phil Hugenholtz; Hans-Peter Klenk
Journal:  Stand Genomic Sci       Date:  2013-12-17

9.  Gracilimonas rosea sp. nov., isolated from tropical seawater, and emended description of the genus Gracilimonas.

Authors:  Yirang Cho; Hyunwoo Chung; Gwang Il Jang; Dong Han Choi; Jae Hoon Noh; Byung Cheol Cho
Journal:  Int J Syst Evol Microbiol       Date:  2013-05-24       Impact factor: 2.747

10.  Improving microbial genome annotations in an integrated database context.

Authors:  I-Min A Chen; Victor M Markowitz; Ken Chu; Iain Anderson; Konstantinos Mavromatis; Nikos C Kyrpides; Natalia N Ivanova
Journal:  PLoS One       Date:  2013-02-12       Impact factor: 3.240

View more
  2 in total

1.  Dissecting Light Sensing and Metabolic Pathways on the Millimeter Scale in High-Altitude Modern Stromatolites.

Authors:  Daniel Gonzalo Alonso-Reyes; Fátima Silvina Galván; José Matías Irazoqui; Ariel Amadio; Diogo Tschoeke; Fabiano Thompson; Virginia Helena Albarracín; María Eugenia Farias
Journal:  Microb Ecol       Date:  2022-09-26       Impact factor: 4.192

2.  Draft genome sequence of Marinobacterium rhizophilum CL-YJ9T (DSM 18822T), isolated from the rhizosphere of the coastal tidal-flat plant Suaeda japonica.

Authors:  Dong Han Choi; Gwang Ii Jang; Alla Lapidus; Alex Copeland; T B K Reddy; Supratim Mukherjee; Marcel Huntemann; Neha Varghese; Natalia Ivanova; Manoj Pillay; Brian J Tindall; Markus Göker; Tanja Woyke; Hans-Peter Klenk; Nikos C Kyrpides; Byung Cheol Cho
Journal:  Stand Genomic Sci       Date:  2017-10-30
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.