Literature DB >> 28748024

Insights into Cedecea neteri strain M006 through complete genome sequence, a rare bacterium from aquatic environment.

Kok-Gan Chan1,2, Wen-Si Tan1.   

Abstract

Cedecea neteri M006 is a rare bacterium typically found as an environmental isolate from the tropical rainforest Sungai Tua waterfall (Gombak, Selangor, Malaysia). It is a Gram-reaction-negative, facultative anaerobic, bacillus. Here, we explore the features of Cedecea neteri M006, together with its genome sequence and annotation. The genome comprised 4,965,436 bp with 4447 protein-coding genes and 103 RNA genes.

Entities:  

Keywords:  Cedecea; Facultative anaerobic; Genome; Gram-negative

Year:  2017        PMID: 28748024      PMCID: PMC5522597          DOI: 10.1186/s40793-017-0255-1

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The genus is an extremely rare member of the family [1]. The name was proposed in 1980 for a new genus formerly designated as CDC Enteric Group 15 [1, 2]. is characterized by positive lipase activity, resistance to colistin and cephalothin, and the inability to hydrolyze gelatin or DNA [3-5]. Discovery was from human sources where its natural environmental habitat remains unknown, constitutes a rare pathogen of rising importance [6]. To date, only a few species of have been identified: , and . All three species exhibit different behaviors in the human body. has been reported to be associated with scrotal abscess [7] and, most recently, to cause bacteraemia in patients with sigmoid colon cancer [8]. On the other hand, has mostly been reported to be involved in pneumonia cases [5, 9]. is associated with bacteremia in heart disease patients [4] and patients with systemic lupus erythematosus [10]. Strain M006 is a strain of and is an aquatic isolate from the Sungai Tua Waterfall, a Malaysian tropical rainforest waterfall (N 03 19.91′ E 101 42.15′). In this study, we present an overview of the classification and features of M006 as well as its genome sequence and annotation. There are a few aquatic isolates deposited in GenBank and strain M006 was one of the few isolates discovered from a waterfall which its genome feature has not been reported. Hence, here we firstly reported the genome information of M006 isolated from a waterfall environment.

Organisms Information

Classification and features

Strain M006 was categorized as a member of the genus by 16S rRNA phylogeny and phenotypic characteristics (Table 1). The EzTaxon database [11] was used as the preliminary 16S rRNA gene sequence-based identification. Strain M006 was most closely related to GTC 1717T (GenBank accession = AB086230) with a sequence similarity of 99.78%. Subsequent phylogenetic analysis was performed comparing the 16S rRNA gene sequences of strain M006 and related species (Fig. 1). The sequences were aligned and phylogenic trees were built using neighbor-joining (NJ) and maximum-likelihood (ML) methods implemented in MEGA version 5 [12].
Table 1

Classification and general features of Cedecea neteri M006 according to MIGS recommendations [14]

MIGS IDPropertyTermEvidence code
ClassificationDomain BacteriaTAS [22]
Phylum Proteobacteria TAS [23, 24]
Class Gammaproteobacteria TAS [2527]
Order unknown TAS [23]
Family Enterobacteriaceae TAS [2830]
Genus Cedecea TAS [4]
Species Cedecea neteri IDA
Strain: M006
Gram stainnegativeTAS [4, 10]
Cell shapebacillusTAS [4, 10]
MotilitymotileTAS [4]
SporulationNon-spore formingNAS
Temperature range4-28 °CIDA
Optimum temperature28 °CIDA
pH range; Optimume.g., 5.0-8.0; 7IDA
Carbon sourceD-sorbitol, Sucrose, D-xylose, malonateTAS [4]
MIGS-6HabitatwaterfallIDA
MIGS-6.3SalinityunknownIDA
MIGS-22Oxygen requirementFacultative anaerobicTAS [4, 10]
MIGS-15Biotic relationshipFree-livingTAS [4]
MIGS-14PathogenicityNon-pathogenIDA
MIGS-4Geographic locationSungai Tua Waterfall, MalaysiaIDA
MIGS-5Sample collection2013IDA
MIGS-4.1LatitudeN 03 19.91′IDA
MIGS-4.2LongitudeE 101 42.15′IDA
MIGS-4.4Altitude586 mIDA

Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [31]

Fig. 1

Phylogenetic tree highlighting the position of Cedecea neteri M006 relative to the type strains of other species within the genus of Cedecea. The strains and their corresponding GenBank accession numbers of 16S rRNA genes are indicated in parentheses. The sequences were aligned and the phylogenetic inferences were obtained using the maximum-likelihood method with MEGA version 5 [12]. The numbers at the nodes are the percentage of bootstrap values obtained by 500 replicates. Bar, 0.01 substitutions per nucleotide positions

Classification and general features of Cedecea neteri M006 according to MIGS recommendations [14] Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [31] Phylogenetic tree highlighting the position of Cedecea neteri M006 relative to the type strains of other species within the genus of Cedecea. The strains and their corresponding GenBank accession numbers of 16S rRNA genes are indicated in parentheses. The sequences were aligned and the phylogenetic inferences were obtained using the maximum-likelihood method with MEGA version 5 [12]. The numbers at the nodes are the percentage of bootstrap values obtained by 500 replicates. Bar, 0.01 substitutions per nucleotide positions M006 cells are Gram-negative, bacillus in shape (0.6-0.7 × 1.3-1.9 μm), are facultatively anaerobic and are motile with 5-9 peritrichous flagella. Colonies formed on nutrient agar are 1.5 mm in diameter and non-pigmented. Scanning electron micrograph pictures of nutrient broth grown cultures showed free-floating cells and clotted cells (Fig. 2). The carbon sources utilized by are D-sorbitol, sucrose, D-xylose and malonate. is reported to be unable to utilize dulcitol, adoitol, L-rhamnose, erythritol, glycerol and mucate. The optimal temperature for strain M006 is 28 °C.
Fig. 2

Scanning electron micrograph of Cedecea neteri M006. Scale bar 3.0 μm

Scanning electron micrograph of Cedecea neteri M006. Scale bar 3.0 μm M006 cells are Gram-negative, bacillus in shape, survive facultative anaerobically and are motile. The colonies formed on nutrient agar are 1.5 mm in diameter and are non-pigmented. The colony is whitish in color and the appearance is round with a smooth edge. Signaling molecules, known as N-acylhomoserine lactone, are produced for communication purposes in order to regulate physiological properties. The preliminary screening of strain M006 using the bacterial biosensor (CV026) showed the purple pigmentation indicative the presence of signaling molecules (Fig. 3).
Fig. 3

Preliminary screening for AHL. AHL screening of strain M006 with CV026. E. carotovora PNP22 and E. carotovora GS101 served as negative and positive controls respectively

Preliminary screening for AHL. AHL screening of strain M006 with CV026. E. carotovora PNP22 and E. carotovora GS101 served as negative and positive controls respectively

Genome sequencing information

Genome project history

Strain M006 was selected for the sequencing based on its phylogenetic position and the similarity of its 16S rRNA to other members of the genus , The genome project was deposited in the Genomes On-Line Database [13] and the genome sequence was deposited in GenBank (CP009458.1). A summary of the project and the Minimum Information about a Genome Sequence (MIGS) [14] are shown in Table 2.
Table 2

Genome sequencing project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityComplete
MIGS-28Libraries usedPacBio
MIGS 29Sequencing platformsPacBio
MIGS 31.2Fold coverage74.34×
MIGS 30AssemblersHGAP V 2.1.1
MIGS 32Gene calling methodIMG-ER
Locus TagLH23
Genbank IDCP009458
Genbank Date of Release2014/10/22
GOLD IDGp0109502
BIOPROJECTPRJNA260769
MIGS 13Source List IdentifierM006
Project relevanceEnvironmental
Genome sequencing project information

Growth conditions and genomic DNA preparation

M006 was cultured aerobically on Luria-Bertani (LB) agar medium at 28 °C overnight (16-18 h). Genomic DNA was extracted using the MasterPure™ DNA Purification Kit (Epicentre Inc., Madison, WI, USA). The extracted genomic DNA was examined via a NanoDrop spectrophotometer (Thermo Scientific, Waltham, MA, USA) and a Qubit 2.0 fluorometer (Life Technologies, Carlsbad, CA, USA) for its quality.

Genome sequencing and assembly

The genome of strain M006 was sequenced at the microbiome lab, High Impact Research, University Malaya, using a Pacific Biosciences single-molecule real-time (PacBio SMRT) sequencer. The sequencing was carried out using P5 chemistry on two SMRT cells with a 20-kb prepared SMRTbell library [15]. De novo assembly of 41,094 reads using the hierarchical genome assembly process in the SMRT version 2.1.1 portal resulted with one contig of 3.96 Mb in size. The sequencing average coverage is 74.34 × and this genome has a GC content of 54.41%.

Genome annotation

After genome assembly, it was analyzed using Rapid Annotation using Subsystem Technology server databases (version 2.0) [16], which identified 4423 predicted coding sequences with a total of 103 RNA genes. The predicted open reading frames were annotated by searching clusters of orthologous groups [17] using the Integrated Microbial Genomes Expert Review [18]. The different groups of RNAs (rRNA and tRNA) were identified by using RNAmmer 1.2 [19] and tRNAscan-SE 1.23 [20] respectively. The additional gene prediction analysis and functional annotation were performed within IMG-ER platform.

Genome properties

The genome comprised a circular chromosome with a length of 4,965,436 bp and 54.41% G + C content (Fig. 4 and Table 3). It is composed of one contig and of the 4550 predicted genes, 4447 were protein-coding genes. The properties of and the statistics for the genome are summarized in Table 3. The distribution of genes into COG functional categories is presented in Table 4.
Fig. 4

Graphical circular map of the genome. Starting from the outermost circle and moving inwards, each ring of the circle contains information on a genome: tRNA/rRNA, genes on the reverse and forward strands, GC skew and GC ratio

Table 3

Genome statistics

AttributeValue% of total
Genome size (bp)4,965,436100
DNA coding (bp)4,350,83487.62
DNA G + C (bp)2,701,61654.41
DNA scaffolds1100
Total genes4550100
Protein coding genes444797.74
RNA genes1032.26
rRNA genes220.48
tRNA801.76
Pseudo genes240.53
Genes in paralog clusters346276.09
Genes with function prediction409189.91
Genes assignmed to COGs361179.36
Genes with Pfam peptides409590.00
Genes with signal peptides46610.24
Genes with transmembrane helices107923.71
CRISPR repeats00.00
Table 4

Number of genes associated with general COG functional categories

CodeValue% agea Description
J1894.70Translation, ribosomal structure and biogenesis
A10.02RNA processing and modification
K3959.82Transcription
L1333.31Replication, recombination and repair
B00.00Chromatin structure and dynamics
D320.80Cell cycle control, Cell division, chromosome partitioning
V471.17Defense mechanisms
T1814.50Signal transduction mechanisms
M2245.57Cell wall/membrane biogenesis
N1172.91Cell motility
U1052.61Intracellular trafficking and secretion
O1453.60Posttranslational modification, protein turnover, chaperones
C2315.74Energy production and conversion
G3629.00Carbohydrate transport and metabolism
E41210.24Amino acid transport and metabolism
F962.39Nucleotide transport and metabolism
H1583.93Coenzyme transport and metabolism
I1092.71Lipid transport and metabolism
P2666.61Inorganic ion transport and metabolism
Q751.86Secondary metabolites biosynthesis, transport and catabolism
R40910.16General function prediction only
S3378.37Function unknown
-93920.64Not in COGs

aThe total is based on the total number of protein coding genes in the annotated genome

Graphical circular map of the genome. Starting from the outermost circle and moving inwards, each ring of the circle contains information on a genome: tRNA/rRNA, genes on the reverse and forward strands, GC skew and GC ratio Genome statistics Number of genes associated with general COG functional categories aThe total is based on the total number of protein coding genes in the annotated genome

Insights from the genome sequence

RAST annotation allowed the insight of subsystem category distribution of strain M006. This category enabled the understanding of various functional roles such as protein classes, amino acid biosynthesis and metabolic pathways. There are 552 subsystems. The most abundant subsystem feature belonged to carbohydrate metabolism (n = 576; out of a total of 3760 subsystem feature counts), followed by amino acid and derivatives (n = 495) and protein metabolism (n = 299) (Fig. 5). One of the subsystem features grouped as regulation and cell signaling was focused to allow functional genes related to quorum sensing (QS) activity to be searched. The in-silico study identified the novel LuxIR homologue of , which was later designated as CneIR. The complete open reading frame of strain M006 cneI and cneR homologues were found and are 462 bp and 723 bp, respectively. The complete genome sequencing allows deeper understanding of the genetic makeup that may help in identifying the linkage of pathogenicity and virulence factors with its QS properties [15].
Fig. 5

RAST annotation of C. neteri strain M006. This annotation pipeline allows a view of the subsystem category distribution of C. neteri strain M006. Genes responsible for QS activity in this strain can be found in regulation and cell signaling subsystem (red arrow)

RAST annotation of C. neteri strain M006. This annotation pipeline allows a view of the subsystem category distribution of C. neteri strain M006. Genes responsible for QS activity in this strain can be found in regulation and cell signaling subsystem (red arrow) Currently, the availability of genomes of this genus is low. Only 5 complete genomes of strains including strain M006 and a draft genome of type strain NBRC 105707 are deposited in NCBI. A matrix and dendrogram were generated based on AAI calculation that provide estimation of the average amino acid identity using best hits (one-way AAI) and reciprocal best hits (two-way AAI) between several genomic datasets of proteins [21], type strain DSM 4568 was included in the analyses. From the analyses, we can see closer protein clustering between strain M004 and strain ND14a (Fig. 6). Some of the basic comparisons of the genomes are listed in Table 5.
Fig. 6

AAI calculation for 6 C. neteri strains and 1 C. davisae strain. Analyses of conserved genes in the core genome computed based on AAI calculator provided (a) an AAI matrix; and (b) AAI-based phylogenetic distance tree, clustered according to distance pattern. The AAI-distance tree was clustered based on BIONJ method

Table 5

Comparison of several strains of C. neteri

Organism/NameStrainSize (Mb)GC%GeneProtein
C. neteri M0064.9754.4047034531
ND024.3153.9040533884
ND14b5.0556.9044914295
ND14a4.6654.8044264215
SSMD044.8855.1046224416
NBRC 1057075.2054.1049444739
AAI calculation for 6 C. neteri strains and 1 C. davisae strain. Analyses of conserved genes in the core genome computed based on AAI calculator provided (a) an AAI matrix; and (b) AAI-based phylogenetic distance tree, clustered according to distance pattern. The AAI-distance tree was clustered based on BIONJ method Comparison of several strains of C. neteri

Conclusion

This study provides phenotypic and genomic insights into strain M006. It reports the isolation of from an aquatic environment for the first time. This study also revealed of the QS ability of .
  23 in total

1.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

2.  Towards a genome-based taxonomy for prokaryotes.

Authors:  Konstantinos T Konstantinidis; James M Tiedje
Journal:  J Bacteriol       Date:  2005-09       Impact factor: 3.490

3.  List of new names and new combinations previously effectively, but not validly, published.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2006-01       Impact factor: 2.747

4.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

5.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

6.  The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

Authors:  T B K Reddy; Alex D Thomas; Dimitri Stamatis; Jon Bertsch; Michelle Isbandi; Jakob Jansson; Jyothi Mallajosyula; Ioanna Pagani; Elizabeth A Lobos; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

7.  Bacteraemia with Cedecea neteri in a patient with systemic lupus erythematosus.

Authors:  A Aguilera; J Pascual; E Loza; J Lopez; G Garcia; F Liaño; C Quereda; J Ortuño
Journal:  Postgrad Med J       Date:  1995-03       Impact factor: 2.401

8.  Cedecea davisae isolated from scrotal abscess.

Authors:  B H Bae; S B Sureka
Journal:  J Urol       Date:  1983-07       Impact factor: 7.450

Review 9.  Bacteraemia due to Cedecea davisae in a patient with sigmoid colon cancer: a case report and brief review of the literature.

Authors:  Karolina Akinosoglou; Angelos Perperis; Dimitrios Siagris; Panagiota Goutou; Irida Spiliopoulou; Charalambos A Gogos; Markos Marangos
Journal:  Diagn Microbiol Infect Dis       Date:  2012-08-10       Impact factor: 2.803

10.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

View more
  1 in total

1.  Urinary Catheter Colonization by Multidrug-Resistant Cedecea neteri in Patient with Benign Prostatic Hyperplasia.

Authors:  Peter S Ginn; Serina B Tart; Stephen M Sharkady; Dorothea K Thompson
Journal:  Case Rep Infect Dis       Date:  2018-07-11
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.