| Literature DB >> 35208867 |
Nurfatini Radzlin1,2, Amira Suriaty Yaakop3, Kian Mau Goh4, Kok Jun Liew4, Iffah Izzati Zakaria1, Ummirul Mukminin Kahar1.
Abstract
A halophilic marine bacterial strain, PS-C1, was isolated from Sekinchan beach in Selangor, Malaysia. The 16S rRNA gene sequence analysis indicated that strain PS-C1 was associated with the genus Celeribacter. To date, there have been no reports on enzymes from the genus Celeribacter. The present study reports on the cellular features of Celeribacter sp. PS-C1, its annotated genome sequence, and comparative genome analyses of Celeribacter glycoside hydrolase (GH) enzymes. The genome of strain PS-C1 has a size of 3.87 Mbp and a G+C content of 59.10%, and contains 3739 protein-coding genes. Detailed analysis using the Carbohydrate-Active enZYmes (CAZy) database revealed that Celeribacter genomes harboured at least 12 putative genes encoding industrially important GHs that are grouped as cellulases, β-glucanases, hemicellulases, and starch-degrading enzymes. Herein, the potential applications of these enzymes are discussed. Furthermore, the activities of two types of GHs (β-glucosidase and licheninase) in strain PS-C1 were demonstrated. These findings suggest that strain PS-C1 could be a reservoir of novel GH enzymes for lignocellulosic biomass degradation.Entities:
Keywords: Celeribacter; Rhodobacteraceae; carbohydrate-active enzymes; glycoside hydrolase; licheninase; lignocellulose biomass; marine bacteria; starch; β-glucan; β-glucosidase
Year: 2022 PMID: 35208867 PMCID: PMC8874975 DOI: 10.3390/microorganisms10020410
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Figure 1Phylogenetic tree based on 16S rRNA gene sequences showing the relationship between strain PS-C1 and representatives of Celeribacter spp. [19,20,21,22,23,24,25,26,27,28,29,30,31,32]. The 16S rRNA NCBI GenBank accession number for each strain is shown in brackets. The 16S rRNA sequences were aligned using ClustalW, and the tree was constructed using a neighbour-joining method with 1000 bootstrap replicates embedded in the MEGA11 software package [45]. The sequenced genomes and their respective information are indicated in blue. The scale bar represents 0.01 nucleotide substitution per site. Roseibacterium elongatum OCH323T was used as an out-group. T Type strain; a complete genome.
Comparison of morphology and biochemical characteristics between strain PS-C1 and type strains of the genus Celeribacter.
| Characteristic | PS-C1 a | EMB201T | ZXM137T | NH195T | DSM100434T | DSM26471T | P73T | DSM27375T | DY2-5T | KMM9024T | IMCC12053 T |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||
| Cell size | 0.18–1.2 × | 0.8–1.2 × | 0.3 × | 0.5–1.0 × | 0.4–0.5 × | 0.4–0.9 × | 0.6–0.7 × | 0.6–1.0 × | 0.5–0.8 × | 0.6–0.8 × | 0.6–0.7 × |
| Catalase | + | + | + | + | + | + | + | + | + | + | + |
| Oxidase | + | + | + | + | + | − | + | + | ND | + | + |
| Motility | − | − | − | − | ND | − | − | − | − | − | − |
|
| |||||||||||
| Temperature range | 30–40 | 10–37 | 4–45 | 20–40 | 10–45 | 5–35 | 10–41 | 4–37 | 0–37 | 4–37 | 4–37 |
| pH range | 5.5–10.0 | 5.0–9.5 | 6.0–9.0 | 5.0–9.0 | 5.0–9.0 | 7.5–8.0 | 2.0–12.0 | 7.5–8.0 | 6.5–9.0 | 5.5–9.5 | 6.0–9.0 |
| NaCl range | 1.0–8.0 | 1.0–7.0 | 0.5–11.0 | 0.5–10.0 | 0–18.0 | 1.0–8.0 | 0.5–12.0 | 0–13.0 | 11.0 | 1.0–7.0 | 0.5–5.0 |
| Anaerobic growth | + | − | − | − | − | + | + | − | ND | − | − |
|
| |||||||||||
| Ampicillin (50 µg/mL) | + | nd | + | + | nd | nd | + | + | nd | + | nd |
| Penicillin G (50 µg/mL) | + | nd | + | nd | nd | nd | nd | + | nd | + | nd |
| Tetracycline hydrochloride (100 µg/mL) | + | nd | + | + | nd | + | + | nd | nd | + | nd |
| Kanamycin sulphate (100 µg/mL) | + | nd | + | + | nd | nd | + | + | nd | + | + |
| Chloramphenicol (100 µg/mL) | + | nd | nd | + | nd | + | nd | + | nd | + | + |
|
| |||||||||||
| Nitrate reduction | + | − | − | − | + | + | − | − | − | + | − |
| Citrate utilization | + | − | + | − | − | − | − | + | − | − | nd |
| Indole production | − | − | − | − | nd | − | − | − | − | − | − |
| Urease degradation | + | − | + | + | + | + | + | + | + | − | − |
| D-lactose degradation | − | nd | − | − | − | + | nd | + | − | − | − |
| Lipase activity | − | − | + | − | + | − | + | − | − | − | − |
| α-glucosidase activity | + | + | + | + | + | + | − | + | nd | + | + |
| β-glucosidase activity | + | + | + | + | + | + | + | + | − | nd | + |
|
| |||||||||||
| NCBI GenBank accession number | MW785752 | KP272156 | FJ436725 | KT852989 | KR349442 | FJ535354 | EU440950 | HM997022 | KF356415 | AB564595 | KF146343 |
| NCBI Blastn (%) | 100.00 | 99.38 | 99.04 | 98.36 | 97.01 | 96.94 | 96.72 | 96.37 | 95.10 | 94.46 | 94.27 |
| EzBioCloud pairwise alignment (%) b | 100.00 | 98.92 | 99.06 | 98.41 | 96.83 | 96.89 | 96.61 | 96.54 | 95.38 | 94.59 | 94.66 |
|
| This study | [ | [ | [ | [ | [ | [ | [ | [ | [ | [ |
PS-C1: Strain PS-C1; EMB201T: Celeribacter naphthalenivorans EMB201T; ZXM137T: Celeribacter halophilus ZXM137T; NH195T: Celeribacter ethanolicus NH195T; DSM100434T: Celeribacter persicus DSM100434T; DSM26471T: Celeribacter neptunius DSM26471T; P73T: Celeribacter indicus P73T; DSM27375T: Celeribacter baekdonensis DSM27375T; DY2-5T: Celeribacter manganoxidans DY2-5T; KMM9024T: Celeribacter arenosi KMM9024T; IMCC12053T: Celeribacter marinus IMCC12053T. Characteristics are scored as: +, positive; −, negative; and, not determined. a Complete API result for strain PS-C1 is listed in Table S1; b pairwise 16S rRNA sequence alignment for taxonomy using EzBioCloud 16S database of strain PS-C1 against all type strains of Celeribacter spp.; T type strain.
Figure 2A field emission scanning electron micrograph of strain PS-C1 at 24,000× magnification. Scale bar: 5 μm.
Genome attributes of strain PS-C1 according to Minimum Information about a Genome Sequence (MIGS) specification a.
| Attribute | Description |
|---|---|
|
| |
| Sequencing platform | Illumina NovaSeq |
| Assembly | SOAPdenovo v2.04 |
| Finishing strategy | High-quality draft |
| Genome coverage | 357 × |
| Genome quality | No contamination |
| Relevance | Industrial |
|
| |
| Genome size (bp) | 3,866,278 |
| G+C content (%) | 59.10% |
| Number of contigs | 40 |
| Longest contig length (bp) | 499,873 |
| N50 value | 302,457 |
| N90 value | 80,703 |
| L50 value | 5 |
| Total genes | 3818 |
| Pseudogenes | 25 |
| Noncoding RNA genes | 54 |
| tRNA | 48 |
| ncRNA | 3 |
| 5S rRNA | 1 |
| 16S rRNA | 1 |
| 23S rRNA | 1 |
| Protein-coding sequences | 3739 |
| With Pfam | 2657 |
| With signal peptide | 496 |
| With COGs | 3702 |
| Connected to KEGG pathways | 2218 |
| Putative hypothetical proteins | 462 |
|
| |
| GenBank | JAHXRW000000000 |
| BioProject | PRJNA716474 |
| BioSample | SAMN18354561 |
| Locus Tag | J5Y17 |
| GenBank | JAHXRW000000000 |
a The MIGS information for all Celeribacter genomes is listed in Table S2.
Genomic comparison of strain PS-C1 against all available genomes of Celeribacter spp. using Genome-to-Genome Distance Calculator (GGDC) and Average Nucleotide Identity (ANI).
| dDDH | Strain | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ANI | PS-C1 | EaN35-2 | HF31 | NH195T | TSPH2 | ZXM137T | G3M19 | DSM100434T | DSM26471T | ASW11-22 | P73T | B30 | DSM27375T | LH4 | IMCC12053T | |
|
|
| 100.00 |
| 34.60 | 23.80 | 23.70 | 22.60 | 22.60 | 22.50 | 22.40 | 21.10 | 19.80 | 19.70 | 19.50 | 19.40 | 18.20 |
|
|
| 100.00 | 41.10 | 24.90 | 24.40 | 23.10 | 23.00 | 22.80 | 23.00 | 20.90 | 20.30 | 19.90 | 19.70 | 19.80 | 18.90 | |
|
| 88.12 | 90.42 | 100.00 | 24.30 | 24.30 | 23.00 | 23.10 | 22.70 | 22.80 | 20.70 | 20.10 | 19.90 | 19.80 | 19.70 | 18.90 | |
|
| 81.39 | 82.05 | 81.67 | 100.00 | 82.10 | 22.00 | 21.80 | 30.50 | 24.10 | 21.00 | 21.10 | 20.60 | 20.60 | 19.50 | 18.70 | |
|
| 81.10 | 81.73 | 81.60 | 98.07 | 100.00 | 21.90 | 21.90 | 29.80 | 23.00 | 20.70 | 22.00 | 19.70 | 19.90 | 19.40 | 19.00 | |
|
| 80.31 | 80.37 | 80.55 | 86.07 | 85.62 | 100.00 | 61.30 | 21.70 | 22.00 | 20.20 | 19.90 | 19.40 | 19.60 | 19.70 | 19.00 | |
|
| 80.18 | 80.43 | 80.44 | 79.25 | 79.23 | 79.04 | 100.00 | 21.60 | 22.00 | 20.20 | 19.80 | 19.40 | 19.40 | 19.40 | 18.70 | |
|
| 79.95 | 80.33 | 80.45 | 79.06 | 78.97 | 78.98 | 95.22 | 100.00 | 23.50 | 20.30 | 21.20 | 20.70 | 21.00 | 19.20 | 18.30 | |
|
| 79.73 | 80.36 | 80.04 | 80.81 | 80.18 | 80.44 | 78.68 | 78.68 | 100.00 | 20.80 | 21.10 | 21.50 | 21.20 | 19.70 | 18.90 | |
|
| 77.70 | 77.64 | 77.11 | 77.44 | 77.29 | 77.14 | 76.23 | 76.37 | 77.11 | 100.00 | 19.30 | 18.70 | 18.40 | 18.60 | 18.20 | |
|
| 76.46 | 76.73 | 76.67 | 76.94 | 76.82 | 77.32 | 75.94 | 76.02 | 77.44 | 74.25 | 100.00 | 19.00 | 18.70 | 18.50 | 18.20 | |
|
| 76.41 | 76.80 | 76.77 | 76.94 | 76.53 | 77.18 | 75.82 | 75.89 | 77.44 | 74.51 | 93.67 | 100.00 | 53.90 | 40.80 | 18.70 | |
|
| 76.30 | 76.68 | 76.37 | 76.24 | 76.32 | 75.94 | 75.98 | 75.97 | 76.18 | 74.35 | 89.78 | 90.28 | 100.00 | 39.00 | 18.60 | |
|
| 76.23 | 76.91 | 76.99 | 77.90 | 78.92 | 78.27 | 75.99 | 75.93 | 77.48 | 75.43 | 74.53 | 74.90 | 74.60 | 100.00 | 72.10 | |
|
| 72.51 | 72.78 | 72.83 | 72.48 | 72.56 | 72.45 | 72.71 | 72.63 | 72.51 | 72.27 | 72.91 | 72.99 | 73.12 | 72.09 | 100.00 | |
The dDDH and ANI values (%) shared between the genomes are shown at blue and red diagonals, respectively; Bold values: closest relative to strain PS-C1; PS-C1: strain PS-C1; EaN35-2: Celeribacter naphthalenivorans EaN35-2; HF31: Celeribacter sp. HF31; NH195T: Celeribacter ethanolicus NH195T [27]; TSPH2: Celeribacter ethanolicus TSPH2 [30]; ZXM137T: Celeribacter halophilus ZXM137T; G3M19: Celeribacter halophilus G3M19; DSM100434T: Celeribacter persicus DSM100434T [26]; DSM26471T: Celeribacter neptunius DSM26471T [19]; ASW11-22: Celeribacter sp. ASW11-22; P73T: Celeribacter indicus P73T [16,24]; B30: Celeribacter baekdonensis B30; DSM27375T: Celeribacter baekdonensis DSM27375T; LH4: Celeribacter baekdonensis LH4 [32]; IMCC12053T: Celeribacter marinus IMCC12053T [25,31]; T type strain.
Cluster of Orthologous Group (COG) functional assignments of strain PS-C1 a.
| COG Class | COG Functional Categories | Strain PS-C1 | |
|---|---|---|---|
| Gene Count | Percentage (%) | ||
|
| |||
| C | Energy production and conversion | 254 | 6.79 |
| E | Amino acid transport and metabolism | 324 | 8.67 |
| F | Nucleotide transport and metabolism | 86 | 2.30 |
| G | Carbohydrate transport and metabolism | 219 | 5.86 |
| H | Coenzyme transport and metabolism | 126 | 3.37 |
| I | Lipid transport and metabolism | 125 | 3.34 |
| P | Inorganic ion transport and metabolism | 253 | 6.77 |
| Q | Secondary metabolites, biosynthesis, transport, and catabolism | 118 | 3.16 |
|
| |||
| D | Cell cycle control, cell division, and mitosis | 37 | 0.99 |
| M | Cell wall/membrane/envelope biogenesis | 339 | 9.07 |
| N | Cell motility | 7 | 0.19 |
| O | Post-translational modification, protein turnover, chaperone functions | 117 | 3.13 |
| T | Signal transduction mechanisms | 119 | 3.18 |
| U | Intracellular trafficking, secretion, and vesicular transport | 65 | 1.74 |
|
| |||
| J | Translation, ribosomal structure, and biogenesis | 187 | 5.00 |
| K | Transcription | 240 | 6.42 |
| L | Replication, recombination, and repair | 2 | 0.05 |
| V | Defense mechanisms | 34 | 0.91 |
| Z | Cytoskeleton | 1 | 0.03 |
|
| |||
| R | General function prediction only | 0 | 0.00 |
| S | Function unknown | 619 | 16.56 |
|
| 3702 | 87.51 | |
a The complete COG profiles for all Celeribacter genomes are listed in Table S4.
Figure 3Total number of CAZymes present in strain PS-C1 and 14 other known genomes of Celeribacter spp. GH: glycoside hydrolase; GT: glycoside transferase; AA: auxiliary activity; CE: carbohydrate esterase; a complete genome; T type strain.
List of several glycoside hydrolases (GHs) identified in Celeribacter genomes and their identity to the other genera.
| Category | Predicted function | Identity within | Identity to Other | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PS-C1 | P73T | TSPH2 | IMCC12053T | LH4 | DSM27375T | DSM26471T | DSM100434T | NH195T | ZXM137T | G3M19 | ASW11-22 | EaN35-2 | B30 | HF31 | |||
|
| |||||||||||||||||
| (GH1) | β-glucosidase | 73.53 | 100 | 73.76 | 75.28 | 75.57 | 75.79 | 71.69 | 74.89 | 73.76 | 75.11 | 75.11 | 71.72 | 74.66 | 76.47 | 74.66 | |
|
| |||||||||||||||||
| (GH8) | Endo-1,3(4)-β-glucanase | − | − | − | − | − | − | − | − | − | − | − | − | − | − | 100 | |
| (GH16) | Licheninase | 53.00 | 100 | − | − | − | − | − | − | − | − | 58.25 | − | 53.00 | − | 56.30 | |
| (GH26) | Endo-β-1,3-1,4- | − | − | 100 | − | 74.58 | − | 72.58 | 96.03 | 99.67 | − | − | − | − | − | - | |
|
| |||||||||||||||||
| (GH2) | β-mannosidase | − | − | 100 | − | 66.75 | 68.99 | − | 81.65 | 97.13 | − | − | − | − | 68.86 | − | |
| (GH26) | β-1,3-xylanase | 100 | − | − | − | − | − | − | − | − | − | − | − | − | − | 92.25 | |
| (GH36) | α-galactosidase | − | − | − | − | − | − | − | − | − | 100 | 94.49 | − | − | − | − | |
| (GH43) | β-xylosidase | − | − | − | 100 | − | − | − | − | − | − | − | − | − | − | − | |
| (GH51) | α-L-arabinofuranosidase | − | − | − | − | − | − | − | − | − | − | − | − | 100 | − | − | |
|
| |||||||||||||||||
| (GH13) | α-amylase | − | − | − | − | − | − | − | − | − | 100 | 98.97 | − | − | 24.15 | − | |
| (GH13) | Oligo-1,6-glucosidase | − | − | − | − | − | − | − | − | − | − | − | − | − | 100 | − | |
| (GH13) | α-glucosidase | 30.97 | 100 | 31.06 | 30.77 | 29.87 | 30.42 | 30.76 | 31.67 | 31.23 | 31.19 | 31.36 | 31.03 | 31.29 | 30.00 | 30.62 | |
PS-C1: strain PS-C1; EaN35-2: Celeribacter naphthalenivorans EaN35-2; HF31: Celeribacter sp. HF31; NH195T: Celeribacter ethanolicus NH195T [27]; TSPH2: Celeribacter ethanolicus TSPH2 [30]; ZXM137T: Celeribacter halophilus ZXM137T; G3M19: Celeribacter halophilus G3M19; DSM100434T: Celeribacter persicus DSM100434T [26]; DSM26471T: Celeribacter neptunius DSM26471T [19]; ASW11-22: Celeribacter sp. ASW11-22; P73T: Celeribacter indicus P73T [16,24]; B30: Celeribacter baekdonensis B30; DSM27375T: Celeribacter baekdonensis DSM27375T; LH4: Celeribacter baekdonensis LH4 [32]; IMCC12053T: Celeribacter marinus IMCC12053T [25,31]. Characteristics scored as: -, absence; T type strain. a The NCBI GenBank accession numbers for the respective GHs are listed in Table S5.
Figure 4(a) Schematic representation of domain organisation for β-glucosidase in strain PS-C1 (BglPS-C1); aa: amino acids. (b) Time courses of growth of strain PS-C1 and BglPS-C1 production by cultivation in the marine broth supplemented with 1.0% (w/v) cellobiose. p-Nitrophenyl-β-D-glucopyranoside (pNPG) was used as the enzymatic substrate. The values shown represent the mean ± standard error of triplicate analyses.
Figure 5(a) Schematic representation of domain arrangements for the licheninase from strain PS-C1 (LicPS-C1); aa: amino acids. (b) Time courses of strain PS-C1 growth and LicPS-C1 production by cultivation in the marine broth supplemented with 1% (w/v) cellobiose. The LicPS-C1 activities were detected using p-nitrophenyl-β-D-glucopyranoside (pNPG) and β-glucan as the enzymatic substrates. The values shown represent the mean ± standard error of triplicate analyses. (c) Analysis of reaction products produced by LicPS-C1 acting on β-glucan at different time intervals using ultra-high-performance liquid chromatography with an evaporative light-scattering detector (UHPLC-ELSD). The amount of total sugars produced at each time point is shown relative to that at 48 h. Dp1: glucose; Dp2: cellobiose; Dp3: cellotriose.