| Literature DB >> 18516288 |
Ronald M Weiner1, Larry E Taylor, Bernard Henrissat, Loren Hauser, Miriam Land, Pedro M Coutinho, Corinne Rancurel, Elizabeth H Saunders, Atkinson G Longmire, Haitao Zhang, Edward A Bayer, Harry J Gilbert, Frank Larimer, Igor B Zhulin, Nathan A Ekborg, Raphael Lamed, Paul M Richardson, Ilya Borovok, Steven Hutcheson.
Abstract
The marine bacterium Saccharophagus degradans strain 2-40 (Sde 2-40) is emerging as a vanguard of a recently discovered group of marine and estuarine bacteria that recycles complex polysaccharides. We report its complete genome sequence, analysis of which identifies an unusually large number of enzymes that degrade >10 complex polysaccharides. Not only is this an extraordinary range of catabolic capability, many of the enzymes exhibit unusual architecture including novel combinations of catalytic and substrate-binding modules. We hypothesize that many of these features are adaptations that facilitate depolymerization of complex polysaccharides in the marine environment. This is the first sequenced genome of a marine bacterium that can degrade plant cell walls, an important component of the carbon cycle that is not well-characterized in the marine environment.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18516288 PMCID: PMC2386152 DOI: 10.1371/journal.pgen.1000087
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
General features of the S. degradans 2-40 genome.
| Category | Number | % of Total |
| DNA, total number of bases | 5057531 | 100.00% |
| DNA coding number of bases | 4385202 | 86.71% |
| DNA G+C number of bases | 2317668 | 45.83% |
| DNA scaffolds | 1 | 100.00% |
| Genes total number | 4067 | 100.00% |
| Protein coding genes (including pseudogenes) | 4017 | 98.77% |
| RNA genes | 50 | 1.23% |
| rRNA genes | 6 | 0.15% |
| 5S rRNA | 2 | 0.05% |
| 16S rRNA | 2 | 0.05% |
| 23S rRNA | 2 | 0.05% |
| tRNA genes | 41 | 1.01% |
| Other RNA genes ( | 3 | 0.08% |
| Genes with function prediction | 2809 | 69.07% |
| Genes without function prediction | 1208 | 29.70% |
| Genes w/o function with similarity | 1206 | 29.65% |
| Genes w/o function w/o similarity | 2 | 0.05% |
| Pseudogenes | 9 | 0.22% |
| Genes assigned to enzymes | 403 | 9.91% |
| Genes connected to KEGG pathways | 404 | 9.93% |
| Genes not connected to KEGG pathways | 3663 | 90.07% |
| Genes in ortholog clusters | 3611 | 88.79% |
| Genes in paralog clusters | 456 | 11.21% |
| Genes in COGs | 2440 | 60.00% |
| Genes in Pfam | 2748 | 67.57% |
| Genes in InterPro | 2953 | 72.61% |
| Genes with IMG Terms | 327 | 8.04% |
| Genes in IMG Pathways | 172 | 4.23% |
GC percentage shown as count of G's and C's divided by a total number of G's, C's, A's, and T's. This is not necessarily synonymous with the total number of bases.
Includes genes encoding proteins, RNA genes and pseudogenes.
Pseudogenes may also be counted as protein coding or RNA genes, so is not additive under total gene count.
See
Figure 1Schematic circular representation of the S. degradans 2-40 closed genome sequence.
The nucleotide sequence of the 5.06 Mb Sde2-40 genome was determined by the United States Department of Energy Joint Genome Institute. Outer ring, sequence address in nucleotides. Next pair of rings, location of the identified 4009 gene models on each coding strand of the genome and predicted function of products: replication and repair (red), energy metabolism (green), carbon and carbohydrate metabolism (blue), lipid metabolism (cyan), transcription (magenta), translation (yellow), amino acid metabolism (orange), metabolism of cofactors and vitamins (pink), purine and pyrimidine metabolism (light red), signal transduction (lavender), cellular processes (sky blue), structural RNA's (pale green) miscellaneous functions (brown), conserved and unique hypothetic proteins (light or dark grey). Next pair of rings, location of predicted or known carbohydrase genes. cellulase (red), chitinase (green), hemicellulase (blue, includes xylanases, arabinofuranases, mixed function glucanases), pectinase (orange), carbohydrate binding module protein (black), alginase (purple), agarase (yellow). Black ring, deviation from the average %( G+C), Inner ring, GC Skew (G−C)/(G+C)
Membrane-bound one-component transcriptional factors encoded adjacent to genes for complex carbohydrate degradation.
| Transcription factor (SDE locus | Number of TM regions | Adjacent genes(SDE Locus |
| HTH_AraC (0324) | 2 | Cellulase (0325) |
| HTH_AraC (3613) | 2 | β-1,4-xylanase (3612) |
| HTH_AraC (2491) | 6–7 | Cellulase (2490) |
| HTH_AraC (2495) | 6–7 | 1,4- β -glucosidase (2497) |
| HTH_AraC (2928) | 6–7 | Cellulase (2929) |
| HTH_AraC (3858) | 6–7 | Cellulose-binding protein; putative (3859) |
| HTH_LytTR (3422) | 4 | Cellulase (3420) |
Gene number having the prefix “Sde”, for Saccharophagus degradans as assigned in Jun 15, 2005 genome assembly.
TM, abbreviation for predicted transmembrane regions.
Figure 2Schematic organization of a typical Glycosyl Hydrolase catalyzing endo-cleavage of a polysaccharide shown within a generic cellulase system pathway.
Carbohydrate-binding modules (CBMs) specifically target enzymes to their substrates, initiating and maintaining prolonged contact with the insoluble polysaccharide. The catalytic module may be a glycosyl hydrolase (GH) polysachharide lysase (PL), glycosyl transferase or an esterase. The flexible linker affords the catalytic module a certain freedom of movement, which presumably allows it to adjust to conformational variations in the substrate while the CBM maintains contact with the substrate. Enzymes, representative of a typical cellulase system, are depicted depolymerizing a single cellulose chain. Exo-acting cellobiohydrolases and endoglucanase synergistically degrade polymeric cellulose to cellobiose and cellodextrins, respectively. At least part of the synergism is believed to result from the activity of endoglucanases creating additional ends for exoglucanases to act upon. Cellodextrins (soluble cello-oligomers) may be further processed to glucose and cellobiose by cellodextrinases. Depending on the organism cellobiose may be cleaved extracellularly by β-glucosidases (cellobiases) and imported as glucose, or imported directly and cleaved in the cytoplasm. Import generally occurs through phosphotransferase transport systems, resulting in cytoplasmic Glucose-6-Phosphate (G6P) and phosphorylated cellobiose. Certain organisms, such as Clostridium thermocellum, are also capable of importing cellodextrins for cytoplasmic cleavage. Systems that degrade other complex polysaccharides (e.g. chitin) share many of the features depicted for cellulose degradation, i.e. endo- and exo-acting enzymes and polymer-specific CBMs; however, there are substrate-specific variations in enzymatic composition, to include enzymes dedicated to the removal of side-chains such as xylose and/or arabinose oligomers or substituent groups, which may include acetate, sulfate and methyl, among others.
Figure 3Oligimers of six major types of complex polysaccharides (Roman numerals), their component sugars and enzymes (Capital letters) that act on specific bonds within the molecules.
The figure is keyed to Table 5, Enzyme activity induced by sole carbon complex polysaccharides. Sugars are color coded and keyed in box.
Enzyme activity induced by sole carbon complex polysaccharides (CP).
| Growth medium sole Carbon Source substrate (inducer) | ||||||
| Activity vs. |
| Avicel | Xylan | β-glucan | Laminarin | Glucose |
| CP (see | I-VI | I | II | III | III | N/A |
| Avicel | 4.8 | 10.5 | 93 | 0 | 0 | 0 |
| CMC | 58.2 | 218 | 117.9 | 95.5 | 59.6 | 7.2 |
| PASC | 51.1 | 254.2 | 118.3 | 0 | 37.5 | 0 |
| Xylan | 29.8 | 111.1 | 267.2 | 0 | 0 | 0 |
| β-glucan | 33.7 | 157.6 | 169 | 203.2 | 159.3 | 0 |
| Laminarin | 28.8 | 72.5 | 101.4 | 164.3 | 295.1 | 50.2 |
| Carbohydrases required to degrade substrate(keyed to | cellulases (I:ABC) xylanases (II:ABCDE) β-glucanases (III:AB) mannanases (IV: AB) pectinases (V:ABC VI:ABCDE) | Cellulases (I:ABC) | xylanases (II:ABCDE) | β-1,3 and β-1,4-glucosidases (III:AB) | β-1,3-glucosidases (IIIA) | Glycolysis, TCA cycle |
Spartina alterniflora (saltmarsh cord grass), found in intertidal wetlands, has a cell wall with approx. 10% lignin, with the remainder being hemicellulose, cellulose, and pectin
Purified cellulose, ∼70% crystallinity
Birchwood xylan, glucuronoarabinoxylan
Barley β-glucan, mixed β-1,3- and β-1,4-glucan
Laminaria digitata laminarin, β-1,3-glucan
Carboxymethyl cellulose, 100% amorphous
Phosphoric acid swollen cellulose, intermediate crystallinity between Avicel and CMC
Genomically predicted laminarinases.
| Name | Locus Tag | Predicted function | Modules | amino acids | MW |
| Lam16A | 1393 | β-1,3-glucanase | GH16/CBM6/CBM6/TSP3/TSP3/TSP3/TSP3/COG3488 | 1,707 | 163.3 |
| Lam16B | 2927 | β-1,3-glucanase | GH16/CBM6/CBM6/EPR(56)/CBM32/CBM32 | 1,441 | 158.6 |
| Lam16C | 1444 | β-1,3-glucanase | GH16/CBM4/CBM32/CBM32 | 1,184 | 129.1 |
| Lam16D | 3021 | β-1,3-glucanase | GH16/CBM32/PSL(48)/TMR | 722 | 77.7 |
| Lam16E | 0652 | β-1,3-glucanase | CBM6/CBM6/GH16 | 569 | 61.4 |
| Lam16F | 3121 | β-1,3-glucanase | LPB/GH16 | 742 | 80.2 |
| Lam16G | 2832 | catalytic residues missing | LPB/GH16/CBM6/CBM6 | 877 | 94.2 |
| Lam81A | 2834 | β-1,3-glucanase | LPB/CAD/GH81/FN3/FN3 | 1,238 | 133.1 |
Gene number having the prefix “Sde”, for Saccharophagus degradans as assigned in Jun 15, 2005 genome assembly. (http://genome.ornl.gov/microbial/mdeg/15jun05/mdeg.html).
Predictions of function and module determination by CAZy ModO at AFMB-CNRS.
Module abbreviations: CAD, cadherin-like domain; CBM, carbohydrate binding module; COG3488, thiol-oxidoreductase like domain; EPR, glutamic acid-proline rich region; FN3, fibronectin type 3 module; GH, glycoside hydrolase; LPB, lipobox signature sequence; PSL, polyserine linker; TMR, transmembrane region; TSP3, thrombospondin type 3 repeat.
MW and amino acid count calculated using the protParam tool at http://us.expasy.org/tools/ based on DOE/JGI gene model amino acid sequence translations including any predicted signal peptide.
Substrate-specific induction of selected carbohydrase genes.
| Gene | Relative Transcript Levels after Growth on | |||||
| Agar | Alginate | Avicel | Chitin | Xylan | Glucose | |
|
| 284±46 | 4±1 | 2±0.5 | 22±3 | 88±22 | 0.5±0.1 |
|
| 1±0.5 | 31±1 | 0.5±0.1 | 2±0.5 | 10±0.5 | 1±0.1 |
|
| 6±2 | 3±0.5 | 778±200 | 201±57 | 158±50 | 0.5±0.1 |
|
| 1±0.5 | 4±1 | 3±0.5 | 1749±146 | 102±20 | 2±0.5 |
|
| 9±2 | 3±1 | 292±50 | 1057±10 | 1349±191 | 0.5±0.1 |
aga16B, Sde_1175; algF, Sde_2873; cel5H, Sde_3237; chi18A, Sde_1704; xyn11A, Sde_0701
S. degradans 2-40 was grown in minimal medium supplemented by 0.2% glucose to an OD600 of 0.33–0.35. The cells were harvested and transferred into fresh medium containing 0.2% xylan, chitin, alginic acid, Avicel, or glucose or 0.1% agar as indicated. After 4 hr, total RNA was extracted, converted to a cDNA copy and transcript levels relative to initial levels estimated by qRT-PCR using gene-specific primer pairs as described in the Materials and Methods. Transcript levels were normalized to guanylate kinase (Sde_3695) transcript levels.
Examples of carbohydrases and CBM proteins detected in S. degradans supernatants by mass spectrometry.
| Growth Substrate | Name | Predicted function | Modules | Locus Tag | amino acids | MW |
| Avicel | Cel5H | endocellulase | GH5/PSL(32)/CBM6/EPR(16) | 3237 | 630 | 66.9 |
| Cel5I | endocellulase | CBM2/PSL(33)/CBM10/PSL(58)/GH5 | 3420 | 725 | 77.2 | |
| Cel9B | endocellulase | GH9/PSL(54)/CBM10/PSL(50)/CBM2 | 0649 | 867 | 89.5 | |
| Ced3A | cellodextrinase | LPB/GH3 | 2497 | 1,072 | 116.0 | |
| Xyl3A | β-xylosidase | LPB/GH3 | 1487 | 893 | 97.6 | |
| Cbm2B | cbm only | CBM2/UNK(914) | 1183 | 1,042 | 112.1 | |
| Xylan | Xyn10E | β-xylanase | LPB/EPR(47)/GH10 | 0323 | 670 | 75.2 |
| Xyl3A | β-xylosidase | LPB/GH3 | 1487 | 893 | 97.6 | |
| Xyl31A | α-xylosidase | LPB/GH31 | 2500 | 973 | 110.2 | |
| Ced3A | cellodextrinase | LPB/GH3 | 2497 | 1,072 | 116.0 | |
| Ced3B | cellodextrinase | LPB/GH3 | 0245 | 862 | 92.9 | |
| Cep94B | cellodextrin phosphorylase | GH94 | 0906 | 788 | 88.7 | |
| Gly3D | β-glycosidase | CBM32/CBM32/CBM32/GH3/CBM32 | 0475 | 1,581 | 173.0 | |
| Cbm2C | cbm only | CBM2/PSL(58)/Y94/PSL(25)/UNK(577) | 0182 | 933 | 97.5 | |
| Cbm32A | cbm only | CBM32/CBM32/UNK(251) | 0478 | 1,028 | 111.9 |
Protein was detected in supernatants of cultures grown in the following growth substrates: Avicel (∼70% crystalline cellulose), xylan (Birchwood glucuronoarabinoxylan).
Predictions of function and module determination utilizing the routines used for the updates of the CAZy database (www.cazy.org/CAZY/).
Module abbreviations: CBM, carbohydrate binding module; UNK, unknown function; PSL, polyserine linker; LPB, lipobox signature sequence; GH, glycosyl hydrolase; EPR, glutamate-proline rich region.
Gene number having the prefix “Sde”, for Saccharophagus degradans as assigned in Jun 15, 2005 genome assembly
MW and amino acid count calculated using the protParam tool at http://us.expasy.org/tools/ based on DOE/JGI gene model amino acid sequence translations including any predicted signal peptide.