| Literature DB >> 22646801 |
Bareket Dassa1, Ilya Borovok, Raphael Lamed, Bernard Henrissat, Pedro Coutinho, Christopher L Hemme, Yue Huang, Jizhong Zhou, Edward A Bayer.
Abstract
BACKGROUND: Microbial degradation of plant cell walls and its conversion to sugars and other byproducts is a key step in the carbon cycle on Earth. In order to process heterogeneous plant-derived biomass, specialized anaerobic bacteria use an elaborate multi-enzyme cellulosome complex to synergistically deconstruct cellulosic substrates. The cellulosome was first discovered in the cellulolytic thermophile, Clostridium thermocellum, and much of our knowledge of this intriguing type of protein composite is based on the cellulosome of this environmentally and biotechnologically important bacterium. The recently sequenced genome of the cellulolytic mesophile, Acetivibrio cellulolyticus, allows detailed comparison of the cellulosomes of these two select cellulosome-producing bacteria.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22646801 PMCID: PMC3413522 DOI: 10.1186/1471-2164-13-210
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Modular architecture of the array of scaffoldins identified in the CD2 genome and their homologs from ATCC 27405. Putative A. cellulolyticus scaffoldins were identified bioinformatically (see Materials and Methods for their accession numbers). Binding specificities of the indicated (black spots) cohesin and dockerin modules were determined previously [17-19]. The sca gene cluster is framed in a shaded box. All proteins have an N-terminal signal peptide except for ScaI. Acronyms: GH9, family-9 glycoside hydrolase; CBM(n), carbohydrate-binding module (family number); Cu, Copper amine oxidase; FN3, Fibronectin type III domain; Peptidase, S8 subtilisin-like peptidase; PPC, bacterial pre-peptidase C-terminal domain; Rhs, Rhs repeat domain. Accession numbers of the A. cellulolyticus scaffoldins are: [GenBank: ZP_09464033-30 (ScaA-D), ZP_09465494 (ScaE), ZP_09464236 (ScaF), ZP_09464788 (ScaG), ZP_09462752 (ScaH), ZP_09463446 (ScaI), ZP_09462222 (ScaJ), ZP_09464725 (ScaK), ZP_09464968 (ScaL), ZP_09463433 (ScaM), ZP_09463827 (ScaN), ZP_09462124 (ScaO), ZP_09461865 (ScaP)]. Accession numbers of the C. thermocellum scaffoldins are: [GenBank: CAA47840 (CipA), YP_001039467 (OlpB), ABN54275 (Orf2p), YP_001039469 (OlpA), YP_001037164 (Cthe_0736), YP_001037732 (SdbA), YP_001036883 (OlpC) and YP_001037163 (Cthe_0735)]
Figure 2Relationship of all cohesin modules from and . Sequence-based dendrogram of cohesin modules from A. cellulolyticus (red) and C. thermocellum (blue). See scheme and key in Figure 1. Only significant bootstrap values are shown
Figure 3Sequence conservation pattern of dockerin modules. The two internal dockerin repeats of A. cellulolyticus (based on 137 sequences) and C. thermocellum (71 sequences) are represented by sequence logos. Positions of calcium binding residues are shown in cyan, and putative recognition residues are shown in yellow
Comparative distribution of dockerin-containing CAZyme modules invs.
| Genome-wide | 2 | 1 | 3 | 16 | 4 | 21 | 4 | 1 | 3 | 1 | 1 | 5 | 2 | 2 | 5 | 3 | — | 4 | 1 | 2 | — | 1 | 1 | 1 | 1 | 1 | 3 | 1 | 1 | 1 | 92 | |
| Dockerin-containing proteins | — | 1 | — | 12 | 3 | 19 | 4 | 1 | — | — | 1 | — | — | — | 4 | 4 | — | 4 | 1 | 1 | — | 1 | 1 | 1 | — | 1 | — | 1 | 1 | 1 | 62 | |
| Genome-wide | 2 | 1 | 2 | 10 | 1 | 16 | 6 | 1 | 2 | 1 | 2 | 4 | — | 2 | 3 | 2 | 1 | 6 | 1 | 2 | 1 | 1 | — | 1 | — | 1 | 3 | — | — | 1 | 73 | |
| Dockerin-containing proteins | — | 1 | — | 8 | 1 | 15 | 3 | 1 | — | — | 1 | 1 | — | — | 3 | 2 | 1 | 5 | 1 | 1 | — | 1 | — | 1 | — | 1 | — | — | — | 1 | 48 | |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |||||
| Genome-wide | 1 | 1 | 3 | | | | | | | | | | | | | | | | | | | | | | | | | | | | 5 | |
| Dockerin-containing proteins | 1 | 1 | 3 | | | | | | | | | | | | | | | | | | | | | | | | | | | | 5 | |
| Genome-wide | 2 | 1 | 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | 4 | |
| Dockerin-containing proteins | 2 | 1 | 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | 4 | |
| | | | | | | | | | | | | | | | | | | | | | ||||||||||||
| Genome-wide | 2 | 1 | 3 | 4 | 1 | — | 1 | 1 | 7 | — | | | | | | | | | | | | | | | | | | | | | 20 | |
| Dockerin-containing proteins | 2 | 1 | 3 | 1 | 1 | — | 1 | — | 4 (6) | — | | | | | | | | | | | | | | | | | | | | | 13 | |
| Genome-wide | 3 | 1 | 2 | 3 | — | 1 | 1 | 1 | 2 | 1 | | | | | | | | | | | | | | | | | | | | | 15 | |
| Dockerin-containing proteins | 3 | 1 | 1 (2) | 1 | — | — | 1 | — | 1 (2) | — | 8 | |||||||||||||||||||||
Numbers represent proteins which contain one or more modules of the different protein families (glycoside hydrolases, polysaccharide lyases and carbohydrate esterases) as were identified by CAZy. The number of proteins are compared between cellulosomal and non-cellulosomal (genome-wide) proteins. Data are provided for both species. Numbers of modules which appear more than once in the same protein are shown parenthetically.
Summary of protein modules in cellulosomal proteins
| 41 | 143 (5) | 74 | 46 | 53 | 13 | 36 | |
| 29 | 73 (3) | 51 | 25 | 47 | 9 | 3 | |
a Numbers in parenthesis indicate the number of X60-dockerin modular pairs in the given species.
b Catalytic modules, such as: GH, PL, CE, and CBMs according to CAZy (http://www.cazy.org/).
c Structural domains are defined in Pfam (http://pfam.sanger.ac.uk/), such as: FN3, GDSL, SNGH, CotH, TRX-like, Kelch-like, SLH, RshA, LRR, PKD.
d Others include Pfam domains such as peptidases, serpins, DUF303, DUF1565, DUF3237.
Genome-wide co-occurrence of CBMs together with either GH or CE modules invs.
| 3/1 | | 2/0 | | 0/1 | 1/0 | | | | | 1/1 | | | | | | | | 1/0 | |
| 19/10 | 0/2 | | | | | | | | 0/1 | | | | | 1/0 | | | | | |
| | | 2/1 | 2/2 | | | 4/5 | | | | | | | | | | | | | |
| | | 1/1 | | | | | | | | | | | | | | | | | |
| | | | | | | | | | | | 0/1 | | | | 2/1 | | | | |
| | 0/4 | | | | | | | | | | | | | | | | 0/1 | | |
| 1/0 | | | | | | | | | | | | | | | | 2/2 | | | |
| | | | | 0/1 | | | 1/0 | 1/0 | | | | 3/2 | | | | | | | |
| | | 1/1 | | | | | | | | | | | 0/1 | | | | | | |
| | | | | | | | | | | | | 0/2 | | | | | | | |
| | | 2/3 | | | 1/1 | | | | | | | | 0/3 | | | | | | |
| | | | | | | | | | 0/1 | | | | | 1/1 | | | | | |
| 1/1 | | | | | | | | | | | | | | | | | | | |
| | | 2/1 | | | | | | | | | | | | | | | | | |
| | | 1/1 | | | | | | | | | | | | | | | | | |
| | | 1/0 | | | 1/0 | | | | | | | | | | | | | 1/0 | |
| 4/2 |
The number of proteins with the combination of the specified modules is noted in the genomes of A. cellulolyticus (left) vs. C. thermocellum (right).
Multifunctional proteins invs.
| ZP_09464944 | Cthe_2972; | ||
| ZP_09465691 | Cthe_2179 | ||
| ZP_09463564 | Cthe_3141 | ||
| ZP_09465667 | | | |
| ZP_09463297 | CBM30- | Cthe_0624; | |
| ZP_09464730 | CBM22- | Cthe_0912; | |
| ZP_09465552 | Cthe_1472; | ||
| | | Cthe_2139 | |
| | | Cthe_0798 | |
| ZP_09464448 | | ||
| ZP_09465738 | | ||
| GT84- | ZP_09462312 | ||
Domain architectures and their corresponding accession numbers or names are listed above. Catalytic modules are marked in bold. GH, glycoside hydrolase; PL, polysaccharide lyase; CE, carbohydrate esterase; CBM, carbohydrate-binding module; GT, glycosyl transferase; Doc, dockerin; numbers indicate family of the indicated module.
Putative σand anti-σregulatory factors inCD2
| ZP_09464729 | ZP_09464728 | CBM3 | Cthe_0403 |
| ZP_09464331 | ZP_09464330 | CBM3 | Cthe_0058 |
| ZP_09466014 | ZP_09466013 | CBM3 | Cthe_0268 |
| ZP_09463653 | ZP_09463652 | CBM3 | Cthe_0058 |
| ZP_09461804 | ZP_09461805 | CBM42 | Cthe_1272 |
| ZP_09463236 | ZP_09463235 | PA14, CBM35 | Cthe_0315 |
| ZP_09464238 | ZP_09464237 | PA14, PA14 | Cthe_1272 |
| ZP_09464575 | ZP_09464574 | unknown | Cthe_0403 |
| ZP_09464240 | ZP_09464239 | S1/S6 peptidase | Cthe_0058 |
| ZP_09463889 | ZP_09463888 | unknown | Cthe_2521 |
| ZP_09466630 | ZP_09466631 | unknown | Cthe_2974 |
| ZP_09465751 | ZP_09465752 | unknown | Cthe_2974 |