| Literature DB >> 24710093 |
Jennifer L Edwards1, Darren L Smith2, John Connolly3, James E McDonald4, Michael J Cox5, Ian Joint6, Clive Edwards7, Alan J McCarthy8.
Abstract
Polysaccharides are an important source of organic carbon in the marine environment and degradation of the insoluble and globally abundant cellulose is a major component of the marine carbon cycle. Although a number of species of cultured bacteria are known to degrade crystalline cellulose, little is known of the polysaccharide hydrolases expressed by cellulose-degrading microbial communities, particularly in the marine environment. Next generation 454 Pyrosequencing was applied to analyze the microbial community that colonizes and degrades insoluble polysaccharides in situ in the Irish Sea. The bioinformatics tool MG-RAST was used to examine the randomly sampled data for taxonomic markers and functional genes, and showed that the community was dominated by members of the Gammaproteobacteria and Bacteroidetes. Furthermore, the identification of 211 gene sequences matched to a custom-made database comprising the members of nine glycoside hydrolase families revealed an extensive repertoire of functional genes predicted to be involved in cellulose utilization. This demonstrates that the use of an in situ cellulose baiting method yielded a marine microbial metagenome considerably enriched in functional genes involved in polysaccharide degradation. The research reported here is the first designed to specifically address the bacterial communities that colonize and degrade cellulose in the marine environment and to evaluate the glycoside hydrolase (cellulase and chitinase) gene repertoire of that community, in the absence of the biases associated with PCR-based molecular techniques.Entities:
Year: 2010 PMID: 24710093 PMCID: PMC3966224 DOI: 10.3390/genes1030371
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Identification of 16S rRNA genes in the Irish Sea cellulose biofilm DNA 454 pyrosequenced dataset by the Greengenes 16S rRNA gene database.
| Sequence ID | Alignment Length | E-value | % Identity | Bit Score | Fragment (Start - End) | Taxonomy Assignment | Best Hit ID |
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| contig16635 | 85 | 1.00E-40 | 100 | 168 | 1 - 85 | Sulfitobacter | 165917 |
| contig26360 | 150 | 2.51E-51 | 93 | 202 | 1 - 149 | Unclassified Rhodobacteraceae | 142124 |
| contig26572 | 162 | 7.94E-70 | 96 | 264 | 4 - 163 | Unclassified Rhodobacteraceae | 70710 |
| contig26707 | 130 | 2.51E-63 | 99 | 242 | 1 - 130 | Unclassified Rhodobacteraceae | 113926 |
| contig25574 | 235 | 2.51E-130 | 100 | 466 | 50 - 284 | Glaciecola | 108683 |
| contig26573 | 526 | 0.00E+00 | 99 | 1003 | 1 - 526 | Glaciecola | 80428 |
| contig26860 | 151 | 1.26E-80 | 100 | 299 | 1 - 151 | Glaciecola | 170839 |
| contig26820 | 696 | 0.00E+00 | 93 | 1015 | 6 - 701 | Teredinibacter | 144812 |
| contig00070 | 247 | 1.26E-81 | 93 | 305 | 738 - 983 | Cellvibrio | 98921 |
| contig12909 | 577 | 0.00E-00 | 93 | 858 | 209 - 784 | Unclassified Gammaproteobacteria | 151615 |
|
|
|
|
|
|
|
|
|
| contig00061 | 892 | 0.00E+00 | 90 | 1094 | 1988 - 2879 | Roseivirga | 102384 |
| contig26228 | 139 | 2.51E-66 | 98 | 252 | 1 - 138 | Flavobacterium | 154970 |
| contig26430 | 241 | 5.01E-130 | 99 | 464 | 1 - 241 | Ulvibacter | 80102 |
| contig26765 | 436 | 5.01E-140 | 89 | 500 | 60 - 495 | Unclassified Cytophagaceae | 2577 |
Best Hit is the ProkMSA Greengenes reference ID.
Figure 1Summary of Phylum level taxonomic assignment of 14,020 assembled contigs matching protein encoding genes (PEGs).
Figure 2Taxonomic assignment of 13,312 sequence contigs matching Protein encoding genes (PEGs) of the Proteobacteria, the Bacteroidetes. The percentage of contigs matching PEGs assigned at the Class level using the SEED MG-RAST (Metagenome Rapid Annotation Using Subsystem Technology are shown for (A) Proteobacteria (8847; 61% of assignments to the Bacteria), (B) Bacteroidetes (4465; 32% of assignments to Bacteria). Values are shown as percentages of the total contigs assigned to each class within the two Phyla.
Summary of the number of 454 sequencing contigs matching each GH family.
| CAZy family | Pfam ID | Number of hits |
|---|---|---|
| GH5 | PF00150 | 56 |
| GH6 | PF01341 | 5 |
| GH7 | PF00840 | 0 |
| GH8 | PF01270 | 40 |
| GH9 | PF00759 | 30 |
| GH12 | PF01670 | 1 |
| GH16 | PF00722 | 64 |
| GH18 | PF00704 | 10 |
| GH19 | PF00182 | 3 |
| GH45 | PF02015 | 0 |
| GH48 | PF02011 | 2 |
| GH61 | PF03443 | 0 |
Searches were performed using Blastx for all 26,859 assembled contigs against the customized library of downloaded GH families using an E value cut-off of 1 × 10-5 Pfam ID is the protein family [36], CAZy (carbohydrate active enzymes database family [10]. A comprehensive list of all hits, values can be found in Supplementary Table 1.
Figure 3Scanning electron micrographs of the microbial community colonizing cellulose ‘bait’ in situ in the Irish Sea. (A) A cellulose fiber heavily colonized by a biofilm matrix. Rod-shaped bacteria can clearly be seen occupying grooves or pits (arrow) providing visible evidence of degradation by these cellulolytic bacteria; (B) a close image (bar = 2 µm) of a hollowed out region of a cellulose fiber with abundant rod-shaped bacteria; (C) rod-shaped bacteria arranged in rows around the cellulose surface with grooves visible where bacterial cells have formed pits on the surface; (D) a closer view of a cellulose fiber where rod shaped bacteria are located in grooves on the surface; (E) surface of cotton string heavily colonized by biofilm matrix showing extensive degradation of the cellulose surface, in comparison to (F) uncolonized cotton string.