| Literature DB >> 27222666 |
Yu Xia1, Yubo Wang1, Yi Wang2, Francis Y L Chin3, Tong Zhang4.
Abstract
BACKGROUND: The Anaerolineae lineage of Chloroflexi had been identified as one of the core microbial populations in anaerobic digesters; however, the ecological role of the Anaerolineae remains uncertain due to the scarcity of isolates and annotated genome sequences. Our previous metatranscriptional analysis revealed this prevalent population that showed minimum involvement in the main pathways of cellulose hydrolysis and subsequent methanogenesis in the thermophilic cellulose fermentative consortium (TCF).Entities:
Keywords: Aggregation; Anaerolinales; Anaerolineae; Cellular adhesion; Cellulolytic capacity; Chloroflexi
Year: 2016 PMID: 27222666 PMCID: PMC4877987 DOI: 10.1186/s13068-016-0524-z
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Fig. 1Two-dimensional coverage plot of the scaffolds assembled from the short-term enrichment (SE) and long-term enrichment (LE) metagenomes. Size of the circle is scaled by the square root of the length of scaffold. And scaffolds are colored according to their consensus taxonomic annotation at Phylum level. Only scaffolds longer than 10 kb are depicted. Diagonal line in blue represents stable coverage between two metagenomes. Scaffolds clustered at close coverage represent potential genome bins and were labeled according to the bin name. The relative abundance of the five retrieved curated genome bins in the SE and LE metagenomes is listed in the table. The relative abundance of each genome bin is estimated as the number of reads mapped to the draft genome in percentage of the total number of reads in the metagenome
Fig. 2Phylogenetic analysis of the five draft genomes within Chloroflexi phylum. Neighbor-joining phylogenetic tree based on 16S rRNA gene. The tree is produced with neighbor-joining analysis based on ClustalW alignment. 16S rRNA gene sequences of Thermotogae are used to root the tree. Bootstrap value is obtained with the maximum-composite-likelihood methods based on 1000 replicates. Bootstrap values greater than 50 % are indicated at branch points. Branch labels are colored according to their categories (1) our five draft genomes in red; (2) complete genomes in purple; (3) isolated strains are in green
Fig. 3Phylogenetic analysis of the five draft genomes within Chloroflexi phylum. Maximum-likelihood tree based on concatenated alignment of 35 essential single-copy genes (ESCGs) conserved in a single-copy manner among five curated genomes retrieved and twenty-two finished genomes within Chloroflexi phylum. Default protein model of Phyml 3.1 is used to construct the tree with 100 bootstraps based on MUSCLE alignment. Bootstrap values greater than 50 % are indicated at branch points. Branch labels are colored according to their categories (1) our five draft genomes in red; (2) complete genomes in purple; (3) isolated strains are in green
Genomic information of the five Anaerolinea genomes retrieved from thermophilic cellulose-degrading metagenomes
| General information | TCF-2 | TCF-5 | TCF-12 | TCF-8 | TCF-13 |
|
|---|---|---|---|---|---|---|
| IMG Genome ID | 2561511051 | 2561511052 | 2561511055 | 2561511056 | 2561511053 | N/A |
| Total length (Mb) | 3.8 | 3.0 | 3.7 | 4.1 | 4.0 | 3.5 |
| Scaffolds (n) | 55 | 27 | 69 | 51 | 153 | 1 |
| Average sequence length (bp) | 71,706 | 11,837 | 56,472 | 84,553 | 27,370 | N/A |
| GC (%) | 54 | 55 | 53 | 64 | 65 | 54 |
| Estimated completeness based on ESCGa (%) | 99.1 | 98.1 | 100.0 | 99.1 | 100.0 | 99.1 |
| Estimated redundancy of ESCGs (%) | 3.8 | 1.9 | 3.7 | 2.8 | 3.7 | 2.8 |
| Genes | 3602 | 3005 | 3580 | 3681 | 3659 | 3224 |
| Protein-coding genes | 3550 | 2949 | 3523 | 3634 | 3609 | 3166 |
| rRNA genes (5S/16S/23S) | 2/2/0 | 1/1/1 | 1/1/1 | 3/1/0 | 1/1/0 | 2/2/2 |
| tRNA gene (n) | 43 | 45 | 48 | 41 | 43 | 50 |
| Genes with functional prediction (%) | 81.4 | 83.2 | 80.4 | 80.8 | 80.5 | 53.7 |
| Genes with transcription (%) | 27.8 | 35.4 | 18.1 | 1.8 | 2.8 | N/A |
| Duplicated AAb (n/ %) | 814 (22.9 %) | 734 (24.9 %) | 921 (26.1 %) | 932 (10.8 %) | 893 (24.7 %) | 757 (23.9 %) |
N/A data not available
a Completeness estimation based on 107 conserved single-copy genes, named as essential single-copy genes (ESCGs) of >95 % complete bacterial genomes [13]
b Putative duplication among amino acid (AA) sequences within each chromosome (based on BLASTP bitscore ≥70, similarity ≥30 over at least 70 % of the query length [66])
Fig. 4Glycoside hydrolase (GH) families involved in lignocellulose hydrolysis in five curated genomes of Anaerolineae. Relative abundance of GH families is measured in terms of RPKM-DNA. GH families showed transcriptional activities are indicated with asterisk. GH families are categorized according to the classification proposed by Pope et al. [67]
Fig. 5Arrangement of the pili cluster on the genome of the five curated genomes and A. thermophila UNI-1. The name and length of scaffold carrying the pili gene cluster is listed to the left of the gene arrangement. Genes within the pili cluster are colored according to their homologies that genes show bidirectional best blast match to each other are in the same color. PilA pilus assembly protein Flp/PilA, CpaB/E/F pilus assembly protein CpaB/E/F, TadB/C tight adherence protein B/C
Fig. 6Composition of the bacterial community during attachment. Only prevalent populations taking >1 % of the community are shown
Transcriptional activities of genes potentially involved in interspecies electron transfer for syntrophic metabolism of the five curated Anaerolineae genomes
| Interspecies electron transfer | ||||
|---|---|---|---|---|
| DIET | H2/Formate | |||
| PilAa | CytCa | Hyda | FDHa | |
| TCF-2 | 984.5 | N/A | 0.0 | 0.3 |
| TCF-5 | 261.5 | N/A | 1.6 | 0.2 |
| TCF-12 | 1651.6 | N/A | 0 | 0 |
| TCF-8 | 0 | N/A | 0 | 0 |
| TCF-13 | 0 | N/A | 0 | 0 |
MRPKM values were used to evaluate the transcriptional activities of different genes. For pilA gene, only the transcription of pilA assembly conserved in all the retrieved genomes as shown in Fig. 5 was included
N/A the target gene is absent from the curated genomes, pilA pilus assembly protein Flp/PilA, CytC c-type cytochrome, MtrC/OmcA family, Hyd hydrogenase, FDH formate dehydrogenase
aGene identification is based on the integration of COGs, TIGRFAMs, and KEGG KO orthology as listed in Additional file 1: Table S9