| Literature DB >> 32552739 |
Yong Wang1, Jiao-Mei Huang2,3, Ying-Li Zhou2,3, Alexandre Almeida4,5, Robert D Finn4, Antoine Danchin6,7, Li-Sheng He2.
Abstract
BACKGROUND: The metabolic capacity, stress response and evolution of uncultured environmental Tenericutes have remained elusive, since previous studies have been largely focused on pathogenic species. In this study, we expanded analyses on Tenericutes lineages that inhabit various environments using a collection of 840 genomes.Entities:
Keywords: Autotrophy; Bacilli; Environmental Tenericutes; Gut microbiome; Pathogen
Year: 2020 PMID: 32552739 PMCID: PMC7301438 DOI: 10.1186/s12864-020-06807-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Phylogenetic trees of Tenericutes. The maximum-likelihood phylogenetic trees were constructed by concatenated conserved proteins (a) and 16S rRNA genes (b). The bootstrap values (> 50) are denoted by the dots on the branches. The colors of the inner layer indicate the positions of the different environmental lineages and groups of Tenericutes in the trees. Sources of the environmental lineages are shown as shapes in different colors in the outer layer
Fig. 2Phylogenetic positions of Tenericutes families in Bacilli. Representative genomes from orders of Bacilli were used to construct the phylogenomics tree using concatenated conserved proteins by IQ-TREE and RAxML. The bootstrap values were shown as triangles (50–90) and dots (> 90) with a red color for the results of RAxML and deep blue for those of IQ-TREE, respectively. The red clades represent the orders of Tenericutes. The Bacilli genomes for Erysipelotrichales and the other orders in purple were selected from GTDB. RFN20, RF39, ML615 were environmental clades named in GTDB and were phylogenetically placed within the NCBI clades consisting of human gut lineages 1, 2 and bioreactor group, respectively
Fig. 3Distribution of genes and pathways in the Tenericutes lineages. Tenericutes lineages were grouped using an agglomerative hierarchical clustering on the basis of the distribution of COGs within each group. The color and size of each dot represent the percentage of genomes within each lineage that carries the gene. The functions of these genes are shown in Additional file 3: Table S1
Representative genomes of RFN20 and RF39. RF39 (HG1) was represented by HG1.1 and HG1.2 from the Tenericutes downloaded from NCBI; RFN20 (HG2) was represented by HG2.1 and HG2.2. S. azabuensis was a species in Erysipetrichales
| ID | HG1.1 | HG1.2 | HG2.1 | HG2.2 | |
|---|---|---|---|---|---|
| Accession | UQAI01000000 | UQAG01000000 | UPZX01000000 | UQBB01000000 | JNKU00000000 |
| Genome size (bp) | 1,690,546 | 1,911,898 | 1,525,481 | 1,699,832 | 2,411,783 |
| %GC | 30 | 29.5 | 30.1 | 30.4 | 37.1 |
| No.contigs | 109 | 71 | 31 | 16 | 94 |
| %Complete | 98.7 | 98.7 | 98.9 | 98.5 | 99.1 |
| %Contaminant | 0 | 0 | 0 | 0 | 0.9 |
| No. tRNA | 38 | 35 | 34 | 45 | 57 |
| No. rRNA | 0 | 2 | 1 | 0 | 10 |
| %Coding density | 92 | 90.8 | 92.5 | 91.6 | 89 |
| No. CDSs | 1548 | 1834 | 1488 | 1570 | 2424 |
Fig. 4Schematic metabolism of RFN20 and RF39. We depicted the metabolic models based on gene annotation results of four representative genomes of RFN20 and RF39 (see Table 1). Solid squares indicate presence of the genes responsible for a step or a pathway. The products depicted in the MEP/DOXP pathway are 1-deoxy-xylulose 5-P, 2-C-methyl-D-erythritol 4-P, 4-(Cytidine 5′-PP)-2-C-methyl-erythritol, 2-P-4-(cytidine 5′-PP)-2-C-methyl-erythritol, 2-C-methyl-erythritol 2,4-PP, 1-hydroxy-2-methyl-2-butenyl 4-PP, dimethylallyl-PP, isopentenyl-PP, and farnesyl-PP