| Literature DB >> 27446019 |
Grace Zhang1, Beile Gao2, Mobolaji Adeolu1, Bijendra Khadka1, Radhey S Gupta1.
Abstract
The order Bifidobacteriales comprises a diverse variety of species found in the gastrointestinal tract of humans and other animals, some of which are opportunistic pathogens, whereas a number of others exhibit health-promoting effects. However, currently very few biochemical or molecular characteristics are known which are specific for the order Bifidobacteriales, or specific clades within this order, which distinguish them from other bacteria. This study reports the results of detailed comparative genomic and phylogenetic studies on 62 genome-sequenced species/strains from the order Bifidobacteriales. In a robust phylogenetic tree for the Bifidobacteriales constructed based on 614 core proteins, a number of well-resolved clades were observed including a clade separating the Scarodvia-related genera (Scardovia clade) from the genera Bifidobacterium and Gardnerella, as well as a number of previously reported clusters of Bifidobacterium spp. In parallel, our comparative analyses of protein sequences from the Bifidobacteriales genomes have identified numerous molecular markers that are specific for this group of bacteria. Of these markers, 32 conserved signature indels (CSIs) in widely distributed proteins and 10 signature proteins are distinctive characteristics of all sequenced Bifidobacteriales species and provide novel and highly specific means for distinguishing these bacteria. In addition, multiple other molecular signatures are specific for the following clades of Bifidobacteriales: (i) 5 CSIs specific for a clade comprising of the Scardovia-related genera; (ii) 3 CSIs and 2 CSPs specific for a clade consisting of the Bifidobacterium and Gardnerella spp.; (iii) multiple other signatures demarcating a number of clusters of the B. asteroides-and B. longum- related species. The described molecular markers provide novel and reliable means for distinguishing the Bifidobacteriales and a number of their clades in molecular terms and for the classification of these bacteria. The Bifidobacteriales-specific CSIs, found in important proteins, are predicted to play important roles in modifying the cellular functions of the affected proteins. Hence, biochemical studies on the cellular functions of these CSIs could lead to discovery of novel characteristics of either all Bifidobacteriales, or specific groups of bacteria within this order. Some of the functions affected/modified by these genetic changes could also be important for the probiotic/pathogenic activities of the bifidobacteria.Entities:
Keywords: Bifidobacterium asteroides-clade; Scardovia-clade; conserved signature indels; conserved signature proteins; molecular signatures for bifidobacteria; phylogeny; taxonomy
Year: 2016 PMID: 27446019 PMCID: PMC4921777 DOI: 10.3389/fmicb.2016.00978
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1A maximum-likelihood tree based on concatenated sequences of 614 core proteins from 62 sequenced genome-sequenced members of the order . The tree was rooted at the midpoint and SH-like support values are indicated at nodes. A number of different clades/clusters that are consistently observed in phylogenetic trees are marked.
Characteristics of conserved signature indels that are Specific for the order .
| Elongation factor Tu | 38606895 | Figure | 4 aa ins | 106–144 |
| DNA topoisomerase I | 489904111 | Supplementary Figure | 1 aa del | 31–80 |
| DNA polymerase sliding clamp subunit | 408500301 | Supplementary Figure | 1 aa ins | 79–118 |
| Beta-galactosidase | 504834401 | Supplementary Figure | 1–2 aa ins | 371–423 |
| Ketol-acid reductoisomerase | 651881972 | Supplementary Figure | 2 aa del | 242–284 |
| Serine-pyruvate aminotransferase | 489903803 | Supplementary Figure | 2 aa ins | 74–119 |
| 50S ribosomal protein L21 | 489922190 | Supplementary Figure | 1 aa ins | 42–82 |
| Methionine aminopeptidase | 547078960 | Supplementary Figure | 1 aa ins | 34–70 |
| Bifunctional acetaldehyde-CoA/alcohol dehydrogenase | 500062906 | Supplementary Figure | 1 aa ins | 534–574 |
| Bifunctional acetaldehyde-CoA/alcohol dehydrogenase | 500062906 | Supplementary Figure | 1 aa ins | 809–845 |
| Formate acetyltransferase | 500063439 | Supplementary Figure | 2 aa ins | 367–416 |
| ATP synthase F0 subunit A | 547078870 | Supplementary Figure | 1 aa ins | 131–163 |
| Peptide chain release factor 1 | 489924412 | Supplementary Figure | 2 aa ins | 197–237 |
| Arginine ABC transporter ATP-binding protein | 489905014 | Supplementary Figure | 1 aa del | 224–280 |
| Transketolase | 489905793 | Supplementary Figure | 4 aa ins | 338–388 |
| Histidine kinase | 547084095 | Supplementary Figure | 1 aa ins | 362–405 |
| DNA repair ATPase | 489905284 | Supplementary Figure | 3 aa ins | 353–394 |
| n-acetyl-gamma-glutamyl-phosphate reductase | 547072106 | Supplementary Figure | 1 aa ins | 10–60 |
| Arginine biosynthesis bifunctional protein ArgJ | 547072098 | Supplementary Figure | 1 aa ins | 1–42 |
| Excinuclease ABC subunit C | 494111998 | Supplementary Figure | 1 aa ins | 103–150 |
| Cysteine desulfurase | 500063210 | Supplementary Figure | 4 aa ins | 54–105 |
| 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase | 489906135 | Supplementary Figure | 1 aa ins | 58–81 |
| Argininosuccinate lyase | 547072080 | Supplementary Figure | 5 aa ins | 405–454 |
| CarD family transcriptional regulator | 500063173 | Supplementary Figure | 1 aa ins | 30–79 |
| Acetyltransferase GNAT family | 547074268 | Supplementary Figure | 1 aa ins | 112–152 |
| Acetyltransferase GNAT family | 547074268 | Supplementary Figure | 2 aa ins | 112–152 |
| Signal recognition particle protein | 489904236 | Supplementary Figure | 1 aa ins | 70–110 |
| 50S ribosomal protein L13 | 489923970 | Supplementary Figure | 1 aa del | 51–90 |
| DNA gyrase B subunit protein | 547082727 | Supplementary Figure | 2 aa del | 637–686 |
| Hemolysin III | 489923478 | Supplementary Figure | 1 aa del | 171–216 |
| Pseudouridine synthase | 547071034 | Supplementary Figure | 1 aa ins | 56–95 |
| Guanylate kinase | 500063064 | Supplementary Figure | 4 aa ins | 85–124 |
| D-alanine–D-alanine ligase | 493336643 | Supplementary Figure | 2–7 aa ins | 202–244 |
The indel region indicates the region of the protein where the described CSI is present.
Figure 2Partial sequence alignment of the protein synthesis elongation factor-Tu showing a 4 aa insertion in a conserved region that is specific for members of the order . The dashes in this alignment as well as all other alignments show identity with the amino acid on the top line. The Genebank Identification numbers of the protein sequences are shown, and the topmost numbers indicate the position of this sequence in the species shown on the top line. Due to space constraints, sequence information for different subspecies is not shown. However, unless otherwise indicated, these CSIs are present in the sequenced subspecies of B. longum, B. animalis, B. pseudolongum, and B. thermacidophilum. Information for large numbers of other CSIs, which are also specific for the order Bifidobacteriales is presented in Table 1 and Supplementary Figures S2–S32.
Conserved signature proteins that are uniquely found in the .
| 73 | Unknown, hypothetical | ||
| 275 | Unknown, hypothetical | ||
| 336 | Unknown, hypothetical | ||
| 228 | Unknown, hypothetical | ||
| 399 | Unknown, hypothetical | ||
| 201 | Unknown, hypothetical | ||
| 121 | Unknown, hypothetical | ||
| 84 | Unknown, hypothetical | ||
| 76 | Unknown, hypothetical | ||
| 321 | Unknown, hypothetical | ||
| 222 | Unknown, hypothetical | ||
| 299 | Unknown, hypothetical | ||
| 260 | Unknown, hypothetical | ||
| 283 | Unknown, hypothetical | ||
| 189 | Unknown, hypothetical | ||
| 152 | Unknown, hypothetical | ||
| 116 | Unknown, hypothetical | ||
| 283 | Unknown, hypothetical | ||
| 190 | Unknown, hypothetical | ||
| 300 | Unknown, hypothetical |
The species that are part of the B. asteroides clusters I, II, and III are indicated in Figure .
Figure 3Partial sequence alignment of DNA polymerase IV showing a 1 aa insertion that is specific for the . Information for other CSIs specific for this clade is presented in Table 3 and Supplementary Figures S33–S35.
Characteristics of Conserved Signature Indels Distinguishing a number of subgroups within the order .
| DNA polymerase IV | 489904486 | Figure | 1 aa ins | 88–125 | |
| Ribosomal RNA small subunit methyltransferase E | 547081721 | Supplementary Figure | 3 aa del | 118–160 | |
| GTP-binding protein YchF | 547055080 | Supplementary Figure | 1 aa ins | 309–354 | |
| Cytochrome C | 500062679 | Supplementary Figure | 3 aa del | 730–765 | |
| Triosephosphate isomerase | 651360171 | Figure | 1 aa ins | 251–286 | Scardovia clade |
| FHA domain protein | 493335662 | Supplementary Figure | 1 aa ins | 37–67 | Scardovia clade |
| Glycosyl transferase | 648490110 | Supplementary Figure | 2 aa ins | 23–67 | Scardovia clade |
| PAC2 family protein | 294458767 | Supplementary Figure | 2 aa ins | 32–77 | Scardovia clade |
| Phosphate ABC transporter substrate-binding protein | 493336671 | Supplementary Figure | 2 aa ins | 167–206 | Scardovia clade |
| Phosphogluconate dehydrogenase | 497766884 | Figure | 1 aa ins | 360–401 | |
| PhoU family transcriptional regulator | 489926631 | Supplementary Figure | 2 aa del | 159–190 | |
| Cystathionine gamma-synthase | 494112910 | Supplementary Figure | 2 aa ins | 262–302 | |
| Transketolase | 489905793 | Supplementary Figure | 1 aa ins | 234–274 | |
| Purine biosynthesis protein purH | 658453400 | Figure | 1 aa ins | 247–278 | |
| Shikimate dehydrogenase | 658453363 | Supplementary Figure | 1 aa ins | 264–301 | |
| 5-methyltetrahydropteroyltriglutamate–homocysteine methyltransferase | 504834759 | Supplementary Figure | 1 aa ins | 336–369 | |
| ABC transporter substrate-binding protein | 504835116 | Supplementary Figure | 1 aa del | 253–286 | |
| 5'-methylthioadenosine nucleosidase | 504835309 | Figure | 3 aa ins | 1–33 | |
| Peptide ABC transporter ATP-binding protein | 504834913 | Supplementary Figure | 20 aa ins | 76–127 | |
| N-acetyl-gamma-glutamyl-phosphate reductase | 504834965 | Supplementary Figure | 1 aa ins | 34–74 |
The B. asteroides-related cluster I, II, and IV are demarcated in Figure .
Figure 4Example of 1 aa conserved signature indel in the protein triosephosphate isomerase that is specific for the . Information for other CSIs specific for this clade is presented in Table 3 and Supplementary Figures S36–S39.
Figure 5Partial sequence alignment of phosphogluconate dehydrogenase showing a 1 aa insertion that is specific for the .
Figure 6Conserved signature indels that are specific for the Partial sequence alignment of the purine biosynthesis protein purH showing a 1 aa insertion which is specific for the B. asteroides cluster II species in the protein tree (Figure 1); (B) Excerpt from sequence alignment of the protein 5′'-methylthioadenosine nucleosidase showing a 3 aa insertion that is specific for the B. asteroides-related cluster IV in the protein tree.
Figure 7Surface representation of the homology model of Elongation factor Tu (EF-Tu) from . The conserved 4 aa insert which is located on the surface of the EF-Tu is shown in magenta. A superposition of the homology model of the B. longum homolog of EF-Tu (Cyan) with the E. coli homolog of EF-Tu of (PDB ID: 3U6K) (Green) shows that the conserved 4 aa insert forms a surface loop on the protein.