| Literature DB >> 35456791 |
German A Ashniev1, Sergey N Petrov2, Stanislav N Iablokov1, Dmitry A Rodionov3.
Abstract
The human gut microbiota (HGM) have an impact on host health and disease. Amino acids are building blocks of proteins and peptides, also serving as precursors of many essential metabolites including nucleotides, cofactors, etc. Many HGM community members are unable to synthesize some amino acids (auxotrophs), while other members possess complete biosynthetic pathways for these nutrients (prototrophs). Metabolite exchange between auxotrophs and prototrophs affects microbial community structure. Previous studies of amino acid biosynthetic phenotypes were limited to model species or narrow taxonomic groups of bacteria. We analyzed over 2800 genomes representing 823 cultured HGM species with the aim to reconstruct biosynthetic pathways for proteinogenic amino acids. The genome context analysis of incomplete pathway variants allowed us to identify new potential enzyme variants in amino acid biosynthetic pathways. We further classified the studied organisms with respect to their pathway variants and inferred their prototrophic vs. auxotrophic phenotypes. A cross-species comparison was applied to assess the extent of conservation of the assigned phenotypes at distinct taxonomic levels. The obtained reference collection of binary metabolic phenotypes was used for predictive metabolic profiling of HGM samples from several large metagenomic datasets. The established approach for metabolic phenotype profiling will be useful for prediction of overall metabolic properties, interactions, and responses of HGM microbiomes as a function of dietary variations, dysbiosis and other perturbations.Entities:
Keywords: amino acid metabolism; human gut microbiome; metabolic reconstruction; metagenomics; non-orthologous displacements; phenotype
Year: 2022 PMID: 35456791 PMCID: PMC9026213 DOI: 10.3390/microorganisms10040740
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Figure 1Reconstructed amino acid biosynthesis pathways in HGM genomes. Biosynthetic enzymes are shown in black using solid black arrows. Amino acid uptake transporters are shown in red using dashed red arrows.
In silico reconstruction and phenotype prediction for amino acid biosynthesis in HGM reference genomes.
| Amino Acid | Pathway Signature and Biosynthetic a.a. Dependencies 1 | PV 2 | BP 3 | No. Gen. 4 | Growth Requirements 5 |
|---|---|---|---|---|---|
| Proline | ProA, ProB | P |
| 2275 | -- |
| - | A |
| 581 | Pro | |
| Threonine | Hom, ThrB, ThrC | P |
| 2483 | -- |
| Hom, ThrC | P* |
| 50 | (missing ThrB) | |
| - | A |
| 323 | Thr | |
| Glycine | [GlyA/SgaA + | P |
| 2182 | -- |
| - | A |
| 58 | Gly | |
| GlyA/SgaA ( | P1 |
| 395 | Ser, Gly | |
| GlyB ( | P2 |
| 25 | Thr, Gly | |
| [GlyA/SgaA ( | P3 |
| 23 | Thr, Ser, Gly | |
| Serine | SerA, SerC, SerB | P |
| 1863 | -- |
| SerA, SerC | P* |
| 450 | (missing SerB) | |
| - | A |
| 543 | Ser | |
| Leucine & Isoleucine & Valine | IlvA, IlvG, (IlvM), IlvC, IlvD, (IlvE), LeuA, LeuB, LeuC, LeuD | P1 |
| 1912 | -- |
| CimA, IlvG, (IlvM), IlvC, IlvD, (IlvE), LeuA, LeuB, LeuC, LeuD | P2 |
| 317 | -- | |
| IlvG, (IlvM), IlvC, IlvD, (IlvE), LeuA, LeuB, LeuC, LeuD | P* |
| 17 | (missing IlvA/CimA) | |
| IlvA, IlvG, (IlvM), IlvC, IlvD, (IlvE) | P3 |
| 60 | Leu | |
| LeuA, LeuB, LeuC, LeuD, IlvE | A* |
| 20 | Ile, Val | |
| - | A |
| 699 | Leu, Ile, Val | |
| Cysteine | CysE, CysK | P |
| 1919 | -- |
| - | A |
| 526 | Cys | |
| CysE, CysK ( | P1 |
| 411 | Ser, Cys | |
| Methionine | Hom, MetA, [CTBL, CTGS]/MetY, [MetH/MetE], (MetF), MetK | P |
| 2252 | -- |
| [MetH/MetE], (MetF), MetK | A1 |
| 83 | Met | |
| Hom, MetA, [CTBL, CTGS]/MetY, MetK | A2 |
| 25 | Met (missing MetH/E) | |
| MetK | A |
| 496 | Met | |
| Lysine | LysC, Asd, DapA, DapB, DapH, (PatA), DapeL, DapF, LysA | P1 |
| 860 | -- |
| LysC, Asd, DapA, DapB, DapD, (DapC), DapE, DapF, LysA | P2 |
| 1051 | -- | |
| LysC, Asd, DapA, DapB, DapL, DapF, LysA | P3 |
| 546 | -- | |
| LysC, Asd, DapA, DapB, Ddh, LysA | P4 |
| 643 | -- | |
| LysC, Asd, DapA, DapB, DapF, LysA | P* |
| 38 | (no amination pathway) | |
| LysA | A1 |
| 14 | Lys, DAP | |
| - | A |
| 248 | Lys | |
| Histidine | HisG, (HisZ), (HisE), HisI, HisA, HisH, HisF, HisB, HisD, (HisN), (HisC) | P |
| 2097 | -- |
| - | A |
| 759 | His | |
| Tyrosine & Phenylalanine | PheA, TyrA/TyrC, (AroH), (TyrB) | FY |
| 2257 | -- |
| PheA, (AroH), (TyrB) | FA |
| 40 | Tyr | |
| TyrA/TyrC, (AroH), (TyrB) | AY |
| 179 | Phe | |
| - | AA |
| 380 | Tyr, Phe | |
| Tryptophan | TrpA, TrpB, TrpC, (TrpD), TrpF, TrpEG | P |
| 1810 | -- |
| TrpA, TrpB, TrpC, (TrpD), TrpF, TrpEG ( | P1 |
| 140 | Ser | |
| TrpA, TrpB | A1 |
| 14 | Trp, indole precursors | |
| TrpA, TrpB, TrpC | A2 |
| 88 | Trp, indole precursors | |
| TrpA, TrpB, TrpC, TrpD, TrpF | A3 |
| 38 | Trp, anthranilate | |
| - | A |
| 766 | Trp | |
| Arginine | (ArgA/ArgJ), (ArgB), ArgC, ArgD, ArgF, (ArgE), ArgG, ArgH | P |
| 2061 | -- |
| ArgG, ArgH | A1 |
| 251 | Arg, citrulline | |
| ArgA, ArgB, ArgC, ArgD, (ArgF), (ArgE) | A2 |
| 5 | Arg | |
| - | A |
| 539 | Arg | |
| Chorismate | [AroG, AroB]/[AroA-II, AroB-II], AroD, AroE, AroK, AroA, AroC | P |
| 2525 | -- |
| AroD, AroE, AroK, AroA, AroC | P1 |
| 14 | (missing AroG/AroB) | |
| AroG, AroB, AroD, AroE, AroK, (AroC) | P2 |
| 37 | (missing AroA) | |
| AroG, AroB, AroD, AroE, AroA, AroC | P3 |
| 69 | (missing AroK) | |
| AroG, AroB, AroD, AroK, AroA, AroC | P4 |
| 30 | (missing AroE) | |
| AroK, AroA, AroC | As |
| 13 | Chorismate, shikimate | |
| - | A |
| 169 | Chorismate | |
| Aspartate & Asparagine | AspC, AsnA/AsnB, (GatABC) | DN |
| 1852 | -- |
| AspC, GatABC | DAG ^ |
| 899 | -- | |
| AspC | DA |
| 28 | Asn | |
| (AsnB/AsnA), (GatABC) | AA |
| 77 | Asp, Asn | |
| Glutamate & Glutamine | GltBD/Gdh, GlnA, (GatABC) | EQ |
| 2663 | -- |
| GltBD/Gdh, GatABC | EAG ^ |
| 17 | -- | |
| GltBD/Gdh | EA |
| 20 | Gln | |
| (GlnA), (GatABC) | AA |
| 156 | Gln, Glu |
1 Special characters used in pathway signatures: Parenthesis denote functional roles that are not required to be present corresponding to enzymes that were not detected in all prototrophs. ‘/’ denotes alternative enzymes with the same functional role (at least one of these is required to be present). Biosynthetic dependencies on other amino acids (biochemical precursors) are shown in red, where parenthesis denote a respective amino acid growth requirement. 2 Pathway Variants: Asterisk denotes incomplete pathways with one or two essential enzymes missing. ‘^’ denote the presence of GatABC amidotransferase (see text). 3 Binary Phenotypes: ‘0’ and ‘1’ correspond to auxotrophs and prototrophs, respectively. 4 Number of genomes possessing a pathway variant. 5 ‘--’ denotes no growth requirement in predicted prototrophs; comments in parenthesis describe missing biosynthetic enzymes or pathways. DAP, meso-2,6-diaminopimelate.
Novel amino acid biosynthesis enzymes predicted as non-orthologous gene displacements in HGM genomes.
| Pathway | Enzyme | Predicted Functional Role | Occurrence 1 | Evidence 2 | Example ID 3 |
|---|---|---|---|---|---|
| Arginine | ArgA2 | N-succinylglutamate synthase (EC 2.3.1.-) | 7.4% | CO, CL, CR, CF | Q8A1A5 |
| ArgA3 | N-acetylglutamate synthase (EC 2.3.1.1) | 0.8% | CO, CL | W3Y6L2 | |
| Serine | SerC2 | Phosphoserine aminotransferase (EC 2.6.1.52) | 4.0% | CO, CF, CL | Q2FXK2 |
| SerC3 | Phosphoserine aminotransferase (EC 2.6.1.52) | 6.1% | CF, CL | A5I0W7 | |
| SerB2 | Phosphoserine phosphatase (EC 3.1.3.3) | 3.6% | CO, CF | C4IFQ5 | |
| Threonine | ThrB2 | Homoserine kinase (EC 2.7.1.39) | 10.6% | CO, CL | Q5LHR7 |
| Lysine | DapF2 | Diaminopimelate epimerase (EC 5.1.1.7) | 4.4% | CO, CL, CR | W1W731 |
1 Percentage of HGM genomes possessing a non-orthologous enzyme. 2 Genome context evidence: CO, co-occurrence; CL, co-localization; CR, co-regulation; CF, common functional class. 3 Uniprot protein ID.
Figure 2Genomic context of new non-orthologous enzymes in amino acid biosynthetic pathways. The Venn diagrams show co-occurrence of 4 different variants of N-acetylglutamate synthase ArgA (A), and phosphoserine phosphatase SerB (B). Gene co-localization is shown for selected genomes only (C).
Figure 3(A) Distribution of amino acids producers among analyzed HGM strains. Colored bars show the average amino acid prototrophy of each genus; empty bars represent auxotrophy. The phylogenetic tree of HGM genera was obtained using concatenated ribosomal proteins as previously described [25]. Number of analyzed strains per genus is shown in the inner circle. (B) Distribution of the amino acid producers at the phylum level. The number of analyzed genomes in each phylum is shown.
Figure 4Distribution of Community Phenotype Indices (CPIs) for amino acid biosynthesis pathways in HGM samples. Samples from the AGP, UKT, and Hadza datasets are shown in blue, red, and green, respectively.
Figure 5Mutual dependence of median rCPI and rPAD values for each amino acid synthesis phenotype in three HGM datasets.
Pearson’s correlation coefficient for comparison of CPIs, relative phenotype abundance (RPA) and relative pathway abundance (MetaCyc) for amino acid synthesis pathways obtained by PICRUST2 pipeline on HGM samples from the AGP dataset.
| RPA vs. CPI | RPA vs. MetaCyc | MetCyc Pathway Name and Annotation |
|---|---|---|
| 0.86 | 0.54 | ARGSYNBSUB-PWY: L-arginine biosynthesis II (acetyl cycle) |
| 0.59 | ARGSYN-PWY: L-arginine biosynthesis I (via L-ornithine) | |
| 0.31 | PWY-5154: L-arginine biosynthesis III (via N-acetyl-L-citrulline) | |
| 0.60 | PWY-7400: L-arginine biosynthesis IV (archaebacteria) | |
| 0.81 | 0.92 | HISTSYN-PWY: L-histidine biosynthesis |
| 0.60 | 0.64 | ILEUSYN-PWY: L-isoleucine biosynthesis I (from threonine) |
| 0.71 | PWY-5101: L-isoleucine biosynthesis II | |
| 0.69 | PWY-5103: L-isoleucine biosynthesis III | |
| 0.56 | PWY-5104: L-isoleucine biosynthesis IV | |
| 0.67 | 0.70 | LEUSYN-PWY: L-leucine biosynthesis |
| 0.69 | 0.27 | DAPLYSINESYN-PWY: L-lysine biosynthesis I |
| −0.05 | PWY-2941: L-lysine biosynthesis II | |
| 0.71 | PWY-2942: L-lysine biosynthesis III | |
| 0.72 | PWY-5097: L-lysine biosynthesis VI | |
| 0.71 | −0.16 | HOMOSER-METSYN-PWY: L-methionine biosynthesis I |
| −0.17 | HSERMETANA-PWY: L-methionine biosynthesis III | |
| 0.74 | 0.86 | TRPSYN-PWY: L-tryptophan biosynthesis |
| 0.60 | 0.64 | VALSYN-PWY: L-valine biosynthesis |
Figure 6Distribution of metagenomic abundances for amino acid biosynthetic pathways in HGM samples from IBD dataset. Pathway abundances were calculated as a sum of TMM-normalized counts for two selected signature genes in each biosynthetic pathway (see Table S8).