Literature DB >> 35344283

Insights from shotgun metagenomics into bacterial species and metabolic pathways associated with NAFLD in obese youth.

Todd Testerman1, Zhongyao Li2, Brittany Galuppo2, Joerg Graf1, Nicola Santoro2,3.   

Abstract

Nonalcoholic fatty liver disease (NAFLD) is the most common form of liver disease and is often the precursor for more serious liver conditions such as nonalcoholic steatohepatitis and cirrhosis. Although the gut microbiome has been implicated in the development of NAFLD, the strong association of obesity with NAFLD and its effect on microbiome structure has made interpreting study outcomes difficult. In the present study, we examined the taxonomic and functional differences between the microbiomes of youth with obesity and with and without NAFLD. Shotgun metagenome sequencing was performed to profile the microbiomes of 36 subjects, half of whom were diagnosed with NAFLD using abdominal magnetic resonance imaging. Beta diversity analysis showed community-wide differences between the groups (p = 0.002). Specific taxonomic differences included increased relative abundances of the species Fusicatenibacter saccharivorans (p = 0.042), Romboutsia ilealis (p = 0.046), and Actinomyces sp. ICM47 (p = 0.0009), and a decrease of Bacteroides thetaiotamicron (p = 0.0002), in the NAFLD group as compared with the non-NAFLD group. At the phylum level, Bacteroidetes (p < 0.0001) was decreased in the NAFLD group. Functionally, branched-chain amino acid (p = 0.01343) and aromatic amino acid (p = 0.01343) synthesis pathways had increased relative abundances in the NAFLD group along with numerous energy use pathways, including pyruvate fermentation to acetate (p = 0.01318).
Conclusion: Community-wide differences were noted based on NAFLD status, and individual bacterial species along with specific metabolic pathways were identified as potential drivers of these differences. The results of the present study support the idea that the NAFLD phenotype displays a differentiated microbial and functional signature from the obesity phenotype.
© 2022 The Authors. Hepatology Communications published by Wiley Periodicals LLC on behalf of American Association for the Study of Liver Diseases.

Entities:  

Mesh:

Year:  2022        PMID: 35344283      PMCID: PMC9315112          DOI: 10.1002/hep4.1944

Source DB:  PubMed          Journal:  Hepatol Commun        ISSN: 2471-254X


INTRODUCTION

Nonalcoholic fatty liver disease (NAFLD) is the most common hepatic complication of obesity in children and adolescents, with an estimated 34% of obese youths affected by this disease.[ , , ] Moreover, data based on histological specimens have shown that inflammation and fibrosis may occur earlier in life in patients who develop NAFLD during childhood,[ ] making them more susceptible to develop liver failure at a younger age.[ ] Although NAFLD is a common complication of obesity in youth, the reason why some patients are susceptible to the disease while others never develop it remains unclear. Nutritional and genetic factors certainly play a key role in conveying susceptibility to NAFLD,[ , , ] but other factors may drive individuals to develop this condition. In particular, it has been suggested that the composition of the gut microbial community may play a pivotal role in this regard.[ , ] Several studies have shown associations between intrahepatic fat content and gut microbiota, including decreased microbial diversity in severe fatty liver disease and an enrichment of specific taxa, including the bacterial phyla Proteobacteria, Firmicutes and Bacteroidetes, and the genera Escherichia, Veillonella, Faecalibacterium, Eubacterium, Bacteroides, and Oscillospira.[ , , , , , ] The results of seminal studies performed in twin adults with NAFLD demonstrated that within the spectrum of NAFLD, a metagenomic signature can not only differentiate between subjects with differing degrees of fibrosis,[ ] but also between those with and without cirrhosis.[ ] Based on these and other findings, we previously assessed whether differences exist between the gut microbiomes of youths with and without NAFLD.[ ] We showed that youths with NAFLD have a different metagenomic profile than those without NAFLD.[ ] However, our initial analysis was limited by the use of sequence data from the V4 region of the 16S rRNA gene, which typically allows for only family‐level and genus‐level identification and no direct assessment of the physiological potential of the microbiome. In the present study, we followed up on our previous findings by shotgun sequencing and analyzing the metagenomes of obese youths with and without NAFLD. Within this group of subjects, we observed community‐wide differences in metagenome composition and identified specific species associated with these differences. We also identified metabolic gene pathways that were increased or decreased in subjects with NAFLD and the species contributing to these differences.

METHODS

Study cohort

Thirty‐eight obese youths (body mass index [BMI] ≥ 95th percentile) were included in the present study. The participants were recruited from the Yale Pediatric Obesity Clinic. Those participants who had known non‐NAFLD hepatic diseases, diabetes, or medication that would interfere with glucose production and liver function were excluded from the study. All participants underwent an oral glucose tolerance test and abdominal fast–magnetic resonance imaging (MRI) to assess abdominal body fat partitioning and intrahepatic fat content as previously described.[ , , ] Hepatic fat fraction (HFF) was assessed by abdominal MRI and used to categorize non‐NAFLD (HFF < 5.5%) and NAFLD (HFF ≥ 5.5%) groups. In addition, fasting blood samples to measure liver function and lipid profiles were obtained. The studies were conducted at the Yale Center for Clinical Investigation at 8 h after a 12‐hour overnight fast. A stool sample from each subject was also collected. Written parental informed consent and written child assent were obtained from all participants. Yale University Human Investigation Committee approved the study.

Statistical analysis

Differences were tested using chi‐square tests for categorical variables, two‐sample t‐tests for normally distributed, continuous variables, and Wilcoxon rank‐sum tests for nonnormally distributed, continuous variables, respectively. Subject characteristic differences were evaluated between the NAFLD and non‐NAFLD groups. Chi‐square testing was applied to the variables of gender (male/female), race (White or Caucasian/Black or African American/Hispanics/Others/Asian), and glucose tolerance (normal/impaired). The distributions of continuous variables were examined by histogram. The following normally distributed variables were examined using two sample t‐tests: visceral fat; BMI, body fat (%); fasting glucose; whole‐body insulin sensitivity index (WBISI); hemoglobin A1C; and total, high‐density lipoprotein, and low‐density lipoprotein cholesterol. Wilcoxon rank‐sum tests were used to test differences for the following variables: hepatic fat fraction (%), subcutaneous, deep subcutaneous, superficial subcutaneous, deep/superficial subcutaneous, age, BMI z‐score, 2‐h glucose, fasting insulin, insulinogenic index, disposition index, triglycerides, alanine aminotransferase (ALT), and aspartate aminotransferase (AST). Subject characteristics analyses were performed using SAS (version 9.4; SAS Institute Inc., Cary, NC, USA).

Shotgun metagenome sample preparation and sequencing

DNA was extracted using a MoBio PowerMag Soil 96‐well kit (MoBio Laboratories, Carlsbad, CA) from 0.25 g of fecal sample. DNA extracts were quantified using the Quant‐iT PicoGreen kit (Invitrogen, Thermo Fisher Scientific, Waltham, MA). Metagenome libraries were produced using a TruSeq PCR‐Free kit (Illumina, San Diego, CA), diluted, and then pooled for loading onto an Illumina HiSeq 2500. The libraries underwent 75 × 75 bp paired‐end sequencing.

Shotgun metagenome preprocessing

Raw paired‐end read data for 36 shotgun metagenomes were downloaded from Basespace (basespace.illumina.com) following demultiplexing. Libraries for two samples failed and were excluded from the study. The reads were then processed using KneadData (version 0.7.4, https://github.com/biobakery/kneaddata) with the default parameters to remove adapter sequences, quality filter and quality trim reads, and remove contaminating host‐associated reads. The median read depth following Quality Control was 46 million reads, and the range was 69 million reads.

Taxonomic profiling and comparison

MetaPhlAn (version 3.0)[ ] was used to taxonomically profile the gut microbial communities to the species level using the latest marker gene ChocoPhlAn database (release 2019.01). To investigate potential differences between the viromes of the two study groups, a virus profiling parameter was also included. Individual sample taxonomic profiles were merged using the merge_metaphlan_tables.py utility script, resulting in a single table with relative abundances provided for each taxon. Absolute counts for use in downstream analyses were recovered by multiplying the total read count for each sample by the relative abundances resulting from the merge table script. Species‐level counts were retrieved from this table and reformatted for import into R (version 4.1.0).[ ] An Operational taxonomic unit table, taxonomy table, and metadata table were imported into RStudio (version 1.4.1717) and read into the package phyloseq (version 1.36.0).[ ] Alpha and beta diversity calculations were performed using the packages phyloseq and microViz (version 0.7.7).[ ] For ecological distance metrics, samples were rarefied to 8,893,658 million reads (the lowest read count for a sample) before calculating distances and ordination. For compositional biplots, a centered log‐ratio transformation was performed as recommended before the generation of principal component analysis (PCA) biplots. Differential abundance testing was performed using the ANCOM statistical framework[ , ] implemented by the package ANCOM‐BC.[ ] Taxa were aggregated at the species, genus, family, order, class, and phylum levels using the microbiome package (version 1.14)[ ] before performing the ANCOM test on each level. Taxa had to appear in a minimum of 25% of samples to be included in ANCOM‐BC analyses.

Functional profiling and comparison

HUMAnN (version 3.0)[ ] was used to functionally profile the microbial communities. Paired‐end sequence files were first concatenated before running HUMAnN. The full ChocoPhlAn pangenome database (release 2019.01) was used for functional pathway abundance and coverage determination, whereas the UniRef90 database (release 2021.03)[ , ] was used for gene family abundance determination. The output pathway and gene family abundance files for each sample were normalized to relative abundances, and the resulting files were joined. Enriched pathways and gene families were identified using the R package MaAsLiN2 (version 1.6.0).[ ] Pathways and gene families achieving a corrected p‐value of 0.05 or less were classified as significantly increased within one of the two patient groups. Identified pathways and gene families were then plotted using the bar plot utility script in HUMAnN.

RESULTS

Population characteristics stratified by NAFLD status

The clinical and demographic characteristics of 36 study participants (18 NAFLD and 18 non‐NAFLD) were evaluated for differences with respect to NAFLD status (Table 1). Most of the subject characteristics were not significantly different between the two groups. Subjects with NAFLD had a higher BMI, BMI z‐score, fasting insulin level, alanine transaminase (ALT) level, and visceral and hepatic fat fractions, but a lower WBISI as compared to subjects without NAFLD. These all attained statistical significance (p < 0.05). Although not statistically significant, subjects with NAFLD had elevated fasting blood glucose, body fat percentage, total cholesterol, triglycerides, and AST as compared to the subjects without NAFLD.
TABLE 1

Characteristics of study population by NAFLD status (n = 36)

NAFLD status p
non‐NAFLD (n = 18)NAFLD (n = 18)
Clinical features
Age (years) a 12.22 ± 3.0212.54 ± 2.420.8628
Gender (M/F)8/10 (44.44%/55.56%)11/7 (61.11%/38.89%)0.3166
Race c 6/4/6/1/1 (33.33%/22.22%/33.33%/5.56%/5.56%)6/2/9/0/1 (33.33%/11.11%/50%/0%/5.56%)0.6868
Glucose tolerance (NGT/IGT) b 15/310/50.2660
BMI (kg/m2) b 30.26 ± 7.0435.21 ± 6.820.0386
BMI z‐score b 2.09 ± 0.482.46 ± 0.280.0204
Body fat (%) b 41.34 ± 10.2346.58 ± 7.860.1100
Glucose metabolism
Fasting glucose (mg/dl) b 89.75 ± 5.3794.53 ± 8.310.0549
Fasting insulin (uU/ml) b 21.69 ± 5.5850.53 ± 30.400.0017
2‐h glucose (mg/dl) b 118.11 ± 25.35130.80 ± 33.500.2249
Hemoglobin A1C (%) b 5.51 ± 0.185.56 ± 0.300.2346
WBISI b 2.25 ± 0.611.24 ± 0.810.0003
IGI b 4.25 ± 3.436.42 ± 4.380.0738
DI b 8.24 ± 4.478.03 ± 8.320.2985
Lipid profile
Total cholesterol (mg/dl) b 143.72 ± 28.39152.21 ± 24.650.3816
HDL cholesterol (mg/dL) b 43.94 ± 8.2144.29 ± 10.410.9180
LDL cholesterol (mg/dl) b 84.28 ± 27.1985.64 ± 19.650.8754
Triglycerides (mg/dl) b 77.17 ± 28.24111.14 ± 68.630.2950
Liver function
Alanine transaminase (U/l) b 18.33 ± 7.2740.14 ± 33.800.0463
Aspartate transaminase (U/l) b 20.83 ± 4.9932.06 ± 26.930.2313
Body fat composition
Visceral (cm2)52.61 ± 24.8677.24 ± 24.030.0047
Deep subcutaneous (cm2)184.98 ± 202.32171.37 ± 58.030.1977
Subcutaneous (cm2) b 467.91 ± 215.92560.69 ± 234.840.1798
Superficial subcutaneous (cm2)161.93 ± 90.37155.98 ± 76.290.9125
Deep/superficial subcutaneous1.14 ± 0.591.20 ± 0.300.1419
Hepatic fat fraction (%)1.23 ± 1.7420.86 ± 11.36< 0.0001

Chi‐square tests were used for categorical variables; two‐sample t‐tests were used for normally distributed continuous variables; and Wilcoxon rank test was used for nonnormally distributed continuous variables.

Abbreviations: BMI, body mass index; DI, disposition index; HDL cholesterol, high‐density lipoprotein cholesterol; IGI, insulinogenic index, IGT, impaired glucose tolerance; LDL cholesterol, low‐density lipoprotein cholesterol; NGT, normal glucose tolerance; WBISI, whole‐body insulin sensitivity index.

Mean ± SD.

3, 1, 3, 3, 3, 3, 3, 5, 3, 4, 4, 4, 4, 4, 4, 4, 1 missing values.

White or Caucasian/Black or African American/Hispanics/Others/Asian.

Characteristics of study population by NAFLD status (n = 36) Chi‐square tests were used for categorical variables; two‐sample t‐tests were used for normally distributed continuous variables; and Wilcoxon rank test was used for nonnormally distributed continuous variables. Abbreviations: BMI, body mass index; DI, disposition index; HDL cholesterol, high‐density lipoprotein cholesterol; IGI, insulinogenic index, IGT, impaired glucose tolerance; LDL cholesterol, low‐density lipoprotein cholesterol; NGT, normal glucose tolerance; WBISI, whole‐body insulin sensitivity index. Mean ± SD. 3, 1, 3, 3, 3, 3, 3, 5, 3, 4, 4, 4, 4, 4, 4, 4, 1 missing values. White or Caucasian/Black or African American/Hispanics/Others/Asian.

Gut microbial community profiles of 36 obese youths

Thirty‐six shotgun metagenomes of the gut microbial communities of 18 subjects with NAFLD and 18 without NAFLD were analyzed in the present study. The results presented in Figure 1 compare the alpha and beta diversity between these groups and profiles the microbial communities at the species and phylum levels. Shannon‐Weiner diversity index values were calculated for all samples and were slightly elevated in the NAFLD group (Figure 1A), although this difference was not significant (p = 0.14, t‐test). Bray‐Curtis dissimilarity values were calculated and plotted using a nonmetric multidimensional scaling ordination to compare overall gut microbial community structure at the species level (Figure 1B). Permutational multivariate analysis of variance (PERMANOVA) testing revealed that the subjects with NAFLD had significantly different clustering from the non‐NAFLD subjects (p adj = 0.001, R2 = 0.1006, PERMANOVA) and showed a significant difference in dispersion (p adj = 0.023, β‐dispersion). These results indicate a significant difference between the centroids of these groups at the species level, and this difference may be due to the tested factor (disease), variability within each group, or a combination of these two factors.[ ] Agglomeration at higher taxonomic levels (genus thru phylum) led to reduced and no longer significant dispersion differences while maintaining the significant clustering differences as measured by PERMANOVA testing (Figure S1).
FIGURE 1

Microbiomes of 36 obese youths. (A) Shannon alpha diversity values are displayed as boxplots with median, interquartile range, and outliers marked. (B) Bray‐Curtis nonmetric multidimensional scaling (NMDS) plot calculated at the species level using rarefied data. Phylum‐level (C) and species‐level (D) compositional bar plots are displayed with each horizontal bar representing an individual microbial community. Nonalcoholic fatty liver disease (NAFLD) status is indicated above each column of bar plots. “Other” indicates all remaining taxa are grouped within this category

Microbiomes of 36 obese youths. (A) Shannon alpha diversity values are displayed as boxplots with median, interquartile range, and outliers marked. (B) Bray‐Curtis nonmetric multidimensional scaling (NMDS) plot calculated at the species level using rarefied data. Phylum‐level (C) and species‐level (D) compositional bar plots are displayed with each horizontal bar representing an individual microbial community. Nonalcoholic fatty liver disease (NAFLD) status is indicated above each column of bar plots. “Other” indicates all remaining taxa are grouped within this category Between the NAFLD and non‐NAFLD groups at the phylum level (Figure 1C, Figure S2A), the mean relative abundance of Firmicutes (72.1% vs. 56.1%) and Bacteroidetes (6.7% vs. 33.2%) differed. Actinobacteria (18.8% vs. 7.9%) was the third most prevalent phylum for both groups. Verrucomicrobia (0.9% vs. 1.9%) and Proteobacteria (0.3% vs. 0.5%) were present as minor bacterial phyla along with the archaeal phylum Euryarcheota (0.8% vs. 0.4%) as well as viruses (0.2% vs. 0.1%). At the species level (Figure 1D, Figure S2B), the Firmicutes Faecalibacterium prausnitzii (9.4% vs. 12.7%), Eubacterium rectale (8.6% vs. 4.6%), Fusicatenibacter saccharivorans (6.0% vs. 1.6%), Ruminococcus bromii (3.9% vs. 3.3%), Anaerostipes hadrus (4.1% vs. 3.0%), Eubacterium sp. CAG 180 (2.4% vs. 2.7%), and Dorea longicatena (3.2% vs. 1.7%); the Bacteroidetes Prevotella copri (1.6% vs. 10.0%) and Bacteroides vulgatus (1.4% vs. 7.5%); and the Actinobacteria Bifidobacterium adolescentis (8.5% vs. 2.3%) represented the 10 most abundant species. The next 25 most abundant species belonged to the genera Eubacterium, Ruminococcus, Colinsella, Roseburia, Bifidobacterium, Bacteroides, Lactobacillus, Allistepes, Akkermansia, Dorea, Blautia, Coprococcus, and Parabacteroides.

Taxonomic differences between the NAFLD and non‐NAFLD groups

The differences between the gut microbial communities of individuals with NAFLD and without NAFLD were first compared using PCA biplots. PCA plotting presents a compositional approach in which results are generally more reproducible, and variance in the data is directly accounted for within the plot (as opposed to requiring separate statistical analysis as in principal coordinate plotting).[ ] Figure 2 shows the species‐level and phylum‐level PCA plots, with arrows representing the contribution of individual taxa to each principal coordinate (the length of the arrow represents the strength of the effect). Beneath each PCA biplot is an iris plot showing the taxonomic composition for each sample corresponding to the location on the PCA plot above. The percent of the variation explained in the PCA plot increased from the species level (PC1 12.2%, PC2 10.6%) to the phylum level (PC1 57.1%, PC2 23.0%).
FIGURE 2

Principal component analysis (PCA) biplots with taxa vectors and iris plots. (A) Species‐level PCA biplot with arrows indicating the abundance gradient for a particular species. The top 10 taxa variable contributors to the principal coordinates (PCs) are shown. (B) Phylum‐level PCA biplot with arrows indicating the abundance gradient for a particular phylum. The top 5 taxa variable contributors to the PCs are shown. (C) Species‐level iris plot showing the taxonomic composition of each sample with its position corresponding to its position in the associated biplot (above). The top 25 species are shown, and all remaining species are grouped into the “other” category. (D) Phylum‐level iris plot showing the taxonomic composition of each sample with its position corresponding to its position in the associated biplot (above). Orange circles on the periphery of the iris plots indicate individuals with NAFLD; blue circles indicate individuals without NAFLD individuals. Genera had to appear in 25% of samples to be included

Principal component analysis (PCA) biplots with taxa vectors and iris plots. (A) Species‐level PCA biplot with arrows indicating the abundance gradient for a particular species. The top 10 taxa variable contributors to the principal coordinates (PCs) are shown. (B) Phylum‐level PCA biplot with arrows indicating the abundance gradient for a particular phylum. The top 5 taxa variable contributors to the PCs are shown. (C) Species‐level iris plot showing the taxonomic composition of each sample with its position corresponding to its position in the associated biplot (above). The top 25 species are shown, and all remaining species are grouped into the “other” category. (D) Phylum‐level iris plot showing the taxonomic composition of each sample with its position corresponding to its position in the associated biplot (above). Orange circles on the periphery of the iris plots indicate individuals with NAFLD; blue circles indicate individuals without NAFLD individuals. Genera had to appear in 25% of samples to be included Species with a large effect size in the quadrant containing most of the non‐NAFLD samples included Alistepes putredinis, Odoribacter splanchnicus, Barnesiella intestinihominis, Parabacteroides merdae, Bacteroides thetaiotamicron, and Bacteroides fragilis (Figure 2A,C). None of the top 10 species shown were dramatically enriched in the individuals with NAFLD. At the phylum level, Bacteroidetes was enriched in the area of the samples from the non‐NAFLD group, while Actinobacteria and viruses were enriched in the area of the samples from the NAFLD group (Figure 2B,D). Proteobacteria and Verrucomicrobia were enriched in a subset of both individuals with and without NAFLD, whereas Firmicutes does not have a clear correlation with either group. The biplots suggested that specific taxa may correlate with the disease state of the subjects, and differential abundance testing was then used to probe this in a statistically meaningful way. Analysis of composition of microbiomes with bias correction (ANCOM‐BC) was used to determine whether any taxa differed significantly in the microbiomes from subjects with and without NAFLD. All taxonomic levels from species to phylum were tested, and the results are presented in Table 2. The phylum Bacteroidetes, class Bacteroidia, order Bacteroidales, family Bacteroidaceae, and genus Bacteroides were all significantly decreased in the NAFLD group. Three species were shown to have elevated relative abundances in the NAFLD group, including Fusicatenibacter saccharivorans, Romboutsia ilealis, and Actinomyces sp. ICM47. The B. thetaiotamicron species was shown to be decreased in the NAFLD group as compared with the non‐NAFLD group.
TABLE 2

ANCOM‐BC results

Species a Coefficient (beta) b SEMTest statistic (W) p Adjusted p‐value (q)
Actinomyces sp. ICM474.422670.988634.473537.69375E‐069.23250E‐04
Fusicatenibacter saccharivorans 1.985060.555593.572883.53079E‐044.16633E‐02
Romboutsia ilealis 4.160141.173043.546453.90464E‐044.56842E‐02
Bacteroides thetaiotaomicron −9.031641.88506−4.791161.65800E‐061.98000E‐04
Genus a
Bacteroides −3.024840.53084−5.698211.21073E‐088.47514E‐07
Flavonifractor −3.208680.80108−4.005446.19029E‐054.27130E‐03
Family a
Bacteroidaceae −2.854270.58112−4.911669.03083E‐073.07048E‐05
Odoribacteraceae −5.526031.38351−3.994226.49073E‐052.14194E‐03
Order a
Bacteroidales−2.673690.47544−5.623631.86985E‐083.55271E‐07
Class a
Bacteroidia−2.783690.44122−6.309112.80639E‐103.92895E‐09
Tissierellia−3.775611.30288−2.897903.75667E‐034.88367E‐02
Phylum a
Bacteroidetes−2.849220.41898−6.800411.04323E‐116.25940E‐11

Only taxa appearing in 25% of samples were included in each taxonomic level’s analysis.

Positive coefficients indicate taxa elevated in the NAFLD group, and negative coefficients indicate reduced in the NAFLD group.

ANCOM‐BC results Only taxa appearing in 25% of samples were included in each taxonomic level’s analysis. Positive coefficients indicate taxa elevated in the NAFLD group, and negative coefficients indicate reduced in the NAFLD group. The differentially abundant species identified using ANCOM‐BC were also plotted in Figure S3 as relative abundances in box plots. Certain species, including Bacteroides thetaiotaomicron and Fusicatenibacter saccharivorans, make up a larger portion of the average species consortium (> 0.1%). Other species, including Actinomyces sp. ICM47 and Romboutsia ilealis, amount to a smaller portion (< 0.1%) of the gut microbial community. The significant results from all other taxonomic levels are plotted in Figures S4–S7.

Functional differences between the NAFLD and non‐NAFLD groups

In addition to taxonomic differences, functional differences between the groups were also investigated. Following the functional classification of reads, their abundances were compared, and the top 50 statistically significant results are reported in Table 3. Of these pathways, 29 had an increased relative abundance in the NAFLD group, while 21 were decreased, among which many biosynthetic pathways were identified, including multiple amino acid synthesis pathways. Multiple lysine synthesis pathways had increased relative abundances along with methionine, isoleucine, ornithine, threonine, serine, tryptophan, arginine, aspartate, and glycine pathways. Superpathways for branched‐chain amino acid (BCAA) and aromatic amino acid (AAA) synthesis along with peptidoglycan synthesis were also observed at higher relative abundances in the NAFLD group. Another subset of pathways was decreased in the NAFLD group and included the ornithine (a distinct pathway from the aforementioned one), glutamine, glutamate, and isoleucine (a distinct pathway from the aforementioned one) synthesis pathways. Degradation, use, and energy generation pathways were also identified. Pyruvate fermentation to acetate and lactate, methanogenesis from acetate, lactose and galactose degradation, glycerol degradation, stachyose degradation, guanosine degradation, glycolysis, and the pentose phosphate pathway were increased in individuals with NAFLD. Two histidine degradation pathways as well as the urea and tricarboxylic acid cycles were decreased in the individuals with NAFLD. All significant results are presented in Table S1.
TABLE 3

Elevated metabolic pathways

PathwayClassCoefficient a SEMPrevalence b p q
COA PWY: coenzyme A biosynthesis I: prokaryoticBiosynthesis0.2570.052360.000020.01318
GLYCOGENSYNTH PWY: glycogen biosynthesis I: from ADP D GlucoseBiosynthesis0.7510.142360.000010.01318
PWY 4242: pantothenate and coenzyme A biosynthesis IIIBiosynthesis0.2850.056360.000010.01318
PWY 5100: pyruvate fermentation to acetate and lactate IIDegradation/use0.7470.149360.000020.01318
PWY 6471: peptidoglycan biosynthesis IV: Enterococcus faeciumBiosynthesis1.2650.249360.000010.01318
ARO PWY: chorismate biosynthesis IBiosynthesis0.3040.068360.000090.01343
BRANCHED CHAIN AA SYN PWY: superpathway of branched chain amino acid biosynthesisBiosynthesis0.2680.063360.000170.01343
COMPLETE ARO PWY: superpathway of aromatic amino acid biosynthesisBiosynthesis0.3090.070360.000090.01343
HSERMETANA PWY: L methionine biosynthesis IIIBiosynthesis0.4720.099360.000030.01343
LACTOSECAT PWY: lactose and galactose degradation IDegradation/use1.5920.379360.000180.01343
PWY 5097: L lysine biosynthesis VIBiosynthesis0.2160.051360.000170.01343
PWY 5103: L isoleucine biosynthesis IIIBiosynthesis0.3110.070360.000090.01343
PWY 6270: isoprene biosynthesis IBiosynthesis0.5620.121360.000050.01343
PWY 7221: guanosine ribonucleotides de novo biosynthesisBiosynthesis0.2370.051360.000050.01343
PWY 724: superpathway of L lysine: L threonine and L methionine biosynthesis IIBiosynthesis0.2620.058360.000060.01343
PWY 7357: thiamine phosphate formation from pyrithiamine and oxythiamine: yeastBiosynthesis0.4190.093360.000080.01343
PWY 7560: methylerythritol phosphate pathway IIBiosynthesis0.5970.129360.000050.01343
GLUTORN PWY: L ornithine biosynthesis IBiosynthesis0.3690.091360.000280.01710
METH ACETATE PWY: methanogenesis from acetateEnergy generation1.6900.421360.000310.01780
NONOXIPENT PWY: pentose phosphate pathway: non oxidative branch: IEnergy generation0.6060.154360.000380.02008
PWY 6527: stachyose degradationDegradation/use0.6770.176360.000510.02400
P4 PWY: superpathway of L lysine: L threonine and L methionine biosynthesis IBiosynthesis1.2690.331360.000510.02401
CALVIN PWY: Calvin Benson Bassham cycleBiosynthesis0.3580.094360.000570.02470
PWY 5188: tetrapyrrole biosynthesis I: from glutamateBiosynthesis0.7920.209360.000580.02470
PWY 6121: 5 aminoimidazole ribonucleotide biosynthesis IBiosynthesis0.2230.059360.000580.02470
DAPLYSINESYN PWY: L lysine biosynthesis IBiosynthesis1.0410.283360.000800.02734
PWY 6163: chorismate biosynthesis from 3 dehydroquinateBiosynthesis0.2550.069360.000780.02734
PWY 6606: guanosine nucleotides degradation IIDegradation/use0.8690.236360.000800.02734
SER GLYSYN PWY: superpathway of L serine and glycine biosynthesis IBiosynthesis0.4390.118360.000740.02734
ARGININE SYN4 PWY: L ornithine biosynthesis IIBiosynthesis−3.2350.636320.000010.01318
HISDEG PWY: L histidine degradation IDegradation/use−1.8760.383360.000020.01318
PWY 1269: CMP 3 deoxy D manno octulosonate biosynthesisBiosynthesis−1.8370.373360.000020.01318
PWY 5973: cis vaccenate biosynthesisBiosynthesis−1.4400.288360.000020.01318
PWY 7663: gondoate biosynthesis: anaerobicBiosynthesis−1.5560.300360.000010.01318
CITRULBIO PWY: L citrulline biosynthesisBiosynthesis−2.3670.540350.000110.01343
POLYISOPRENSYN PWY: polyisoprenoid biosynthesis: E: coliBiosynthesis−1.8750.443360.000170.01343
PWY 4984: urea cycleDegradation/use−2.4020.547350.000100.01343
PWY 5030: L histidine degradation IIIDegradation/use−1.7430.381360.000060.01343
PWY 5505: L glutamate and L glutamine biosynthesisBiosynthesis−2.0940.483350.000120.01343
PWY0 845: superpathway of pyridoxal 5: phosphate biosynthesis and salvageBiosynthesis−2.6210.560330.000040.01343
PYRIDOXSYN PWY: pyridoxal 5: phosphate biosynthesis IBiosynthesis−2.8920.602330.000030.01343
PWY 6859: all trans farnesol biosynthesisBiosynthesis−2.0370.493360.000220.01441
PWY 7539: 6 hydroxymethyl dihydropterin diphosphate biosynthesis III: ChlamydiaBiosynthesis−1.5510.376360.000230.01461
PWY 5104: L isoleucine biosynthesis IVBiosynthesis−2.2180.546330.000270.01663
PWY 6147: 6 hydroxymethyl dihydropterin diphosphate biosynthesis IBiosynthesis−1.5290.380360.000300.01751
PWY 7332: superpathway of UDP N acetylglucosamine derived O antigen building blocks biosynthesisBiosynthesis−2.2410.557170.000300.01751
PWY 7392: taxadiene biosynthesis: engineeredBiosynthesis−1.7810.448360.000350.01877
TCA: TCA cycle I: prokaryoticEnergy generation−0.8550.218360.000400.02066
NAD BIOSYNTHESIS II: NAD salvage pathway III: to nicotinamide ribosideBiosynthesis−2.1450.582330.000790.02734
PWY 7282: 4 amino 2 methyl 5 diphosphomethylpyrimidine biosynthesis IIBiosynthesis−1.4280.388350.000800.02734

Positive coefficients indicate an elevated pathway in the NAFLD group, and negative coefficients indicate a reduced pathway in the NAFLD group.

Prevalence indicates the number of samples in which this particular pathway was detected.

Elevated metabolic pathways Positive coefficients indicate an elevated pathway in the NAFLD group, and negative coefficients indicate a reduced pathway in the NAFLD group. Prevalence indicates the number of samples in which this particular pathway was detected. Additionally, the specific contributions of individual species to a particular metabolic pathway were calculated and then compared between groups to identify differentially abundant taxonomic contributions. Bar plots are provided in Figures S9–S14 for a subset consisting of pathways of interest. Eubacterium hallii and Fusicatenibacter saccharivorans were by far the two most common taxa contributing to increased pathways in the NAFLD group. Additionally, Blautia wexlerae, Blautia obeum, Streptococcus salivarius, Ruminococcus torques, Streptococcus parasanguinis, Coprococcus catus, Streptococcus thermophilus, Romboutsia ilealis, Roseburia sp. CAG 471, and Dorea formicigenerans were also increased in relative abundance in certain pathways. Reduced taxa contributing to decreased metabolic pathways in the NAFLD group included Bacteroides vulgatus, Bacteroides thetaiotaomicron, Bacteroides ovatus, Bacteroides stercoris, Parabacteroides distasonis, Bacteroides fragilis, Bacteroides xylanisolvens, Ruthenibacterium lactatiformans, and Parabacteroides merdae. All significant results are presented in Table S1.

DISCUSSION

In the present study, we used shotgun metagenomic sequencing to illuminate a number of taxonomic and functional differences between the gut microbial communities of obese youths with and without NAFLD. Previous studies sequenced relatively short variable regions of the 16S rRNA gene, which limits the taxonomic resolution capacity and lacks important functional information.[ ] The analysis of the shotgun metagenome data at the species level revealed that the composition of the microbiomes differed significantly between the NAFLD and non‐NAFLD groups (Figure 1). The high degree of variability of the human microbiome, especially at the species and genus levels, has been well documented.[ , ] This variability may explain the dispersion visualized in Figure 1B, particularly for the non‐diseased group. The reduction in variance at higher taxonomic levels suggests that a larger number of genera and species vary among the subjects without NAFLD than the subjects with NAFLD. The taxonomic contributors of the observed difference in beta diversity were analyzed using PCA biplots. Interestingly, a number of species appeared to correlate with the non‐NAFLD group, whereas the NAFLD group lacks obvious ones. This result indicates that a lack of certain microbial taxa and general dysbiosis may contribute to the pathogenesis of NAFLD as opposed to the presence of specific taxa being causative.[ ] The most commonly observed and significant taxonomic difference was the relatively decreased abundance of bacteria belonging to the phylum Bacteroidetes in subjects with NAFLD. Both the PCA biplot and ANCOM differential abundance testing identified this group as strongly decreased in individuals with NAFLD. This was also the finding of an earlier study that we performed on a larger cohort using 16S rRNA gene sequencing and that included the subjects in this study.[ ] A commonly cited overabundance of Proteobacteria[ ] in individuals with NAFLD was not observed in our data set, although a subset of both study groups possessed increased relative abundances of Proteobacteria. A number of species were identified as being differentially abundant, among which B. thetaiotamicron was shown to be significantly decreased in the NAFLD group and was identified as a major driver in PCA (Figure 2). B. thetaiotamicron has been shown to reduce diet‐induced body weight gain and adiposity in mice and was observed at lower abundances in obese human subjects compared with healthy ones.[ ] O. splanchnicus, a species within the family Odoribacteriaceae that was decreased in the NAFLD group, was shown to be associated with decreased incidence of NAFLD,[ ] cystic fibrosis,[ ] and inflammatory bowel disease[ ] in previous studies. This species was identified as a major driver in PCA, and its bacterial family was identified via ANCOM analysis. Fusicatenibacter saccharivorans, the lone species within a relatively newly recognized genus,[ ] comprised a fairly large portion of the NAFLD community. This species has been associated with a diet high in processed foods[ ] and is capable of fermenting a wide variety of saccharides, producing short‐chain fatty acids (SCFAs) as a result.[ ] The species Actinomyces sp. ICM47 is primarily associated with the human oral microbiome,[ ] and its overabundance in individuals with NAFLD could be partly due to increased saliva entering the gastrointestinal tract through more frequent eating or an increased salivary response (although this is only speculation). It is important to note that detecting DNA from specific bacterial species does not establish their living presence within a certain niche. The presence of oral microbial DNA in fecal samples is to be expected but does not necessarily indicate that a species is a gut resident. Functionally, an even more expansive list of features was shown to be differentially abundant between the two groups. BCAA (Figure S9) and AAA (Figure S10) biosynthetic genes were observed to be at higher relative abundances in the group with NAFLD, and previous studies have reported BCAAs and AAAs to be associated with NAFLD.[ , ] Interestingly, pathways for isoleucine (a BCAA) were shown to be increased in both groups. Isoleucine pathways I (Figure S11) and III (Figure S12) were increased in individuals with NAFLD, while pathway IV (Figure S13) was decreased. Pathways 1 and 3 use 2‐oxobutanoate and glutamate, respectively, whereas pathway 4 uses propanoate. Among the three aforementioned pathways, contributions by Eubacterium, Blautia, and Streptococcus spp. were noted as increased in individuals with NAFLD, while Bacteroides and Parabacteroides spp. were decreased. Although these shotgun data only detect the presence of certain genes and do not provide information on expression levels, it is possible that these pathways are active and lead to an increase in free isoleucine associated with NAFLD. The fermentation of pyruvate to lactate and acetate, a SCFA, is another interesting pathway that was observed at higher relative abundances in the NAFLD group compared with the non‐NAFLD group (Figure S14)—a difference primarily driven by the abundances of Fusicatenibacter saccharivorans and two Streptococcus species. Total SCFA content in the gut has been correlated with obesity status.[ ] The classification of both evaluated groups as obese may indicate that certain bacterial community members further exacerbate SCFA concentration, increasing the chances for more severe obesity and NAFLD progression. The increase or decrease in specific SCFAs (acetate, propionate, and butyrate) may also contribute to disease progression. Butyrate and acetate are the most and least potent anti‐inflammatory SCFAs, respectively.[ ] The gene potential for increased acetate production by F. saccharivorans and a decrease in butyrate production from a decreased abundance of F. prausnitzii (the most abundant species in these metagenomes and a known butyrate producer) (Figure S2B) may contribute to more severe gut inflammation in the NAFLD group.[ ] This study has some limitations, the main one being the relatively small sample size (n = 36) used, which could have led to a potentially high false discovery rate (FDR). To account for this issue, we made FDR‐related statistical adjustments and used minimum prevalence values to exclude taxa or features only present in a small subset of samples. Another consideration, particularly for the functional data, is that shotgun metagenomic data provide the functional potential of the community present rather than the expression levels of these genes. This distinction is important, as certain genes may be present but not actively transcribed, whereas others may be overexpressed. Although a metatranscriptomic analysis could provide a functional snapshot of the community, this approach has its own set of caveats, as the transcriptomes of fecal microbiome samples are not necessarily representative of the physiology in the colon. The significant differences noted for BMI and body fat percentage between the two groups may also present as a confounding variable. The NAFLD group had consistently higher BMI and percent body fat measurements, indicating more severe obesity, which has been shown to affect microbiome diversity.[ ] In the present study, we profiled a largely undercharacterized age group within the population,[ ] including thorough clinical phenotyping of the patient groups using MRI and shotgun deep sequencing of the gut microbial communities. In summary, we found that the two groups had significantly different microbial community compositions, and that these differences were at least partially driven by a specific subset of bacterial species and that biosynthetic pathways producing metabolites (BCAAs, AAAs, SCFAs) previously correlated with disease were again identified here. There is great potential for future studies to investigate these identified species as probiotics or drug targets as well as using correlated metabolite production as a biomarker for increased disease risk to inform clinical interventions. The comparison examined was also unique in that only obese youth with and without NAFLD were included, thereby controlling for the effect of obesity on the microbiome. Given NAFLD’s link with type 2 diabetes, cardiovascular disease, and advanced liver disease, as well as its increasing prevalence, NAFLD poses a major concern to public health and begs more effective strategies for prevention and treatment. While gut dysbiosis has been postulated as a potential contributor to the pathogenesis of NAFLD and NASH, more research is needed to elucidate how gut‐liver axis signaling influences hepatic fat accumulation and inflammation and how these mechanisms might be modulated for therapeutic benefit. Future studies that combine clinical, metabolomic, and microbiome data will be extremely important for prospective therapies for NAFLD and its more severe progressions, especially considering children and adolescents in high‐income countries are in the midst of an obesity epidemic.

CONFLICTS OF INTEREST

Nothing to report.

ETHICS APPROVAL

The Yale University Human Investigation committee approved the study.

CONSENT TO PARTICIPATE AND CONSENT FOR PUBLICATION

Written child assent and written parental permission were obtained from all participants involved in the study.

AVAILABILITY OF DATA AND MATERIAL

Data can be found in the NCBI SRA database under project ID PRJNA328258.

CODE AVAILABILITY

Code can be found at https://github.com/joerggraflab/Code‐for‐Testerman‐Li‐2021. Fig S1‐S14 Click here for additional data file. Table S1 Click here for additional data file.
  46 in total

1.  UniRef: comprehensive and non-redundant UniProt reference clusters.

Authors:  Baris E Suzek; Hongzhan Huang; Peter McGarvey; Raja Mazumder; Cathy H Wu
Journal:  Bioinformatics       Date:  2007-03-22       Impact factor: 6.937

Review 2.  Metabolic syndrome in pediatrics: old concepts revised, new concepts discussed.

Authors:  Ebe D'Adamo; Nicola Santoro; Sonia Caprio
Journal:  Pediatr Clin North Am       Date:  2011-10       Impact factor: 3.278

Review 3.  Pediatric fatty liver disease: role of ethnicity and genetics.

Authors:  Pierluigi Marzuillo; Emanuele Miraglia del Giudice; Nicola Santoro
Journal:  World J Gastroenterol       Date:  2014-06-21       Impact factor: 5.742

4.  Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery 3.

Authors:  Francesco Beghini; Lauren J McIver; Aitor Blanco-Míguez; Leonard Dubois; Francesco Asnicar; Sagun Maharjan; Ana Mailyan; Paolo Manghi; Matthias Scholz; Andrew Maltez Thomas; Mireia Valles-Colomer; George Weingart; Yancong Zhang; Moreno Zolfo; Curtis Huttenhower; Eric A Franzosa; Nicola Segata
Journal:  Elife       Date:  2021-05-04       Impact factor: 8.140

5.  Analysis of composition of microbiomes: a novel method for studying microbial composition.

Authors:  Siddhartha Mandal; Will Van Treuren; Richard A White; Merete Eggesbø; Rob Knight; Shyamal D Peddada
Journal:  Microb Ecol Health Dis       Date:  2015-05-29

6.  A Branched-Chain Amino Acid-Related Metabolic Signature Characterizes Obese Adolescents with Non-Alcoholic Fatty Liver Disease.

Authors:  Martina Goffredo; Nicola Santoro; Domenico Tricò; Cosimo Giannini; Ebe D'Adamo; Hongyu Zhao; Gang Peng; Xiaoqing Yu; Tukiet T Lam; Bridget Pierpont; Sonia Caprio; Raimund I Herzog
Journal:  Nutrients       Date:  2017-06-22       Impact factor: 5.717

7.  A gut microbiome signature for cirrhosis due to nonalcoholic fatty liver disease.

Authors:  Cyrielle Caussy; Anupriya Tripathi; Greg Humphrey; Shirin Bassirian; Seema Singh; Claire Faulkner; Ricki Bettencourt; Emily Rizo; Lisa Richards; Zhenjiang Z Xu; Michael R Downes; Ronald M Evans; David A Brenner; Claude B Sirlin; Rob Knight; Rohit Loomba
Journal:  Nat Commun       Date:  2019-03-29       Impact factor: 14.919

Review 8.  Gut Microbial Metabolism and Nonalcoholic Fatty Liver Disease.

Authors:  Suzanne R Sharpton; Germaine J M Yong; Norah A Terrault; Susan V Lynch
Journal:  Hepatol Commun       Date:  2018-12-03

9.  Novel Odoribacter splanchnicus Strain and Its Outer Membrane Vesicles Exert Immunoregulatory Effects in vitro.

Authors:  Kaisa Hiippala; Gonçalo Barreto; Claudia Burrello; Angelica Diaz-Basabe; Maiju Suutarinen; Veera Kainulainen; Jolene R Bowers; Darrin Lemmer; David M Engelthaler; Kari K Eklund; Federica Facciotti; Reetta Satokari
Journal:  Front Microbiol       Date:  2020-11-12       Impact factor: 5.640

10.  Insights from shotgun metagenomics into bacterial species and metabolic pathways associated with NAFLD in obese youth.

Authors:  Todd Testerman; Zhongyao Li; Brittany Galuppo; Joerg Graf; Nicola Santoro
Journal:  Hepatol Commun       Date:  2022-03-28
View more
  2 in total

1.  The Effects of Time-Restricted Eating on Metabolism and Gut Microbiota: A Real-Life Study.

Authors:  Ilario Ferrocino; Marianna Pellegrini; Chiara D'Eusebio; Ilaria Goitre; Valentina Ponzo; Maurizio Fadda; Rosalba Rosato; Giulio Mengozzi; Guglielmo Beccuti; Fabio Dario Merlo; Farnaz Rahimi; Isabella Comazzi; Luca Cocolin; Ezio Ghigo; Simona Bo
Journal:  Nutrients       Date:  2022-06-21       Impact factor: 6.706

2.  Insights from shotgun metagenomics into bacterial species and metabolic pathways associated with NAFLD in obese youth.

Authors:  Todd Testerman; Zhongyao Li; Brittany Galuppo; Joerg Graf; Nicola Santoro
Journal:  Hepatol Commun       Date:  2022-03-28
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.