Literature DB >> 34327846

Metabolomic profiling of stool of two-year old children from the INSIGHT study reveals links between butyrate and child weight outcomes.

Debmalya Nandy1, Sarah J C Craig2,3, Jingwei Cai4, Yuan Tian4, Ian M Paul3,5, Jennifer S Savage6,7, Michele E Marini7, Emily E Hohman7, Matthew L Reimherr1,3, Andrew D Patterson4,8, Kateryna D Makova2,3, Francesca Chiaromonte1,3,9.   

Abstract

BACKGROUND: Metabolomic analysis is commonly used to understand the biological underpinning of diseases such as obesity. However, our knowledge of gut metabolites related to weight outcomes in young children is currently limited.
OBJECTIVES: To (1) explore the relationships between metabolites and child weight outcomes, (2) determine the potential effect of covariates (e.g., child's diet, maternal health/habits during pregnancy, etc.) in the relationship between metabolites and child weight outcomes, and (3) explore the relationship between selected gut metabolites and gut microbiota abundance.
METHODS: Using 1 H-NMR, we quantified 30 metabolites from stool samples of 170 two-year-old children. To identify metabolites and covariates associated with children's weight outcomes (BMI [weight/height2 ], BMI z-score [BMI adjusted for age and sex], and growth index [weight/height]), we analysed the 1 H-NMR data, along with 20 covariates recorded on children and mothers, using LASSO and best subset selection regression techniques. Previously characterized microbiota community information from the same stool samples was used to determine associations between selected gut metabolites and gut microbiota.
RESULTS: At age 2 years, stool butyrate concentration had a significant positive association with child BMI (p-value = 3.58 × 10-4 ), BMI z-score (p-value = 3.47 × 10-4 ), and growth index (p-value = 7.73 × 10-4 ). Covariates such as maternal smoking during pregnancy are important to consider. Butyrate concentration was positively associated with the abundance of the bacterial genus Faecalibacterium (p-value = 9.61 × 10-3 ).
CONCLUSIONS: Stool butyrate concentration is positively associated with increased child weight outcomes and should be investigated further as a factor affecting childhood obesity.
© 2021 The Authors. Pediatric Obesity published by John Wiley & Sons Ltd on behalf of World Obesity Federation.

Entities:  

Keywords:  1-HNMR; butyrate; childhood obesity; metabolomics; weight outcomes at 2-years

Mesh:

Substances:

Year:  2021        PMID: 34327846      PMCID: PMC8647636          DOI: 10.1111/ijpo.12833

Source DB:  PubMed          Journal:  Pediatr Obes        ISSN: 2047-6302            Impact factor:   3.910


INTRODUCTION

The prevalence of childhood obesity in the United States has increased in the past several decades. , Based on reports from the Center for Disease Control and Prevention, 13.9% of preschool‐aged children (age 2–5 years) were classified as having obesity in 2015–2016. Importantly, childhood obesity is associated with increased risk of adult obesity and its comorbidities. Therefore, identifying early life risk factors, such as variation in microbial communities or their metabolic outputs, to characterize children most at risk of developing obesity is critical. The gut microbiota has been shown to be an important component of obesity aetiology. It has been over a decade since Turnbaugh's seminal work showing that microbiota from obese mice can stimulate the development of obesity in gnotobiotic mice. In humans, numerous studies have shown that the gut microbiota of adults and children with obesity significantly differ from those without obesity. , , , , Although there has not been a consensus on what the composition of an “obese microbiota” community looks like, most scientists agree that the microbiota communities of healthy and diseased individuals do differ. While the role of the gut microbiota in obesity has been broadly investigated, the microorganisms present in the gut explain only part of the picture. The metabolic capacity and outputs of this microbial community, which cannot be characterized through sequence based approaches, is the link between the microbiota and the human host. Metabolomics data generated by proton Nuclear Magnetic Resonance (1H‐NMR) spectroscopy have popularly been used to study these small molecules to detect important biomarkers for many diseases. , Faecal metabolome analysis in adults has shown that individuals with obesity have higher levels of short chain fatty acids (SCFAs, e.g., acetate, propionate, and butyrate), branched chain amino acids (e.g., leucine, isoleucine, and valine), and aromatic amino acids (phenylalanine, tryptophan, and tyrosine). , Higher levels of faecal SCFAs in individuals with obesity, in addition to differences in the gut microbial communities, have also been related to measurements of cardiometabolic disorders (e.g., inflammation, glycemia, and dyslipidemia) and gut permeability (LPS binding protein). These findings suggest that characterizing the metabolites in the gut is an important step for understanding their roles in the human physiology and risk of diseases. The characterization of both microbiota and metabolomic environments has mainly been conducted in adults and older children; although a few small studies have been conducted in younger children. Interestingly, the patterns seen in young children do not always match those seen in older children and adults. Furthermore, our knowledge of obesity‐related gut metabolites in young children is limited because the majority of studies in children focus on plasma metabolites. , Our study, which focuses on the gut metabolome of young children and its potential role in their weight outcomes, fills an important gap. The Intervention Nurses Start Infants Growing on Healthy Trajectories (INSIGHT) study is a clinical trial, in which first‐born children were randomized to a responsive parenting behavioural intervention or to a home safety control shortly after birth, and were followed longitudinally. This clinical trial collected data on growth (e.g., weight and height), behaviour (e.g., sleep, activity and temperament), home environment, diet, and medical history. Recently, Craig and colleagues examined the oral and gut microbiota of children from the INSIGHT study at age 2 years, and related these microbial communities to the children's growth trajectories (from birth through age 2). That study, surprisingly, found overall weak links between the gut microbiota and children's growth patterns. Here, we hypothesize that differences in the metabolic output of the gut microbiota community might show stronger associations with children's weight outcomes than the microbiota community itself—specifically, that children with higher weight outcomes at age 2 years will have metabolomic profiles that differ significantly from those of children with lower weight outcomes. In this article, we report metabolite concentrations from targeted 1H‐NMR , profiling of stool sampled from 170 children in the INSIGHT cohort at the age of approximately 2 years. Based upon the data from related prior studies, , we specifically targeted 30 major metabolites commonly found in human stool samples. Our first objective was to investigate the association of these metabolites with children's BMI (weight/height2), BMI z‐score (BMI adjusted for age and sex), and growth index (weight/height). Our second objective was to assess the potential effects of 20 other covariates collected from INSIGHT, including children's dietary intake and maternal health and habits during pregnancy, on the relationship between identified metabolites and children's weight outcomes. Our third objective was to investigate whether metabolites significantly associated with weight outcomes have any relationship to the microbial communities we have previously characterized. See Figure 1 for a diagrammatic representation of these objectives.
FIGURE 1

Schematic of objectives of the current study. The bold solid arrow represents our main objective: to investigate the relationship between gut metabolites and child weight outcomes (1). Lighter arrows represent related objectives: to investigate effects of covariates recorded on children and mothers as direct effects (solid) and through interactions with butyrate (dashed) on child weight outcomes (2). We also investigate the relationship between the gut microbiota and the gut metabolites (3)

Schematic of objectives of the current study. The bold solid arrow represents our main objective: to investigate the relationship between gut metabolites and child weight outcomes (1). Lighter arrows represent related objectives: to investigate effects of covariates recorded on children and mothers as direct effects (solid) and through interactions with butyrate (dashed) on child weight outcomes (2). We also investigate the relationship between the gut microbiota and the gut metabolites (3)

METHODS

Study population characterization and sample collection

Our sample included 170 of the 279 children enrolled in the INSIGHT study. These children are full‐term singletons born to primiparous mothers in Central Pennsylvania and are largely of European descent (Table S1). Stool samples were collected during a study visit occurring around the second birthday of each child. Specifically, the samples were collected by mothers from their child's diaper into sterile containers, placed into the home freezer and transported to the visit (on ice, wrapped in ice packs, in thermo‐envelopes). These samples were then stored at –80°C in the laboratory. During the same visit, weight and standing height of each child were measured by trained research nurses using an electronic scale (Seca 354) and a portable stadiometer (Shorr Productions, Olney, MD), respectively. For each child, we calculated BMI (weight in kilograms divided by squared height in meters), BMI z‐score (BMI adjusted for age and sex), and growth index (weight in kilograms divided by height in meters). Guided by previous literature, we considered a pool of covariates concerning children and mothers that could potentially contribute to children's weight outcomes. These included the child's sex (female/male; recorded from medial records) ; the child's exact age (in months) at the two‐year study visit (calculated from date of birth from the medical record and date of visit); the INSIGHT intervention study group (control/responsive parenting) ; the child's exposure to antibiotics between birth and 2 years (no/yes, determined from parental survey) ; maternal gestational weight gain status (below or at guidelines/exceeded guidelines, determined from medical records) ; delivery mode (vaginal/caesarean section; recorded from medical records) ; maternal gestational diabetes status (none/controlled with diet and exercise, determined from maternal survey) ; maternal smoking status during pregnancy (no/yes, determined from maternal survey) ; and the frequencies of 12 food group variables reflecting the qualitative pattern of the child's diet (determined by Infant Food Frequency Questionnaires completed by the parents). More details on these food group variables are provided in Table 1 and Table S2 and in Preprocessing of the Covariates section below.
TABLE 1

Summary of the maternal and child‐related covariates collected on the 170 INSIGHT children included within this study

Covariates n = 170
Child's sex (n = female)84 (49%)
Intervention group (n = intervention)88 (52%)
Antibiotic usage from birth through age 2 years (n = yes)110 (65%)
Maternal gestational weight gain status (n = exceeded guidelines)91 (54%)
Maternal smoking status during pregnancy (n = yes)13 (8%)
Mode of delivery (n = caesarean section)46 (27%)
Maternal gestational diabetes (n = controlled with diet and exercise)9 (5%)
Child's exact age at two‐year study visit (months) median (median absolute deviation)/range24.23 (0.27)/24.01‐25.49
Food groups a (per day consumption frequency) median (median absolute deviation)/range
Dairy [8]3.82 (1.7)/0.50‐12.42
Fried foods [3]0.32 (0.27)/0‐2.07
Fruit juice [1]0.36 (0.53)/0‐6
Fruits [18]2.99 (1.43)/0‐18.64
Grains [11]1.94 (0.74)/0‐6.86
Meats [11]1 (0.56)/0‐3.50
Mixed foods [10]0.85 (0.53)/0‐4.36
Non‐meat proteins [3]0.64 (0.42)/0‐3
Snacks [5]1 (0.53)/0‐5.07
Sweets [12]0.60 (0.59)/0‐3.29
Vegetables [19]2.14 (1.19)/0.28‐10.83
Water [1]3 (2.22)/0‐6

See Table S1 for the food items considered within each food group. Value in square brackets indicates the number of food items considered within that food group.

Summary of the maternal and child‐related covariates collected on the 170 INSIGHT children included within this study See Table S1 for the food items considered within each food group. Value in square brackets indicates the number of food items considered within that food group.

Metabolomic profiling

We performed 1H‐NMR profiling following the steps described by Tian and colleagues ; the raw 1H‐NMR spectra were generated in two batches. We used the Chenomx NMR Suite of tools (Alberta, Canada) to process the raw spectra (calibrating internal standard, correcting phase and baseline) and fit metabolite profiles following the steps described in Nichols and colleagues. Initially we targeted 35 metabolites, namely: 4‐pyridoxate, acetate, alanine, aspartate, butanone, butyrate, creatine, formate, fumarate, galactose, glucose, glutamate, glycine, histamine, histidine, hypoxanthine, isocitrate, isoleucine, lactate, leucine, methionine, niacinamide, O‐phosphocholine, phenylalanine, proline, propionate, pyruvate, succinate, trimethylamine N‐oxide, tryptophan, tyrosine, uracil, valine, xanthine, and xylose. Because the metabolite assignments of butanone, histamine, isocitrate, niacinamide, and trimethylamine could not be verified with 2D‐NMR spectra, we removed these five metabolites from further analyses. The 30 verified metabolites were profiled using the “batch fit” tool in Chenomx and then manually calibrated and quantitated for each sample. The profiling procedure described above introduced two sources of variation that we adjusted for in our analyses (see below): (i) batch effect (raw 1H‐NMR spectra were generated in two batches) and (ii) processor effect (metabolites were calibrated and quantitated using Chenomx software by D.N. and S.J.C.C.).

Statistical analyses

We performed the statistical analyses in two phases: Phase 1 included preprocessing, transforming, cleaning the data and then using the cleaned data to detect statistically significant metabolites associated, separately, with children's BMI, BMI z‐score, and growth index (covariates were not considered in this phase). To validate metabolites' roles after adjusting for the covariates and to investigate the effects of the latter, Phase 2 included analyses of the metabolites selected in Phase 1 along with the covariates and the metabolite‐covariate interactions. See Figure S1 for a visual representation of this workflow.

Data preprocessing

Preprocessing of the metabolite concentrations

First, we normalized the metabolite concentrations in each sample with respect to the total concentration of all the 30 metabolites, , , creating relative concentration, thus accounting also for the dry weight of the samples. We also mitigated the high skewness of the relative concentration values (Figure S2) using a logarithmic transformation. The histograms in Figure S3 indicate that the log‐transformed relative concentrations of the metabolites are fairly regularly distributed (they do not deviate strongly from the Gaussian distribution). Outliers were omitted from this analysis (Figure S4). We then adjusted the log‐transformed relative concentrations of the metabolites for potential effects of batch and data processor. For each metabolite, we fitted a two‐way ANOVA for log‐transformed relative concentrations on “batch” and “processor” (Table S3). The residuals from these ANOVA fits were used as “adjusted metabolite concentrations” in subsequent analyses. We centred and scaled each of these 30 adjusted metabolite concentrations separately to have zero mean and unit variance , , ; almost all of the transformed adjusted metabolite concentrations fairly resemble the Gaussian distribution (Figure S5).

Preprocessing of the covariates

The food group variables included as covariates in our Phase 2 analyses (Table 1) are derived from a modified food frequency questionnaire. The raw data from the questionnaire are the same as the data in our previous study ; however, we modified the grouping of the food items and created a new food group variable (“Mixed foods”) to more accurately represent the diet. We started with a total of 121 food items divided into 16 food groups (Table S2): beans (4 items), dairy (8 items), fat/oils (4 items), fried foods (3 items), fruit juice (1 item), fruits (18 items), grains (11 items), meats (11 items), mixed foods (10 items), non‐meat proteins (3 items), non‐sugar sweetened beverages (4 items), snacks (5 items), sugar‐sweetened beverages (7 items), sweets (12 items), vegetables (19 items), and water (1 item). For each food item, mothers reported how often their child had consumed it in the past week using response options of 0, 1, 2–3, 4–6 times per week, 1, 2, 3, 4–5 or 6 or more times per day. We converted all records to times per day, using the medians for options that spanned a range of values (see Figure S6). We then omitted four food groups (beans, fats/oils, non‐sugar sweetened beverages and sugar‐sweetened beverages) for which all items had zero median daily consumption frequencies. Thus, we had a total of 12 food groups included in the Phase 2 analyses (Table 1). Next, for each of the 12 retained food groups, we calculated the sum total of daily intake frequencies (summing over items within the groups; we replaced missing values with zeros). Finally, after adding 1 to the resulting frequencies, we log‐transformed them to mitigate right skewness (Figures S7 and S8), centred to zero mean and scaled to unit variance. We also log‐transformed the child's exact age in months at the time of the two‐year visit, centred to zero mean and scaled to unit variance. Seven categorical covariates were dummy‐coded as “0/1”: intervention group (safety‐control/parenting‐intervention), child's sex (female/male), child's antibiotic exposure between birth and 2 years (never/yes), maternal gestational weight gain status (below or at guidelines/exceeded guidelines), mode of delivery (caesarean section/vaginal), maternal diabetes mellitus status (none/controlled with diet and exercise), and maternal smoking status during pregnancy (no/yes). For the LASSO regression analysis in Phase 2, we centred and scaled each of the covariates separately, including the dummy‐coded ones, to zero mean and unit variance.

Regression analyses

In Phase 1, our regressions comprised 30 predictor variables (i.e., explanatory variables used in the linear regression models); namely, the 30 adjusted metabolite concentrations. We ran two alternative procedures: (a) LASSO (an L1‐penalized linear regression method which performs predictor selection) followed by a post‐selection least squares fit; and (b) Bayesian information criterion (BIC , ) best subset selection (an exhaustive search over all possible model sizes for linear regression which selects the “best” model as per the BIC criterion), also followed by a post‐selection least squares fit. For LASSO, the penalization parameter was tuned with 10‐fold cross‐validation repeated 100 times on 100 random fold partitions of the data. The selected tuning parameter value, which determines the number of predictors retained by the LASSO, corresponds to the minimum average (over the 100 repetitions) cross‐validation mean‐squared error. For the BIC best subset selection procedure, we examined all possible models with sizes ranging from 1 up to 30 predictors and selected the model with the minimum BIC. These analyses were completed separately treating each of three weight outcomes (BMI, BMI z‐score, and growth index) as the response variable. In Phase 2, our regressions comprised 41 predictor variables; namely, the butyrate adjusted metabolite concentration (selected in Phase 1; see below), 20 covariates, and the two‐way interactions of the butyrate adjusted metabolite concentration with each of the 20 covariates (a total of 20 interaction terms). Just as in Phase 1, we implemented (a) LASSO and (b) BIC best subset selection regression procedures, each followed by a post‐selection least squares fit. For LASSO, we tuned the penalization parameter using the same strategy as in Phase 1. For the BIC best subset selection, we again examined all possible models, with sizes ranging from 1 up to 41. Similar to Phase 1, these analyses were completed separately treating each of the three weight outcomes (BMI, BMI z‐score, and growth index) as the response variable. We performed these statistical analyses using the R software (version 4.0.2). The LASSO procedure with repeated 10‐fold cross‐validation tuning was implemented using the ipflasso CRAN package and the BIC best subset selection procedure was implemented using the leaps CRAN package. All codes are publicly available on GitHub: https://github.com/makovalab-psu/ChildhoodObesity_1HNMR-Metabolomics.

Microbiota association

The gut microbiota of the INSIGHT children (from the same stool samples used in this study) were characterized by our group in a previous study. The raw data are available on dbGaP (accession number phs001498.v1.p1). From these data, we selected all bacteria identified as a member of the Firmicutes phylum. Next, using information from Vital et al., we aggregated the abundances of bacteria identified as known or candidate butyrate producers; first as all butyrate producing bacteria, then as butyrate producing Firmicutes, and finally as individual genera within the butyrate producing Firmicutes. See Table S4 for a list of the bacterial groups included in this analysis. We used the statistical software R (version 3.6.1); correlations were interrogated using the rcorr() function within the Hmisc CRAN package and graphs were generated using the ggplot2 CRAN package.

RESULTS

Study population and children's weight gain indicators

We considered 170 children enrolled in the INSIGHT , study who provided stool specimens at the 2‐year study visit (see above). A variety of clinical, anthropometric and dietary information was collected on the children along with pregnancy health variables on their mothers (Table 1). We considered the children's height and weight collected during the same visit to compute our main weight indicator—BMI (ratio of weight to squared height, measured in kg/m2). BMI ranged from 13.9 to 20.7, with a median of 16.5. Using CDC guidelines, we determined that 4.84% of the children considered were underweight, 79.0% had normal‐weight, 11.8% were overweight and 4.30% had obesity (Figure S9). As additional weight outcomes, we computed the BMI z‐score (BMI adjusted for age and sex) and the growth index (ratio of weight to height, measured in kg/m). We used all these three outcomes to investigate the relationships among children's weight, metabolites, and other covariates. Figure S10 demonstrates the significant positive correlations between the pairs from these three outcomes over the range of values covered by our study (A) BMI versus GI: R  = 0.856; p‐value <2.20 × 10−16; (B) BMI z‐score versus GI: R  = 0.822; p‐value <2.20 × 10−16; (C) BMI z‐score versus BMI: R  = 0.983; p‐value <2.20 × 10−16.

Analysing metabolites: Butyrate is positively associated with child weight outcomes

The stool samples collected from 170 children were used to generate 1H‐NMR spectra to determine the concentrations for 30 metabolites (see Methods for details). To identify metabolites associated with weight outcomes, we regressed each weight outcome (response) on 30 adjusted metabolite concentrations (predictors; here defined to mean explanatory variables used in the linear regression models). To conduct each of the regression analyses, we used the LASSO procedure and, separately, the BIC best subset selection procedure, each followed by a post‐selection least squares fit (Phase 1 in Figure S1). Considering BMI as the weight outcome, LASSO retained only two metabolites: butyrate and glycine. Of these, only butyrate was significant in the post‐selection least squares fit (butyrate: p‐value = 0.0120, = 0.295; glycine: p‐value = 0.147, = −0.169). Similarly, using the BIC best subset procedure, butyrate was the only metabolite selected (p‐value = 3.58 × 10−4; = 0.375; R 2 = 0.0729, adjusted R 2 = 0.0677). To assess the out‐of‐sample predictive performance of butyrate for BMI, we considered 1000 independent 70%:30% random splits of our data into training and test sets (n total = 170, n training = 120, and n test = 50); the median R 2 and the median root mean squared prediction error across the 1000 test sets were 0.0699 and 1.35, respectively. Relatedly, BMI has a significant positive correlation with butyrate adjusted metabolite concentration (R  = 0.0729, p‐value = 3.58 × 10−4, Pearson correlation; Figure 2(A)). Moreover, we found (Figure 2(B)) a significantly lower mean butyrate adjusted metabolite concentration in children with normal weight (mean concentration = –0.0387) than in those with overweight (mean concentration = 0.208, p‐value = 0.0312; one‐sided, two‐sample t‐test) or obesity (mean concentration = 0.339, p‐value = 0.0176; one‐sided, two‐sample t‐test).
FIGURE 2

Relationship between butyrate adjusted metabolite concentration and BMI of 170 INSIGHT children. (A) A scatterplot of BMI against butyrate concentration with a simple regression trend line (R2 = 0.0729, p‐value = 3.58 × 10−4). The colours (horizontal bands) correspond to approximate BMI class cut‐offs (yellow = underweight, blue = normal weight, green = overweight, purple = obese) and match those in panel (B). (B) Boxplots of butyrate concentration by BMI class. The width of the boxes are proportional to the number of individuals within each class (underweight: 9, normal weight: 137, overweight: 16, obese: 8). a = children with overweight have greater average butyrate adjusted metabolite concentration than children with normal weight, p‐value = 0.0312 (one‐sided t‐test). b = children with obesity have greater average adjusted metabolite concentration than children with normal weight, p‐value = 0.0176 (one‐sided t test)

Relationship between butyrate adjusted metabolite concentration and BMI of 170 INSIGHT children. (A) A scatterplot of BMI against butyrate concentration with a simple regression trend line (R2 = 0.0729, p‐value = 3.58 × 10−4). The colours (horizontal bands) correspond to approximate BMI class cut‐offs (yellow = underweight, blue = normal weight, green = overweight, purple = obese) and match those in panel (B). (B) Boxplots of butyrate concentration by BMI class. The width of the boxes are proportional to the number of individuals within each class (underweight: 9, normal weight: 137, overweight: 16, obese: 8). a = children with overweight have greater average butyrate adjusted metabolite concentration than children with normal weight, p‐value = 0.0312 (one‐sided t‐test). b = children with obesity have greater average adjusted metabolite concentration than children with normal weight, p‐value = 0.0176 (one‐sided t test) Considering BMI z‐score (BMI adjusted for age and sex) in place of BMI as the weight outcome, LASSO selected butyrate, glycine, and lactate adjusted metabolite concentrations, but only butyrate was significant in the post‐selection least squares fit (butyrate: p‐value = 0.0143, = 0.204; glycine: p‐value = 0.532, = –0.0579; lactate: p‐value = 0.333, = –0.0853). Again, the BIC best subset procedure selected only butyrate, which was significant in the post‐selection least squares fit (p‐value = 3.47 × 10−4, = 0.263, R  = 0.0736, adjusted R  = 0.0680). The median out‐of‐sample R and the median root mean squared prediction error of butyrate for BMI z‐score were 0.0719 and 0.948, respectively. As an additional verification of butyrate's association with child's growth, we considered growth index as the third weight outcome (response) and obtained reassuringly similar results compared to those for BMI and BMI z‐score. LASSO retained acetate and butyrate adjusted metabolite concentrations, but only the latter was significant in the post‐selection least squares fit (butyrate: p‐value = 0.0108, = 0.284; acetate: p‐value = 0.212, = 0.138). The BIC best subset procedure selected again only butyrate, which was significant in the post‐selection least squares fit (p‐value = 7.73 × 10−4; = 0.342; R 2 = 0.0653, adjusted R 2 = 0.0597). As with the other outcomes, the median out‐of‐sample R 2 and the median root mean squared prediction error of butyrate for growth index were 0.0654 and 1.30, respectively.

Analysing metabolites and covariates

The relationship between metabolites in the stool and a child's growth outcome may need to be considered in conjunction with some of the covariates recorded on INSIGHT children and their mothers. To investigate this possibility, we considered 20 potentially relevant covariates (Table 1, see Methods for details). To incorporate these covariates into our analysis, we considered a regression with the child's growth outcome as the response, and predictor terms, which included the butyrate adjusted metabolite concentration (identified from the Phase 1 analysis), 20 covariates, and the interactions between butyrate adjusted metabolite concentration and each covariate. Using a similar regression approach as in Phase 1, we performed: (a) LASSO and (b) BIC best subset selection, each followed by a post‐selection least squares fit on the selected terms (see Phase 2 in Figure S1). When BMI is used as the outcome, the LASSO procedure retained two terms: butyrate adjusted metabolite concentration and maternal smoking status during pregnancy. The post‐selection least squares fit on these two terms produced an R 2 of 0.110, an adjusted R 2 of 0.0998, and a p‐value of 5.71 × 10−5; both terms with significant positive coefficient estimates ( = 0.405 and 1.01, respectively) (Table 2). The BIC best subset procedure selected a model with three terms, all significant in the post‐selection least squares fit (Table 2 and Figure S11): butyrate adjusted metabolite concentration and maternal smoking status during pregnancy—both with positive coefficient estimates ( = 0.406 and 1.04, respectively), and the interaction between butyrate adjusted metabolite concentration and the child's exact age—with a negative coefficient estimate ( = −0.245). The R 2 and the adjusted R 2 for this linear model were, respectively, 0.137 and 0.122, and the p‐value was 1.87 × 10−5.
TABLE 2

Significant predictors of BMI, BMI z‐score, and growth index identified in Phase 2 analyses through two different regression procedures: (1) LASSO and (2) BIC best subset selection, each followed by post‐selection least squares fits

BMIBMI z‐scoreGrowth index
LASSO + post‐selection fit a BIC best subset + post‐selection fit b LASSO + post‐selection fit c BIC best subset + post‐selection fit d LASSO + post‐selection fit e BIC best subset + post‐selection fit f
Butyrate adjusted metabolite concentration 0.405 (1.02 × 10 −4 ) g 0.406 (8.02 × 10 −5 ) g 0.263 (3.47 × 10 −4 ) g 0.298 (3.30 × 10 −5 ) g 0.361 (2.33 × 10 −4 ) g 0.340 (6.11 × 10 −4 ) g
Maternal smoking status during pregnancy (yes vs. no) 1.01 (9.01 × 10 −3 ) g 1.04 (6.33 × 10 −3 ) g 0.745 (4.91 × 10 −3 ) g 0.661 (0.0822)
Child's exact age at a 2‐year study visit: Butyrate adjusted metabolite concentration −0.245 (0.0241) −0.187 (0.0128) −0.184 (0.0844)
Per day meats consumption frequency0.185 (0.0630) 0.252 (0.0103) g
Maternal gestational weight gain status (exceeded vs. below/at guidelines)0.371 (0.0597) 0.497 (0.0115) g
Child sex (male vs. female)0.363 (0.0565)
Per day grains consumption frequency 0.157 (0.0252)

Note: Each cell reports the estimated coefficient with corresponding p‐value in parentheses. Significant terms (p‐values <0.05) are boldfaced.

For BMI, post LASSO selection (two terms) least squares fit: R 2 = 0.110, adjusted R 2 = 0.0998, and p‐value = 5.71 × 10−5.

For BMI, post BIC best subset selection (three terms) least squares fit: R 2 = 0.137, adjusted R 2 = 0.122, and p‐value = 1.87 × 10−5.

For BMI z‐score, post LASSO selection (one term) least squares fit: R 2 = 0.0736, adjusted R 2 = 0.0680, and p‐value = 3.47 × 10−4.

For BMI z‐score, post BIC best subset selection (four terms) least squares fit: R 2 = 0.164, adjusted R 2 = 0.144, and p‐value = 5.53 × 10−6.

For growth index, post LASSO selection (seven terms) least squares fit (six significant terms shown in the table): R 2 = 0.198, adjusted R 2 = 0.163, and p‐value = 6.55 × 10−6.

For growth index, post BIC best subset selection (three terms) least squares fit: R 2 = 0.134, adjusted R 2 = 0.118, and p‐value = 2.58 × 10−5.

The significance of these predictors would remain unaffected by Bonferroni's multiple testing p‐value adjustments at α = 0.05 level of significance.

Significant predictors of BMI, BMI z‐score, and growth index identified in Phase 2 analyses through two different regression procedures: (1) LASSO and (2) BIC best subset selection, each followed by post‐selection least squares fits Note: Each cell reports the estimated coefficient with corresponding p‐value in parentheses. Significant terms (p‐values <0.05) are boldfaced. For BMI, post LASSO selection (two terms) least squares fit: R 2 = 0.110, adjusted R 2 = 0.0998, and p‐value = 5.71 × 10−5. For BMI, post BIC best subset selection (three terms) least squares fit: R 2 = 0.137, adjusted R 2 = 0.122, and p‐value = 1.87 × 10−5. For BMI z‐score, post LASSO selection (one term) least squares fit: R 2 = 0.0736, adjusted R 2 = 0.0680, and p‐value = 3.47 × 10−4. For BMI z‐score, post BIC best subset selection (four terms) least squares fit: R 2 = 0.164, adjusted R 2 = 0.144, and p‐value = 5.53 × 10−6. For growth index, post LASSO selection (seven terms) least squares fit (six significant terms shown in the table): R 2 = 0.198, adjusted R 2 = 0.163, and p‐value = 6.55 × 10−6. For growth index, post BIC best subset selection (three terms) least squares fit: R 2 = 0.134, adjusted R 2 = 0.118, and p‐value = 2.58 × 10−5. The significance of these predictors would remain unaffected by Bonferroni's multiple testing p‐value adjustments at α = 0.05 level of significance. Next, we performed the selection procedure using BMI z‐score as the weight outcome. The LASSO retained only one term, butyrate adjusted metabolic concentration, which remained significant in the post‐selection least squares fit (Table 2; = 0.263, R 2 = 0.0736, adjusted R 2 = 0.0680, p‐value = 3.47 × 10−4). Using the BIC best subset procedure, a model with four terms was selected (Figure S12). These four terms remained significant in the post‐selection least squares fit (Table 2; R 2 = 0.164, adjusted R 2 = 0.144, p‐value = 5.53 × 10−6). Three terms (butyrate adjusted metabolite concentration, = 0.298; maternal smoking status during pregnancy, = 0.745; and child's per day grains consumption frequency, = 0.157) had positive coefficient estimates, and one term (child's exact age at two‐year visit interacting with butyrate adjusted metabolite concentration, = –0.187) had a negative coefficient estimate. Finally, we repeated the analysis using the same predictors, but using the growth index as the response. The LASSO procedure retained seven terms. The post‐selection least squares fit on these seven terms produced an R 2 of 0.198, an adjusted R 2 of 0.163, and a p‐value of 6.55 × 10−6. However, only one of the seven terms, butyrate adjusted metabolite concentration, was significant (p‐value = 2.33 × 10–4; Table 2) with a positive coefficient estimate ( = 0.361). The BIC best subset procedure selected a model with three terms, all with significant positive coefficient estimates in the post‐selection least squares fit—butyrate adjusted metabolite concentration ( = 0.340), child's daily meats consumption frequency (= 0.252), and maternal gestational weight gain status ( = 0.497) (Table 2 and Figure S13; R 2 = 0.134, adjusted R 2 = 0.118, p‐value = 2.58 × 10−5).

Butyrate concentration is associated with the abundance of butyrate synthesizing Firmicutes

Butyrate is a metabolic product of the microbiota. To connect our results to prior analyses of the gut microbiota that we had published using the same samples from children enrolled in INSIGHT, we tested for associations between bacterial groups and butyrate adjusted metabolite concentration. First, we considered the cumulative abundance (determined by classified normalized read counts from 16S rDNA gene sequencing) of all bacteria assigned to the Firmicutes phylum, as the majority of butyrate‐producing bacteria belong to this phylum. We found no significant association between this abundance and butyrate adjusted metabolite concentration (R  = 0.00391, p‐value = 0.428; Figure 3(A)). Second, we only considered bacteria that could synthesize butyrate, as identified by Vital and colleagues based on the presence of genes encoding enzymes in the butyrate synthesis pathway, and tested the association of their abundances with butyrate concentration. We found that the abundance of bacteria in this group did correlate positively with butyrate adjusted metabolite concentration (R  = 0.0315, p‐value = 0.0234; Figure 3(B)). Third, among these butyrate‐producing bacteria, we further focused on Firmicutes, which represent the majority of bacteria within this group. This increased the significance of the positive association with butyrate adjusted metabolite concentration (R  = 0.0341, p‐value = 0.0184; Figure 3(C)). Finally, among butyrate‐producing Firmicutes, those belonging to the genus Faecalibacterium seem to drive the observed relationship, with an even stronger association between their abundance and butyrate adjusted metabolite concentration (R  = 0.0412, p‐value = 9.61 × 10−3; Figure 3(D)). Thus, butyrate‐producing Firmicutes, and those in the Faecalibacterium genus in particular, appear to be important contributors to determining the stool butyrate adjusted metabolite concentration in our samples.
FIGURE 3

Relationship between the abundance of different groups of bacteria and butyrate adjusted metabolite concentration. (A) All bacteria classified as Firmicutes; (B) All bacteria identified in Vital et al. as having butyrate synthesis potential; (C) Firmicute bacteria with butyrate synthesis potential, as identified in Vital et al.; (D) Faecalibacterium (one outlier was removed). The black solid lines indicate the fitted linear regression models; the R2 and the p‐values correspond to Pearson's correlation

Relationship between the abundance of different groups of bacteria and butyrate adjusted metabolite concentration. (A) All bacteria classified as Firmicutes; (B) All bacteria identified in Vital et al. as having butyrate synthesis potential; (C) Firmicute bacteria with butyrate synthesis potential, as identified in Vital et al.; (D) Faecalibacterium (one outlier was removed). The black solid lines indicate the fitted linear regression models; the R2 and the p‐values correspond to Pearson's correlation

DISCUSSION

In this study, we found that stool butyrate concentration has a significant positive association with a child's weight outcomes at age 2 years. It is consistently selected as a significant predictor in all regression models for each one of the three child weight outcomes we considered as the response—BMI, BMI z‐score and growth index. Butyrate is a SCFA, a group of metabolic products formed by anaerobic microbial fermentation of non‐digestible carbohydrates (such as resistant starch, dietary fibre and other polysaccharides) in the host small intestine and colon. The SCFAs, acetate and propionate, are largely produced by the Bacteroidetes phylum, whereas butyrate is mainly the product of the Firmicutes phylum. SCFAs can influence host physiology by signalling through the free fatty acid receptors GPR41 and GPR43. When activated, these receptors lead to an increase in the satiety hormone PYY and in intestinal mobility. GPR41 activation also stimulates the expression of leptin in adipocytes and increases hepatic lipogenesis. Furthermore, butyrate and propionate play a role in decreasing appetite through the formation of the gut hormone GLP‐1. To date, the evidence concerning the association of butyrate with weight gain has been mixed. However, consistent with our findings, some studies have shown higher levels of butyrate in children and adults with obesity. , This could be due to a higher abundance of gut bacteria that ferment carbohydrates in individuals with obesity, which subsequently results in an increase in production of SCFAs. This increase in SCFAs can provide extra energy to the host that can be stored as lipids or glucose. Other studies have shown an anti‐inflammatory/anti‐obesity role for butyrate. , , This metabolite is likely to play an important role in gut homeostasis, possibly through the gut‐brain axis. , However, investigating the mechanism by which butyrate influences a child's weight is outside the scope of this study—a larger, longitudinal study collecting data on the host gut cell physiology and the utilization of metabolites (as in Cuesta‐Zuluaga et al. ) would be necessary to elucidate mechanistically the metabolome's effects on human health. This study provides evidence of an association between butyrate adjusted metabolite concentration and child weight outcomes at the age of 2 years. This brings us closer to more fully understanding factors, which could impact an individual's risk of developing obesity. As mentioned above, butyrate (and other SCFAs) could act as an intermediary between the gut microbiota and the host physiology. Since we have previously characterized the microbiota of these same samples, it seemed logical to examine the relationship between the microbiota and butyrate. When we combined the abundance of all bacteria categorized as Firmicutes, there was no significant association between bacterial abundance and butyrate concentration. However, although the majority of bacteria that synthesize butyrate belong to the Firmicutes phylum, not all Firmicutes produce butyrate. Instead, we grouped bacteria according to function (rather than according to phylogeny) based on the presence of butyrate synthesis pathway genes. This approach proved successful in identifying Faecalibacterium as potentially driving the association between the microbiota and butyrate concentration. These results are consistent with literature on Faecalibacterium as a butyrate producer , being found in higher concentrations in the gut of children with obesity when compared to children with normal weight. , , It is of course important to note that our study established statistical associations among butyrate, microbiota, and BMI—not causative links. We could be observing a compensatory increase in butyrate production to counter the effects of increased BMI or we could be observing a change in butyrate production as a result of a dysbiosis in the gut microbiota. Furthermore, we are also only investigating an association between butyrate concentration and microbiota abundance, which is different from a direct assessment of the microbiota's production of butyrate. There are other potential sources for butyrate in the gut that we cannot account for (e.g., archaea, fungal, dietary sources etc.). In order to properly unravel causality, a large, longitudinal study that records changes in microbiota community composition (including all micro‐organisms and not just bacteria), anthropometric measurements, physiological measurements of gut cell health, and relevant covariate data (e.g., diet, health history, etc.) is essential. We used three related outcomes in order to show stability in the results of the statistical analyses. For example, butyrate was selected as a significant variable regardless of the weight outcome considered. The selection of covariates was less consistent, although several of them were selected across multiple outcomes. One such strong finding was maternal smoking status during pregnancy, which had a significant positive relationship with BMI and BMI z‐score based on both regression procedures, and a marginally significant positive relationship with growth index based on one of the regression procedures. Many previous studies have documented a role of maternal smoking status during pregnancy, which was found to be positively associated with obesity risk for the child. , , , , , , , , However, there is no consensus on the underlying mechanism linking maternal smoking to a child's weight gain. Another strong covariate found to have a significant negative association with two weight outcomes (BMI and BMI z‐score) and a marginally significant for the third (growth index) was the interaction between a child's exact age (in months) and butyrate adjusted metabolite concentration. This negative statistical effect on weight outcome suggests that the positive statistical effect of butyrate adjusted metabolite concentration in relation to weight outcome decreases in older children (here, the term “statistical effect” refers to the estimated coefficient in the regression model). This could potentially be related to the plasticity of the gut microbiota. For instance, Rivière and colleagues found that the abundance of Clostridial clusters is low immediately after birth, and increases between the ages of 6 and 24 months. Not until around 6 years of age are these bacteria at a higher abundance where they remain through adulthood. Note that the range of children's ages in our study is very narrow (24.01–25.49 months, as recorded at the two‐year study visit when stool samples were collected)—yet these ~1.5 months could be sufficient for the gut microbiota to have partially evolved in composition, particularly if there is a concurrent evolution in diet, health, or behaviour. However, to the best of our knowledge, microbiota changes during this small amount of time have not been evaluated previously and should be considered in future studies. Finally, there are several covariates that are statistically significantly associated with just one weight outcome, such as, excessive maternal gestational weight gain, frequency of a child's per day meats consumption (significantly positive coefficient estimates in regressions with growth index), and frequency of a child's per day grains consumption (significant positive coefficient estimate in regressions with BMI z‐score). These variables have also been shown in the literature to be associated with child weight , , but in this study, their association with child's weight seems to be less compelling. However, these data could serve to support future hypotheses regarding the impacts of maternal factors and diet on children's weight trajectories.

Statistical considerations

The LASSO and the BIC best subset selection regression procedures produced non‐identical but consistent results for the three weight outcomes considered—BMI, BMI z‐score and growth index. Many other statistical techniques could be used for analysing the metabolomics data. Among the ones most frequently used in the field are Principal Components Analysis , and clustering, which are unsupervised techniques, and orthogonal partial least squares—regression/discriminant analysis (OPLS/OPLS‐DA ), which are supervised techniques. , , , The regression approach we employed in this study provides a simple and useful alternative. Its main advantage is its interpretability, as we are able to focus on the selection of individual predictor terms (metabolites, covariates, and interactions) instead of linear combinations, as produced by methods such as the OPLS. Additionally, as previously mentioned, the use of three weight outcomes, and of two regression procedures for each, helps us validate our selection of relevant variables and gauge the robustness of our conclusion. Variables that are selected across multiple outcomes and multiple methods display stable statistical signals. From a different perspective, considering multiple outcomes may also reveal associations that are outcome‐specific, and should be considered when conducting multi‐study meta‐analyses. Our analyses relied on the tuning of the penalization parameter in the LASSO and on the choice of a criterion for the best subset selection regression procedures. We tuned the LASSO penalization parameter minimizing a 10‐fold cross‐validation error, averaged over 100 repetitions of the cross‐validation random fold splits. For best subset selection, we used the BIC—but other criteria such as the Akaike Information Criterion , or Mallow's Cp , could also be used.

Limitations and future directions

The individuals enrolled in the INSIGHT study were recruited from Central Pennsylvania and are largely of European descent (see Table S1). Consequently, our results might not generalize to other populations. Furthermore, our sample size (n = 170) is rather modest—though similar to those of other metabolomic studies. , The sample size is one consideration for deciding between a targeted versus an untargeted metabolomics approach, as the dimension of the data increases by orders of magnitude in an untargeted metabolomics approach. In our case, with a modest sample size, a targeted metabolomics approach was more realistic. However, this does limit our ability to detect novel associations. It is possible that metabolites we did not query could impact child weight as well. A significant advantage of our cohort over some other cohorts studied is its deep characterization, that is, the wealth of information on children's and mothers' covariates that were collected and incorporated into our analyses. Nevertheless, for a more comprehensive understanding of the effect of the metabolome on children weight outcomes, our results should be replicated with larger cohorts of children belonging to different ethnicities, ideally also incorporating additional phenotypic and clinical covariates along with longitudinal information. The results that we discuss here are cross‐sectional (i.e., the data were collected at a single time point) and as such we can only report on associations between the metabolites and child weight. A longitudinal study would be able to address how changes in the metabolites relate to changes in a child's weight over time. While the present study focused on metabolites, multiple cross‐sectional “omics” data sets are available or are in the process of being generated for the INSIGHT children cohort—including data on microbiota, genetic and epigenetic variants. Moreover, longitudinal metabolomic and microbiomic profiles are being collected for the SIBSIGHT cohort , (second born siblings of the children within INSIGHT). This will allow us to leverage “multi‐omics” approaches to consolidate and expand our findings on childhood obesity. “Multi‐omics” approaches are indeed becoming increasingly popular to gain a comprehensive understanding of diseases—including obesity and other metabolic disorders. , , , , ,

CONFLICT OF INTERESTS

The authors declare no financial or other conflicts of interests.

AUTHOR CONTRIBUTIONS

Study conception and design (Sarah J. C. Craig and Kateryna D. Makova), data generation (Sarah J. C. Craig, Debmalya Nandy, Andrew D. Patterson, Jingwei Cai, and Yuan Tian), statistical analyses (Debmalya Nandy, Francesca Chiaromonte, Matthew L. Reimherr, and Sarah J. C. Craig), resources (Kateryna D. Makova, Francesca Chiaromonte, Ian M. Paul, Emily E. Hohman, Michele E. Marini, and Jennifer S. Savage), manuscript preparation (Sarah J. C. Craig, Debmalya Nandy, Matthew L. Reimherr, Francesca Chiaromonte, Kateryna D. Makova, and comments from all co‐authors). Data S1. Supporting information. Click here for additional data file.
  70 in total

1.  Faecalibacterium prausnitzii produces butyrate to decrease c-Myc-related metabolism and Th17 differentiation by inhibiting histone deacetylase 3.

Authors:  Mingming Zhang; Lixing Zhou; Yuming Wang; Robert Gregory Dorfman; Dehua Tang; Lei Xu; Yida Pan; Qian Zhou; Yang Li; Yuyao Yin; Shimin Zhao; Jianlin Wu; Chenggong Yu
Journal:  Int Immunol       Date:  2019-07-30       Impact factor: 4.823

2.  Targeted profiling: quantitative analysis of 1H NMR metabolomics data.

Authors:  Aalim M Weljie; Jack Newton; Pascal Mercier; Erin Carlson; Carolyn M Slupsky
Journal:  Anal Chem       Date:  2006-07-01       Impact factor: 6.986

3.  Does maternal smoking during pregnancy cause childhood overweight?

Authors:  Marius Widerøe; Torstein Vik; Geir Jacobsen; Leiv S Bakketeig
Journal:  Paediatr Perinat Epidemiol       Date:  2003-04       Impact factor: 3.980

Review 4.  The neuropharmacology of butyrate: The bread and butter of the microbiota-gut-brain axis?

Authors:  Roman M Stilling; Marcel van de Wouw; Gerard Clarke; Catherine Stanton; Timothy G Dinan; John F Cryan
Journal:  Neurochem Int       Date:  2016-06-23       Impact factor: 3.921

5.  Prevalence of Obesity and Severe Obesity in US Children, 1999-2016.

Authors:  Asheley Cockrell Skinner; Sophie N Ravanbakht; Joseph A Skelton; Eliana M Perrin; Sarah C Armstrong
Journal:  Pediatrics       Date:  2018-03       Impact factor: 7.124

Review 6.  Impact of the gut microbiota on inflammation, obesity, and metabolic disease.

Authors:  Claire L Boulangé; Ana Luisa Neves; Julien Chilloux; Jeremy K Nicholson; Marc-Emmanuel Dumas
Journal:  Genome Med       Date:  2016-04-20       Impact factor: 11.117

7.  1H NMR Metabolomics Identifies Underlying Inflammatory Pathology in Osteoarthritis and Rheumatoid Arthritis Synovial Joints.

Authors:  James R Anderson; Susama Chokesuwattanaskul; Marie M Phelan; Tim J M Welting; Lu-Yun Lian; Mandy J Peffers; Helen L Wright
Journal:  J Proteome Res       Date:  2018-10-03       Impact factor: 4.466

8.  Higher Fecal Short-Chain Fatty Acid Levels Are Associated with Gut Microbiome Dysbiosis, Obesity, Hypertension and Cardiometabolic Disease Risk Factors.

Authors:  Jacobo de la Cuesta-Zuluaga; Noel T Mueller; Rafael Álvarez-Quintero; Eliana P Velásquez-Mejía; Jelver A Sierra; Vanessa Corrales-Agudelo; Jenny A Carmona; José M Abad; Juan S Escobar
Journal:  Nutrients       Date:  2018-12-27       Impact factor: 5.717

Review 9.  Development of the Gut Microbiome in Children, and Lifetime Implications for Obesity and Cardiometabolic Disease.

Authors:  Anica I Mohammadkhah; Eoin B Simpson; Stephanie G Patterson; Jane F Ferguson
Journal:  Children (Basel)       Date:  2018-11-27

10.  Metabolomic profiling of stool of two-year old children from the INSIGHT study reveals links between butyrate and child weight outcomes.

Authors:  Debmalya Nandy; Sarah J C Craig; Jingwei Cai; Yuan Tian; Ian M Paul; Jennifer S Savage; Michele E Marini; Emily E Hohman; Matthew L Reimherr; Andrew D Patterson; Kateryna D Makova; Francesca Chiaromonte
Journal:  Pediatr Obes       Date:  2021-07-30       Impact factor: 3.910

View more
  1 in total

1.  Metabolomic profiling of stool of two-year old children from the INSIGHT study reveals links between butyrate and child weight outcomes.

Authors:  Debmalya Nandy; Sarah J C Craig; Jingwei Cai; Yuan Tian; Ian M Paul; Jennifer S Savage; Michele E Marini; Emily E Hohman; Matthew L Reimherr; Andrew D Patterson; Kateryna D Makova; Francesca Chiaromonte
Journal:  Pediatr Obes       Date:  2021-07-30       Impact factor: 3.910

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.