| Literature DB >> 35166340 |
Yile Chen1, Bing He1, Yu Liu1, Max T Aung2, Zaira Rosario-Pabón3, Carmen M Vélez-Vega3, Akram Alshawabkeh4, José F Cordero5, John D Meeker6, Lana X Garmire1.
Abstract
BACKGROUND: Preterm birth is defined by the onset of labor at a gestational age shorter than 37 weeks, and it can lead to premature birth and impose a threat to newborns' health. The Puerto Rico PROTECT cohort is a well-characterized prospective birth cohort that was designed to investigate environmental and social contributors to preterm birth in Puerto Rico, where preterm birth rates have been elevated in recent decades. To elucidate possible relationships between metabolites and preterm birth in this cohort, we conducted a nested case-control study to conduct untargeted metabolomic characterization of maternal plasma of 31 women who experienced preterm birth and 69 controls who underwent full-term labor at 24-28 gestational weeks.Entities:
Keywords: biomarkers; fatty acid; lipid; metabolic pathway; metabolomics; network; preterm
Mesh:
Substances:
Year: 2022 PMID: 35166340 PMCID: PMC8847704 DOI: 10.1093/gigascience/giac004
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Demographic and clinical characteristics in case and control groups
| Characteristic | Controls (n = 69) | Cases (n = 31) |
|
|---|---|---|---|
| Mean (SD) | |||
| Maternal age, y | 27.07 (5.91) | 24.84 (5.10) | 0.058 |
| BMI, kg/m2 | 25.55 (5.25) | 27.51 (6.92) | 0.165 |
| Gestational age, weeks | 39.20 (0.98) | 34.69 (2.08) | 1.28e−13 |
| Annual household income | 3.87 (2.12) | 2.87 (2.22) | 0.039 |
| No. | |||
| Baby sex | |||
| Female | 35 | 14 | 0.669 |
| Male | 34 | 17 | |
| Smoker | |||
| Yes | 12 | 2 | 0.215 |
| No | 57 | 29 | |
| Alcohol use | |||
| None during pregnancy | 32 | 19 | 0.294 |
| Drank before pregnancy | 32 | 9 | |
| Drank during pregnancy | 4 | 2 | |
| Unknown | 1 | 1 | |
| SGA | |||
| No | 58 | 25 | 0.567 |
| Yes | 10 | 6 | |
| Unknown | 1 | 0 | |
t-test for continuous variables and Fisher exact test for count data.
Income categories: 1 = <$4,999; 2 = $5,000–$9,999; 3 = $10,000–$19,999; 4 = $20,000–$29,999; 5 = $30,000–$39,999; 6 = $40,000–$49,999; 7 = $50,000–$74,999; 8 = $75,000–$99,999; 9 = $100,000–$199,999.
SGA: small for gestational age.
Figure 1:(A) Correlation matrix of the 10 phenotypic variables on the 100 samples (69 controls vs 31 preterm cases). (B) Partial least squares–discriminant analysis (PLS-DA) plot of the 100 samples using 333 metabolites. (C) Source of variation (SOV) analysis using 100 samples; 333 metabolites are used in the ANOVA model. (D) Heat map of correlations between 333 metabolites and 11 confounding factors. The rows represent the clinical factors, and the columns represent metabolites (point-biserial correlation for continuous and binary covariates; Pearson correlation for continuous covariates; Spearman correlation for continuous and ordinal covariates).
Figure 2:WGCNA network in all samples. (A) WGCNA network modules of metabolomics data from both preterm and control samples. Each node represents a lipid. Node color represents a module. (B) Module-trait associations.
Figure 3:Metabolites show significantly different levels in preterm and control samples. (A) Heat map of the 38 metabolites with a significant difference exclusively between preterm and control samples (P < 0.05). (B) Bar plots on the averaged normalized intensities in cases vs controls. (C) Bipartite graph of the significantly differentiated metabolites and the significantly altered metabolic pathways with which they are associated. Five pathways with a significant difference between preterm and control samples (P < 0.05) and 33 significantly differentiated metabolites engaged in these pathways are shown. Elliptical nodes: metabolites. Rectangular nodes: pathways from HMDB, PubChem, and KEGG databases. Node color: red, upregulated; blue, down-regulated. Node size: the absolute value of log fold change (logFC).
Figure 4:Classification model for preterm birth. (A) Comparison of 7 classification models using 17 metabolites on the hold-out testing. The dataset was randomly split into training data (80%) and testing data (20%) 10 times. The mean value (bars) and standard error (error bars) of the 10 repeats are shown for 3 performance metrics of the area under the receiver operating curve (AUC), F1 statistic, and balanced accuracy. The winning method RF in training data (left) was then applied to the testing data (right). (B) The heat map of correlation coefficients between the 17 metabolites and clinical variables. (C) The precision-recall curves of the RF model from (A) on classifying preterm, LGA (large for gestational age), income, and maternal age (≥35 y or not), respectively, using the same set of testing data as in (A). (D) Normalized variable importance scores for the 17 lipid markers in the RF model. The normalization is done on R by making the sum of importance scores be 100.
Figure 5:Predicted significant (P < 0.01) causality interactions between the 17 metabolites and preterm birth. Arrow indicates the causality interaction. Blue and red nodes are down- and upregulated metabolites, while the center one is preterm.
Figure 6:A proposed model of metabolite changes affecting preterm birth.