| Literature DB >> 27689004 |
Loic Yengo1, Abdelilah Arredouani2, Michel Marre3, Ronan Roussel3, Martine Vaxillaire1, Mario Falchi4, Abdelali Haoudi5, Jean Tichet6, Beverley Balkau7, Amélie Bonnefond1, Philippe Froguel8.
Abstract
OBJECTIVE: Characterizing specific metabolites in sub-clinical phases preceding the onset of type 2 diabetes to enable efficient preventive and personalized interventions. RESEARCH DESIGN AND METHODS: We developed predictive models of type 2 diabetes using two strategies. One strategy focused on the probability of incidence only and was based on logistic regression (MRS1); the other strategy accounted for the age at diagnosis of diabetes and was based on Cox regression (MRS2). We assessed 293 metabolites using non-targeted metabolomics in fasting plasma samples of 1,044 participants (including 231 incident cases over 9 years) used as training population; and fasting serum samples of 128 participants (64 incident cases versus 64 controls) used as validation population. We applied a LASSO-based variable selection aiming at maximizing the out-of-sample area under the receiver operating characteristic curve (AROC) and integrated AROC.Entities:
Keywords: High dimensional regression; LASSO; Metabolomics; Risk prediction; Type 2 diabetes
Year: 2016 PMID: 27689004 PMCID: PMC5034686 DOI: 10.1016/j.molmet.2016.08.011
Source DB: PubMed Journal: Mol Metab ISSN: 2212-8778 Impact factor: 7.422
Metabolites contributing to MRS1 and MRS2. The first nine metabolites contribute to both scores and the 15 others are specific to each score. Relative contributions ≥1/16 = 6.25% for MRS1 and ≥1/17 ≈ 5.88% for MRS2 are highlighted in bold font. Relative contribution ratios for the two scores that are above 2 are also highlighted in bold font. References are given for metabolites reported in the literature for associations with insulin resistance or prevalent and incident type 2 diabetes.
| Metabolites | Associated pathways | MRS1 | MRS2 | References | ||
|---|---|---|---|---|---|---|
| Regression coefficient | Relative contribution to the score | Regression coefficient | Relative contribution to the score | |||
| 1,5-Anhydroglucitol | Glycolysis, gluconeogenesis, Pyruvate Metabolism | −0.50 | −0.26 | |||
| 1-Linoleoyl-GPC | Lysolipid | −0.31 | 5.97% | −0.07 | 1.92% | |
| 1-Palmitoylglycerol | Monoacylglycerol | 0.16 | 3.10% | 0.25 | ||
| Cotinine | Tobacco Metabolite | 0.33 | 0.32 | |||
| γ-Glutamylphenylalanine | Gamma-glutamyl Amino Acid | 0.17 | 3.34% | 0.09 | 2.61% | |
| Glucose | Glycolysis, Gluconeogenesis, Pyruvate Metabolism | 1.03 | 0.51 | |||
| Isoleucine | Leucine, Isoleucine, Valine Metabolism | 0.28 | 5.39% | 0.27 | ||
| Mannose | Fructose, Mannose, Galactose Metabolism | 0.37 | 0.13 | 3.48% | ||
| Pro-hydroxy-pro | Urea cycle; Arginine, Proline Metabolism | −0.30 | 5.85% | −0.16 | 4.40% | |
| Fructose | Fructose, Mannose, Galactose Metabolism | 0.27 | 5.21% | |||
| γ-Glutamyltyrosine | Gamma-glutamyl Amino Acid | 0.29 | 5.59% | |||
| Isovalerylcarnitine | Leucine, Isoleucine, Valine Metabolism | 0.19 | 3.73% | |||
| Phenylalanine | Phenylalanine, Tyrosine Metabolism | 0.28 | 5.48% | |||
| Piperine | Food Component/Plant | 0.30 | 5.91% | |||
| Serine | Glycine, Serine, Threonine Metabolism | −0.31 | 6.08% | |||
| Tyrosine | Phenylalanine, Tyrosine Metabolism | −0.05 | 0.97% | |||
| 1-Stearoyl-GPI | Lysolipid | −0.26 | ||||
| 3-Hydroxyisobutyrate | Leucine, Isoleucine, Valine Metabolism | 0.15 | 4.03% | |||
| Dehydroisoandrosterone sulfate | Steroid | 0.30 | ||||
| γ-Glutamylvaline | Gamma-glutamyl Amino Acid | 0.12 | 3.35% | |||
| Glycine | Glycine, Serine, Threonine Metabolism | −0.13 | 3.45% | |||
| Palmitoyl sphingomyelin | Sphingolipid Metabolism | −0.14 | 3.93% | |||
| Stearoylcarnitine | Fatty Acid Metabolism (Acyl Carnitine) | −0.19 | 5.12% | |||
| Urea | Urea cycle; Arginine, Proline Metabolism | −0.31 | ||||
Association between MRS1/MRS2 (continuous score or categorized score) with incidence of type 2 diabetes measured with hazard and odds ratios; and with age at diagnosis.
| Training population | Validation population | |||||
|---|---|---|---|---|---|---|
| Hazard Ratio | Odds Ratio | Regression coefficient for association with | Hazard Ratio | Odds Ratio | Regression coefficient for association with | |
| Continuous MRS1 | 2.88 (2 × 10−16) | 8.44 | 0.08 year | 1.49 | 3.3 | 1 year |
| Categorized MRS1 | ||||||
| 1st tertile groups | 4.13 (6 × 10−4) | 1.78 (2 × 10−5) | 7.16 years (0.06) | 1.52 (0.30) | 1.95 (0.17) | 4.08 years (0.38) |
| Continuous MRS2 | 2.72 | 3.63 | −2.7 years | 1.63 | 1.78 | −3.75 years |
| Categorized MRS2 | ||||||
| 1st tertile groups | 3.35 (6 × 10−4) | 3.04 (2 × 10−4) | −1.12 years (0.67) | 1.97 (0.06) | 2.01 (0.13) | −0.81 year (0.83) |
Figure 1ROC Receiver operating characteristic (ROC) curves and area under these curves (AROC) statistics for three predictive models: Model 1 with clinical and biological risk factors only, Model 2 with MRS1 only, and Model 3 including clinical and biological risk factors + MRS1.
Discriminative performances of different models; model 1 including only classic clinical and biological risk factors; model 2 including MRS1 (when comparison is made using AUC) or MRS2 (when comparison is made using integrated AROC or iAROC) and model 3 including all risk factors + MRS1 or MRS2. MRS1 and MRS 2 were never added simulatneously in any models. When MRS1 or MRS2 were added in a model, impaired fasting glucose and current smoking status were not included as clinical and biological risk factors to avoid redundancy.
| Predictive models | Training population | Validation population | ||||
|---|---|---|---|---|---|---|
| AROC | iAROC | AROC | iAROC | |||
| Model 1: clinical and biological risk factors only | 83.7% | 60.5% | Model 1 | 61.2% | 52.5% | Model 1 |
| Model 2: MRS1/MRS2 only | 88.2% | 84.4% | Model 2 | 75.0% | 67.9% | Model 2 |
| Model 3: clinical, biological risk factors and MRS1/MRS 2 | 89.8% | 70.0% | Model 3 | 72.9% | 52.9% | Model 3 |