Arthur M Lee1, Jian Hu2, Yunwen Xu3, Alison G Abraham4, Rui Xiao2, Josef Coresh3, Casey Rebholz3, Jingsha Chen3, Eugene P Rhee5, Harold I Feldman2, Vasan S Ramachandran6, Paul L Kimmel7, Bradley A Warady8, Susan L Furth9,10, Michelle R Denburg9,10. 1. Division of Nephrology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania leeam@chop.edu. 2. Center for Clinical Epidemiology and Biostatistics, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania. 3. Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore Maryland. 4. School of Public Health, University of Colorado Denver, Denver, Colorado. 5. Department of Medicine, Massachusetts General Hospital, Harvard University, Boston, Massachusetts. 6. Department of Medicine, Boston University School of Medicine, Boston University School of Public Health, Boston University Center for Computing and Data Science, Boston, Massachusetts. 7. National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland. 8. Department of Pediatrics, Children's Mercy Hospital, University of Missouri-Kansas City School of Medicine, Kansas City, Missouri. 9. Division of Nephrology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania. 10. Department of Pediatrics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania.
Abstract
BACKGROUND: Untargeted plasma metabolomic profiling combined with machine learning (ML) may lead to discovery of metabolic profiles that inform our understanding of pediatric CKD causes. We sought to identify metabolomic signatures in pediatric CKD based on diagnosis: FSGS, obstructive uropathy (OU), aplasia/dysplasia/hypoplasia (A/D/H), and reflux nephropathy (RN). METHODS: Untargeted metabolomic quantification (GC-MS/LC-MS, Metabolon) was performed on plasma from 702 Chronic Kidney Disease in Children study participants (n: FSGS=63, OU=122, A/D/H=109, and RN=86). Lasso regression was used for feature selection, adjusting for clinical covariates. Four methods were then applied to stratify significance: logistic regression, support vector machine, random forest, and extreme gradient boosting. ML training was performed on 80% total cohort subsets and validated on 20% holdout subsets. Important features were selected based on being significant in at least two of the four modeling approaches. We additionally performed pathway enrichment analysis to identify metabolic subpathways associated with CKD cause. RESULTS: ML models were evaluated on holdout subsets with receiver-operator and precision-recall area-under-the-curve, F1 score, and Matthews correlation coefficient. ML models outperformed no-skill prediction. Metabolomic profiles were identified based on cause. FSGS was associated with the sphingomyelin-ceramide axis. FSGS was also associated with individual plasmalogen metabolites and the subpathway. OU was associated with gut microbiome-derived histidine metabolites. CONCLUSION: ML models identified metabolomic signatures based on CKD cause. Using ML techniques in conjunction with traditional biostatistics, we demonstrated that sphingomyelin-ceramide and plasmalogen dysmetabolism are associated with FSGS and that gut microbiome-derived histidine metabolites are associated with OU.
BACKGROUND: Untargeted plasma metabolomic profiling combined with machine learning (ML) may lead to discovery of metabolic profiles that inform our understanding of pediatric CKD causes. We sought to identify metabolomic signatures in pediatric CKD based on diagnosis: FSGS, obstructive uropathy (OU), aplasia/dysplasia/hypoplasia (A/D/H), and reflux nephropathy (RN). METHODS: Untargeted metabolomic quantification (GC-MS/LC-MS, Metabolon) was performed on plasma from 702 Chronic Kidney Disease in Children study participants (n: FSGS=63, OU=122, A/D/H=109, and RN=86). Lasso regression was used for feature selection, adjusting for clinical covariates. Four methods were then applied to stratify significance: logistic regression, support vector machine, random forest, and extreme gradient boosting. ML training was performed on 80% total cohort subsets and validated on 20% holdout subsets. Important features were selected based on being significant in at least two of the four modeling approaches. We additionally performed pathway enrichment analysis to identify metabolic subpathways associated with CKD cause. RESULTS: ML models were evaluated on holdout subsets with receiver-operator and precision-recall area-under-the-curve, F1 score, and Matthews correlation coefficient. ML models outperformed no-skill prediction. Metabolomic profiles were identified based on cause. FSGS was associated with the sphingomyelin-ceramide axis. FSGS was also associated with individual plasmalogen metabolites and the subpathway. OU was associated with gut microbiome-derived histidine metabolites. CONCLUSION: ML models identified metabolomic signatures based on CKD cause. Using ML techniques in conjunction with traditional biostatistics, we demonstrated that sphingomyelin-ceramide and plasmalogen dysmetabolism are associated with FSGS and that gut microbiome-derived histidine metabolites are associated with OU.
Authors: Neal B Shah; Andrew S Allegretti; Sagar U Nigwekar; Sahir Kalim; Sophia Zhao; Benjamin Lelouvier; Florence Servant; Gloria Serena; Ravi Ishwar Thadhani; Dominic S Raj; Alessio Fasano Journal: Clin J Am Soc Nephrol Date: 2019-04-08 Impact factor: 8.237
Authors: Lloyd W Sumner; Alexander Amberg; Dave Barrett; Michael H Beale; Richard Beger; Clare A Daykin; Teresa W-M Fan; Oliver Fiehn; Royston Goodacre; Julian L Griffin; Thomas Hankemeier; Nigel Hardy; James Harnly; Richard Higashi; Joachim Kopka; Andrew N Lane; John C Lindon; Philip Marriott; Andrew W Nicholls; Michael D Reily; John J Thaden; Mark R Viant Journal: Metabolomics Date: 2007-09 Impact factor: 4.290
Authors: Shengyuan Luo; Elena V Feofanova; Adrienne Tin; Sarah Tung; Eugene P Rhee; Josef Coresh; Dan E Arking; Aditya Surapaneni; Pascal Schlosser; Yong Li; Anna Köttgen; Bing Yu; Morgan E Grams Journal: Kidney Int Date: 2021-04-08 Impact factor: 18.998
Authors: Eugene P Rhee; Sushrut S Waikar; Casey M Rebholz; Zihe Zheng; Regis Perichon; Clary B Clish; Anne M Evans; Julian Avila; Michelle R Denburg; Amanda Hyre Anderson; Ramachandran S Vasan; Harold I Feldman; Paul L Kimmel; Josef Coresh Journal: Clin J Am Soc Nephrol Date: 2018-12-20 Impact factor: 10.614
Authors: Michelle R Denburg; Yunwen Xu; Alison G Abraham; Josef Coresh; Jingsha Chen; Morgan E Grams; Harold I Feldman; Paul L Kimmel; Casey M Rebholz; Eugene P Rhee; Ramachandran S Vasan; Bradley A Warady; Susan L Furth Journal: Clin J Am Soc Nephrol Date: 2021-08 Impact factor: 10.614
Authors: Dan Burghelea; Tudor Moisoiu; Cristina Ivan; Alina Elec; Adriana Munteanu; Ștefania D Iancu; Anamaria Truta; Teodor Paul Kacso; Oana Antal; Carmen Socaciu; Florin Ioan Elec; Ina Maria Kacso Journal: Biomedicines Date: 2022-05-17