Literature DB >> 31592078

Development of a Plasma Screening Panel for Pediatric Nonalcoholic Fatty Liver Disease Using Metabolomics.

Richard D Khusial¹, Catherine E Cioffi², Shelley A Caltharp^3,4, Alyssa M Krasinskas⁴, Adina Alazraki^3,5, Jack Knight-Scott³, Rebecca Cleeton⁶, Eduardo Castillo-Leon⁶, Dean P Jones⁷, Bridget Pierpont⁸, Sonia Caprio⁸, Nicola Santoro⁸, Ayman Akil¹, Miriam B Vos^2,3,6.

Abstract

Nonalcoholic fatty liver disease (NAFLD) is the most common chronic liver disease in children, but diagnosis is challenging due to limited availability of noninvasive biomarkers. Machine learning applied to high-resolution metabolomics and clinical phenotype data offers a novel framework for developing a NAFLD screening panel in youth. Here, untargeted metabolomics by liquid chromatography-mass spectrometry was performed on plasma samples from a combined cross-sectional sample of children and adolescents ages 2-25 years old with NAFLD (n = 222) and without NAFLD (n = 337), confirmed by liver biopsy or magnetic resonance imaging. Anthropometrics, blood lipids, liver enzymes, and glucose and insulin metabolism were also assessed. A machine learning approach was applied to the metabolomics and clinical phenotype data sets, which were split into training and test sets, and included dimension reduction, feature selection, and classification model development. The selected metabolite features were the amino acids serine, leucine/isoleucine, and tryptophan; three putatively annotated compounds (dihydrothymine and two phospholipids); and two unknowns. The selected clinical phenotype variables were waist circumference, whole-body insulin sensitivity index (WBISI) based on the oral glucose tolerance test, and blood triglycerides. The highest performing classification model was random forest, which had an area under the receiver operating characteristic curve (AUROC) of 0.94, sensitivity of 73%, and specificity of 97% for detecting NAFLD cases. A second classification model was developed using the homeostasis model assessment of insulin resistance substituted for the WBISI. Similarly, the highest performing classification model was random forest, which had an AUROC of 0.92, sensitivity of 73%, and specificity of 94%.
Conclusion: The identified screening panel consisting of both metabolomics and clinical features has promising potential for screening for NAFLD in youth. Further development of this panel and independent validation testing in other cohorts are warranted.

Entities: Chemical Disease Gene Mutation Species

Year: 2019 PMID： 31592078 PMCID： PMC6771165 DOI： 10.1002/hep4.1417

Source DB: PubMed Journal: Hepatol Commun ISSN： 2471-254X

alanine aminotransferase aspartate aminotransferase area under the receiver operating curve body mass index correlation‐based feature search fasting plasma glucose high‐density lipoprotein homeostasis model assessment of insulin resistance high‐resolution metabolomics information gain low‐density lipoprotein lysophosphatidylcholine lysophosphatidylethanolamine mass‐to‐charge ratio magnetic resonance imaging nonalcoholic fatty liver disease oral glucose tolerance test phosphatidylcholine phosphatidylethanolamine retention time support vector machine whole‐body insulin sensitivity index waist circumference Nonalcoholic fatty liver disease (NAFLD) is the most common chronic liver disease in the United States, largely due to parallel rises in pediatric obesity.1, 2 The term NAFLD encompasses a range of disease severity from steatosis, defined as fat accumulation in at least 5% of hepatocytes, to nonalcoholic steatohepatitis, defined as steatosis with inflammation and hepatocyte ballooning. There are limited natural history data in children, but studies have shown that childhood NAFLD can result in cirrhosis, severe liver disease, and death.3, 4 NAFLD is also strongly associated with cardiometabolic disease risk factors, such as prediabetes and type 2 diabetes and atherogenic dyslipidemia.5, 6 Currently, NAFLD is a diagnosis of exclusion. Thus, there is a need for a sensitive and cost‐effective screening assay for pediatric NAFLD. Machine learning can be used to examine patterns within a data set’s attributes to extract important relationships relative to a targeted class.7, 8 In combination with high‐resolution metabolomics (HRM), which is capable of measuring thousands of small molecule metabolites in biofluids, machine learning offers a potential solution for uncovering a novel combination of metabolites that differentiate NAFLD from non‐NAFLD cases. Here, we applied machine learning to high‐resolution liquid chromatography–mass spectrometry metabolomics data as well as clinical phenotype data. Our objective was to identify a panel of variables useful as a screening tool for pediatric NAFLD.

Patients and Methods

Study Population

The combined cohort for this study was a total of 559 patients ages 2‐25 years old who were diagnosed with or without NAFLD by one or both of the gold standard methods (magnetic resonance‐based imaging or liver biopsy) and who were enrolled in one of the three following studies: the Emory University Pediatric Liver Biopsy Data Repository (n = 172), the SweetBev Trial (n = 39), and the Yale Pediatric NAFLD Cohort (n = 348). The Emory Pediatric Liver Biopsy Cohort was a cross‐sectional study of patients enrolled from Children’s Healthcare of Atlanta before undergoing a clinically indicated liver biopsy for suspected liver disease or disease monitoring. Exclusion criteria were fever in the past 2 weeks, renal disease/insufficiency, or pregnancy. Participants provided a fasting blood sample before liver biopsy, which was processed and stored for future research, and shared their liver biopsy slides, which were blindly scored according to the Nonalcoholic Steatohepatitis Clinical Research Network criteria by two pathologists.9 NAFLD diagnosis was defined as having steatosis >5% on liver biopsy combined with a clinical diagnosis of NAFLD. The Emory SweetBev Trial was conducted in Hispanic/Latino adolescents age 11‐18 years with overweight or obesity and tested the effect of a 4‐week reduction in dietary fructose intake on hepatic fat. Inclusion criteria were hepatic fat >10% and self‐reported habitual sugar‐sweetened beverage consumption. Exclusion criteria were currently attempting weight gain or loss, cirrhosis, renal disease/insufficiency, recent acute illness in the past 4 weeks, or pregnancy. More details of the sample are described elsewhere.10 For most participants (n = 29), plasma samples from baseline were used for metabolomics analysis in this study. For those without stored baseline samples, plasma samples collected at 2 weeks (n = 7) or 4 weeks (n = 3) were used. Hepatic fat was assessed by magnetic resonance spectroscopy (MRS) using described methods,11 and NAFLD diagnosis was defined as hepatic fat >5% based on MRS. The Yale Pediatric NAFLD Cohort is a multi‐ethnic cohort study of children and adolescents with overweight or obesity recruited from the Yale Pediatric Obesity Clinic in New Haven, CT. Exclusion criteria were known liver diseases (except for NAFLD), alcohol consumption, and use of medications that alter glucose, lipid, or amino acid metabolism. Although some subjects were followed prospectively, for this study only data and samples from each participant’s first study visit were used, which included a stored fasting blood sample for metabolomics analysis among other phenotype assessments.12 A fast‐gradient magnetic resonance imaging (MRI) was performed to assess hepatic fat using methods based on the two‐point Dixon,13 and NAFLD diagnosis was defined as having hepatic fat >5%. The Institutional Review Board at Yale University approved all study procedures and the Institutional Review Board at Emory University approved this secondary analysis of the data and samples. All participants provided written informed consent and assent for their samples and data to be used for future research. In the combined cohort, 222 participants were diagnosed with NAFLD and 337 were non‐NAFLD controls based on the biopsy or MRI criteria described above. Additional information for each parent study can be found in the Supporting Information.

HRM Analysis

HRM was performed on stored fasting plasma samples using described methods.14, 15 Briefly, plasma samples that had been frozen at −80°C since the time of collection were prepared and analyzed in batches of 40; each contained pooled human plasma (Q standard) at the start, middle, and end of each batch for quality control purposes. Samples were analyzed in triplicate using 10‐μL injections and hydrophilic interaction liquid chromatography with ultra‐high resolution mass spectrometry (Q‐Exactive HF Orbitrap, Thermo Scientific, San Jose, CA) and positive electrospray ionization. The mass spectrometer was operated at 120,000 resolution and in full‐scan mode in order to detect mass‐to‐charge ratios (m/z) ranging from 85 to 1,275. Raw data files were extracted and aligned by apLCMS with xMSanalyzer.16, 17 Spectral features were batch corrected by ComBat,18 filtered based on coefficient of variation, and averaged across triplicates based on Pearson’s correlation. The resulting data consisted of 13,013 metabolic features defined by an accurate mass m/z, retention time (RT), and ion abundance, which were then entered into further statistical analyses.

Metabolite Annotation and Identification

Metabolic features were computationally annotated using xMSannotator.19 This software performs accurate mass matching to common positive‐mode adducts in the Human Metabolome Database.20 For this study, an m/z tolerance of ±5 ppm and an RT threshold of 10 seconds were used to select matches. Additionally, xMSannotator uses a multilevel algorithm that assigns scores to matched metabolic feature, ranging from 0 (no confidence), to 3 (high confidence), based on adduct/isotope patterns, elemental or abundance ratio checks, and metabolic pathway‐level information. Features with no match or with a score of 0 or 1 in xMSannotator were level 4 “unknown” compounds according to criteria set by the Metabolomics Standards Initiative21; those with a score of 2 or greater in xMSannotator were at least level 2 “putatively annotated” compounds. Among these, annotations having m/z and RT of M+H adducts previously confirmed by comparing ion dissociation and elution time to reference standards22 were level 1 “identified” compounds.

Clinical Phenotype Assessments

Phenotype assessments collected for the combined cohort differed according to parent study and are summarized in the Supporting Information. For all participants, demographic information for age, sex, and race/ethnicity were collected by surveys or medical record review, and anthropometric measurements for height (cm) and weight (kg) were performed. Waist circumference (WC; cm) and hip circumference (cm) were also measured in the Emory Pediatric Liver Biopsy Cohort and the Yale Pediatric NAFLD Cohort, although only a subsample of the Emory Pediatric Liver Biopsy Cohort (n = 39) had these measurements because this was added to the protocol after enrollment started. For participants 2‐19 years, age‐ and sex‐adjusted body mass index (BMI) percentiles and z scores were calculated using the 2000 Centers for Disease Control and Prevention growth charts.23 For participants >19 years, we calculated BMI as weight (kg) divided by height in meters squared. Participants were grouped as follows: normal weight (BMI percentile, <85 or BMI, <25 kg/m2 for 2‐19 or 20‐25 years, respectively), overweight (BMI percentile, 85‐94 or BMI, 25‐29 kg/m2), and obese (BMI percentile, ≥95 or BMI, ≥30 kg/m2). Additionally, WC and hip circumference (cm) were measured for some participants (n = 377) in the Emory Pediatric Liver Biopsy Cohort and the Yale Pediatric NAFLD Cohort. The following laboratory tests were performed in all three parent studies by routine methods: liver enzymes alanine aminotransferase (ALT) and aspartate aminotransferase (AST), fasting plasma glucose (FPG), and a blood lipid panel, including total cholesterol, blood triglycerides, high‐density lipoprotein (HDL) cholesterol, and low‐density lipoprotein (LDL) cholesterol. Fasting insulin was also measured in the Emory SweetBev Trial and the Yale Pediatric NAFLD Cohort and used with FPG to calculate the homeostasis model assessment of insulin resistance (HOMA‐IR).24 Hemoglobin A1C was measured in the Yale Pediatric NAFLD Cohort. A standard oral glucose tolerance test (OGTT) was also performed in the Yale Pediatric NAFLD Cohort and used to calculate 1) the whole‐body insulin sensitivity index (WBISI) based on the Matsuda index, which has been validated against the euglycemic‐hyperinsulinemic clamp in children and adolescents25; 2) the insulinogenic index (IGI) based on the formula Δinsulin (0‐30 minutes)/Δglucose (0‐30 minutes); 3) the disposition index based on the product of IGI and the WBISI. Participants with an FPG and/or 2‐hour postprandial glucose level from the OGTT were characterized as having type 2 diabetes mellitus, impaired fasting glucose, and/or impaired glucose tolerance based on criteria from the American Diabetes Association.26 Additional details of the laboratory methods used for each substudy are in the Supporting Information.

Machine Learning‐Based Analysis

Demographic and clinical characteristics of the sample were summarized using counts and frequencies for categorical variables and means and SDs or medians and interquartile ranges for continuous variables (Table 1). The metabolomics data set was randomly divided into a training set (two thirds of data) and testing set (one third of data) for developing and testing the NAFLD classification model, using the approach in Fig. 1. Partial least squares discriminant analysis (PLS‐DA) was used to reduce data set dimensionality. Metabolic features with a variable importance of projection (VIP) >1 were retained.27 A two‐step feature selection was performed on remaining features. The first step was a correlation‐based feature search (CFS) during which metabolic features were evaluated for their classification potential by Pearson’s correlation. The second step was information gain (IG) with a wrapper‐method support vector machine (SVM), during which metabolic features were evaluated by the amount of information gained and its importance to the SVM algorithm. Retained metabolic features were entered into classification analysis using logistic regression, Naive Bayes, and random forest. Performance of the metabolomics‐only model by each method was evaluated in the test and training sets by area under the receiver operating curve (AUROC), sensitivity, and specificity.

Table 1

Sociodemographic and Health Characteristics of the Sample Stratified by NAFLD Status

	Non‐NAFLD (n = 337)		NAFLD (n = 222)		P value*
	N Obs	Estimate	N (%)	Estimate	P value*
Age, mean (SD)	337	13.7 (4.0)	222	13.6 (2.9)	0.687
Sex, count (%)
Male	337	132 (39.1)	222	135 (60.8)	<0.001
Race/ethnicity, count (%)
Non‐Hispanic black	337	132 (38.9)	222	25 (11.3)	<0.001
Non‐Hispanic white		128 (37.8)		70 (31.5)	0.16
Hispanic		66 (19.5)		121 (54.5)	<0.001
Asian/other		10 (2.9)		4 (1.8)	0.75
Weight status, count (%)
Normal/underweight	332	100 (29.4)	222	4 (1.8)	<0.001
Overweight		45 (13.6)		15 (6.8)	0.027
Obese		187 (57.0)		203 (91.4)	<0.001
Waist circumference, mean (SD)	237	96.0 (18.98)	140	104.3 (17.91)	<0.001
Hip circumference, mean (SD)	236	104.9 (16.46)	140	107.0 (16.93)	0.24
Fasting glucose (mg/dL), mean (SD)	328	87.9 (12.7)	220	88.3 (12.4)	0.69
2‐Hour glucose (mg/dL), mean (SD)	233	117.7 (29.7)	115	127.2 (33.9)	0.012
Fasting insulin (mg//dL), mean (SD)	238	27.8 (22.8)	138	37.9 (21.5)	<0.001
WBISI, mean (SD)	226	2.4 (1.54)	113	1.5 (0.79)	<0.001
Insulinogenic index, mean (SD)	218	4.7 (5.47)	112	5.6 (4.58)	0.15
Disposition index, mean (SD)	217	9.3 (16.64)	112	7.1 (6.10)	0.19
HOMA‐IR, mean (SD)	238	6.3 (5.38)	138	8.7 (5.33)	<0.001
HBA1C (%), mean (SD)	204	5.5 (0.28)	97	5.5 (0.34)	0.07
Diabetes diagnoses, count (%)
Type 2 diabetes	328	8 (2.7)	220	4 (2.3)	0.99
Impaired glucose tolerance	233	40 (17.2)	115	26 (22.6)	0.28
Impaired fasting glucose	328	26 (10.2)	220	24 (14.0)	0.69
ALT (U/L), median (IQR)	308	19 (28)	204	24 (27)	<0.001
AST (U/L), median (IQR)	307	43 (65.3)	203	32 (24.5)	<0.001
Total‐C (mg/dL), mean (SD)	304	158.4 (41.2)	203	163.8 (37.9)	0.13
Triglycerides (mg/dL), mean (SD)	304	89.3 (56.7)	203	130.9 (71.6)	<0.001
HDL‐C (mg/dL), mean (SD)	304	46.7 (13.8)	203	43.9 (10.9)	0.010
LDL‐C (mg/dL), mean (SD)	304	93.8 (38.0)	203	96.7 (32.3)	0.34

P values calculated using Student t tests or chi‐square tests. For ALT and AST, nonparametric Mann‐Whitney U test was performed due to non‐normality of the variables. Bold values indicate clinically important differences between groups.

Abbreviations: C, cholesterol, HBA1C, hemoglobin A1C; IQR, interquartile range.

Figure 1

Workflow of the machine learning‐based approach used for developing and testing the NAFLD screening panels. The proposed workflow consists of a dimension reduction technique, followed by a two‐step machine learning approach. After a potential subset of features is obtained, algorithm mapping with three classifiers is explored to identify an optimal model.

Sociodemographic and Health Characteristics of the Sample Stratified by NAFLD Status P values calculated using Student t tests or chi‐square tests. For ALT and AST, nonparametric Mann‐Whitney U test was performed due to non‐normality of the variables. Bold values indicate clinically important differences between groups. Abbreviations: C, cholesterol, HBA1C, hemoglobin A1C; IQR, interquartile range. Workflow of the machine learning‐based approach used for developing and testing the NAFLD screening panels. The proposed workflow consists of a dimension reduction technique, followed by a two‐step machine learning approach. After a potential subset of features is obtained, algorithm mapping with three classifiers is explored to identify an optimal model. For the development of a combined metabolomics–clinical model, the clinical phenotype data were also analyzed by a two‐step feature selection. Before analysis, missing continuous data were imputed using the median population value of the attribute per the respected class. The two‐step feature selection step was first performed on all 26 demographic and clinical phenotype variables. The retained variables from this analysis were then combined with the retained metabolic features from above to assess their classification performance using logistic regression, Naive Bayes, and random forest. Due to the more intensive nature of performing an OGTT, we also repeated the two‐step feature selection and classification analysis using the clinical phenotype data without the OGTT measurements. Machine learning analysis was conducted in the Waikato Environment for Knowledge Analysis (WEKA, version 3.8), and all other statistical analyses were performed with R statistical package software (version 3.4.2).

Results

Characteristics of the sample are in Table 1. The NAFLD group compared to the non‐NAFLD group had a higher percentage of male subjects (61% vs. 39%, respectively), Hispanics (55% vs. 20%, respectively), and individuals with obesity (92% vs. 57%) (all P < 0.05). The NAFLD group also had significantly higher WC, HOMA‐IR, ALT, AST, and blood triglycerides and lower WBISI and HDL‐cholesterol compared to the non‐NAFLD group (all P < 0.05).

Machine Learning Analysis: Metabolomics‐Only Data

PLS‐DA was implemented for dimension reduction of the 13,008 metabolic features, resulting in 28 components being retained, which explained 95% cumulative variance, and 279 metabolic features being selected with a VIP score ≥1. The 279 metabolic features were then filtered by a two‐step feature selection. As a result of CFS, 222 metabolic features were removed and 57 were retained. The retained metabolic features are summarized in Supporting Table S1. As a result of IG‐SVM, 46 metabolic features were removed and 11 were retained, which are summarized in Table 2. This included the M+H adducts for serine, dihydrothymine, leucine/isoleucine, and tryptophan; the M+H adduct of lysophosphoethanolamine (LysoPE) 20:0; and the M+Na adduct of lysophosphotiylcholine (LysoPC) 18:1. The C13 isotopes of these adducts for leucine/isoleucine, tryptophan, and LysoPC (18:1) were also retained as well as two unknown metabolic features (m/z, 196.8651 and RT, 57; m/z, 434.6894 and RT, 55.3).

Table 2

Annotated Identities for the 11 Metabolic Features From HRM Retained After Feature Selection

m/z	Time (Seconds)	Compound Name	HMDB ID	Formula	MSI Level*	Adduct	Difference in Expression (NAFLD vs. Control)
106.0499	81.1	Serine	HMDB00187	C3H7NO3	1	M+H	↓
129.0658	61.3	Dihydrothymine	HMDB00079	C5H8N2O2	2	M+H	↓
132.1019	44.9	Leucine/isoleucine	HMDB00172	C6H13NO2	1	M+H	↑
133.1052	45.9	Leucine/isoleucine	HMDB00172	C6H13NO2	1	(M+1)+H	↑
196.8651	57	‐	‐	‐	4	‐	↓
205.0972	45.8	Tryptophan	HMDB00929	C11N12N2O2	1	M+H	↑
206.1005	45.7	Tryptophan	HMDB00929	C11N12N2O2	1	(M+1)+H	↑
434.6894	55.3	‐	‐	‐	4	‐	↓
510.3552	37.6	LysoPE(20:0)	HMDB11481	C25H52NO7P	2	M+H	↓
544.3377	39	LysoPC(18:1)	HMDB02815	C26H52NO7P	2	M+Na	↓
545.3415	37.9	LysoPC(18:1)	HMDB02815	C26H52NO7P	2	(M+1)+Na	↓

Confidence levels assigned according to MSI criteria, whereby level 1 is identified compounds, level 2 is putatively annotated compounds, level 3 is putatively characterized compound class, and level 4 is unknown compounds.

Abbreviations: HMDB, Human Metabolome Database; MSI, Metabolomics Standard Initiative.

Annotated Identities for the 11 Metabolic Features From HRM Retained After Feature Selection Confidence levels assigned according to MSI criteria, whereby level 1 is identified compounds, level 2 is putatively annotated compounds, level 3 is putatively characterized compound class, and level 4 is unknown compounds. Abbreviations: HMDB, Human Metabolome Database; MSI, Metabolomics Standard Initiative. The 11 selected metabolic features were then entered into a classification analysis using logistic regression, Naive Bayes, and random forest, which enabled us to examine different patterns within the data set between metabolic features and class. The training and test set evaluation metrics for the metabolomics‐only model are shown in Table 3. The three classifiers resulted in similar AUROCs, ranging from 0.85 to 0.86 in the training set and 0.82 to 0.84 in the test set (Fig. 2A). Naive Bayes had the highest sensitivity at 77% (training set) and 73% (test set), and random forest had the highest specificity at 84% (training set) and 87% (test set). As a sensitivity analysis, we tested classifier performance without the two unknown metabolic features. This resulted in minor changes in their evaluation metrics in the test set, with AUROCs of 0.81‐0.82, highest sensitivity of 75% using Naive Bayes, and highest specificity of 89% using random forest (Supporting Table S2).

Table 3

Evaluation Metrics for the Metabolomics‐Only and the Combined Metabolomics–Clinical Models With and Without OGTT Variables Applied to the Training and Test Sets

Classifier	Metabolomics‐Only Model			Combined Model #1 (With OGTT Variables)			Combined Model #2 (Without OGTT Variables)
Classifier	AUROC	Sensitivity	Specificity	AUROC	Sensitivity	Specificity	AUROC	Sensitivity	Specificity
Training Set
Naive Bayes	0.847	77%	78%	0.880	82%	77%	0.866	81%	81%
Random forest	0.846	60%	84%	0.924	77%	90%	0.923	72%	91%
Logistic	0.860	68%	83%	0.875	75%	87%	0.876	73%	88%
Test Set
Naive Bayes	0.825	73%	78%	0.848	73%	75%	0.842	73%	76%
Random forest	0.818	60%	87%	0.944	73%	97%	0.922	73%	94%
Logistic	0.839	69%	81%	0.863	72%	83%	0.858	70%	81%

Bold values indicate the classifier with the highest AUROC for each model in the training or test set.

Abbreviations: AUROC, Area under the receiver operator curve; OGTT, Oral glucose tolerance test.

Figure 2

AUROC curves produced from random forest, logistic regression, and Naive Bayes classifiers. (A) Metabolomics‐only model, (B) combined metabolomics–clinical model with OGTT variables, (C) combined metabolomics–clinical model without OGTT variables.

Evaluation Metrics for the Metabolomics‐Only and the Combined Metabolomics–Clinical Models With and Without OGTT Variables Applied to the Training and Test Sets Bold values indicate the classifier with the highest AUROC for each model in the training or test set. Abbreviations: AUROC, Area under the receiver operator curve; OGTT, Oral glucose tolerance test. AUROC curves produced from random forest, logistic regression, and Naive Bayes classifiers. (A) Metabolomics‐only model, (B) combined metabolomics–clinical model with OGTT variables, (C) combined metabolomics–clinical model without OGTT variables.

Machine Learning Analysis: Combined Metabolomics–Clinical Data

The same two‐step feature selection analysis was applied to all 26 clinical phenotype variables. After CFS, 14 variables were retained, and after IG‐SVM, the following three variables were retained: WC, WBISI, and triglycerides. The training and test set evaluation metrics for the combined model, evaluated with the 11 metabolic features from above, are summarized in Table 3, and the AUROCs are shown in Fig. 2B. In the test set, random forest had the highest AUROC at 0.94 as well as the highest sensitivity at 73% and specificity at 97%. A second analysis was conducted using 22 clinical phenotype variables, excluding the OGTT‐based measurements. After CFS, 12 variables were retained, and after IG‐SVM, the following three variables were retained: WC, HOMA‐IR, and triglycerides. The training and test set evaluation metrics for this second combined model are summarized in Table 3. In the test set, random forest had the highest AUROC at 0.92 and specificity at 94%. Naive Bayes and random forest had the same sensitivity of 73%. The AUROCs for each of the models are shown in Fig. 2C. Similar to above, we performed sensitivity analyses without the two unknown features for these models; this resulted in only slight reductions in their evaluation metrics in the test set (Supporting Table S2).

Discussion

Current approaches to screen for and diagnose NAFLD are inefficient and cumbersome. In this study, we applied a machine learning approach to a combination of HRM and clinical phenotype data to identify a set of variables with potential for use as a NAFLD screening panel in children and adolescents. The metabolomics‐only‐based model, which was developed using the training data set and consisted of 11 metabolic features, achieved a sensitivity of 73% and specificity of 84% in the test set. Further, with the addition of clinical phenotype variables to the machine learning workflow and model, we were able to achieve an even higher specificity of 97% in the test set. The main feature of our machine learning approach was a two‐step feature selection consisting of CFS and IG‐SVM, which was applied to both the metabolomics and clinical phenotype data sets. CFS is a multivariate method of identifying subsets of features that are highly correlated with the class outcome and uncorrelated with each other.28 IG‐SVM was used to further analyze the remaining attributes by a univariate approach. IG is a filter method that eliminates irrelevant attributes in high‐dimensional data, whereas the SVM wrapper eliminates redundancy to decrease noise in the data. The IG‐SVM method has previously shown success for biomarker selection in high‐dimensional cancer gene data.29 Using these steps, 11 metabolic features were retained, of which five were identified as adducts of the amino acids serine, leucine/isoleucine, and tryptophan. Higher levels of branched chain and aromatic amino acids have been reported in children with NAFLD30, 31 and other cardiometabolic disease risk factors.32, 33, 34 The exact cause of these amino acid alterations remains an active area of research but may relate to higher protein consumption, impaired catabolism, and/or microbial amino acid metabolism. Additionally, adult studies have shown that serine levels are inversely associated with NAFLD severity,35, 36, 37, 38 possibly due to increased hepatic import/degradation for glutathione synthesis,39 although this relationship has not been extensively studied in youth. Another metabolic feature was putatively annotated as dihydrothymine, an intermediate in liver pyrimidine catabolism. Studies in animals have linked altered pyrimidine metabolism with increased hepatic lipid,40 providing plausibility for our finding in this study. The remaining features were annotated to two phospholipids, LysoPE (20:0) and LysoPC (18:1). Altered glycerophospholipid metabolism has been reported in both blood and liver tissue samples from patients with NAFLD41, 42, 43, 44, 45 and may relate to disruptions in the cytidine diphosphate‐choline pathway for PC synthesis or the PE to PC conversion pathway catalyzed by PE methyltransferase, both of which take place in the liver. Because there were multiple isomeric matches to these lysophospholipids, additional work is needed to confirm their identities as well as the identities of the two unknown metabolic features. The machine learning‐based approach was also applied to the clinical phenotype data available for the cohort. Using all 26 variables, WC, WBISI, and blood triglycerides had the greatest potential to identify NAFLD. This aligns with research showing a strong association of NAFLD with metabolic syndrome, which refers to a cluster of metabolic abnormalities that increase risk for future cardiometabolic disease including central adiposity, impaired glucose tolerance, and dyslipidemia.46, 47, 48 Due to the additional time and cost required for an OGTT, the feature selection steps were reapplied to a subset of clinical phenotype variables not including OGTT measurements, and this revealed that WC, HOMA‐IR, and blood triglycerides had the greatest capability to identify NAFLD. This was not surprising because although HOMA‐IR and WBISI are calculated using different information, i.e., fasting versus OGTT‐based glucose and insulin measurements, respectively, both are generally reflective of an individual’s degree of insulin resistance, which is often a co‐occurring morbidity with NAFLD and has been shown in prior studies to be predictive of NAFLD among children with obesity.49 After feature selection, we tested several classification algorithms to identify the optimal algorithm for our screening model. For the metabolomics‐only model, the AUROCs of the classifiers were relatively similar, all greater than 0.8, suggesting that all 11 metabolic features were valuable in classifying NAFLD cases. When combined with WC, WBISI, and blood triglycerides, the evaluation metrics for all three classifiers improved, with the random forest algorithm showing the highest specificity of 97%, sensitivity of 73%, and an AUROC of 0.94 on the test set. The performance of this combined model is comparable to the OWLiver Care test, which is a serum‐based panel developed from 467 adults with and without liver biopsy‐proven NAFLD that achieved a specificity of 78%, sensitivity of 98%, and AUROC of 0.90.35 Compared to this test, all three classification algorithms in our study achieved higher specificity than sensitivity, possibly because the control group had patients with other non‐NAFLD liver diseases. When HOMA‐IR was substituted for WBISI, the random forest algorithm again had the greatest performance with specificity of 94%, sensitivity of 73%, and AUROC of 0.92. Thus, the evaluation metrics for this second combined model were improved from the metabolomics‐only model but inferior to the first combined model with OGTT‐based WBISI. However, the drop in AUROC was only from 0.94 to 0.92, suggesting that while WBISI may be a slightly stronger predictor of NALFD, HOMA‐IR could provide a more practical alternative to WBISI with minimal information loss. This study was strengthened by its large and diverse sample of children, adolescents, and young adults recruited from two different geographic regions of the United States. All participants had a NAFLD diagnosis confirmed by one or both of the two most accurate methods, MRI/spectroscopy or liver biopsy, which increased our confidence in participant classifications. The use of an untargeted HRM assay allowed us to measure all possible metabolic features without an a priori hypothesis, offering greater potential to identify novel biomarkers. Despite the promising results, there are limitations to this work. Because different methods (imaging or biopsy) were used for the classification of some participants, this may impact the internal validity of the model. The model was constructed and tested on the same population. Although hold‐out testing was performed, further studies are warranted to validate these models independently. Another limitation is the degree of missing data for some clinical variables. To address this, imputation was used and may have resulted in biased results. However, excluding subjects due to missing data would have resulted in loss of power due to a smaller sample size. Lastly, while we were able to annotate and/or identify most of the retained metabolic features with confidence, “unknown” features still remained and require further investigation. However, in sensitivity analyses excluding these features, the performance metrics of the panel was relatively unchanged, supporting its utility without their validation. In conclusion, we were able to identify a panel of metabolites and clinical phenotype variables that offer potential as a screening tool for NAFLD by using HRM and clinical phenotype data combined with machine learning algorithms. In a clinic setting, this panel could be used as a first step in identifying children and adolescents who should be further evaluated for NAFLD diagnosis and/or staging using more invasive and expensive procedures. Further investigation to understand the biological significance of the chosen metabolic features as well as external validation of the proposed panel is warranted. Click here for additional data file.

46 in total

1. A comprehensive untargeted metabonomic analysis of human steatotic liver tissue by RP and HILIC chromatography coupled to mass spectrometry reveals important metabolic alterations.

Authors: Juan C García-Cañaveras; M Teresa Donato; José V Castell; Agustín Lahoz
Journal: J Proteome Res Date: 2011-09-06 Impact factor: 4.466

2. Adjusting batch effects in microarray expression data using empirical Bayes methods.

Authors: W Evan Johnson; Cheng Li; Ariel Rabinovic
Journal: Biostatistics Date: 2006-04-21 Impact factor: 5.899

Review 3. 2. Classification and Diagnosis of Diabetes.

Authors:
Journal: Diabetes Care Date: 2017-01 Impact factor: 19.112

4. Genome-scale metabolic modelling of hepatocytes reveals serine deficiency in patients with non-alcoholic fatty liver disease.

Authors: Adil Mardinoglu; Rasmus Agren; Caroline Kampf; Anna Asplund; Mathias Uhlen; Jens Nielsen
Journal: Nat Commun Date: 2014 Impact factor: 14.919

5. Design and validation of a histological scoring system for nonalcoholic fatty liver disease.

Authors: David E Kleiner; Elizabeth M Brunt; Mark Van Natta; Cynthia Behling; Melissa J Contos; Oscar W Cummings; Linda D Ferrell; Yao-Chang Liu; Michael S Torbenson; Aynur Unalp-Arida; Matthew Yeh; Arthur J McCullough; Arun J Sanyal
Journal: Hepatology Date: 2005-06 Impact factor: 17.425

6. A lipidomic analysis of nonalcoholic fatty liver disease.

Authors: Puneet Puri; Rebecca A Baillie; Michelle M Wiest; Faridoddin Mirshahi; Jayanta Choudhury; Onpan Cheung; Carol Sargeant; Melissa J Contos; Arun J Sanyal
Journal: Hepatology Date: 2007-10 Impact factor: 17.425

7. Proposed minimum reporting standards for chemical analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI).

Authors: Lloyd W Sumner; Alexander Amberg; Dave Barrett; Michael H Beale; Richard Beger; Clare A Daykin; Teresa W-M Fan; Oliver Fiehn; Royston Goodacre; Julian L Griffin; Thomas Hankemeier; Nigel Hardy; James Harnly; Richard Higashi; Joachim Kopka; Andrew N Lane; John C Lindon; Philip Marriott; Andrew W Nicholls; Michael D Reily; John J Thaden; Mark R Viant
Journal: Metabolomics Date: 2007-09 Impact factor: 4.290

8. Amount of hepatic fat predicts cardiovascular risk independent of insulin resistance among Hispanic-American adolescents.

Authors: Ran Jin; Ngoc-Anh Le; Rebecca Cleeton; Xiaoyan Sun; Jessica Cruz Muños; James Otvos; Miriam B Vos
Journal: Lipids Health Dis Date: 2015-05-01 Impact factor: 3.876

9. xMSanalyzer: automated pipeline for improved feature detection and downstream analysis of large-scale, non-targeted metabolomics data.

Authors: Karan Uppal; Quinlyn A Soltow; Frederick H Strobel; W Stephen Pittard; Kim M Gernert; Tianwei Yu; Dean P Jones
Journal: BMC Bioinformatics Date: 2013-01-16 Impact factor: 3.169

10. Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification.

Authors: Lingyun Gao; Mingquan Ye; Xiaojie Lu; Daobin Huang
Journal: Genomics Proteomics Bioinformatics Date: 2017-12-12 Impact factor: 7.691

12 in total

Review 1. Metabolomic and Lipidomic Biomarkers for Premalignant Liver Disease Diagnosis and Therapy.

Authors: Diren Beyoğlu; Jeffrey R Idle
Journal: Metabolites Date: 2020-01-28

2. Sex-Specific Metabolite Biomarkers of NAFLD in Youth: A Prospective Study in the EPOCH Cohort.

Authors: Wei Perng; Ellen C Francis; Harry A Smith; John Carey; Dongqing Wang; Katerina M Kechris; Dana Dabelea
Journal: J Clin Endocrinol Metab Date: 2020-09-01 Impact factor: 5.958

3. Proceedings of the Summer Institute on Symptoms and Omics.

Authors: Irene Yang; Marcia Holstad
Journal: Biol Res Nurs Date: 2021-11-06 Impact factor: 2.522

Review 4. Perspectives on youth-onset nonalcoholic fatty liver disease.

Authors: Eduardo Castillo-Leon; Catherine E Cioffi; Miriam B Vos
Journal: Endocrinol Diabetes Metab Date: 2020-09-17

5. Plasma Metabolome Alterations Associated with Extrauterine Growth Restriction.

Authors: Danuta Dudzik; Isabel Iglesias Platas; Montserrat Izquierdo Renau; Carla Balcells Esponera; Beatriz Del Rey Hurtado de Mendoza; Carles Lerin; Marta Ramón-Krauel; Coral Barbas
Journal: Nutrients Date: 2020-04-23 Impact factor: 5.717

6. Comparison of the Lipidomic Signature of Fatty Liver in Children and Adults: A Cross-Sectional Study.

Authors: Jake P Mann; Benjamin Jenkins; Samuel Furse; Stuart G Snowden; Anna Alisi; Laura G Draijer; Kylie Karnebeek; Deirdre A Kelly; Bart G Koot; Antonella Mosca; Camilla Salvestrini; Indra van Mourik; Anita Vreugdenhil; Matthias Zilbauer; Albert Koulman
Journal: J Pediatr Gastroenterol Nutr Date: 2022-02-19 Impact factor: 3.288

7. Changes of microbial and metabolome of the equine hindgut during oligofructose-induced laminitis.

Authors: Maimaiti Tuniyazi; Junying He; Jian Guo; Shuang Li; Naisheng Zhang; Xiaoyu Hu; Yunhe Fu
Journal: BMC Vet Res Date: 2021-01-06 Impact factor: 2.741

Review 8. Non-Alcoholic Fatty Liver Disease in Obese Youth With Insulin Resistance and Type 2 Diabetes.

Authors: Serena Scapaticci; Ebe D'Adamo; Angelika Mohn; Francesco Chiarelli; Cosimo Giannini
Journal: Front Endocrinol (Lausanne) Date: 2021-04-06 Impact factor: 5.555

Review 9. Amino Acid-Related Metabolic Signature in Obese Children and Adolescents.

Authors: Nella Polidori; Eleonora Agata Grasso; Francesco Chiarelli; Cosimo Giannini
Journal: Nutrients Date: 2022-03-30 Impact factor: 5.717

10. Prenatal Exposure to Perfluoroalkyl Substances Associated With Increased Susceptibility to Liver Injury in Children.

Authors: Nikos Stratakis; David V Conti; Ran Jin; Katerina Margetaki; Damaskini Valvi; Alexandros P Siskos; Léa Maitre; Erika Garcia; Nerea Varo; Yinqi Zhao; Theano Roumeliotaki; Marina Vafeiadi; Jose Urquiza; Silvia Fernández-Barrés; Barbara Heude; Xavier Basagana; Maribel Casas; Serena Fossati; Regina Gražulevičienė; Sandra Andrušaitytė; Karan Uppal; Rosemary R C McEachan; Eleni Papadopoulou; Oliver Robinson; Line Småstuen Haug; John Wright; Miriam B Vos; Hector C Keun; Martine Vrijheid; Kiros T Berhane; Rob McConnell; Lida Chatzi
Journal: Hepatology Date: 2020-10-19 Impact factor: 17.425