| Literature DB >> 32870314 |
George N Ioannou1,2, Weijing Tang3, Lauren A Beste4, Monica A Tincopa5, Grace L Su5,6, Tony Van7, Elliot B Tapper5,6, Amit G Singal8, Ji Zhu3,7, Akbar K Waljee5,6,7.
Abstract
Importance: Deep learning, a family of machine learning models that use artificial neural networks, has achieved great success at predicting outcomes in nonmedical domains. Objective: To examine whether deep learning recurrent neural network (RNN) models that use raw longitudinal data extracted directly from electronic health records outperform conventional regression models in predicting the risk of developing hepatocellular carcinoma (HCC). Design, Setting, and Participants: This prognostic study included 48 151 patients with hepatitis C virus (HCV)-related cirrhosis in the national Veterans Health Administration who had at least 3 years of follow-up after the diagnosis of cirrhosis. Patients were identified by having at least 1 positive HCV RNA test between January 1, 2000, to January 1, 2016, and were followed up from the diagnosis of cirrhosis to January 1, 2019, for the development of incident HCC. A total of 3 models predicting HCC during a 3-year period were developed and compared, as follows: (1) logistic regression (LR) with cross-sectional inputs (cross-sectional LR); (2) LR with longitudinal inputs (longitudinal LR); and (3) RNN with longitudinal inputs. Data analysis was conducted from April 2018 to August 2020. Exposures: Development of HCC. Main Outcomes and Measures: Area under the receiver operating characteristic curve, area under the precision-recall curve, and Brier score.Entities:
Mesh:
Year: 2020 PMID: 32870314 PMCID: PMC7489819 DOI: 10.1001/jamanetworkopen.2020.15626
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Figure 1. Schematics of Case and Control Definitions and Models Developed to Predict HCC Development
A, Patients with hepatitis C virus infection who had a diagnosis of cirrhosis and at least 3 years of follow-up from the time of diagnosis of cirrhosis to their last follow-up visit in the Veterans Healthcare Administration (VHA) were identified. Patients who developed hepatocellular carcinoma (HCC) within 3 years of time t after the development of cirrhosis were designated cases, and those who did not were designated controls. All data available at or before time t were used as predictors of the development of cirrhosis within 3 years of time t. The first and third examples are for patients who developed HCC during follow-up; the second example is for a patient who did not develop HCC during follow-up. B, Schematic comparison of the 3 different models we developed to predict HCC development (ie, model 1, logistic regression using cross-sectional baseline data at time t; model 2, logistic regression using human-designed longitudinal data prior to time t; and model 3, recurrent neural networks using raw longitudinal data prior to time t). C, Model structure of longitudinal recurrent neural network under 1 representative splitting.
Characteristics of Controls and Cases Used in Model Building at the Sampled Visit
| Characteristic | Mean (SD) | |
|---|---|---|
| Samples of patients who did not develop HCC within 3 y (n = 42 245) | Samples of patients who developed HCC within 3 y (n = 10 738) | |
| Age at cirrhosis diagnosis, y | 56.9 (6.9) | 58.2 (6.6) |
| Men, No. (%) | 41 315 (97.8) | 10 633 (99.0) |
| Race/Ethnicity, No. (%) | ||
| White, non-Hispanic | 23 681 (56.1) | 5996 (55.8) |
| Black, non-Hispanic | 10 805 (25.6) | 2626 (24.5) |
| Hispanic, Asian, Pacific Island, AIAN, or other | 4956 (11.7) | 1323 (12.35) |
| Declined to answer or missing | 2803 (6.6) | 793 (7.4) |
| Genotype, No. (%) | ||
| 1 | 30 702 (72.7) | 7497 (69.8) |
| 2 | 3289 (7.8) | 637 (5.9) |
| 3 | 3256 (7.7) | 1212 (11.3) |
| ≥4 | 356 (0.8) | 93 (0.9) |
| Missing | 4642 (11.0) | 1299 (12.1) |
| Achieved SVR at time | 5680 (13.4) | 1192 (11.1) |
| Duration of cirrhosis at time | 1.97 (2.61) | 2.72 (3.30) |
| BMI at time | 28.7 (5.6) | 28.1 (5.4) |
| Laboratory test results at time | ||
| AST, U/L | 71.7 (47.2) | 87.1 (49.5) |
| ALT, U/L | 70.1 (71.0) | 74.9 (58.5) |
| Platelet count, ×103/μL | 141.1 (74.5) | 121.0 (68.3) |
| Bilirubin, mg/dL | 1.2 (1.5) | 1.5 (1.5) |
| INR | 1.2 (0.3) | 1.2 (0.3) |
| Creatinine, mg/dL | 1.1 (1.0) | 1.0 (0.7) |
| FIB-4 score | 5.1 (4.7) | 6.8 (5.2) |
| APRI score | 1.8 (2.0) | 2.5 (2.2) |
Abbreviations: AIAN, American Indian/Alaskan Native; ALT, alanine aminotransferase; AST, aspartate transaminase; APRI, AST to platelet ratio index; BMI, body mass index (calculated as weight in kilograms divided by height in meters squared); FIB-4, fibrosis-4; INR, international normalized ratio; SVR, sustained virologic response.
SI conversion factors: To convert ALT and AST to microkatals per liter, multiply by 0.0167; bilirubin to millimoles per liter, multiply by 17.104; creatinine to millimoles per liter, multiply by 88.4; and platelet count to ×109 per liter, multiply by 1.0.
Comparison of the Performance Characteristics of 3 Different Models Predicting the Development of Hepatocellular Carcinoma Within 3 Years in Patients With Hepatitis C Virus–Related Cirrhosis
| Performance characteristic | Mean (SD) | |||
|---|---|---|---|---|
| Cross-sectional LR model | Longitudinal LR model | RNN model | ||
| AUROC | 0.682 (0.007) | 0.689 (0.009) | 0.759 (0.009) | <.001 |
| Brier score | 0.150 (0.003) | 0.149 (0.003) | 0.136 (0.003) | <.001 |
| AUPRC | 0.345 (0.011) | 0.361 (0.009) | 0.479 (0.018) | <.001 |
| Proportion of samples who test positive at 90% sensitivity | 0.746 (0.008) | 0.736 (0.013) | 0.663 (0.012) | <.001 |
| Specificity at 90% sensitivity | 0.293 (0.010) | 0.305 (0.016) | 0.397 (0.014) | <.001 |
| Positive predictive value at 90% sensitivity | 0.243 (0.003) | 0.246 (0.003) | 0.273 (0.006) | <.001 |
| Negative predictive value at 90% sensitivity | 0.920 (0.006) | 0.923 (0.007) | 0.940 (0.003) | <.001 |
| Proportion of samples who test positive at 80% sensitivity | 0.601 (0.012) | 0.591 (0.017) | 0.514 (0.015) | <.001 |
| Specificity at 80% sensitivity | 0.449 (0.015) | 0.462 (0.021) | 0.558 (0.018) | <.001 |
| Positive predictive value at 80% sensitivity | 0.268 (0.007) | 0.27 (30.009) | 0.314 (0.009) | <.001 |
| Negative predictive value at 80% sensitivity | 0.898 (0.004) | 0.901 (0.005) | 0.916 (0.004) | <.001 |
| AUROC | 0.672 (0.030) | 0.705 (0.024) | 0.806 (0.025) | <.001 |
| Brier score | 0.139 (0.006) | 0.136 (0.006) | 0.117 (0.007) | <.001 |
| AUPRC | 0.333 (0.060) | 0.361 (0.050) | 0.519 (0.064) | <.001 |
| Proportion of samples who test positive at 90% sensitivity | 0.793 (0.041) | 0.702 (0.028) | 0.571 (0.052) | <.001 |
| Specificity at 90% sensitivity | 0.230 (0.050) | 0.340 (0.035) | 0.499 (0.064) | <.001 |
| Positive predictive value at 90% sensitivity | 0.205 (0.022) | 0.230 (0.020) | 0.285 (0.033) | <.001 |
| Negative predictive value at 90% sensitivity | 0.904 (0.013) | 0.933 (0.006) | 0.954 (0.005) | <.001 |
| Proportion of samples who test positive at 80% sensitivity | 0.628 (0.040) | 0.559 (0.035) | 0.429 (0.039) | <.001 |
| Specificity at 80% sensitivity | 0.409 (0.051) | 0.492 (0.043) | 0.651 (0.047) | <.001 |
| Positive predictive value at 80% sensitivity | 0.230 (0.026) | 0.257 (0.023) | 0.337 (0.036) | <.001 |
| Negative predictive value at 80% sensitivity | 0.898 (0.009) | 0.914 (0.009) | 0.934 (0.007) | <.001 |
Abbreviations: AUPRC, area under the precision-recall curve; AUROC, area under receiver operating characteristic curve; LR, logistic regression; RNN, recurrent neural network; SVR, sustain virologic response.
Figure 2. Receiver Operating Characteristic Curves for 3 Prediction Models
We developed 3 different models predicting the development of hepatocellular carcinoma within 3 years under 1 representative splitting (results based on the testing set) in all samples from patients with hepatitis C virus (HCV)–related cirrhosis and samples from patients who achieved sustained virologic response (SVR) with HCV-related cirrhosis.
Comparison of Predicted and Observed 3-Year HCC Risk in the Study Population Divided Into Tertiles According to Each Model Under 1 Representative Splitting
| Tertile | Cross-sectional LR model | Longitudinal LR model | RNN model | ||||||
|---|---|---|---|---|---|---|---|---|---|
| % | No. (%) | % | No. (%) | % | No. (%) | ||||
| Observed 3-y HCC risk | Predicted 3-y HCC risk | Observed 3-y HCC risk | Predicted 3-y HCC risk | Observed 3-y HCC risk | Predicted 3-y HCC risk | ||||
| First, low risk | 4 | 5 | 17661 (33.3) | 4 | 4 | 17661 (33.3) | 2 | 2 | 17661 (33.3) |
| Second, medium risk | 10 | 9 | 17661 (33.3) | 9 | 9 | 17661 (33.3) | 7 | 8 | 17661 (33.3) |
| Third, high risk | 18 | 18 | 17661 (33.3) | 18 | 19 | 17661 (33.3) | 24 | 25 | 17661 (33.3) |
Abbreviations: HCC, hepatocellular carcinoma; LR, logistic regression; RNN, recurrent neural network.
The observed 3-year HCC risk is the proportion of those who developed HCC within 3 years among samples in each group. The predicted 3-year HCC risk is the mean of probabilities that are returned by models for samples in each group.