| Literature DB >> 35272662 |
Brandon Oselio1, Amit G Singal2, Xuefei Zhang3, Tony Van4, Boang Liu3,5, Ji Zhu3,6, Akbar K Waljee7,8,9.
Abstract
BACKGROUND: Evaluation of new treatment policies is often costly and challenging in complex conditions, such as hepatitis C virus (HCV) treatment, or in limited-resource settings. We sought to identify hypothetical policies for HCV treatment that could best balance the prevention of cirrhosis while preserving resources (financial or otherwise).Entities:
Keywords: Cirrhosis; Hepatology; Machine learning; Prediction modeling; Reinforcement learning; Risk-based treatment; Treatment policy
Mesh:
Substances:
Year: 2022 PMID: 35272662 PMCID: PMC8913329 DOI: 10.1186/s12911-022-01789-7
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Modeling approach for reinforcement learning and off-policy evaluation. The historical cohort dataset consists of patients (1), whose state, i.e., longitudinal and demographic information is measured (2). Given these measurements, the risk to the patient progressing to cirrhosis is then evaluated (3). Finally, following the usual care treatment policy (4), a clinician makes a treatment decision (5) for the patient. The cycle then continues until the patient no longer returns for follow-up or the follow-up period concludes
Fig. 2Treatment probabilities as a function of APRI score for the (a) piecewise treatment policy and (b) logistic regression (APRI only) policy. The first treatment policy is a piecewise function where treatment probability increases with APRI score (a). The second treatment policy is a data-driven treatment policy using logistic regression with APRI as a single feature, and the outcome being a positive diagnosis of cirrhosis (b)
Fig. 3Three example traces for patients. Treatment decisions over time (a), SVR status (b), and risk score for development of cirrhosis and APRI/300, where 300 was chosen to place risk and APRI on similar scales (c) are displayed
Fig. 4Analysis of risk scores. a Comparison of risk scores in the dataset separated between treated and untreated measurement timepoints. As expected, the untreated timepoints have a higher risk score on average. b Median risk (with 50% percentile interval) striated across amount of time after treatment start date. As expected, risk continues to decrease after initial treatment
Expected risk of baseline and evaluation policies
| Risk score | 90% Bootstrap CI | Number of patients treated | High risk (n = 380) | Medium risk (n = 1530) | Low risk (n = 1882) | Untreated timepoints | |
|---|---|---|---|---|---|---|---|
| Policy 1: Piecewise Policy | 0.028 | (0.027, 0.033) | 2018.5 + − 17.8 | 307.2 + − 6.9 | 893.9 + − 13.8 | 817.4 + − 15.7 | 7040.8 + − 127.5 |
| Policy 2: Logistic Regression (APRI Only) | 0.026 | (0.024, 0.031) | 1914.4 + − 18.6 | 316.2 + − 4.7 | 919.6 + − 16.7 | 672.3 + − 12.4 | 7742.9 + − 141.9 |
| Policy 3: Logistic Regression (Full State) | 0.023 | (0.022, 0.029) | 1637 + − 15.8 | 311.2 + − 5.8 | 850.2 + − 9.7 | 475.6 + _ 8.8 | 8877.4 + − 97.9 |
| Policy 4: Risk-Based Policy | 0.016 | (0.016, 0.019) | 1843.7 + − 16.5 | 346.4 + − 1.4 | 1121.7 + − 13.8 | 361.0 + − 20.1 | 7968.4 + − 110.4 |
Fig. 5Risk sensitivity for k and . Sensitivity of the risk-based policy to and. As increases, risk also increases for a fixed (a), and as increases, decreases but stabilizes for a fixed (b)