| Literature DB >> 29567970 |
Ying Lin1, Shuai Huang2, Gregory E Simon3,4, Shan Liu2.
Abstract
Depression is a common mental illness with complex and heterogeneous progression dynamics. Risk grouping of depression treatment population based on their longitudinal patterns has the potential to enable cost-effective monitoring policy design. This paper establishes a rule-based method to identify a set of risk predictive patterns from person-level longitudinal disease measurements by integrating the data transformation, rule discovery and rule evaluation. We further extend the identified rules to create rule-based monitoring strategies to adaptively monitor individuals with different disease severities. We applied the rule-based method on an electronic health record (EHR) dataset of depression treatment population containing person-level longitudinal Patient Health Questionnaire (PHQ)-9 scores for assessing depression severity. 12 risk predictive rules are identified, and the rule-based prognostic model based on identified rules enables more accurate prediction of disease severity than other prognostic models including RuleFit, logistic regression and Support Vector Machine. Two rule-based monitoring strategies outperform the latest PHQ-9 based monitoring strategy by providing higher sensitivity and specificity. The rule-based method can lead to a better understanding of disease dynamics, achieving more accurate prognostics of disease progressions, personalizing follow-up intervals, and designing cost-effective monitoring of patients in clinical practice.Entities:
Mesh:
Year: 2018 PMID: 29567970 PMCID: PMC5864956 DOI: 10.1038/s41598-018-23326-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The rule-based analytic framework for longitudinal pattern discovery and adaptive monitoring.
(a) Individual-rule based monitoring strategies in the next 6 months. (b) Multiple-rules based monitoring strategies in the next 6 months.
|
| ||||||
|---|---|---|---|---|---|---|
|
|
| |||||
|
|
| |||||
|
| Monitor | Not monitor | ||||
|
| Not monitor | Monitor | ||||
|
| ||||||
|
|
|
|
|
|
| |
| Group 1 | unendorsed | unendorsed | Monitor | Not monitor | Not monitor | Monitor |
| Group 2 | endorsed | unendorsed | Monitor | Monitor | Monitor | Monitor |
| Group 3 | unendorsed | endorsed | Not monitor | Not monitor | Not monitor | Not monitor |
| Group 4 | endorsed | endorsed | Monitor | Not monitor | Monitor | Not monitor |
Note: the population is segmented into four groups including un-endorsing any of increasing risk rules and any of decreasing risk rules (Group 1), endorsing any of increasing risk rules but un-endorsing all decreasing risk rules (Group 2), un-endorsing all increasing risk rules but endorsing any of decreasing risk rules (Group 3), and endorsing all rules (Group 4) (percentage of patients in each group is presented in brackets). Four monitoring scenarios are considered: monitoring groups 1, 2 and 4 (Scenario 1); monitoring group 2 (Scenario 2); monitoring groups 2 and 4 (Scenario 3); monitoring groups 1 and 2 (Scenario 4).
12 top rules identified by the RuleFit model.
| Decreasing risk rules | Increasing risk rules | ||
|---|---|---|---|
| Rule 1 | Deepest increase between consecutive PHQ9 scores <7.50 & 75 percentile of PHQ9 score <14.62 | Rule 7 | Observing density >0.03 & Minimal PHQ9 score >8.50 |
| Rule 2 | 25 percentile of PHQ9 score <6.13 & Volatility of PHQ9 score <9.64 | Rule 8 | Minimal PHQ9 score >9.50 & Volatility of difference between nearby PHQ9 scores <4.75 |
| Rule 3 | 75 percentile of PHQ9 score <15.88 & Percentage of moderate depression <0.39 | Rule 9 | Latest PHQ9 score >17.50 & Volatility of PHQ9 score <7.33 |
| Rule 4 | Deepest decrease between consecutive PHQ9 scores >2.50 & 75 percentile of PHQ9 score <14.12 | Rule 10 | Minimal PHQ9 score >6.50 & 75 percentile of PHQ9 score >14.88 |
| Rule 5 | Sex is male & Mean of 9th question scores <0.71 & Percentage of moderately severe <0.38 | Rule 11 | Age <65 & Percentage of severe depression >0.23 |
| Rule 6 | Latest PHQ9 score <8.50 & Maximal PHQ9 score <16.50 | Rule 12 | Deepest decrease between consecutive PHQ9 scores <13.50 & Mean of PHQ9 scores >14.73 |
Note: the cut-off values of the variables in the rules were automatically determined by RuleFit for maximum statistical prediction power.
Figure 2Proportion of low-risk patients (average PHQ-9 score <10) in rule-endorsing group.
Figure 3The associations between rule endorsements and depression severity of 12 rules. Each curve represents the probabilities of endorsing a rule under various depression severity. The increasing risk rules are plotted in dash lines and the decreasing risk rules are plotted in solid lines.
Prediction accuracy of several methods on testing data.
| Model | Rule-based prognostic model | RuleFit | Logistic regression | SVM | |||
|---|---|---|---|---|---|---|---|
| All factors | Significant factors | All factors | Significant factors | All factors | Significant factors | ||
|
| 0.83 | 0.82 | 0.82 | 0.81 | 0.81 | 0.81 | 0.81 |
Figure 4Comparison of sensitivity and specificity in all monitoring strategies.
Monitoring outcomes of all strategies (N = 562 patients).
| Strategy | Sensitivity | Specificity |
|---|---|---|
| Rule 9 | 0.34 | 0.96 |
| Rule 8 | 0.38 | 0.94 |
| Rule 7 | 0.49 | 0.89 |
| Rule 12 | 0.56 | 0.85 |
| Rule 10 | 0.67 | 0.80 |
| Rule 11 | 0.67 | 0.78 |
| MultiRule 2 | 0.69 | 0.78 |
| MultiRule 4 | 0.73 | 0.77 |
| PHQ-9 Based | 0.77 | 0.67 |
| MultiRule 3 | 0.81 | 0.69 |
| MultiRule 1 | 0.84 | 0.68 |
| Rule 4 | 0.84 | 0.60 |
| Rule 1 | 0.85 | 0.56 |
| Rule 3 | 0.88 | 0.55 |
| Rule 2 | 0.91 | 0.50 |
| Rule 6 | 0.94 | 0.46 |
| Rule 5 | 0.90 | 0.28 |
| Status Quo (SQ) | 1.00 | 0.00 |