| Literature DB >> 35016654 |
Wenyan Tan1, Heng Weng2, Haicheng Lin1, Aihua Ou3, Zehui He3, Fujun Jia4.
Abstract
BACKGROUND: Based on more than 15 million follow-up records of 404,426 patients from Guangdong Mental Health Center over the past 10 years, this study aims to propose a disease risk analysis and prediction model to support chronic disease management and clinical research for schizophrenia patients.Entities:
Keywords: AutoAHP; Disease risk prediction; Intelligent information processing; Risk analysis for severe psychosis
Mesh:
Year: 2022 PMID: 35016654 PMCID: PMC8750858 DOI: 10.1186/s12911-022-01749-1
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1The visualization of the discretization method, where the red column indicates risk events, blue indicates no risk events, area indicates a population density distribution, and dotted line indicates a population ratio
Fig. 2The distributions of age and disease course, where the red line indicates risk event, blue line indicates no risk events
Fig. 3The overview of our AutoAHP framework for factor analysis
The categories of criteria and sub-criteria of all the risk factors
| Main criteria | Sub-criteria |
|---|---|
| Demography | Region, age, gender, education, disability grade, social support, economic status |
| Treatment | Adverse reaction, compliance, treatment, combination of drugs |
| Disease course | Number of hospitalizations, risk events, annual policy, duration |
Hyper-parameters of baseline methods
| Methods | Hyper-parameters |
|---|---|
| Random Forest | num of trees: 1000, num of attr consider at each split: 5 |
| Neural Network | Neurons of hidden layers: 100, activation: Relu, solver: Adam, regularization, learning rate: 0.001, iters: 200 |
| Logistic Regression | regularization type: ridge(L2), strength: |
| SGD | Loss function: logistic regression, regularization method: Elastic Net, |
| kNN | |
| SVM | RBF, Kernel: |
Fig. 4Risk events data by age cohort period index and APC canonical parameters as well as representation of follow-up data of schizophrenia patients
The final features and related indicators
| Info. gain | Gain ratio | Gini | ReliefF | FCBF | ||
|---|---|---|---|---|---|---|
| Referral times* | 0.034 | 0.024 | 0.021 | 2881.741 | 0.056 | 0 |
| Drug combination | 0.013 | 0.003 | 0.008 | 1037.934 | 0.112 | 0 |
| Compliance | 0.011 | 0.026 | nan | 892.802 | 0.02 | N/A |
| Adverse reactions times* | 0.003 | 0.005 | 0.002 | 339.393 | 0.001 | 0 |
| Suggest Referral | 0.036 | 0.07 | 0.019 | 168.791 | 0.038 | 0.05 |
| Diagnostic type | 0.004 | 0.005 | 0.003 | 141.892 | 0.052 | 0 |
| Social function | 0.006 | 0.008 | 0.004 | 136.586 | 0.024 | 0.007 |
| Poverty | 0.003 | 0.003 | 0.002 | 129.539 | − 0.002 | 0 |
| Targeted poverty alleviation | 0.003 | 0.003 | 0.002 | 129.539 | − 0.002 | 0 |
| Duration days* | 0.002 | 0.001 | 0.001 | 85.161 | 0.02 | 0 |
| Family guardianship subsidy | 0.006 | 0.01 | 0.004 | 60.961 | 0.006 | 0 |
| Auxiliary drug combination | 0.003 | 0.001 | 0.002 | 59.541 | − 0.005 | 0 |
| Disability rating | 0.002 | 0.001 | 0.001 | 38.923 | 0.128 | 0 |
| Hospitalization times | 0.001 | 0.001 | 0.001 | 36.31 | 0.027 | 0 |
| Region | 0.063 | 0.022 | 0.042 | 21.243 | 0.142 | 0.034 |
| Gender | 0 | 0 | 0 | 16.812 | 0.004 | 0 |
| Course of disease rating | 0.001 | 0.001 | 0.001 | 13.776 | 0.048 | 0 |
| Region | 0.001 | 0.002 | 0 | 10.812 | 0.032 | 0 |
| Drug combination number* | 0 | 0 | 0 | 9.131 | 0.011 | 0 |
| Age rating | 0.001 | 0.001 | 0 | 5.367 | 0.036 | 0 |
| Economic status | 0 | 0 | nan | 2.868 | 0.068 | N/A |
*denotes the variables that are discretized
Parameter tests of Cox regression
| Coefficients [95% CI] | z | |
|---|---|---|
| Adverse reactions times | 0.01 [0.01, 0.01] | 4.43* |
| Drug combination | 0.16 [0.12, 0.19] | 8.44* |
| Age rating | 0.09 [0.05, 0.13] | 4.42* |
| Diagnostic type | 0.61 [0.54, 0.68] | 17.12* |
| Duration days | 0.05 [0.01, 0.08] | 2.55* |
| Drug combination | − 0.06 [− 0.08, − 0.03] | − 4.03* |
| Gender: female | − 0.37 [− 0.41, − 0.33] | − 17.98* |
| Disability rating | − 0.06 [− 0.09, − 0.03] | − 4.56* |
| Social function | 0.25 [0.21, 0.28] | 13.85* |
| Compliance | 0.48 [0.44, 0.52] | 26.31* |
| Targeted poverty alleviation | 0.31 [0.27, 0.35] | 15.12* |
| Family guardianship subsidy | 0.39 [0.32, 0.45] | 12.29* |
*
Performance comparison of the AutoAHP framework against baseline methods
| Method | AUC | Accuracy | Precision | Recall | F1 |
|---|---|---|---|---|---|
| SVM | 0.480 | 0.589 | 0.619 | 0.589 | 0.602 |
| kNN | 0.537 | 0.727 | 0.639 | 0.727 | 0.657 |
| SGD | 0.564 | 0.722 | 0.686 | 0.722 | 0.697 |
| Logistic Regression | 0.722 | 0.758 | 0.719 | 0.758 | 0.700 |
| Naive Bayes | 0.712 | 0.763 | 0.732 | 0.763 | 0.728 |
| Neural Network | 0.900 | 0.881 | 0.878 | 0.881 | 0.879 |
| Random Forest | 0.945 | 0.921 | 0.919 | 0.921 | 0.919 |
| AutoAHP | 0.954 | 0.924 | 0.923 | 0.924 | 0.923 |
Fig. 5Disease treatment efficiency and risk distributions of Guangdong province for schizophrenia risk early warning. In the map, the higher the population risk score, the larger in node size. The worse clinical effect is, the darker red in node color. The greater crowd density, the brighter blue in cloud color