| Literature DB >> 35033059 |
Guanglei Yu1, Linlin Zhang2, Ying Zhang3, Jiaqi Zhou1, Tao Zhang1, Xuehua Bi4.
Abstract
BACKGROUND: The greatly accelerated development of information technology has conveniently provided adoption for risk stratification, which means more beneficial for both patients and clinicians. Risk stratification offers accurate individualized prevention and therapeutic decision making etc. Hospital discharge records (HDRs) routinely include accurate conclusions of diagnoses of the patients. For this reason, in this paper, we propose an improved model for risk stratification in a supervised fashion by exploring HDRs about coronary heart disease (CHD).Entities:
Keywords: Hospital discharge records; Risk stratification; Supervised latent Dirichlet allocation; Topic models
Mesh:
Year: 2022 PMID: 35033059 PMCID: PMC8760773 DOI: 10.1186/s12911-022-01747-3
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Symbols and notations
| Symbols | Notations |
|---|---|
| HDRs index | |
| Patient features | |
| Patient feature-value pairs | |
| Topic index | |
| Topic-feature multinomials of feature | |
| Topic-value multinomials of feature-value | |
| Topic assignments | |
| Response variables | |
| Class coefficients |
Fig. 1Probabilistic graphical model. The probabilistic graphical model representation of Hierarchical sLDA (left); the graphical model representation of variational distribution (right)
Fig. 2Original HDR. The original HDR in Chinese (left); The corresponding English version (right)
Fig. 3Features annotation of HDRs. The process of features annotation of HDRs
Summary statistics of datasets
| Number of patient records | Number of patient features | Number of patient feature-value pairs | ||
|---|---|---|---|---|
| 420 | 34 | 79 | ||
Fig. 4Comparison of performance. Comparison of over all classes based on fivefold cross validation: training time (left); testing time (middle); average accuracy (right)
Fig. 5Comparison of confusion matrices. Comparison of confusion matrices of topic ; multi-class sLDA (left); Hierarchical sLDA (right)
Comparison of macro-F1, Precision, Recall
| Macro-F1 (%) | Macro-Precision (%) | Macro-Recall (%) | |
|---|---|---|---|
| Hierarchical sLDA | 70.91 | 70.96 | 71.70 |
| Multi-class sLDA | 70.54 | 70.84 | 71.70 |
Risk factors of CHD extracted from Hierarchical sLDA
| ICM | AMI | SAP | UAP |
|---|---|---|---|
| ST segment-Abn. | Hb-Abn. | SBP-Abn. | Diabetes-Yes |
| CTA stenosis-Mild | Duration-10 min | HbA1c-Abn. | Antiplatelet Medication-Yes |
| Carotid Atherosclerosis-Multiple plaque group | ST segment-Elevation | Uric acid-Abn. | HbA1c-Abn. |
| Fasting blood glucose-Abn. | ST segment-Change | Duration-3~5 min | |
| Heart rate-Sinus velocity | Gender-Female | Lipid drug medication-Yes | Gender-Male |
| CTA lesions-Single | CTA lesions-Single | Cardiac B-Ultrasound -Abn. | Hypertension-Yes |
| HbA1c-Abn. | ACEI/ARB medication-Yes | Hb-Abn. | DBP-Abn. |
| Age-45–65 years | Age-45–65 years | CTA stenosis-Mild | WBC-Abn. |
| Uric acid-Abn. | Uric acid-Abn. | Diabetes-Yes | SBP- Abn. |
| SBP- Abn. | SBP-Abn. | Antiplatelet medication-Yes | Uric acid-Abn. |
Fig. 6Selection of optimal hyperparameter . Using from 0.1 to 1.5 with interval of 0.1, topic K from 10 to 70 with interval of 5, the 3-D representions show that we should optimize hyperparameter with the fitted curve. The left view of 3-D represention (left); the front view of 3-D represention (right)