| Literature DB >> 29065617 |
Xiao Xu1, Tao Jin1, Zhijie Wei1, Jianmin Wang1.
Abstract
Clinical pathways are widely used around the world for providing quality medical treatment and controlling healthcare cost. However, the expert-designed clinical pathways can hardly deal with the variances among hospitals and patients. It calls for more dynamic and adaptive process, which is derived from various clinical data. Topic-based clinical pathway mining is an effective approach to discover a concise process model. Through this approach, the latent topics found by latent Dirichlet allocation (LDA) represent the clinical goals. And process mining methods are used to extract the temporal relations between these topics. However, the topic quality is usually not desirable due to the low performance of the LDA in clinical data. In this paper, we incorporate topic assignment constraint and topic correlation limitation into the LDA to enhance the ability of discovering high-quality topics. Two real-world datasets are used to evaluate the proposed method. The results show that the topics discovered by our method are with higher coherence, informativeness, and coverage than the original LDA. These quality topics are suitable to represent the clinical goals. Also, we illustrate that our method is effective in generating a comprehensive topic-based clinical pathway model.Entities:
Mesh:
Year: 2017 PMID: 29065617 PMCID: PMC5474282 DOI: 10.1155/2017/5208072
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
The national clinical pathway of intracerebral hemorrhage released by the Ministry of Health of China.
| Stage | Order |
|---|---|
| Stage 1 | Long-term medical order: |
| (1) neurology nursing routine, (2) level I care, | |
| Temporary medical order: | |
| (1) blood, urine, and stool routine examination; | |
|
| |
| Stage 2 | Long-term medical order: |
| (1) neurology nursing routine, (2) level I care, | |
| Temporary medical order: | |
| (1) re-examination for abnormal laboratory values and (2) when necessary: re-examination CT | |
|
| |
| Stage 3 | Long-term medical order: |
| (1) neurology nursing routine, (2) level I care, | |
| Temporary medical order: | |
| (1) re-examination for abnormal laboratory values | |
|
| |
| Stage 4 | Long-term medical order: |
| (1) neurology nursing routine, (2) level II care, | |
| Temporary medical order: | |
| (1) re-examination for abnormal laboratory values; | |
|
| |
| Stage 5 | Long-term medical order: |
| (1) neurology nursing routine, (2) level II-III care, | |
| Temporary medical order: | |
| (1) re-examination for abnormal laboratory values | |
|
| |
| Stage 6 | Long-term medical order: |
| (1) discharge with drugs | |
Figure 1Two problems in applying LDA on clinical data: (a) the clinical activities in one day for a patient and (b) the three discovered topics.
Figure 2The illustrative process of our approach. The top part is the pipeline, middle part is the business domain, and bottom part is the corresponding algorithm domain (T: patient trace; a: clinical activity; day: clinical day; G: group; VOC: vocabulary).
Figure 3Graphical representation of (a) LDA and (b) CDG-LDA.
Meanings of the notations.
| Notation | Meaning |
|---|---|
|
| The set of clinical days |
|
| The set of clinical activities |
|
| The set of topics assigned to |
|
| The number of clinical days and topics |
|
| The number of unique clinical activities |
|
| The number of clinical activities in |
|
| The number of clinical activities that are assigned |
|
| The number of groups (unique clinical activities) |
|
| The number of clinical activities in |
|
| The |
|
| The topic for |
|
| The clinical activities of |
|
| The |
|
| The topic for |
|
| { |
|
| The number of clinical activities that are assigned |
|
| The number of clinical activities with value |
|
| The number of clinical activities that are assigned topic |
|
| The number of clinical activities with value |
|
| Dirichlet prior vector |
|
| Multinomial distribution over clinical activities |
|
| Multinomial distribution over clinical activities for |
|
| Multinomial distribution over topics for |
Algorithm 1Gibbs sampling of CDG-LDA.
Statistics of our datasets.
| Disease | Trace | Day | Voc | Avg | Min | Max |
|---|---|---|---|---|---|---|
| ICH | 240 | 3204 | 752 | 14 | 2 | 34 |
| IH | 33 | 241 | 447 | 6 | 2 | 10 |
Eight labeled topics of ICH by different methods.
| Number | Topic tag | LDA | Topic tag | CDG-LDA |
|---|---|---|---|---|
| 1 | Imaging | (1) Doppler echocardiography, (2) Doppler | Imaging | (1) Doppler echocardiography, |
|
| ||||
| 2 | Medication | (1) Aceglutamide, (2) sodium chloride, | Medication | (1) Pantoprazole, (2) potassium magnesium |
|
| ||||
| 3 | High-level | (1) Level I care, (2) ECG monitoring, | High-level | (1) Level I care, (2) ECG monitoring, |
|
| ||||
| 4 | Regular nursing | (1) Level II care, (2) hospital examining fee, | Regular | (1) Level II care, (2) venous transfusion, |
|
| ||||
| 5 | Admission | (1) ECG, (2) PLGA, (3) fibrous protein, | Admission | (1) Brain CT, (2) venous sampling, (3) ECG, |
|
| ||||
| 6 | Biochemical | (1) Urea assay, (2) creatinine assay, (3) uric | Biochemical | (1) Creatinine assay, (2) urea assay, |
|
| ||||
| 7 | Biochemical | (1) Serum a-L-glucosidase assay, (2) serum | Biochemical | (1) Serum 5′ nucleotidase assay, (2) serum |
|
| ||||
| 8 | Biochemical | (1) Serum aspartate aminotransferase assay, | Biochemical | (1) Serum aspartate aminotransferase assay, |
Five labeled topics of IH by different methods.
| Number | Topic tag | LDA | Topic tag | CDG-LDA |
|---|---|---|---|---|
| 1 | Biochemical | (1) Blood cell analysis, (2) glucose assay, | Biochemical | (1) Blood cell analysis, (2) glucose assay, |
|
| ||||
| 2 | Infectious | (1) Hepatitis A antibody assay, (2) treponema | Presurgical | (1) Skin preparation package, (2) skin |
|
| ||||
| 3 | Admission | (1) ECG, (2) ECG monitoring, (3) ABO | Admission | (1) ECG, (2) ECG monitoring, (3) urine |
|
| ||||
| 4 | Regular | (1) Level II care, (2) hospital examining fee, | Regular | (1) Level II care, (2) level III care, (3) dressing |
|
| ||||
| 5 | Surgery | (1) Level II care, (2) endotherm knife, | Surgery | (1) Mask, (2) endotherm knife, (3) surgery |
Topic coherence in terms of NKQM@N.
| Dataset | Method | NKQM@5 | NKQM@10 | NKQM@20 |
|---|---|---|---|---|
| ICH | LDA | 0.8211 | 0.8004 | 0.7907 |
| CDG-LDA | 0.8448 | 0.8498 | 0.8235 | |
|
| ||||
| IH | LDA | 0.7915 | 0.7830 | 0.7631 |
| CDG-LDA | 0.8316 | 0.8266 | 0.8116 | |
Figure 4Redundancy indicator RE on various topic number K and top size N.
Figure 5ICH process model.
Figure 6IH process model.