| Literature DB >> 28830417 |
T Delespierre1,2, P Denormandie3, A Bar-Hen4, L Josseran5.
Abstract
BACKGROUND: Korian is a private group specializing in medical accommodations for elderly and dependent people. A professional data warehouse (DWH) established in 2010 hosts all of the residents' data. Inside this information system (IS), clinical narratives (CNs) were used only by medical staff as a residents' care linking tool. The objective of this study was to show that, through qualitative and quantitative textual analysis of a relatively small physiotherapy and well-defined CN sample, it was possible to build a physiotherapy corpus and, through this process, generate a new body of knowledge by adding relevant information to describe the residents' care and lives.Entities:
Keywords: Data mining; Hierarchical clustering; Information extraction; Multiple component analysis; Named entity recognition; Nursing homes; Principal component analysis; SQL query; Text mining; Word cloud
Mesh:
Year: 2017 PMID: 28830417 PMCID: PMC5568397 DOI: 10.1186/s12911-017-0519-0
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1The experiment design with monitored (textual SQL and classification) and unsupervised (PCA, MCA, HC and textmining) techniques
Fig. 3Bar plot of words appearing at least 30 times in the physiotherapy sample corpus (stage 2)
Fig. 4Word cloud with words appearing at least 10 times in the physiotherapy sample corpus (stage 2)
The residents’ Pathos variable frequencies during their NH stays on 09/30/2013
| Number of medical histories per resident as of 09/30/2013 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
|
|
|
| pleuro-pulmonary | 896 | 111 | 8 | |||||
| infections | 941 | 69 | 5 | |||||
| dermatology | 954 | 58 | 3 | |||||
|
|
|
|
|
|
|
| ||
|
|
|
|
|
| ||||
| endocrine | 888 | 116 | 10 | 1 | ||||
| uro-nephrology | 873 | 119 | 22 | 1 | ||||
|
|
|
|
|
|
| |||
| Number of pathologies per resident as of 09/30/2013 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
|
| |
| pleuro-pulmonary | 881 | 131 | 2 | 1 | ||||
| infections | 977 | 37 | 1 | |||||
| dermatology | 924 | 87 | 4 | |||||
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
| |||
|
|
|
|
| |||||
|
|
|
|
|
| ||||
|
|
|
|
|
|
|
In italics all medical histories or pathologies with at least 200 residents (19.7%) afflicted at least once
The physiotherapy corpus built through the SQL process (stage 1) with selected one physio expression--one physio concept and their precision
| word | frequency | percentage | Precision a priori | Sentence precision | Precision a posteriori |
|---|---|---|---|---|---|
|
|
|
|
|
|
|
| autonomy | 560(524) | 13.82 | 93.57 | 80.56 | 98.75 |
| renewal | 518 | 12.79 | 100 | - | 100 |
| per_week | 476 | 11.75 | 100 | - | 100 |
| good | 383(327) | 9.45 | 85.37 | 94.64 | 99.22 |
| functional_recovery | 358 | 8.84 | 100 | - | 100 |
|
|
|
|
|
|
|
| walking | 302 | 8 | - | 100 | 100 |
| partial | 295(294) | 7.28 | 100 | 100 | 100 |
| antalgic | 222(208) | 5.43 | 100 | 100 | 100 |
| per_day | 209 | 5.16 | 100 | - | 100 |
| others | 194 | 4.79 | 100 | - | 100 |
| pain | 86 | 1.88 | - | 96.51 | 96.51 |
| balance | 84 | 2.07 | - | 98.81 | 98.81 |
| massage | 74 | 1.83 | - | 100 | 100 |
| participation | 51 | 1.26 | - | 96.8 | 96.8 |
| voluntary | 47 | 1.16 | - | 100 | 100 |
| motivation | 37 | 0.91 | - | 100 | 100 |
| stimulation | 33 | 0.81 | - | 100 | 100 |
| stopping_treatment | 21 | 0.51 | 100 | - | 100 |
| cognition | 15 | 0.37 | - | 100 | 100 |
| useless | 11 | 0.27 | 100 | - | 100 |
| modification_treatment | 9 | 0.22 | 100 | - | 100 |
N/A not applicable
Fig. 2The residents’ number of falls and ages frequencies of the physiotherapy sample
Fig. 5The PCA on medical histories, pathologies and number of falls defined as continuous variables
The word frequencies of the acute and falling groups’ physiotherapy corpuses computed with tm®
| Hospitalized or followed by a physician after falling (261 residents) | Fallen at least 15 times (58 residents) | ||||
|---|---|---|---|---|---|
| word | frequency | ratio | word | frequency | ratio |
| autonomy | 127 | 0.49 | autonomy | 29 | 0.50 |
| functional | 97 | 0.37 | walking | 23 | 0.40 |
| recovery | 96 | 0.37 | functional | 16 | 0.28 |
| walking | 64 | 0.25 | recovery | 16 | 0.28 |
| antalgic | 45 | 0.17 | mobilization | 12 | 0.21 |
| very | 42 | 0.16 | very | 11 | 0.19 |
| plus | 34 | 0.13 | help | 10 | 0.17 |
| patient | 28 | 0.11 | plus | 10 | 0.17 |
| work | 26 | 0.10 | always | 9 | 0.16 |
| always | 19 | 0.07 | work | 8 | 0.14 |
| limbs | 18 | 0.07 | massage | 7 | 0.12 |
| massage | 17 | 0.07 | physiotherapist | 6 | 0.10 |
| mobilization | 16 | 0.06 | limbs | 6 | 0.10 |
| troubles | 15 | 0.06 | going further | 6 | 0.10 |
| help | 13 | 0.05 | rachis | 6 | 0.10 |
| pain | 13 | 0.05 | re-education | 6 | 0.10 |
| exercises | 13 | 0.05 | good | 5 | 0.09 |
| sessions | 13 | 0.05 | wheelchair | 5 | 0.09 |
| friday | 13 | 0.05 | fracture | 5 | 0.09 |
| good | 12 | 0.05 | less | 5 | 0.09 |
| difficult | 12 | 0.05 | new | 5 | 0.09 |
Fig. 6HC + MCA with regions, departments, NH’s names, gender, medical histories and pathologies as categorical variables
Fig. 7The HCPC plot function of the 6 clusters in 3D and 2D