| Literature DB >> 36147556 |
M Dashtban1, Weizi Li1.
Abstract
The hospital outpatient non-attendance imposes a substantial financial burden on hospitals and roots in multiple diverse reasons. This research aims to build an advanced predictive model for predicting non-attendance regarding the whole spectrum of probable contributing factors to non-attendance that could be collated from heterogeneous sources including electronic patients records and external non-hospital data. We proposed a new non-attendance prediction model based on deep neural networks and machine learning models. The proposed approach works upon sparse stacked denoising autoencoders (SDAEs) to learn the underlying manifold of data and thereby compacting information and providing a better representation that can be utilised afterwards by other learning models as well. The proposed approach is evaluated over real hospital data and compared with several well-known and scalable machine learning models. The evaluation results reveal the proposed approach with softmax layer and logistic regression outperforms other methods in practice.Entities:
Keywords: No-show patients; deep learning; electronic patients records; health care; machine learning; outpatient appointment; prediction
Year: 2021 PMID: 36147556 PMCID: PMC9487947 DOI: 10.1080/20476965.2021.1924085
Source DB: PubMed Journal: Health Syst (Basingstoke) ISSN: 2047-6965
Figure 1.Research framework to develop non-attendance prediction model and evaluating performance gains from deep learning architecture.
Figure 2.Denoising Autoencoder Architecture.
Figure 3.Number of attendances vs non-attendances during 2015 to 2018 at monthly basis (note two stacked lines follow different scales given at vertical axis in right and left of the figure, respectively).
Brief description of variable groups for non-attendances prediction.
| CATEGORY | VARIABLE |
|---|---|
| Demographic | Age, Gender, Ethnicity, |
| Patient History | Multi-Comorbidities, Address stability |
| Appointment Characteristics | Follow-up or first-time appointment, GP referral time to appointment, care Speciality, Site |
| Time Variables | Day, month, year, time of the day |
| Patient Appointment History | Statistics on number/ratio of attended and non-attended within and out of 30 days, |
| Socioeconomics | Education decile, Index of Multiple Deprivation, Income Decile, Living Environment Rank, etc. |
| Weather | Temperature, Condition (e.g., rain/snow, etc.), Humidity |
| Admission History | Recent admission, length of Stay, Procedure, time interval |
Distribution of non-attendance over ethnicity group, care group and gender type.
| Attribute cluster | Levels | Non-attendance | total appointments% |
|---|---|---|---|
| British | 62.39% | 68.23% | |
| Non-British | 20.57% | 16.98% | |
| Not Known | 17.04% | 14.79% | |
| CG2 – Planned | 52.65% | 52.63% | |
| CG3 – Networked | 26.71% | 28.21% | |
| CG1 – Urgent/etc. | 18.97% | 19.17% | |
| Female | 55.40% | 57.6% | |
| Male | 44.69% | 42.4% |
Figure 4.Trend of non-attendance at different ages: (a) exhibits the number of non-attendance against the percentage of non-attendance at each age group, and (b) demonstrates the percentage of non-attendance vs the percentage of appointments.
Performance of different predictive methods (*numbers in bold represent top five F1-scores over 0.21).
| Method | Measures | ||
|---|---|---|---|
| Precision | Recall | F1-Score | |
| Logistic Regression (Hilbe, | 0.197 | 0.286 | 0.233 |
| SVM + Linear | 0.115 | 0.487 | 0.186 |
| SVM + Polynomial = 3 | 0.122 | 0.557 | 0.200 |
| SVM-RBF Kernel | 0.094 | 0.617 | 0.163 |
| KNN (best K = 50) | 0.059 | 0.926 | 0.111 |
| KNN (best K = 3) | 0.062 | 0.941 | 0.117 |
| Naïve Bayes (Kernel) | 0.146 | 0.424 | 0.217 |
| Naïve Bayes (Normal) | 0.200 | 0.143 | 0.167 |
| Bayesian Network Classifier | 0.175 | 0.272 | 0.213 |
| Decision Tree (Optimised, pruned, min leaf = 2) | 0.101 | 0.451 | 0.165 |
| Random Forest (optimised,2000 trees, 50 cycles, minleaf = 10) | 0.176 | 0.415 | 0.247 |
| Rotation Forest (K = 10) | 0.117 | 0.514 | 0.191 |
| Rotation Forest (K = 50) | 0.081 | 0.751 | 0.146 |
Performance of SDAE in combination of candidate classification algorithm Numbers in parentheses represent either the number of extracted features by SDAE, or the number of features in the last layer of SDAE.
| Method | Measures | | ||
|---|---|---|---|---|
| Precision | Recall | F1-Score | AUC | |
| SDAE (16) + Random Forest | 0.095 | 0.743 | 0.168 | 0.568 |
| SDAE (32) + Random Forest | 0.125 | 0.661 | 0.210 | 0.603 |
| SDAE (16) + Logistic Regression | 0.223 | 0.405 | ||
| SDAE (32) + Logistic Regression | 0.162 | 0.482 | 0.242 | 0.641 |
| SDAE (16) + SVM (polynomial 3) | 0.087 | 0.821 | 0.157 | 0.559 |
| SDAE (32) + SVM (polynomial 3) | 0.143 | 0.492 | 0.222 | 0.593 |
| SDAE-Softmax (16) | 0.188 | 0.601 | ||
| SDAE-Softmax (32) | 0.160 | 0.655 | 0.257 | 0.667 |
Figure 5.Variable group importance in attendance prediction.
Figure 6.Non-attendance prediction model integrated with the hospital appointment system.
Figure 7.Non-attendance application in hospital reporting system (with identifiable information removed).
Figure 8.Contact actions (DNA refers to “Do not attend”).
Distribution of non-attendance across care specialities and other variables.
| Attribute name | Category/value | Non-attendance | % Non-Attendance | # Appointments | % Appointments |
|---|---|---|---|---|---|
|
| African | 4350 | 1.46% | 37,805 | 1.01% |
| Any other Asian background | 8509 | 2.85% | 91,227 | 2.43% | |
| Any other Black background | 2208 | 0.74% | 18,991 | 0.51% | |
| Any other ethnic group | 5369 | 1.80% | 54,456 | 1.45% | |
| Any other mixed background | 1797 | 0.60% | 16,816 | 0.45% | |
| Any other white background | 17,815 | 5.96% | 201,197 | 5.37% | |
| Bangladeshi | 670 | 0.22% | 6887 | 0.18% | |
| British | 186,419 | 62.39% | 2,556,810 | 68.2% | |
| Caribbean | 3168 | 1.06% | 30,095 | 0.80% | |
| Chinese | 963 | 0.32% | 13,398 | 0.36% | |
| Indian | 6060 | 2.03% | 68,277 | 1.82% | |
| Irish | 1228 | 0.41% | 15,282 | 0.41% | |
| Not known | 3531 | 1.18% | 9201 | 0.25% | |
| Not stated | 45,502 | 15.23% | 540,797 | 14.4% | |
| Pakistani | 7168 | 2.40% | 62,762 | 1.67% | |
| White and Asian | 626 | 0.21% | 6502 | 0.17% | |
| White and Black African | 392 | 0.13% | 3503 | 0.09% | |
| White and Black Caribbean | 1162 | 0.39% | 9165 | 0.24% | |
| Others | 1875 | 0.63% | 4114 | 0.11% | |
|
| CG2 – Planned | 157,311 | 52.65% | 1,972,086 | 52.6% |
| CG3 – Networked | 79,817 | 26.71% | 1,056,996 | 28.21% | |
| CG1 – Urgent | 56,686 | 18.97% | 702,491 | 18.75% | |
| NULL | 4998 | 1.67% | 15,712 | 0.42% | |
|
| Accident and Emergency | 477 | 0.002% | 4732 | 0.001% |
| Allied Health Professional Episode | 18,811 | 0.063% | 268,796 | 0.071% | |
| Anaesthetics | 6804 | 0.023% | 87,880 | 0.023% | |
| Audiological Medicine | 11,759 | 0.039% | 213,210 | 0.056% | |
| Cardiology | 8968 | 0.030% | 167,115 | 0.044% | |
| Chemical Pathology | 283 | 0.001% | 1898 | 0.0005% | |
| Clinical Haematology | 7740 | 0.026% | 67,392 | 0.017% | |
| Clinical Oncology | 4726 | 0.016% | 101,404 | 0.027% | |
| Clinical Physiology | 0 | ~0 | 8 | ~0 | |
| Community Medicine | 0 | ~0 | 2 | ~0 | |
| Critical Care Medicine | 327 | 0.001% | 9909 | 0.0026% | |
| Dental Medicine Specialities | 1 | 0.000% | 10 | ~0 | |
| Dermatology | 10,573 | 0.035% | 181,812 | 0.048% | |
| Endocrinology | 8103 | 0.027% | 60,910 | 0.016% | |
| ENT | 14,710 | 0.049% | 160,874 | 0.043% | |
| Gastroenterology | 8533 | 0.029% | 89,036 | 0.024% | |
| General Medicine | 1322 | 0.004% | 12,534 | 0.003% | |
| General Surgery | 11,871 | 0.040% | 183,527 | 0.048% | |
| Genito-Urinary Medicine | 213 | 0.001% | 1835 | 0.0004% | |
| Geriatric Medicine | 1098 | 0.004% | 11,362 | 0.003% | |
| Gynaecology | 6547 | 0.022% | 116,086 | 0.031% | |
| Haematology | 60 | ~0 | 635 | 0.0002% | |
| Medical Oncology | 69 | ~0 | 1521 | 0.0004% | |
| Midwife Episode | 9681 | 0.032% | 61,306 | 0.0164% | |
| Nephrology | 4278 | 0.014% | 56,974 | 0.015% | |
| Neurology | 6971 | 0.023% | 72,550 | 0.019% | |
| Neurosurgery | 0 | ~0 | 2 | ~0 | |
| Nursing Episode | 38 | 0.032% | 65 | ~0 | |
| Obstetrics | 9470 | ~0 | 128,421 | 0.034% | |
| Obstetrics and Gynaecology | 17 | 0.215% | 197 | ~0 | |
| Ophthalmology | 64,260 | 0.008% | 730,747 | 0.195% | |
| Oral and Maxilla Facial Surgery | 2457 | 0.013% | 23,099 | 0.006% | |
| Oral Surgery | 3960 | 0.006% | 55,880 | 0.015% | |
| Orthodontics | 1783 | 0.001% | 20,289 | 0.005% | |
| Paediatric Cardiology | 255 | ~0 | 3627 | 0.001% | |
| Paediatric Surgery | 61 | 0.043% | 471 | 0.0001% | |
| Paediatrics | 12,728 | ~0 | 115,615 | 0.031% | |
| Palliative Medicine | 3 | 0.005% | 7 | ~0 | |
| Plastic Surgery | 1538 | ~0 | 27,379 | 0.007% | |
| Psychotherapy | 11 | ~0 | 352 | ~0 | |
| Radiology | 107 | 0.004% | 1232 | 0.0003% | |
| Rehabilitation | 1138 | 0.027% | 11,931 | 0.003% | |
| Respiratory Medicine | 8109 | 0.025% | 94,251 | 0.025% | |
| Rheumatology | 7403 | 0.075% | 94,660 | 0.025% | |
| Trauma and Orthopaedics | 22,281 | 0.015% | 344,663 | 0.092% | |
| Unknown | 4597 | 0.048% | 13,726 | 0.004% | |
| Urology | 14,257 | 0.001% | 145,304 | 0.039% | |
| Others | 414 | 0.002% | 2049 | 0.0005% | |
|
| Female | 165,373 | 55.40% | 2,160,125 | 57.65% |
| Male | 133,419 | 44.69% | 1,586,900 | 42.35% | |
|
| Average score of multiple deprivation in for patients with non-attendance = 14.81860624 | Average rank of multiple deprivation indexes for patients with non-attendance = 20,780.94586 | |||
a“~0” indicates the numbers close to zero, “#” denotes for “the number of”.