| Literature DB >> 32709646 |
Zain Hussain1, Syed Ahmar Shah2,3, Mome Mukherjee1,3, Aziz Sheikh1,3,4.
Abstract
INTRODUCTION: Most asthma attacks and subsequent deaths are potentially preventable. We aim to develop a prognostic tool for identifying patients at high risk of asthma attacks in primary care by leveraging advances in machine learning. METHODS AND ANALYSIS: Current prognostic tools use logistic regression to develop a risk scoring model for asthma attacks. We propose to build on this by systematically applying various well-known machine learning techniques to a large longitudinal deidentified primary care database, the Optimum Patient Care Research Database, and comparatively evaluate their performance with the existing logistic regression model and against each other. Machine learning algorithms vary in their predictive abilities based on the dataset and the approach to analysis employed. We will undertake feature selection, classification (both one-class and two-class classifiers) and performance evaluation. Patients who have had actively treated clinician-diagnosed asthma, aged 8-80 years and with 3 years of continuous data, from 2016 to 2018, will be selected. Risk factors will be obtained from the first year, while the next 2 years will form the outcome period, in which the primary endpoint will be the occurrence of an asthma attack. ETHICS AND DISSEMINATION: We have obtained approval from OPCRD's Anonymous Data Ethics Protocols and Transparency (ADEPT) Committee. We will seek ethics approval from The University of Edinburgh's Research Ethics Group (UREG). We aim to present our findings at scientific conferences and in peer-reviewed journals. © Author(s) (or their employer(s)) 2020. Re-use permitted under CC BY. Published by BMJ.Entities:
Keywords: asthma; epidemiology; health informatics; public health
Mesh:
Year: 2020 PMID: 32709646 PMCID: PMC7380838 DOI: 10.1136/bmjopen-2019-036099
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Flowchart of proposed steps in the methodology. OPCRD, Optimum Patient Care Research Database.
Candidate predictors to be assessed for inclusion in models (adapted from Blakey et al5)
| Variable | Description |
| Sex | Male or female |
| Age | In years at the start of the 3-year study period |
| BMI | Last recorded, in kg/m2; categorised as underweight (<18.5), normal (18.5–24.9), overweight (25- |
| Ethnicity | Ethnicity information if available (white, black, Asian, South Asian Caribbean etc) |
| Smoking status | Last recorded, categorised as never smoker, current smoker or ex-smoker |
| Charlson comorbidity index | Score in the baseline year, categorised as 0, 1–4, 5–9, ≥10 |
| Comorbidities* | Recorded ever or active: eczema, allergic and non-allergic rhinitis, nasal polyps, anaphylaxis diagnosis, anxiety/depression diagnosis, diabetes (type 1 or 2), GERD, cardiovascular disease, ischaemic heart disease, heart failure, psoriasis |
| Comedications | In baseline year, prescription (yes/no) for paracetamol, NSAIDs, beta-blockers, statins |
| % predicted PEF | Recorded ever, expressed as percentage of predicted normal, categorised as unknown,<60%, |
| Blood eosinophil count | Last recorded, in 109 cell/L, categorised as ≤0.4 or >0.4 |
| BTS step† | |
| Step 1 | Inhaled SABA as needed |
| Step 2 | ICS or LTRA |
| Step 3 | Add LABA to ICS or use high-dose ICS (≥400 mg/day FP equivalent) |
| Step 4 | Add LTRA/Theo to (ICS+LABA) or add LABA/LTRA/Theo to high-dose ICS |
| Step 5 | Add OCS |
| Average daily dose of SABA/ICS | Cumulative dose of SABA/ICS prescribed in baseline year, expressed in mg/day albuterol or FP equivalent and divided by 365.25 |
| Prescribed daily ICS dose | Dose of ICS prescribed at last prescription of baseline year in mg/day, FP equivalents |
| ICS medication possession ratio | ICS refill rate during the baseline year: sum of number of days per pack (number of actuations per pack/number of actuations per day)/365.25 |
| ICS device type | In baseline year: categorised as no ICS, MDI, BAI or DPI |
| Spacer use with ICS pMDI | Recorded in baseline year (yes/no) |
| Oral corticosteroid use | Any maintenance prescription for corticosteroids in baseline year (yes/no) |
| Prior asthma education | Recorded ever (yes/no) |
| Primary care consults | Number of primary care consultations, categorised as 0, 1–5, 6–12, ≥13 |
| Primary care consults for asthma | Number of primary care consultations with an asthma-related Read code |
| Antibiotics with lower respiratory consult | Number of consultations that resulted in antibiotic prescription (included to capture asthma events that may have been misclassified as LRTI) |
| Acute respiratory events | Number of events in the baseline year, defined as asthma-related hospitalisation or ED attendance or an acute course of OCS or antibiotics prescription with lower respiratory consultation |
| Acute OCS courses | Number of acute courses of OCS in baseline year, categorised as 0, 1, ≥2 |
| Acute OCS courses with lower respiratory consult | Number of OCS courses with Read code for lower respiratory consultation in baseline year, categorised as 0, 1, ≥2 |
| Antibiotics courses | Number of antibiotics prescriptions with Read code for lower respiratory consultation in baseline year, categorised as 0, 1, ≥2 |
| Hospital attendance/admission | Number of asthma-related‡ ED, inpatient and outpatient attendance/admission in baseline year (as recorded in primary care data) |
| Asthma attacks | Number of asthma-related‡ hospital ED attendance, inpatient admission or acute OCS course |
| Eosinophil count | Blood eosinophil count (cells/L) categorised into high and not high (threshold of 0.35×109 cells/L) to define high/not high eosinophil count |
*Comorbidity recorded ‘ever’ was defined as a diagnostic Read code during the baseline year or at any time before baseline. ‘Active’ refers to those for which a diagnosis was recorded within the baseline year and/or a previous diagnosis was accompanied by a prescription for the comorbidity within the baseline year. ‘Rhinitis’ included allergic and non-allergic rhinitis.
†Based on the British guideline on the management of asthma (October 2014) for adults and children.25
‡Any patient with a lower respiratory Read code (asthma or LRTI code).
BAI, Breath-actuated inhaler; BMI, body mass index; BTS, British Thoracic Society; DPI, dry powder inhaler; ED, emergency department; FP, fluticasone propionate; GERD, gastro-oesophageal reflux disease; ICS, inhaled corticosteroids; LABA, long-acting beta antagonists; LRTI, lower respiratory tract infection; LTRA, leukotriene receptor antagonist; MDI, metered-dose inhaler; NSAIDs, non-steroidal anti-inflammatory drugs; OCS, oral corticosteroids; PEF, peak expiratory flow; SABA, short-acting b2 agonist; Theo, theophylline.
Figure 2Overview of the various classification algorithms that will be used to predict asthma attack. The methods are broadly divided into one-class classifier and two-class classifier. Our baseline reference method is logistic regression (shown in green).