| Literature DB >> 24736621 |
Alys Havard1, Louisa R Jorm1, Sanja Lujic1.
Abstract
Adjustment for the differing risk profiles of patients is essential to the use of administrative hospital data for epidemiological research. Smoking is an important factor to include in such adjustments, but the accuracy of the diagnostic codes denoting smoking in hospital records is unknown. The aims of this study were to measure the validity of current smoking and ever smoked status identified from diagnoses in hospital records using a range of algorithms, relative to self-reported smoking status; and to examine whether the misclassification of smoking identified through hospital data is differential or non-differential with respect to common exposures and outcomes. Data from the baseline questionnaire of the 45 and Up Study, completed by 267,153 residents of New South Wales (NSW), Australia, aged 45 years and older, were linked to the NSW Admitted Patient Data Collection. Patients who had been admitted to hospital for an overnight stay between 1 July 2005 and the date of completion of the questionnaire (1 January 2006 to 2 March 2009) were included. Smokers were identified by applying a range of algorithms to hospital admission histories, and compared against self-reported smoking in the questionnaire ('gold standard'). Sensitivities for current smoking ranged from 59% to 84%, while specificities were 94% to 98%. Sensitivities for ever smoked ranged from 45% to 74% and specificities were 93% to 97%. For the majority of algorithms, sensitivities and/or specificities differed significantly according to principal diagnosis, number of comorbidities, socioeconomic status, residential remoteness, Indigenous status, 28 day readmission and 365 day mortality. The identification of smoking through diagnoses in hospital data results in differential misclassification. Risk adjustment based on smoking identified from these data will yield potentially misleading results. Systematic capture of information about smoking in hospital records using a mandatory item would increase the utility of administrative data for epidemiological research.Entities:
Mesh:
Year: 2014 PMID: 24736621 PMCID: PMC3988140 DOI: 10.1371/journal.pone.0095029
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Algorithms for identifying smokers from hospital data.
| Algorithm | Identified as a smoker if: |
| Most recent separation | Smoking diagnosis was present in hospital record with a separation date prior to, but as close as possible to, the survey completion date |
| Most recent episode | Smoking diagnosis was present in any of the record(s) comprising the episode with a summary separation date prior to, but as close as possible to, the survey completion date |
| 1 year lookback | Smoking diagnosis was present in the most recent episode or in at least one hospital record with a separation date in the 365 days prior to the separation date of the most recent episode |
| 5 year lookback | Smoking diagnosis was present in the most recent episode or in at least one hospital record with a separation date in the 5 years (1826 days) prior to the separation date of the most recent episode. |
ICD-10-AM diagnosis codes relating to smoking.
| ICD-10-AM code | Description |
| F17.1 | Harmful use of tobacco. Assigned if the clinician has clearly documented a relationship between a particular condition(s) and smoking – even if the patient has ceased smoking |
| F17.2 | Tobacco dependence syndrome |
| Z72.0 | Tobacco use, current. Assigned if the patient has smoked any amount of tobacco within the last month |
| Z86.43 | Personal history of tobacco use disorder. Assigned if it is documented that the patient smoked any amount of tobacco in the past, but excluding the last month |
Characteristics of participants at the time of their most recent hospital admission.
| n | % | |
|
| ||
| Male | 31,302 | 49 |
| Female | 32.053 | 51 |
|
| ||
| ≤54 | 13,711 | 49 |
| 55–64 | 17,396 | 51 |
| 65–74 | 15,773 | 25 |
| 75–84 | 13,451 | 21 |
| 85+ | 3,024 | 5 |
|
| ||
| Decile 1–2 | 6,658 | 11 |
| Decile 2–4 | 10,962 | 17 |
| Decile 5–6 | 14,812 | 23 |
| Decile 7–8 | 11,474 | 18 |
| Decile 9–10 | 16,514 | 26 |
| Missing | 2,935 | 5 |
|
| ||
| Aboriginal or Torres Strait Islander | 268 | <1 |
| Not Aboriginal or Torres Strait Islander | 62,285 | 98 |
| Missing | 802 | 1 |
|
| ||
| Major city | 26,979 | 43 |
| Inner regional | 24,341 | 38 |
| Outer regional | 11,058 | 17 |
| Remote/very remote | 673 | 1 |
| Missing | 304 | <1 |
|
| ||
| Currently a regular smoker | 3,852 | 6 |
| Ever been a regular smoker | 27,905 | 44 |
|
| ||
| Infectious & parasitic | 598 | 1 |
| Neoplasms | 5,684 | 9 |
| Blood & immune mechanism | 502 | 1 |
| Endocrine, nutritional & metabolic | 1,302 | 2 |
| Mental & behavioural | 972 | 2 |
| Nervous system | 2,051 | 3 |
| Eye & adnexa | 2,289 | 4 |
| Ear & mastoid process | 440 | 1 |
| Circulatory | 7,580 | 12 |
| Respiratory | 2,893 | 5 |
| Digestive | 9,408 | 15 |
| Skin & subcutaneous | 1,055 | 2 |
| Musculoskeletal | 6,964 | 11 |
| Genitourinary | 4,932 | 8 |
| Pregnancy & childbirth | 56 | <1 |
| Congenital malformations | 86 | <1 |
| Symptoms & findings NEC | 5,447 | 9 |
| Injury & poisoning | 4,553 | 7 |
| Factors influencing health care | 6,503 | 10 |
| Missing | 40 | <1 |
|
| ||
| 0 | 56,255 | 89 |
| 1 | 4,017 | 6 |
| 2 | 1,761 | 3 |
| 3+ | 1,322 | 2 |
|
| ||
| No | 62,547 | 99 |
| Yes | 787 | 1 |
| Died | 21 | <1 |
|
| ||
| No | 62,281 | 99 |
| Yes | 674 | 1 |
Performance of algorithms for identifying current smoking and ever smoked status from APDC records.
| Algorithm | N | +ve45 Up | +veAPDC | +veboth | Sn %(95% CI) | Sp %(95% CI) | PPV %(95% CI) | Kappa | |
|
|
| 63082 | 4223 | 3425 | 2471 | 58.5 (56.9–60.2) | 98.4 (98.3–98.5) | 72.2 (70.8–73.5) | 0.62 |
|
| 3545 | 2533 | 60.0 (58.4–61.6) | 98.3 (98.2–98.4) | 71.5 (70.1–72.8) | 0.63 | |||
|
| 4745 | 3045 | 72.1 (70.8–73.4) | 97.1 (97.0–97.2) | 64.2 (62.7–65.6) | 0.65 | |||
|
| 6925 | 3531 | 83.6 (82.7–84.5) | 94.2 (94.0–94.4) | 51.0 (49.5–52.5) | 0.60 | |||
|
|
| 63353 | 27899 | 13549 | 12468 | 44.7 (43.9–45.5) | 97.0 (96.8–97.1) | 92.0 (91.7–92.3) | 0.44 |
|
| 14085 | 12939 | 46.4 (45.6–47.2) | 96.8 (96.6–97.0) | 91.9 (91.5–92.2) | 0.46 | |||
|
| 18569 | 16802 | 60.2 (59.5–60.9) | 95.0 (94.8–95.2) | 90.5 (90.1–90.8) | 0.57 | |||
|
| 23307 | 20644 | 74.0 (73.4–74.6) | 92.5 (92.2–92.8) | 88.6 (88.2–88.9) | 0.68 |
Whether the misclassification of smoking arising from application of the algorithms is differential between levels of common exposures and outcomes.
| SES | Residential remoteness | Indigenous status | Diagnosis | Number of comorbidities | 28 day readmission | 365 day mortality | ||
|
|
| ✓ | ✓ | ✓ | ✓ | ✓ | ||
|
| ✓ | ✓ | ✓ | ✓ | ✓ | |||
|
| ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
|
| ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
|
|
| ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | |
|
| ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
|
| ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
|
| ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
✓indicates the presence of differential misclassification, defined as a difference in sensitivity and/or specificity between at least two levels on the exposure/outcome, as indicated by non-overlapping 95% confidence intervals.