Literature DB >> 26776084

Improved incidence estimates from linked vs. stand-alone electronic health records.

Elizabeth R C Millett1, Jennifer K Quint2, Bianca L De Stavola3, Liam Smeeth2, Sara L Thomas4.   

Abstract

OBJECTIVE: Electronic health records are widely used for public health research, and linked data sources are increasingly available. The added value of using linked records over stand-alone data has not been quantified for common conditions such as community-acquired pneumonia (CAP). STUDY DESIGN AND
SETTING: Our cohort comprised English patients aged ≥65 years from the Clinical Practice Research Datalink, eligible for record linkage to Hospital Episode Statistics. Stand-alone general practice (GP) records were used to calculate CAP incidence over time using population-averaged Poisson regression. Incidence was then recalculated for the same patients using their linked GP-hospital admission data. Results of the two analyses were compared.
RESULTS: Over 900,000 patients were included in each analysis. Population-averaged CAP incidence was 39% higher using the linked data than stand-alone data. This difference grew over time from 7% in 1997 to 83% by 2010. An increasingly larger number of pneumonia events were recorded in the hospital admission data compared to the GP data over time.
CONCLUSION: Use of primary or secondary care data in isolation may not give accurate incidence estimates for important infections in older populations. Further work is needed to establish the extent of this finding in other diseases, age groups, and populations.
Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Aged; Cohort; Data linkage; Electronic health records; England/epidemiology; Pneumonia

Mesh:

Year:  2016        PMID: 26776084      PMCID: PMC4922622          DOI: 10.1016/j.jclinepi.2016.01.005

Source DB:  PubMed          Journal:  J Clin Epidemiol        ISSN: 0895-4356            Impact factor:   6.437


Use of linked primary-secondary care health data provided markedly higher incidence estimates of community-acquired pneumonia compared to stand-alone general practice (GP) records for the same group of English older adults. Comparison of the data sources revealed diverging incidence estimates over time, rising from 7% higher in 1997/98 to 83% higher in 2010/11 when using the linked data compared to the stand-alone GP data. The benefits of the use of linked electronic health records (compared to single data sources) have been demonstrated for conditions such as cardiovascular diseases; this is the first article to demonstrate the benefits for an important, common infection. Use of primary or secondary care data in isolation may not give accurate estimates of burden of disease for important infections in older populations. Further work is needed to establish if this trend is seen in other infections and diseases.

Introduction

Electronic health records are extensively used in epidemiological research, because of their wide and detailed population coverage. It is increasingly possible to link electronic data sources to enhance available data. For example, linked primary and secondary care data provide more complete information on outcomes, enriched data on covariates such as patients' medical and therapeutic histories, and accurate timing of events such as hospitalizations. The value of linked over stand-alone data has been investigated for conditions such as cardiovascular events, asthma, diabetes, and upper gastrointestinal bleeding [1], [2], [3], [4]. However, the potential benefits of linked data for examining the burden of important infectious diseases are unclear. Community-acquired pneumonia (CAP) causes considerable morbidity among older individuals and can be treated in either primary or secondary care. Large-scale studies of CAP incidence trends have commonly used either stand-alone general practice (GP) records, potentially excluding patients who present to hospital if practices record hospitalized events suboptimally, or stand-alone hospital records which exclude cases treated in the community. Two recent studies used large linked GP and hospital data sets to assess disease burden of CAP but did not assess the added value of using the linked data [5], [6]. We thus investigated the utility of linked primary/secondary care data in better determining trends in CAP disease burden in England among those aged ≥65 years by comparing incidence of CAP derived from stand-alone primary care data with that from linked primary-secondary care data. Each analysis used essentially the same cohort of patients over the same time period, using the same analytical approach.

Methods

The Clinical Practice Research Datalink (CPRD) is a nationally representative UK primary care dataset, containing a range of information including Read-coded diagnoses [1]. Hospital Episode Statistics (HES) contain inpatient records with ICD10-coded diagnoses, including admission and discharge dates. CPRD and HES records are linked at a patient-level for consenting English practices. By March 2011, CPRD contained >12 million patient records, with HES-linkage available for 65% of English CPRD practices (around 5% of the English population) [7]. Practices and patients joined CPRD throughout the study period, providing dynamic cohorts of patients. To ensure comparability of the two data sources, a near-identical group of patients were used in both analyses. Patients included in the study were eligible for record linkage, were aged ≥65 years, and contributed ≥1 day of follow-up. Follow-up started at the latest of the study start date (April 1, 1997), the patient's 65th birthday, the date the practice met CPRD quality standards or 28 weeks after patient registration (to exclude historical illnesses retrospectively reported) [6]. Follow-up ended at the earliest of the study end date (March 31, 2011), death, the practice's last data collection date, or the date the patient left the practice. We have previously described in detail definitions for pneumonia illness episodes in CPRD and HES, using pneumonia and other lower respiratory tract infection records [6]. In brief, records for which pneumonia was recorded in CPRD (stand-alone and linked data) or as the admitting diagnosis (primary code of the first episode) in HES (linked data only) within 28 days of each other or of a record for lower respiratory tract infection were considered to be part of the same episode. The incident date of the episode was the date of the first of these pneumonia codes. In both analyses, pneumonia illness episodes which started ≤14 days after a hospitalization were assumed to be hospital-acquired (HAP) and were excluded; episodes with no such hospitalization record were classed as community acquired. The method for defining hospitalizations, and thus distinguishing between CAP and HAP, differed between the two analyses. In the stand-alone CPRD data, hospitalization records were identified using Read codes and other relevant fields in the GP files. In the linked cohort, the 14-day period started at the discharge date of any hospital admission. Patients were not considered “at-risk” of pneumonia during any pneumonia episode (CAP or HAP) or for 28 days after the last record in the episode, and this time was excluded from the denominator in both cohorts. A key difference in the linked data analysis was the capacity to also exclude the duration of any hospital admission and the subsequent 14 days from person-time at risk of a community-acquired infection and thus obtain more accurate denominator data. This was not possible in the stand-alone data as hospital admission, and discharge dates were not available. Population-averaged Poisson models were used to calculate the incidence of CAP across clusters of CAP episodes per patient. Rates were calculated stratified by year, age group, and sex. The financial year structure (April 1–March 31) was used to assign respiratory pathogens circulating over winter months to the same year. In the linked data, whether patients had consulted with a GP (either face to face or by telephone) on the CAP incident date was examined using the “constype” field in the consultation file.

Results

The study population included 917,852 patients in the stand-alone data from 351 practices across England. The linked analysis included 916,128 (>99.8%) of these patients who had ≥1 day of follow-up after additionally excluding person-time in hospital. In both analyses 53% of patients were aged 65–69 years at start of follow-up and 56% were female. Using only GP records, we identified 31,575 CAP episodes during the study period. Using linked GP/hospital admission data identified 45,285 CAP episodes. In both analyses, >80% of patients had only one CAP episode during follow-up. Incidence estimates using linked data were higher than those using stand-alone data. Overall, incidence was 39% higher using the linked data, and the difference increased markedly over time from 7% (6.18 vs. 5.77/1,000 person-years) in 1997/98 to 83% higher (10.13 vs. 5.54/1,000 person-years) in 2010/11 (Fig. 1). Although rates of CAP rose with age in both data sources, the relative increase in CAP estimates using the linked compared to GP stand-alone data was comparable for each age group, and so, the disparity was not attributable to a specific age group (data not shown). Incidence was higher in men than women in both analyses, but the divergence between estimates was observed in both sexes.
Fig. 1

Population-averaged incidence of CAP among older adults by data source over time. Abbreviations: CAP, community-acquired pneumonia; CPRD, Clinical Practice Research Datalink; HES, hospital episode statistics.

Because of the dynamic nature of the cohort, the number of patients contributing to each analysis increased over the study period, increasing the person-time included. However, the increase in person-time within each analysis was similar (91% increase in linked vs. 93% in stand-alone data), whereas the increase in CAP episodes was substantially larger in the linked data (147% vs. 52% in stand-alone). Between 1997 and 2010, the percentage of patients who had consulted with their GP on the day of the CAP diagnosis decreased from 82% to 43%. Over the same period, consultation with a GP for an LRTI in the 28 days before the CAP diagnosis decreased from 15% to 10%.

Discussion

Our investigation of incidence trends for a major infectious disease shows the benefits of using linked data. Use of primary care data alone yielded CAP incidence estimates that were 28% lower than estimates from linked primary/secondary care data. The divergence between estimates increased appreciably over the 14-year study period, and linked data estimates were 83% higher than those from stand-alone GP records by March 2011. In the linked data analysis, we could refine estimated person-time at risk of community-acquired infection, by discounting the person-time patients were in hospital. However, it seems that the diverging estimates were attributable largely to the higher number of CAP episodes in the linked data. All pneumonias recorded in GP records are included in linked GP/hospital data, but pneumonias from hospital admissions are only included in stand-alone GP data if patients consulted their GP prehospitalization, or hospital diagnoses were retrospectively recorded by the patients' GP. Our analyses demonstrate that CAP identified in hospital is incompletely recorded by GPs, and this underrecording, coupled with the known increase in CAP hospitalizations in England over the study period, may explain the divergence we report [8]. Patients with CAP may have increasingly presented directly to Accident and Emergency Departments because of changes in GP service provision or perceived severity of illness, and the threshold for admission for these older patients may also have decreased. Both these scenarios are consistent with the larger increase in CAP episodes in the HES records and with decreasing consultations with a GP on the day of a CAP diagnosis. They also highlight that for conditions that can be treated both in the community and in hospital, changes to health services, patient, and clinician behavior could all result in marked underestimation of disease burden if single data sources are used. Our analyses used large, nationally representative data sets containing ≥900,000 patients [9]. Overall validity of diagnoses in CPRD data has been shown to be high, although few studies have assessed the sensitivity of recording [10]. Over 99.8% of the same patients were included in both analyses, enabling examination of the differences in CAP estimates due to the data source and methodology used. We are unaware of other studies that have assessed the added value of using linked vs. stand-alone data within the same population for estimating the burden of any infectious disease. The two data sources use different coding systems, and changes to coding practices over time within each source are a further consideration. For example, “tentative” pneumonia codes such as “Influenza or pneumonia” (available in the Read but not ICD10 coding system) were not included in this study. Patients assigned a tentative pneumonia code by their GP and subsequently hospitalized with CAP would have been included in the linked data but not in the stand-alone data. However, to have contributed to the disparity, GPs would have needed to use these tentative diagnoses increasingly over time. Alternatively, if hospital physicians increasingly diagnosed or labeled older patients as having pneumonia, this would contribute to the divergent trends. We have no evidence that this occurred, but a clear understanding of trends in coding practices is essential for interpreting findings from both stand-alone and linked data. In conclusion, use of primary or secondary care data in isolation may underestimate disease incidence for certain conditions, particularly those that can be treated in either care setting. Additionally, incomplete recording of events in UK stand-alone GP data limits its use in studies of the burden of pneumonia in older adults. Further work is needed to establish if this trend is seen in other diseases and age groups.
  10 in total

Review 1.  Recent advances in the utility and use of the General Practice Research Database as an example of a UK Primary Care Data resource.

Authors:  Tim Williams; Tjeerd van Staa; Shivani Puri; Susan Eaton
Journal:  Ther Adv Drug Saf       Date:  2012-04

2.  Estimating the prevalence of diagnosed diabetes in a health district of Wales: the importance of using primary and secondary care sources of ascertainment with adjustment for death and migration.

Authors:  C L Morgan; C J Currie; N C Stott; M Smithers; C C Butler; J R Peters
Journal:  Diabet Med       Date:  2000-02       Impact factor: 4.359

3.  Searching multiple clinical information systems for longer time periods found more prevalent cases of asthma.

Authors:  William M Vollmer; Elizabeth A O'Connor; Michael Heumann; E Ann Frazier; Victor Breen; Jacqueline Villnave; A Sonia Buist
Journal:  J Clin Epidemiol       Date:  2004-04       Impact factor: 6.437

4.  Defining upper gastrointestinal bleeding from linked primary and secondary care data and the effect on occurrence and 28 day mortality.

Authors:  Colin John Crooks; Timothy Richard Card; Joe West
Journal:  BMC Health Serv Res       Date:  2012-11-13       Impact factor: 2.655

5.  Data Resource Profile: Clinical Practice Research Datalink (CPRD).

Authors:  Emily Herrett; Arlene M Gallagher; Krishnan Bhaskaran; Harriet Forbes; Rohini Mathur; Tjeerd van Staa; Liam Smeeth
Journal:  Int J Epidemiol       Date:  2015-06-06       Impact factor: 7.196

Review 6.  Validation and validity of diagnoses in the General Practice Research Database: a systematic review.

Authors:  Emily Herrett; Sara L Thomas; W Marieke Schoonen; Liam Smeeth; Andrew J Hall
Journal:  Br J Clin Pharmacol       Date:  2010-01       Impact factor: 4.335

7.  Is secondary preventive care improving? Observational study of 10-year trends in emergency admissions for conditions amenable to ambulatory care.

Authors:  Martin Bardsley; Ian Blunt; Sian Davies; Jennifer Dixon
Journal:  BMJ Open       Date:  2013-01-02       Impact factor: 2.692

8.  Incidence of community-acquired lower respiratory tract infections and pneumonia among older adults in the United Kingdom: a population-based study.

Authors:  Elizabeth R C Millett; Jennifer K Quint; Liam Smeeth; Rhian M Daniel; Sara L Thomas
Journal:  PLoS One       Date:  2013-09-11       Impact factor: 3.240

9.  Completeness and diagnostic validity of recording acute myocardial infarction events in primary care, hospital care, disease registry, and national mortality records: cohort study.

Authors:  Emily Herrett; Anoop Dinesh Shah; Rachael Boggon; Spiros Denaxas; Liam Smeeth; Tjeerd van Staa; Adam Timmis; Harry Hemingway
Journal:  BMJ       Date:  2013-05-20

10.  General practitioners' contribution to the management of community-acquired pneumonia in the Netherlands: a retrospective analysis of primary care, hospital, and national mortality databases with individual data linkage.

Authors:  Bianca Snijders; Wim van der Hoek; Irina Stirbu; Marianne A B van der Sande; Arianne B van Gageldonk-Lafeber
Journal:  Prim Care Respir J       Date:  2013-12
  10 in total
  19 in total

1.  Non-benzodiazepine hypnotic use for sleep disturbance in people aged over 55 years living with dementia: a series of cohort studies.

Authors:  Kathryn Richardson; George M Savva; Penelope J Boyd; Clare Aldus; Ian Maidment; Eduwin Pakpahan; Yoon K Loke; Antony Arthur; Nicholas Steel; Clive Ballard; Robert Howard; Chris Fox
Journal:  Health Technol Assess       Date:  2021-01       Impact factor: 4.014

2.  Performing studies using the UK Clinical Practice Research Datalink: to link or not to link?

Authors:  Laura McDonald; Anna Schultze; Robert Carroll; Sreeram V Ramagopalan
Journal:  Eur J Epidemiol       Date:  2018-04-04       Impact factor: 8.082

3.  The Use of a Bayesian Hierarchy to Develop and Validate a Co-Morbidity Score to Predict Mortality for Linked Primary and Secondary Care Data from the NHS in England.

Authors:  Colin J Crooks; Tim R Card; Joe West
Journal:  PLoS One       Date:  2016-10-27       Impact factor: 3.240

4.  Seasonality, risk factors and burden of community-acquired pneumonia in COPD patients: a population database study using linked health care records.

Authors:  Nicholas P Williams; Ngaire A Coombs; Matthew J Johnson; Lynn K Josephs; Lucy A Rigge; Karl J Staples; Mike Thomas; Tom Ma Wilkinson
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2017-01-17

5.  Incidence and antibiotic prescribing for clinically diagnosed urinary tract infection in older adults in UK primary care, 2004-2014.

Authors:  Haroon Ahmed; Daniel Farewell; Hywel M Jones; Nick A Francis; Shantini Paranjothy; Christopher C Butler
Journal:  PLoS One       Date:  2018-01-05       Impact factor: 3.752

6.  Improved outcomes in ex-smokers with COPD: a UK primary care observational cohort study.

Authors:  Lynn Josephs; David Culliford; Matthew Johnson; Mike Thomas
Journal:  Eur Respir J       Date:  2017-05-23       Impact factor: 16.671

7.  Using a linked database for epidemiology across the primary and secondary care divide: acute kidney injury.

Authors:  M Johnson; H Hounkpatin; S Fraser; D Culliford; M Uniacke; P Roderick
Journal:  BMC Med Inform Decis Mak       Date:  2017-07-11       Impact factor: 2.796

8.  Assessing recording delays in general practice records to inform near real-time vaccine safety surveillance using the Clinical Practice Research Datalink (CPRD).

Authors:  Andreia Leite; Nick J Andrews; Sara L Thomas
Journal:  Pharmacoepidemiol Drug Saf       Date:  2017-02-03       Impact factor: 2.890

9.  Recording of hospitalizations for acute exacerbations of COPD in UK electronic health care records.

Authors:  Kieran J Rothnie; Hana Müllerová; Sara L Thomas; Joht S Chandan; Liam Smeeth; John R Hurst; Kourtney Davis; Jennifer K Quint
Journal:  Clin Epidemiol       Date:  2016-11-21       Impact factor: 4.790

10.  Mortality, morbidity and health in developed societies: a review of data sources.

Authors:  Guillaume Wunsch; Catherine Gourbin
Journal:  Genus       Date:  2018-01-29
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.