Literature DB >> 31933524

Sensitivity, specificity, positive and negative predictive values of identifying atrial fibrillation using administrative data: a systematic review and meta-analysis.

Ren Jie Robert Yao1, Jason G Andrade1, Marc W Deyell1, Heather Jackson2, Finlay A McAlister3, Nathaniel M Hawkins1.   

Abstract

INTRODUCTION: Atrial fibrillation (AF) is the commonest arrhythmia and a major cause of stroke and health care utilization. Researchers and administrators use electronic health data to assess disease burden, quality and variance in care, value of interventions and prognosis. We performed a systematic review and meta-analysis to assess the validity of AF case definitions in administrative databases.
METHODS: Medline was searched from 2000 to 2018. Extracted information included sensitivity, specificity, positive and negative predictive values (PPV and NPV) for various AF case definitions. Estimates were pooled using random-effects models due to significant heterogeneity between studies.
RESULTS: We identified 24 studies, including 21 from North America or Scandinavia. Hospital, ambulatory and mixed data sources were assessed in 10, 4 and 10 studies, respectively. Nine different AF case definitions were evaluated, most based on ICD-9 or 10 codes. Twenty-two studies assessed case definitions in patients diagnosed with AF and thus could generate PPV alone. Half the studies sampled unrestricted populations including a mix of those with and without AF to assess sensitivity. Only 13 studies included ECG confirmation as a gold standard. The pooled random effects estimates were: sensitivity 80% (95% CI 72-86%); specificity 98% (96-99%); PPV 88% (82-94%); NPV 97% (94-99%). Only 3 studies reported all accuracy parameters and included rhythm monitoring in the gold standard definition.
CONCLUSION: Relatively few studies examined sensitivity, and fewer still included rhythm monitoring in the gold standard comparison. Administrative data may fail to identify a significant proportion of patients with AF. This, in turn, may bias estimates of quality of care and prognosis.
© 2019 Yao et al.

Entities:  

Keywords:  accuracy; atrial fibrillation; registries; sensitivity; specificity; validation studies

Year:  2019        PMID: 31933524      PMCID: PMC6712502          DOI: 10.2147/CLEP.S206267

Source DB:  PubMed          Journal:  Clin Epidemiol        ISSN: 1179-1349            Impact factor:   4.790


Introduction

Atrial fibrillation (AF) increases risk of stroke, heart failure and death, and is one of the few cardiac conditions whose prevalence continues to rise.1,2 Most developed health systems collect reasons for hospital and ambulatory encounters for administration, service planning, quality improvement and reimbursement. Health services researchers use these administrative electronic databases to monitor the burden of disease, quality of care, and ascertain exposures or outcomes. The accuracy of AF identification is central to these applications. Sensitivity and specificity, though theoretically independent, typically trade-off and are inversely related.3 The “optimal” approach to identifying AF depends on the purpose. High sensitivity more completely captures a population, improves generalizability and is important when defining AF as an exposure. By contrast, high specificity ensures persons identified truly have AF and is central to adjudicating treatment uptake, which appears inappropriately low if patients with sinus rhythm are misclassified as having AF.4 Conceptually, the AF patient journey involves ambulatory and acute contacts dissociated in time and space, between which information flows by varying amounts and rates. Interrogating data sources over short time intervals or single environments may miss infrequent encounters. A previous systematic review examined the accuracy of AF detection, but was limited to ICD-9 codes only, non-contemporary electronic sources, North American cohorts and narrative synthesis without consideration of the impact of different health care settings (indeed the focus was largely on hospitalization data).5 We, therefore, undertook a systematic review to address these evidence gaps.

Methods

Participants, outcomes and study designs

Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines were followed (Table S1). We examined the accuracy of AF case definitions in electronic administrative health data, namely sensitivity (SN), specificity (SP), positive and negative predictive values (PPV and NPV). Inpatient, outpatient and mixed populations were included. All study designs were accepted. The study protocol was not published.
Table S1

PRISMA checklist

Section/topic#Checklist itemReport page #
Title
Title1Identify the report as a systematic review, meta-analysis or both.1
Abstract
Structured summary2Provide a structured summary including, as applicable: background; objectives; data sources; study eligibility criteria, participants and interventions; study appraisal and synthesis methods; results; limitations; conclusions and implications of key findings; systematic review registration number.2
Introduction
Rationale3Describe the rationale for the review in the context of what is already known.3
Objectives4Provide an explicit statement of questions being addressed with reference to participants, interventions, comparisons, outcomes and study design (PICOS).4
Methods
Protocol5Indicate if a review protocol exists, if and where it can be accessed (eg, Web address), and, if available, provide registration information including registration number.5
Eligibility6Specify study characteristics (eg, PICOS, length of follow-up) and report characteristics (eg, years considered, language, publication status) used as criteria for eligibility, giving rationale.5
Sources7Describe all information sources (eg, databases with dates of coverage, contact with study authors to identify additional studies) in the search and date last searched.5
Search8Present full electronic search strategy for at least one database, including any limits used, such that it could be repeated.5
Selection9State the process for selecting studies (ie, screening, eligibility, included in systematic review, and, if applicable, included in the meta-analysis).5
Collection10Describe method of data extraction from reports (eg, piloted forms, independently, in duplicate) and any processes for obtaining and confirming data from investigators.5
Data items11List and define all variables for which data were sought (eg, PICOS, funding sources) and any assumptions and simplifications made.5
Bias in studies12Describe methods used for assessing risk of bias of individual studies (including specification of whether this was done at the study or outcome level), and how this information is to be used in any data synthesis.N/A
Summary measures13State the principal summary measures (eg, risk ratio, difference in means).5
Synthesis14Describe the methods of handling data and combining results of studies, if done, including measures of consistency (eg, I2) for each meta-analysis.6
Bias across studies15Specify any assessment of risk of bias that may affect the cumulative evidence (eg, publication bias, selective reporting within studies).6
Additional analyses16Describe methods of additional analyses (eg, sensitivity or subgroup analyses, meta-regression), if done, indicating which were pre-specified.N/A
Results
Selection17Give numbers of studies screened, assessed for eligibility and included in the review, with reasons for exlusions at each stage, ideally with a flow diagram.Figure S1
Characteristics18For each study, present characteristics for which data were extracted (eg, study size, PICOS, follow-up period) and provide the citations.Table 1
Bias within studies19Present data on risk of bias of each study and, if available, any outcome level assessment (see item 12).N/A
Results20For all outcomes considered (benefits or harms), present, for each study: (a) simple summary data for each intervention group (b) effect estimates and confidence intervals, ideally with a forest plot.Figure 1
Synthesis21Present results of each meta-analysis done, including confidence intervals and measures of consistency.7,8
Bias across studies22Present results of any assessment of risk of bias across studies (see Item 15).N/A
Additional23Give results of additional analyses, if done (eg, sensitivity or subgroup analyses, meta-regression).N/A
Discussion
Summary24Summarize the main findings including the strength of evidence for each main outcome; consider their relevance to key groups (eg, health care providers, users and policy makers).8–10
Limitations25Discuss limitations at study and outcome level (eg, risk of bias), and at review level (eg, incomplete retrieval of identified research, reporting bias).11
Conclusion26Provide a general interpretation of the results in the context of other evidence, implications future research.11
Funding
Funding27Describe sources of funding for the systematic review and other support (eg, supply of data); role of funders for the systematic review.13

Search strategy and data collection

MEDLINE was searched from January 2000 to February 2018, limited to adult humans and English language, excluding case studies, reviews and conference abstracts. Search terms were determined by literature review and database query. The search strategy combined Medical Subject Headings (MeSH) terms and keywords in title and abstract to define three groups: atrial fibrillation (including atrial flutter (AFL) if not differentiated); administrative and electronic medical databases; and studies examining accuracy of AF identification within these records (Table S2). The search returned 1007 unique records. Manual bibliography searches identified an additional 31 publications (Figure S1). Titles and abstracts were screened for inclusion, and 302 full-text articles reviewed. Studies fulfilling the participant, outcomes and study design criteria were included. Variables of interest were decided a priori, expanded iteratively after pilot and collected in Microsoft Excel. The following information was extracted: bibliographic details, sample size, population characteristics, inclusion and exclusion criteria, codes and algorithms, AF confirmation gold standard and accuracy parameter outcomes.
Table S2

Search strategy

1) *atrial fibrillation
2) Atrial fibrillation.ti,ab.
3) or/1–2
4) *registries/ or *records as topic/ or *databases, factual/ or *database management systems/ or *epidemiologic studies/
5) (administrative or registr* or database* or claims or health maintenance organization or population-based).ti,ab.
6) or/4–5
7) *validation studies/ or *data accuracy/ or *predictive value of tests/
8) (sensitivity or specificity or predictive value or accuracy or abstract* or identif*).ti,ab.
9) or/7–8
10) 3 and 6 and 9
11) Limit 10 to humans
12) Limit 11 to english language
13) Limit 12 to yr =“2000-Current”
14) 13 not exp newborn/ not exp infant/ not exp child/ not exp adolescent/
15) 14 not (comment or editorial or note or letter or interview or lectures or personal narratives or biography or autobiography or addresses or patient education handout or interactive tutorial or news or newspaper article or historical article or webcasts or video-audio media or portraits or twin study or retraction of publication or retracted publication or published erratum or duplicate publication or case reports or legal cases or guideline or conference abstract or English abstract or clinical conference or congresses or meta-analysis or randomized controlled trial or clinical trial or clinical trial, phase I or clinical trial, phase II or clinical trial, phase III or clinical trial, phase IV or controlled clinical trial).pt.
16) 15 not catheter ablation/ not transcatheter aortic valve replacement/ not septal occluder device/ not antibodies, monoclonal/
17) 16 not (ablation or pulmonary vein or Amplat* or watchman or pacemaker or defibrillator or single-chamber or dual-chamber or resynchronization or gene or genes or genet* or ibrutinib).ti,ab.

Data synthesis

Weighted averages of sensitivity, specificity, positive and negative predictive values were calculated using the DerSimonian-Laird random effects model.6 Forest plots of estimates with 95% confidence intervals (CI) were generated. Publication bias was assessed through visual inspection of funnel plots and the Begg-Mazumdar rank correlation test for asymmetry.7,8 Heterogeneity was tested with visual forest plot inspection, Cochrane Q, I and Tau statistics.9 Estimates with significant heterogeneity (I>90%) were examined manually and formally for moderating effects including country, publication year and reference standard, none of which were significant. The leave-one-out method was used to determine if the results were sensitive to the inclusion of extreme values from specific studies.10

Results

Study characteristics

Twenty-four studies were identified (Table 1). Most originated from countries with established administrative databases that are often interrogated by health services researchers, including 10 from the United States, 3 from Canada and 8 from Sweden or Denmark. The populations were heterogeneous, including general unselected, stroke and post-operative cohorts. Hospital, ambulatory and mixed data sources were assessed in 10, 4 and 10 studies, respectively. Only 3 studies outside Scandinavia examined mixed populations.4,11,12 One Canadian study included administrative data from emergency departments separate from hospitalizations.11
Table 1

Study characteristics

ValidatedPrevalent/incidentTime period defining incidentYearCountryPopulation sampledAF (%)Data sourceExclusion
Mixed data
Frost 0728Prevalent80–02DenmarkStroke + AFNational patient registryStroke, valvular
Norberg 1329Prevalent04–10SwedenGeneral3.0National patient register
Baturova 1415Prevalent01–11SwedenStroke28.2National patient register
Tu 1611Prevalent11CanadaGeneral2.6Administrative databases
Navar-Boggan 154Prevalent11–12USIn/outpatientDuke University Health SystemAFL alone
Sung 1612Prevalent08–10TaiwanStroke10.1Taiwan National Health Insurance
Sundboll 1630Incident10–12DenmarkGeneralNational patient registry
Rix 1231IncidentStart records93–09DenmarkGeneral6.1National patient registry
Frost 0532IncidentStart records93–99DenmarkGeneral1.2National patient registryPrior cancer
Smith 1033IncidentNot specified91–05SwedenGeneral1.3National hospital/death registry
Ambulatory data
Brophy 0434Prevalent97–01USGeneralVeteran Affairs clinicsValvular disease
Borzecki 0413Prevalent98–99USHypertension6Veteran Affairs clinics 10 sites
Go 01/0016,18Prevalent96–97USAF1.0HMO Kaiser PermanentePrior AF ECG
Ruigomez 05/0214,35IncidentStart records96UKAF0.3General Practice Research Database
Hospital data
Quon 0436Prevalent96–97CanadaGeneral9.4Discharge Abstract Database
Kokotailo 0520Prevalent00–01CanadaStrokeDischarge Abstract Database
Thigpen 1519Prevalent06–10USStroke + AFAdministrative databasesProsthetic valve
Shen 0837Prevalent95–00USGeneralHMO Kaiser PermanenteValvular disease
Shireman 0438Prevalent98–99USAF warfarinMedicare discharges, 750/state
Chotchaisuwatana22Prevalent07–08ThailandAFThailand community hospitals
Munkholm 1539IncidentUnspecified11–13DenmarkCardiac surgery36.4Western Denmark Heart Registry
Walkey 1140IncidentUnspecified07–08USSepsis5.9California Inpatient Database
Alonso 0917IncidentStart study87–04USGeneral7.0ARIC study
Hravnak 0141IncidentUnspecified96–98USCABG31.9Medical Archive Pittsburgh

Abbreviations: AF, atrial fibrillation; AFL, atrial flutter; ARIC, Atherosclerosis Risk in Communities study; CABG, coronary artery bypass grafting; ECG, electrocardiogram; HMO, health maintenance organization; UK, United Kingdom; US, United States.

Study characteristics Abbreviations: AF, atrial fibrillation; AFL, atrial flutter; ARIC, Atherosclerosis Risk in Communities study; CABG, coronary artery bypass grafting; ECG, electrocardiogram; HMO, health maintenance organization; UK, United Kingdom; US, United States.

Coding and case definition algorithms

Most reports investigated International Classification of Diseases codes: ICD8 (427.93 AF, 427.94 AFL), ICD9 (4 digit code 427.3 and more explicit 427.31 AF and 427.32 AFL), ICD10 (I48). Overall, 9 different combinations of codes were studied (Table 2). The impact of coding position (primary versus secondary diagnosis in hospitalization data) was never examined. Four studies compared the accuracy of two versus one encounter coded as AF in ambulatory data sources within a single year.4,11–13 This consistently increased specificity but decreased sensitivity. A single study compared 2 versus 1 year for case ascertainment in Veterans Affairs outpatient records, finding greater sensitivity with only slightly reduced specificity.13 Overall in that study, 2 diagnoses over 2 years were optimal for detecting AF.13 Only one study from Canada examined more complex algorithms including cardioversion codes and pharmacy dispensations for antiarrhythmic drugs.11
Table 2

Characteristic and validation of atrial fibrillation in studies

ValidatedICD code(s) *Number of encounters required to diagnose(and over what time period)AF confirmation gold standard
Mixed data
Frost 0728427.93–94, I481Chart, ECG, telemetry, event recorder
Norberg 1329I481Chart, ECG, Holter, CIED
Baturova 1415427D, I481Chart, ECG
Tu 1611427.3x, I48427.x physician bill1 Hosp/ED or 1MD + cardioversion or 1 rhythm control or 1 MD + OAC (1 year)Primary care chart (physician notes and/or ECG)
Navar-Boggan 154427.311 inpatient or 2 outpatient/ED visits (1 year)Chart, ECG
Sung 1612427.311 inpatient or 2 outpatient visits (2 years)Database registry
Sundboll 1630427.93–94, I481Chart
Rix 1231427.93–94, I481Chart, ECG, electrophysiologist review
Frost 0532427.93–94, I481Chart, ECG, telemetry or Holter
Smith 1033427.92, 427D, I481Chart, ECG, electrophysiologist review
Ambulatory data
Brophy 0434427.3, 427.311Chart, ECG
Borzecki 0413427.3≥1 or ≥2 (1 or 2 years)Chart
Go 01/0016,18427.311ECG
Ruigomez 05/0214,35OXMIS codes1Questionnaire to general practitioner
Hospital data
Quon 0436427.3x1 (inpatient)Chart
Kokotailo 0520Not stated ICD91 (inpatient)Chart, ECG, Holter
Thigpen 1519427.311 (inpatient)ECG
Shen 0837427.3x 1st 3 codes1 (inpatient)Chart, ECG
Shireman 0438427.311 (inpatient)Chart
Chotchaisuwatana22I481 (inpatient)Chart
Munkholm 1539Not stated1 (inpatient)Chart
Walkey 1140427.3x1 (inpatient)Chart
Alonso 0917427.311 (inpatient)Chart, ECG
Hravnak 0141427.311 (inpatient)Text search, prescriptions

Abbreviations: CIED, cardiac implantable electronic device; ECG, electrocardiogram; ED, emergency department; ICD, International Classification of Diseases; MD, doctor of medicine; OAC, oral anticoagulation.

Characteristic and validation of atrial fibrillation in studies Abbreviations: CIED, cardiac implantable electronic device; ECG, electrocardiogram; ED, emergency department; ICD, International Classification of Diseases; MD, doctor of medicine; OAC, oral anticoagulation.

Characteristics of AF

Prevalent and incident AF were assessed in two-thirds and one-thirds of studies, respectively (Table 1). Incident cases were typically defined by exclusion of prior AF diagnoses since the records began, or methods were not specified. The incidence and prevalence of AF varied markedly depending on the population studied, from 0.3% to 55%.14,15 The incidence and prevalence was highest in studies following cardiac surgery (32–36%) and stroke (10–28%), lower in general hospitalizations (7–9%) and lowest in unselected outpatients (0.3–1%). No study distinguished between persistent and paroxysmal AF. Three studies reported from 7.0% to 20.4% of the coded AF to be transient.4,16,17 Two defined transient as a single episode without recurrence,4,16 while two added precipitants including cardiac surgery and/or hyperthyroidism.16,17

Gold standard for diagnosis of AF

With the exception of two studies, medical chart review was considered the gold standard by which history of AF was classified (Table 2). Of these, 13 studies specifically included ECG review, of which 2 employed ECG alone for confirmation of AF.18,19 No prospective protocols or frequencies for ECG were reported. A median of 11 ECGs per patient with AF was noted in a Swedish outpatient setting.15 Only 4 studies mentioned use of longer term rhythm monitoring such as Holter, although these results may also have been available in medical record review.

Sensitivity and specificity

Half the studies (n=12) sampled an unrestricted population including those without AF to assess sensitivity of case definitions and these ranged from 57% to 93%, median 81% (Table 3). The pooled random effects estimate for sensitivity was 80% (95% CI 72–86%) with significant heterogeneity (Q 439, I 97.7%, Tau2 0.08). One-third of the studies (n=8) reported specificity. Estimates were consistently high in ambulatory, hospitalized and mixed populations, ranging from 91% to 99%, median 99% (Table 3). The pooled random effects estimate for specificity was 98% (95% CI 96–99%).
Table 3

Codes and accuracy for identifying atrial fibrillation

ValidatedCoding positionn SampleAFTrue positiveFalse positivePPVNot AFTrue negativeFalse negativeNPVSNSP
Mixed data
Frost 07281°/2°174174172299
Norberg 1329227421962119779615593
Baturova 14151°/2°666188152368148244632928393
Tu 1611750021815563717308724537998199
Navar-Boggan 1543003002871396
Sung 161264696664741927158185626177977397
Sundboll 16301°/2°979792595
Rix 12312842842622293 All65 ED94 IP/OP
Frost 0532116116112497
Smith 103310010097397
Ambulatory data
Brophy 04343366261974778
Borzecki 0413117669591086≥1/1110310931499≥1/180≥1/167≥2/186≥1/274≥2/299≥1/199≥2/197≥1/299≥2/2
Go 01/0016,185050391178
Ruigomez 05/0214,3518881888176312593
Hospital data
Quon 0436 Prev1200102966941098108117988599
Kokotailo 052013718171941161154978199
Thigpen 1519Any17061706148921787
Shen 0837Any10010096496
Shireman 04381°/2°38,92438,92427,67411,25071
Chotchaisuwatana221931931692488
Munkholm 153913814583788083878798125867591
Quon 0436 Incident120076186119311867994699
Walkey 11401631631471690
Alonso 0917154616913534801377135126988498
Hravnak 014126014811257

Abbreviations: AF, atrial fibrillation; NPV, negative predictive value; PPV, positive predictive value; SN, sensitivity; SP, specificity; ED, Emergency department population; IP, inpatient population; OP, outpatient population.

Codes and accuracy for identifying atrial fibrillation Abbreviations: AF, atrial fibrillation; NPV, negative predictive value; PPV, positive predictive value; SN, sensitivity; SP, specificity; ED, Emergency department population; IP, inpatient population; OP, outpatient population.

Positive and negative predictive value

Positive predictive value was reported in nearly all studies (n=22), and was the only parameter reported in half the studies (n=12). The PPV ranged from 71% to 99%, median 93% (Table 3). The pooled random effects estimate was 88% (95% CI 82–94%) with significant heterogeneity (Q 4997, I2 99.6%, Tau2 0.02) (Figure 1). The pooled estimate was similar in ambulatory, hospitalized and mixed populations (respectively: 87% (78–96%); 87% (79–95%); 90% (85–94%)). Negative predictive value was reported in 8 studies. Estimates were consistently high, ranging from 86% to 99%, median 98% (Table 3). The pooled random effects estimate was 97% (95% CI 94–99%).
Figure 1

Positive predictive values of atrial fibrillation (AF) algorithms stratified by population type.

Positive predictive values of atrial fibrillation (AF) algorithms stratified by population type.

Discussion

This analysis reports several key findings. The overall specificity and NPV of an AF diagnosis using the ICD case definitions was high, 98% and 97%, respectively. The sensitivity and PPV were lower though reasonable, 80% and 88%, respectively. Only half the studies sampled patients with and without an assigned diagnosis of AF to determine the sensitivity of the case definitions and thus the proportion potentially missed by using administrative data. Half the studies confirmed AF using electrocardiography as the gold standard, while the remainder employed medical record review, alternative databases (like primary care EMRs) and/or patient questionnaires. Only 3 studies reported all accuracy parameters and included rhythm monitoring in the gold standard definition.15,17,20

Sensitivity

High sensitivity improves case finding as it more completely captures a population, increases the estimated incidence and prevalence and enhances generalizability. This is particularly relevant when estimating the burden of disease and to reduce bias when studying health inequalities. Sensitivity is also important when defining AF as an exposure. Misclassification of exposure (eg, AF) as non-exposure (eg, no AF) attenuates the association with outcomes such as stroke.21 By contrast, sensitivity is less of a concern when defining AF as an outcome, for example in pharmacovigilance studies. In these circumstances, estimates of relative risk are not biased providing misclassification occurs to the same degree in exposed and non-exposed patients. Sensitivity is reduced when cases are missed and AF is misclassified as normal (ie, false negatives). This occurs in two circumstances. First, when recording or coding is incorrect. Second, when correctly recorded and coded diagnoses are missed in time or space. Examining shorter time frames may miss infrequent encounters, as evidenced in the Veterans Affairs study where sensitivity increased using a 2 versus 1 year period for case ascertainment.13 Information also flows by varying amounts and rates through health systems. Although the median time from AF on ECG to diagnosis in the Swedish Patient Register was 16 days, this time lapse exceeded 6 months in one-third of patients.15 Sensitivity may be viewed from different perspectives: local, horizontal level of care (eg, primary care), vertical (eg, health maintenance organization) or global (entire health care system). Examining a single health care setting may miss encounters meeting the AF case definition in another. For example, hospitalization data alone misses patients managed entirely in the community, causing under-estimates of prevalence rates and over-estimates of adverse outcome rates. There were insufficient studies to accurately compare sensitivity between ambulatory, hospital and mixed populations. However, one of the mixed population studies did compare the accuracy of coding between primary care, secondary care or both together. In that study from Ontario, the sensitivity was 45%, 39% and 75% for hospitalization, emergency department or outpatient data sources alone, respectively, and 83% combining the three sources.11 The true population incidence and prevalence may also be influenced by access to rhythm monitoring (ECG, Holter, event or loop recorder), reporting standards (eg, training, quality assurance) and information transfer (eg, interface to electronic medical record). These factors are potentially more challenging in community than in hospital settings, particularly relevant to measuring inequalities, and difficult to quantify. None of the included studies described these aspects of access.

Positive predictive value

Since sensitivity and specificity are typically inversely related, higher sensitivity reduces specificity, which increases false positives and lowers PPV. The impact on PPV is magnified for diseases with a relatively low prevalence such as AF. A high PPV ensures persons identified truly have AF (fewer false positives). This is central to adjudicating treatment uptake, which will appear inappropriately withheld if patients with sinus rhythm are misclassified as having AF, unless OAC is prescribed for an alternate reason.4 A PPV value exceeding 85–90% suggested adequate for research purposes.19,20 The reasons for false positives were rarely explored.22 Potential scenarios include: 1) miscoding eg, allergic rhinitis was written as “AR” and coded as AF;22 2) rhythm misinterpretation such as atrial tachycardia; 3) misreporting if based on medical history alone; and 4) AF defined by an intervention shared with other conditions eg, cardioversion. PPV is also highly dependent on disease prevalence: as many studies focused on older or high-risk individuals they may overestimate the true PPV for that case definition if applied in a younger population. Oral anticoagulation is the only treatment to improve survival in patients with AF, and thus a key quality indicator. Although the overall PPV was high (88%), the specificity and PPV to identify AF requiring anticoagulation (as opposed to any AF) could be lower for several reasons. First, up to 10% of incident AF is isolated with a defined precipitant, low recurrence, and may not require anticoagulation.4,16,23 Only three studies reported or excluded such patients.4,16,17 Second, anticoagulation adjudication requires accurate coding of embolic and bleeding risk factors, which like AF exhibit high specificity but are under-reported.12,22 More subjective bleeding risks such as frailty and falls are particularly difficult to quantify, although a recently described frailty score based on administrative data (the Hospital Frailty Risk Score) has been described.24 Finally, patient preferences are major drivers of anticoagulation decisions but are never captured in administrative databases.

Atrial fibrillation phenotype and coding considerations

The disease spectrum (permanent, persistent, paroxysmal, isolated unprovoked or provoked episodes) was rarely reported yet also impacts accuracy of AF detection. Permanent or persistent AF is associated with greater comorbidity and hence health care encounters during which arrhythmia is continuously present. By contrast, isolated or paroxysmal AF may be under-represented by health care encounters. Treatment including rate versus rhythm control and anticoagulation also varies based on symptoms, AF duration, risk of recurrence and thromboembolism.25 The accuracy of administrative data to identify AF requiring anticoagulation is thus further lessened by limited phenotypic characterization.4 The AF phenotype may also impact the “gold standard” for diagnosing AF, whereby paroxysmal AF is missed by ECG alone but detected by chart review. In the only study examining this issue, ECG review did not improve sensitivity of AF detection over diagnosis codes alone.4 Most developed health systems collect reasons for hospital and ambulatory encounters for administration, service planning, quality improvement and reimbursement. A single primary or most responsible diagnosis is typically assigned, while conditions complicating or prolonging stay are coded in multiple secondary positions, sometimes further categorized as pre-existing or de novo disease. Differences in coding accuracy, treatment and prognosis are reported between primary and secondary positions for conditions such as heart failure.26,27 To our knowledge, such differences have not been explored in patients with AF, and no study identified by our search compared coding positions.

Strengths and limitations

Several strengths and limitations merit consideration. Our analysis is contemporary, included varied health systems, ICD-8 to ICD-10 codes, and both ambulatory and hospital populations. However, most studies originated from North America or Scandinavia, and examined ICD codes in administrative data sources. This potentially limits generalizability to other health care systems. There was significant heterogeneity in terms of population, prevalence of AF and reported accuracy parameters. Most studies assessed accuracy in restricted cohorts as opposed to the broader population.

Directions for future research

Health service researchers and administrators may interpret administrative data using either our pooled estimates or locally relevant studies from among those identified. Jurisdictions would ideally conduct nationally representative validation studies to provide estimates specific to their populations and data sources. These should examine existing codes and test new case definition algorithms in all data sources with differences in coding practices and diagnostic accuracy (eg, hospitalization, emergency department, ambulatory primary and secondary care), and in scenarios with varying disease prevalence. Though challenging and costly, random sampling of representative populations is essential to define sensitivity, enhance generalizability and reduce bias when studying inequality. To understand the true disease burden, algorithms should combine primary and secondary care data sources. More complex algorithms utilizing advanced analytics such as natural language processing and machine learning to mine free-text medical records merit investigation. Potential avenues include integrating corroboratory data such as medications and procedures, and temporal and spatial coding patterns. Future work should investigate the optimal gold standard including rhythm monitoring, electronic data sources and chart review. The reasons for false positives and negatives need to be explored in detail, as does the impact of AF phenotype and coding position. Finally, the accuracy of embolic and bleeding risk factor case definitions requires further validation in order to adjudicate appropriateness of anticoagulation management choices.

Conclusion

The overall accuracy of AF identification was reasonable for system planning and surveillance of prevalence, quality and outcomes. However, there is a marked disconnect between the volume of publications in these domains, and those examining the underpinning data. Sensitivity and PPV were the least accurate parameters with greatest uncertainty in terms of evidence and interpretation. This potentially underestimates the burden of disease and may bias estimates of outcomes and treatment quality. The optimal AF case definition should consider the purpose of the study and the data sources available. Health service administrators, researchers and clinicians should be mindful of these factors, and work together to refine our use of electronic data.
  40 in total

1.  Assessing accuracy of diagnosis-type indicators for flagging complications in administrative data.

Authors:  Hude Quan; Gerry A Parsons; William A Ghali
Journal:  J Clin Epidemiol       Date:  2004-04       Impact factor: 6.437

Review 2.  The clinical profile and pathophysiology of atrial fibrillation: relationships among clinical features, epidemiology, and mechanisms.

Authors:  Jason Andrade; Paul Khairy; Dobromir Dobrev; Stanley Nattel
Journal:  Circ Res       Date:  2014-04-25       Impact factor: 17.367

3.  Identifying Patients With Atrial Fibrillation in Administrative Data.

Authors:  Karen Tu; Robby Nieuwlaat; Stephanie Y Cheng; Laura Wing; Noah Ivers; Clare L Atzema; Jeff S Healey; Paul Dorian
Journal:  Can J Cardiol       Date:  2016-06-23       Impact factor: 5.223

4.  Meta-analysis in clinical trials.

Authors:  R DerSimonian; N Laird
Journal:  Control Clin Trials       Date:  1986-09

5.  Validation of electronic medical database in patients with atrial fibrillation in community hospitals.

Authors:  Somjit Chotchaisuwatana; Arom Jedsadayanmata; Nathorn Chaiyakunapruk; Ketechan Jampachaisri
Journal:  J Med Assoc Thai       Date:  2011-06

6.  Incidence of chronic atrial fibrillation in general practice and its treatment pattern.

Authors:  Ana Ruigómez; Saga Johansson; Mari Ann Wallander; Luis Alberto García Rodríguez
Journal:  J Clin Epidemiol       Date:  2002-04       Impact factor: 6.437

7.  Combined anticoagulant-antiplatelet use and major bleeding events in elderly atrial fibrillation patients.

Authors:  Theresa I Shireman; Patricia A Howard; Timothy F Kresowik; Edward F Ellerbeck
Journal:  Stroke       Date:  2004-08-26       Impact factor: 7.914

8.  Atrial fibrillation in the Malmö Diet and Cancer study: a study of occurrence, risk factors and diagnostic validity.

Authors:  J Gustav Smith; Pyotr G Platonov; Bo Hedblad; Gunnar Engström; Olle Melander
Journal:  Eur J Epidemiol       Date:  2010-02       Impact factor: 8.082

9.  Is the diagnostic coding position of acute heart failure related to mortality? A report from the Euro Heart Failure Survey-1.

Authors:  Ahmad Shoaib; Mohamed Farag; Mansoor Nasir; Joseph John; Sanjay Gupta; Pierpaolo Pellicori; Renjith Antony; Rashida Perveen; Alan Rigby; Kevin M Goode; Ashraf Yassin; Andrew L Clark; John G F Cleland
Journal:  Eur J Heart Fail       Date:  2016-03-16       Impact factor: 15.534

10.  Positive predictive value of cardiovascular diagnoses in the Danish National Patient Registry: a validation study.

Authors:  Jens Sundbøll; Kasper Adelborg; Troels Munch; Trine Frøslev; Henrik Toft Sørensen; Hans Erik Bøtker; Morten Schmidt
Journal:  BMJ Open       Date:  2016-11-18       Impact factor: 2.692

View more
  8 in total

1.  COVID-RDNet: A novel coronavirus pneumonia classification model using the mixed dataset by CT and X-rays images.

Authors:  Lingling Fang; Xin Wang
Journal:  Biocybern Biomed Eng       Date:  2022-08-05       Impact factor: 5.687

2.  Do routine hospital data accurately record comorbidity in advanced kidney disease populations? A record linkage cohort study.

Authors:  Rommel Ravanan; Dominic Taylor; Ailish Nimmo; Retha Steenkamp
Journal:  BMC Nephrol       Date:  2021-03-17       Impact factor: 2.388

3.  A Population-Based Study of Unexplained/Lone Atrial Fibrillation: Temporal Trends, Management, and Outcomes.

Authors:  Thomas M Roston; Sunjidatul Islam; Nathaniel M Hawkins; Zachary W Laksman; Shubhayan Sanatani; Andrew D Krahn; Roopinder Sandhu; Padma Kaul
Journal:  CJC Open       Date:  2021-09-14

4.  Geographic and temporal variation in the treatment and outcomes of atrial fibrillation: a population-based analysis of national quality indicators.

Authors:  Derek S Chew; Flora Au; Yuan Xu; Braden J Manns; Marcello Tonelli; Stephen B Wilton; Brenda Hemmelgarn; Shiying Kong; Derek V Exner; Amity E Quinn
Journal:  CMAJ Open       Date:  2022-08-02

5.  Use of oral anticoagulants among individuals with cancer and atrial fibrillation in the United States, 2010-2016.

Authors:  Shirin Ardeshirrouhanifard; Huijun An; Ravi K Goyal; Mukaila A Raji; Jodi B Segal; G Caleb Alexander; Hemalkumar B Mehta
Journal:  Pharmacotherapy       Date:  2022-04-14       Impact factor: 6.251

6.  Nationwide Study of Sex Differences in Incident Heart Failure in Newly Diagnosed Nonvalvular Atrial Fibrillation.

Authors:  Haran Yogasundaram; Sunjidatul Islam; Douglas C Dover; Nathaniel M Hawkins; Justin Ezekowitz; Padma Kaul; Roopinder K Sandhu
Journal:  CJC Open       Date:  2022-05-06

7.  Administrative Data in Cardiovascular Research-A Comparison of Polish National Health Fund and CRAFT Registry Data.

Authors:  Cezary Maciejewski; Krzysztof Ozierański; Mikołaj Basza; Piotr Lodziński; Andrzej Śliwczyński; Leszek Kraj; Maciej Janusz Krajsman; Jefte Prado Paulino; Agata Tymińska; Grzegorz Opolski; Andrzej Cacko; Marcin Grabowski; Paweł Balsam
Journal:  Int J Environ Res Public Health       Date:  2022-09-22       Impact factor: 4.614

Review 8.  Validity of Acute Cardiovascular Outcome Diagnoses Recorded in European Electronic Health Records: A Systematic Review.

Authors:  Jennifer Davidson; Amitava Banerjee; Rutendo Muzambi; Liam Smeeth; Charlotte Warren-Gash
Journal:  Clin Epidemiol       Date:  2020-10-14       Impact factor: 4.790

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.