Literature DB >> 31933524

Sensitivity, specificity, positive and negative predictive values of identifying atrial fibrillation using administrative data: a systematic review and meta-analysis.

Ren Jie Robert Yao¹, Jason G Andrade¹, Marc W Deyell¹, Heather Jackson², Finlay A McAlister³, Nathaniel M Hawkins¹.

Abstract

INTRODUCTION: Atrial fibrillation (AF) is the commonest arrhythmia and a major cause of stroke and health care utilization. Researchers and administrators use electronic health data to assess disease burden, quality and variance in care, value of interventions and prognosis. We performed a systematic review and meta-analysis to assess the validity of AF case definitions in administrative databases.
METHODS: Medline was searched from 2000 to 2018. Extracted information included sensitivity, specificity, positive and negative predictive values (PPV and NPV) for various AF case definitions. Estimates were pooled using random-effects models due to significant heterogeneity between studies.
RESULTS: We identified 24 studies, including 21 from North America or Scandinavia. Hospital, ambulatory and mixed data sources were assessed in 10, 4 and 10 studies, respectively. Nine different AF case definitions were evaluated, most based on ICD-9 or 10 codes. Twenty-two studies assessed case definitions in patients diagnosed with AF and thus could generate PPV alone. Half the studies sampled unrestricted populations including a mix of those with and without AF to assess sensitivity. Only 13 studies included ECG confirmation as a gold standard. The pooled random effects estimates were: sensitivity 80% (95% CI 72-86%); specificity 98% (96-99%); PPV 88% (82-94%); NPV 97% (94-99%). Only 3 studies reported all accuracy parameters and included rhythm monitoring in the gold standard definition.
CONCLUSION: Relatively few studies examined sensitivity, and fewer still included rhythm monitoring in the gold standard comparison. Administrative data may fail to identify a significant proportion of patients with AF. This, in turn, may bias estimates of quality of care and prognosis.

Entities: Chemical

Keywords: accuracy; atrial fibrillation; registries; sensitivity; specificity; validation studies

Year: 2019 PMID： 31933524 PMCID： PMC6712502 DOI： 10.2147/CLEP.S206267

Source DB: PubMed Journal: Clin Epidemiol ISSN： 1179-1349 Impact factor: 4.790

Introduction

Atrial fibrillation (AF) increases risk of stroke, heart failure and death, and is one of the few cardiac conditions whose prevalence continues to rise.1,2 Most developed health systems collect reasons for hospital and ambulatory encounters for administration, service planning, quality improvement and reimbursement. Health services researchers use these administrative electronic databases to monitor the burden of disease, quality of care, and ascertain exposures or outcomes. The accuracy of AF identification is central to these applications. Sensitivity and specificity, though theoretically independent, typically trade-off and are inversely related.3 The “optimal” approach to identifying AF depends on the purpose. High sensitivity more completely captures a population, improves generalizability and is important when defining AF as an exposure. By contrast, high specificity ensures persons identified truly have AF and is central to adjudicating treatment uptake, which appears inappropriately low if patients with sinus rhythm are misclassified as having AF.4 Conceptually, the AF patient journey involves ambulatory and acute contacts dissociated in time and space, between which information flows by varying amounts and rates. Interrogating data sources over short time intervals or single environments may miss infrequent encounters. A previous systematic review examined the accuracy of AF detection, but was limited to ICD-9 codes only, non-contemporary electronic sources, North American cohorts and narrative synthesis without consideration of the impact of different health care settings (indeed the focus was largely on hospitalization data).5 We, therefore, undertook a systematic review to address these evidence gaps.

Methods

Participants, outcomes and study designs

Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines were followed (Table S1). We examined the accuracy of AF case definitions in electronic administrative health data, namely sensitivity (SN), specificity (SP), positive and negative predictive values (PPV and NPV). Inpatient, outpatient and mixed populations were included. All study designs were accepted. The study protocol was not published.

Table S1

PRISMA checklist

Section/topic	#	Checklist item	Report page #
Title
Title	1	Identify the report as a systematic review, meta-analysis or both.	1
Abstract
Structured summary	2	Provide a structured summary including, as applicable: background; objectives; data sources; study eligibility criteria, participants and interventions; study appraisal and synthesis methods; results; limitations; conclusions and implications of key findings; systematic review registration number.	2
Introduction
Rationale	3	Describe the rationale for the review in the context of what is already known.	3
Objectives	4	Provide an explicit statement of questions being addressed with reference to participants, interventions, comparisons, outcomes and study design (PICOS).	4
Methods
Protocol	5	Indicate if a review protocol exists, if and where it can be accessed (eg, Web address), and, if available, provide registration information including registration number.	5
Eligibility	6	Specify study characteristics (eg, PICOS, length of follow-up) and report characteristics (eg, years considered, language, publication status) used as criteria for eligibility, giving rationale.	5
Sources	7	Describe all information sources (eg, databases with dates of coverage, contact with study authors to identify additional studies) in the search and date last searched.	5
Search	8	Present full electronic search strategy for at least one database, including any limits used, such that it could be repeated.	5
Selection	9	State the process for selecting studies (ie, screening, eligibility, included in systematic review, and, if applicable, included in the meta-analysis).	5
Collection	10	Describe method of data extraction from reports (eg, piloted forms, independently, in duplicate) and any processes for obtaining and confirming data from investigators.	5
Data items	11	List and define all variables for which data were sought (eg, PICOS, funding sources) and any assumptions and simplifications made.	5
Bias in studies	12	Describe methods used for assessing risk of bias of individual studies (including specification of whether this was done at the study or outcome level), and how this information is to be used in any data synthesis.	N/A
Summary measures	13	State the principal summary measures (eg, risk ratio, difference in means).	5
Synthesis	14	Describe the methods of handling data and combining results of studies, if done, including measures of consistency (eg, I²) for each meta-analysis.	6
Bias across studies	15	Specify any assessment of risk of bias that may affect the cumulative evidence (eg, publication bias, selective reporting within studies).	6
Additional analyses	16	Describe methods of additional analyses (eg, sensitivity or subgroup analyses, meta-regression), if done, indicating which were pre-specified.	N/A
Results
Selection	17	Give numbers of studies screened, assessed for eligibility and included in the review, with reasons for exlusions at each stage, ideally with a flow diagram.	Figure S1
Characteristics	18	For each study, present characteristics for which data were extracted (eg, study size, PICOS, follow-up period) and provide the citations.	Table 1
Bias within studies	19	Present data on risk of bias of each study and, if available, any outcome level assessment (see item 12).	N/A
Results	20	For all outcomes considered (benefits or harms), present, for each study: (a) simple summary data for each intervention group (b) effect estimates and confidence intervals, ideally with a forest plot.	Figure 1
Synthesis	21	Present results of each meta-analysis done, including confidence intervals and measures of consistency.	7,8
Bias across studies	22	Present results of any assessment of risk of bias across studies (see Item 15).	N/A
Additional	23	Give results of additional analyses, if done (eg, sensitivity or subgroup analyses, meta-regression).	N/A
Discussion
Summary	24	Summarize the main findings including the strength of evidence for each main outcome; consider their relevance to key groups (eg, health care providers, users and policy makers).	8–10
Limitations	25	Discuss limitations at study and outcome level (eg, risk of bias), and at review level (eg, incomplete retrieval of identified research, reporting bias).	11
Conclusion	26	Provide a general interpretation of the results in the context of other evidence, implications future research.	11
Funding
Funding	27	Describe sources of funding for the systematic review and other support (eg, supply of data); role of funders for the systematic review.	13

Search strategy and data collection

MEDLINE was searched from January 2000 to February 2018, limited to adult humans and English language, excluding case studies, reviews and conference abstracts. Search terms were determined by literature review and database query. The search strategy combined Medical Subject Headings (MeSH) terms and keywords in title and abstract to define three groups: atrial fibrillation (including atrial flutter (AFL) if not differentiated); administrative and electronic medical databases; and studies examining accuracy of AF identification within these records (Table S2). The search returned 1007 unique records. Manual bibliography searches identified an additional 31 publications (Figure S1). Titles and abstracts were screened for inclusion, and 302 full-text articles reviewed. Studies fulfilling the participant, outcomes and study design criteria were included. Variables of interest were decided a priori, expanded iteratively after pilot and collected in Microsoft Excel. The following information was extracted: bibliographic details, sample size, population characteristics, inclusion and exclusion criteria, codes and algorithms, AF confirmation gold standard and accuracy parameter outcomes.

Table S2

Search strategy

1) *atrial fibrillation

2) Atrial fibrillation.ti,ab.

3) or/1–2

4) *registries/ or *records as topic/ or *databases, factual/ or *database management systems/ or *epidemiologic studies/

5) (administrative or registr* or database* or claims or health maintenance organization or population-based).ti,ab.

6) or/4–5

7) *validation studies/ or *data accuracy/ or *predictive value of tests/

8) (sensitivity or specificity or predictive value or accuracy or abstract* or identif*).ti,ab.

9) or/7–8

10) 3 and 6 and 9

11) Limit 10 to humans

12) Limit 11 to english language

13) Limit 12 to yr =“2000-Current”

14) 13 not exp newborn/ not exp infant/ not exp child/ not exp adolescent/

15) 14 not (comment or editorial or note or letter or interview or lectures or personal narratives or biography or autobiography or addresses or patient education handout or interactive tutorial or news or newspaper article or historical article or webcasts or video-audio media or portraits or twin study or retraction of publication or retracted publication or published erratum or duplicate publication or case reports or legal cases or guideline or conference abstract or English abstract or clinical conference or congresses or meta-analysis or randomized controlled trial or clinical trial or clinical trial, phase I or clinical trial, phase II or clinical trial, phase III or clinical trial, phase IV or controlled clinical trial).pt.

16) 15 not catheter ablation/ not transcatheter aortic valve replacement/ not septal occluder device/ not antibodies, monoclonal/

17) 16 not (ablation or pulmonary vein or Amplat* or watchman or pacemaker or defibrillator or single-chamber or dual-chamber or resynchronization or gene or genes or genet* or ibrutinib).ti,ab.

Data synthesis

Weighted averages of sensitivity, specificity, positive and negative predictive values were calculated using the DerSimonian-Laird random effects model.6 Forest plots of estimates with 95% confidence intervals (CI) were generated. Publication bias was assessed through visual inspection of funnel plots and the Begg-Mazumdar rank correlation test for asymmetry.7,8 Heterogeneity was tested with visual forest plot inspection, Cochrane Q, I and Tau statistics.9 Estimates with significant heterogeneity (I>90%) were examined manually and formally for moderating effects including country, publication year and reference standard, none of which were significant. The leave-one-out method was used to determine if the results were sensitive to the inclusion of extreme values from specific studies.10

Results

Study characteristics

Twenty-four studies were identified (Table 1). Most originated from countries with established administrative databases that are often interrogated by health services researchers, including 10 from the United States, 3 from Canada and 8 from Sweden or Denmark. The populations were heterogeneous, including general unselected, stroke and post-operative cohorts. Hospital, ambulatory and mixed data sources were assessed in 10, 4 and 10 studies, respectively. Only 3 studies outside Scandinavia examined mixed populations.4,11,12 One Canadian study included administrative data from emergency departments separate from hospitalizations.11

Table 1

Study characteristics

Validated	Prevalent/incident	Time period defining incident	Year	Country	Population sampled	AF (%)	Data source	Exclusion
Mixed data
Frost 0728	Prevalent	–	80–02	Denmark	Stroke + AF	–	National patient registry	Stroke, valvular
Norberg 1329	Prevalent	–	04–10	Sweden	General	3.0	National patient register	–
Baturova 1415	Prevalent	–	01–11	Sweden	Stroke	28.2	National patient register	–
Tu 1611	Prevalent	–	11	Canada	General	2.6	Administrative databases	–
Navar-Boggan 154	Prevalent	–	11–12	US	In/outpatient	–	Duke University Health System	AFL alone
Sung 1612	Prevalent	–	08–10	Taiwan	Stroke	10.1	Taiwan National Health Insurance	–
Sundboll 1630	Incident	–	10–12	Denmark	General	–	National patient registry	–
Rix 1231	Incident	Start records	93–09	Denmark	General	6.1	National patient registry	–
Frost 0532	Incident	Start records	93–99	Denmark	General	1.2	National patient registry	Prior cancer
Smith 1033	Incident	Not specified	91–05	Sweden	General	1.3	National hospital/death registry	–
Ambulatory data
Brophy 0434	Prevalent	–	97–01	US	General	–	Veteran Affairs clinics	Valvular disease
Borzecki 0413	Prevalent	–	98–99	US	Hypertension	6	Veteran Affairs clinics 10 sites	–
Go 01/0016,18	Prevalent	–	96–97	US	AF	1.0	HMO Kaiser Permanente	Prior AF ECG
Ruigomez 05/0214,35	Incident	Start records	96	UK	AF	0.3	General Practice Research Database	–
Hospital data
Quon 0436	Prevalent	–	96–97	Canada	General	9.4	Discharge Abstract Database	–
Kokotailo 0520	Prevalent	–	00–01	Canada	Stroke	–	Discharge Abstract Database	–
Thigpen 1519	Prevalent	–	06–10	US	Stroke + AF	–	Administrative databases	Prosthetic valve
Shen 0837	Prevalent	–	95–00	US	General	–	HMO Kaiser Permanente	Valvular disease
Shireman 0438	Prevalent	–	98–99	US	AF warfarin	–	Medicare discharges, 750/state
Chotchaisuwatana22	Prevalent	–	07–08	Thailand	AF	–	Thailand community hospitals	–
Munkholm 1539	Incident	Unspecified	11–13	Denmark	Cardiac surgery	36.4	Western Denmark Heart Registry	–
Walkey 1140	Incident	Unspecified	07–08	US	Sepsis	5.9	California Inpatient Database	–
Alonso 0917	Incident	Start study	87–04	US	General	7.0	ARIC study	–
Hravnak 0141	Incident	Unspecified	96–98	US	CABG	31.9	Medical Archive Pittsburgh	–

Abbreviations: AF, atrial fibrillation; AFL, atrial flutter; ARIC, Atherosclerosis Risk in Communities study; CABG, coronary artery bypass grafting; ECG, electrocardiogram; HMO, health maintenance organization; UK, United Kingdom; US, United States.

Study characteristics Abbreviations: AF, atrial fibrillation; AFL, atrial flutter; ARIC, Atherosclerosis Risk in Communities study; CABG, coronary artery bypass grafting; ECG, electrocardiogram; HMO, health maintenance organization; UK, United Kingdom; US, United States.

Coding and case definition algorithms

Most reports investigated International Classification of Diseases codes: ICD8 (427.93 AF, 427.94 AFL), ICD9 (4 digit code 427.3 and more explicit 427.31 AF and 427.32 AFL), ICD10 (I48). Overall, 9 different combinations of codes were studied (Table 2). The impact of coding position (primary versus secondary diagnosis in hospitalization data) was never examined. Four studies compared the accuracy of two versus one encounter coded as AF in ambulatory data sources within a single year.4,11–13 This consistently increased specificity but decreased sensitivity. A single study compared 2 versus 1 year for case ascertainment in Veterans Affairs outpatient records, finding greater sensitivity with only slightly reduced specificity.13 Overall in that study, 2 diagnoses over 2 years were optimal for detecting AF.13 Only one study from Canada examined more complex algorithms including cardioversion codes and pharmacy dispensations for antiarrhythmic drugs.11

Table 2

Characteristic and validation of atrial fibrillation in studies

Validated	ICD code(s) *	Number of encounters required to diagnose(and over what time period)	AF confirmation gold standard
Mixed data
Frost 0728	427.93–94, I48	1	Chart, ECG, telemetry, event recorder
Norberg 1329	I48	1	Chart, ECG, Holter, CIED
Baturova 1415	427D, I48	1	Chart, ECG
Tu 1611	427.3x, I48427.x physician bill	1 Hosp/ED or 1MD + cardioversion or 1 rhythm control or 1 MD + OAC (1 year)	Primary care chart (physician notes and/or ECG)
Navar-Boggan 154	427.31	1 inpatient or 2 outpatient/ED visits (1 year)	Chart, ECG
Sung 1612	427.31	1 inpatient or 2 outpatient visits (2 years)	Database registry
Sundboll 1630	427.93–94, I48	1	Chart
Rix 1231	427.93–94, I48	1	Chart, ECG, electrophysiologist review
Frost 0532	427.93–94, I48	1	Chart, ECG, telemetry or Holter
Smith 1033	427.92, 427D, I48	1	Chart, ECG, electrophysiologist review
Ambulatory data
Brophy 0434	427.3, 427.31	1	Chart, ECG
Borzecki 0413	427.3	≥1 or ≥2 (1 or 2 years)	Chart
Go 01/0016,18	427.31	1	ECG
Ruigomez 05/0214,35	OXMIS codes	1	Questionnaire to general practitioner
Hospital data
Quon 0436	427.3x	1 (inpatient)	Chart
Kokotailo 0520	Not stated ICD9	1 (inpatient)	Chart, ECG, Holter
Thigpen 1519	427.31	1 (inpatient)	ECG
Shen 0837	427.3x 1st 3 codes	1 (inpatient)	Chart, ECG
Shireman 0438	427.31	1 (inpatient)	Chart
Chotchaisuwatana22	I48	1 (inpatient)	Chart
Munkholm 1539	Not stated	1 (inpatient)	Chart
Walkey 1140	427.3x	1 (inpatient)	Chart
Alonso 0917	427.31	1 (inpatient)	Chart, ECG
Hravnak 0141	427.31	1 (inpatient)	Text search, prescriptions

Abbreviations: CIED, cardiac implantable electronic device; ECG, electrocardiogram; ED, emergency department; ICD, International Classification of Diseases; MD, doctor of medicine; OAC, oral anticoagulation.

Characteristic and validation of atrial fibrillation in studies Abbreviations: CIED, cardiac implantable electronic device; ECG, electrocardiogram; ED, emergency department; ICD, International Classification of Diseases; MD, doctor of medicine; OAC, oral anticoagulation.

Characteristics of AF

Prevalent and incident AF were assessed in two-thirds and one-thirds of studies, respectively (Table 1). Incident cases were typically defined by exclusion of prior AF diagnoses since the records began, or methods were not specified. The incidence and prevalence of AF varied markedly depending on the population studied, from 0.3% to 55%.14,15 The incidence and prevalence was highest in studies following cardiac surgery (32–36%) and stroke (10–28%), lower in general hospitalizations (7–9%) and lowest in unselected outpatients (0.3–1%). No study distinguished between persistent and paroxysmal AF. Three studies reported from 7.0% to 20.4% of the coded AF to be transient.4,16,17 Two defined transient as a single episode without recurrence,4,16 while two added precipitants including cardiac surgery and/or hyperthyroidism.16,17

Gold standard for diagnosis of AF

With the exception of two studies, medical chart review was considered the gold standard by which history of AF was classified (Table 2). Of these, 13 studies specifically included ECG review, of which 2 employed ECG alone for confirmation of AF.18,19 No prospective protocols or frequencies for ECG were reported. A median of 11 ECGs per patient with AF was noted in a Swedish outpatient setting.15 Only 4 studies mentioned use of longer term rhythm monitoring such as Holter, although these results may also have been available in medical record review.

Sensitivity and specificity

Half the studies (n=12) sampled an unrestricted population including those without AF to assess sensitivity of case definitions and these ranged from 57% to 93%, median 81% (Table 3). The pooled random effects estimate for sensitivity was 80% (95% CI 72–86%) with significant heterogeneity (Q 439, I 97.7%, Tau2 0.08). One-third of the studies (n=8) reported specificity. Estimates were consistently high in ambulatory, hospitalized and mixed populations, ranging from 91% to 99%, median 99% (Table 3). The pooled random effects estimate for specificity was 98% (95% CI 96–99%).

Table 3

Codes and accuracy for identifying atrial fibrillation

Validated	Coding position	n Sample	AF	True positive	False positive	PPV	Not AF	True negative	False negative	NPV	SN	SP
Mixed data
Frost 0728	1°/2°	174	174	172	2	99	–	–	–	–	–	–
Norberg 1329	–	2274	2196	2119	77	96	–	–	155	–	93	–
Baturova 1415	1°/2°	666	188	152	36	81	482	446	32	92	83	93
Tu 1611	–	7500	218	155	63	71	7308	7245	37	99	81	99
Navar-Boggan 154	–	300	300	287	13	96	–	–	–	–	–	–
Sung 1612	–	6469	666	474	192	71	5818	5626	177	97	73	97
Sundboll 1630	1°/2°	97	97	92	5	95	–	–	–	–	–	–
Rix 1231	–	284	284	262	22	93 All65 ED94 IP/OP	–	–	–	–	–	–
Frost 0532	–	116	116	112	4	97	–	–	–	–	–	–
Smith 1033	–	100	100	97	3	97	–	–	–	–	–	–
Ambulatory data
Brophy 0434	–	3366	–	2619	–	–	–	–	747	–	78	–
Borzecki 0413	–	1176	69	59	10	86≥1/1	1103	1093	14	99≥1/1	80≥1/167≥2/186≥1/274≥2/2	99≥1/199≥2/197≥1/299≥2/2
Go 01/0016,18	–	50	50	39	11	78	–	–	–	–	–	–
Ruigomez 05/0214,35	–	1888	1888	1763	125	93	–	–	–	–	–	–
Hospital data
Quon 0436 Prev	–	1200	102	96	6	94	1098	1081	17	98	85	99
Kokotailo 0520	–	137	18	17	1	94	116	115	4	97	81	99
Thigpen 1519	Any	1706	1706	1489	217	87	–	–	–	–	–	–
Shen 0837	Any	100	100	96	4	96	–	–	–	–	–	–
Shireman 0438	1°/2°	38,924	38,924	27,674	11,250	71	–	–	–	–	–	–
Chotchaisuwatana22	–	193	193	169	24	88	–	–	–	–	–	–
Munkholm 1539	–	1381	458	378	80	83	878	798	125	86	75	91
Quon 0436 Incident	–	1200	7	6	1	86	1193	1186	7	99	46	99
Walkey 1140	–	163	163	147	16	90	–	–	–	–	–	–
Alonso 0917	1°	1546	169	135	34	80	1377	1351	26	98	84	98
Hravnak 0141	–	260	–	148	–	–	–	–	112	–	57	–

Abbreviations: AF, atrial fibrillation; NPV, negative predictive value; PPV, positive predictive value; SN, sensitivity; SP, specificity; ED, Emergency department population; IP, inpatient population; OP, outpatient population.

Codes and accuracy for identifying atrial fibrillation Abbreviations: AF, atrial fibrillation; NPV, negative predictive value; PPV, positive predictive value; SN, sensitivity; SP, specificity; ED, Emergency department population; IP, inpatient population; OP, outpatient population.

Positive and negative predictive value

Positive predictive value was reported in nearly all studies (n=22), and was the only parameter reported in half the studies (n=12). The PPV ranged from 71% to 99%, median 93% (Table 3). The pooled random effects estimate was 88% (95% CI 82–94%) with significant heterogeneity (Q 4997, I2 99.6%, Tau2 0.02) (Figure 1). The pooled estimate was similar in ambulatory, hospitalized and mixed populations (respectively: 87% (78–96%); 87% (79–95%); 90% (85–94%)). Negative predictive value was reported in 8 studies. Estimates were consistently high, ranging from 86% to 99%, median 98% (Table 3). The pooled random effects estimate was 97% (95% CI 94–99%).

Figure 1

Positive predictive values of atrial fibrillation (AF) algorithms stratified by population type.

Discussion

This analysis reports several key findings. The overall specificity and NPV of an AF diagnosis using the ICD case definitions was high, 98% and 97%, respectively. The sensitivity and PPV were lower though reasonable, 80% and 88%, respectively. Only half the studies sampled patients with and without an assigned diagnosis of AF to determine the sensitivity of the case definitions and thus the proportion potentially missed by using administrative data. Half the studies confirmed AF using electrocardiography as the gold standard, while the remainder employed medical record review, alternative databases (like primary care EMRs) and/or patient questionnaires. Only 3 studies reported all accuracy parameters and included rhythm monitoring in the gold standard definition.15,17,20

Sensitivity

High sensitivity improves case finding as it more completely captures a population, increases the estimated incidence and prevalence and enhances generalizability. This is particularly relevant when estimating the burden of disease and to reduce bias when studying health inequalities. Sensitivity is also important when defining AF as an exposure. Misclassification of exposure (eg, AF) as non-exposure (eg, no AF) attenuates the association with outcomes such as stroke.21 By contrast, sensitivity is less of a concern when defining AF as an outcome, for example in pharmacovigilance studies. In these circumstances, estimates of relative risk are not biased providing misclassification occurs to the same degree in exposed and non-exposed patients. Sensitivity is reduced when cases are missed and AF is misclassified as normal (ie, false negatives). This occurs in two circumstances. First, when recording or coding is incorrect. Second, when correctly recorded and coded diagnoses are missed in time or space. Examining shorter time frames may miss infrequent encounters, as evidenced in the Veterans Affairs study where sensitivity increased using a 2 versus 1 year period for case ascertainment.13 Information also flows by varying amounts and rates through health systems. Although the median time from AF on ECG to diagnosis in the Swedish Patient Register was 16 days, this time lapse exceeded 6 months in one-third of patients.15 Sensitivity may be viewed from different perspectives: local, horizontal level of care (eg, primary care), vertical (eg, health maintenance organization) or global (entire health care system). Examining a single health care setting may miss encounters meeting the AF case definition in another. For example, hospitalization data alone misses patients managed entirely in the community, causing under-estimates of prevalence rates and over-estimates of adverse outcome rates. There were insufficient studies to accurately compare sensitivity between ambulatory, hospital and mixed populations. However, one of the mixed population studies did compare the accuracy of coding between primary care, secondary care or both together. In that study from Ontario, the sensitivity was 45%, 39% and 75% for hospitalization, emergency department or outpatient data sources alone, respectively, and 83% combining the three sources.11 The true population incidence and prevalence may also be influenced by access to rhythm monitoring (ECG, Holter, event or loop recorder), reporting standards (eg, training, quality assurance) and information transfer (eg, interface to electronic medical record). These factors are potentially more challenging in community than in hospital settings, particularly relevant to measuring inequalities, and difficult to quantify. None of the included studies described these aspects of access.

Positive predictive value

Since sensitivity and specificity are typically inversely related, higher sensitivity reduces specificity, which increases false positives and lowers PPV. The impact on PPV is magnified for diseases with a relatively low prevalence such as AF. A high PPV ensures persons identified truly have AF (fewer false positives). This is central to adjudicating treatment uptake, which will appear inappropriately withheld if patients with sinus rhythm are misclassified as having AF, unless OAC is prescribed for an alternate reason.4 A PPV value exceeding 85–90% suggested adequate for research purposes.19,20 The reasons for false positives were rarely explored.22 Potential scenarios include: 1) miscoding eg, allergic rhinitis was written as “AR” and coded as AF;22 2) rhythm misinterpretation such as atrial tachycardia; 3) misreporting if based on medical history alone; and 4) AF defined by an intervention shared with other conditions eg, cardioversion. PPV is also highly dependent on disease prevalence: as many studies focused on older or high-risk individuals they may overestimate the true PPV for that case definition if applied in a younger population. Oral anticoagulation is the only treatment to improve survival in patients with AF, and thus a key quality indicator. Although the overall PPV was high (88%), the specificity and PPV to identify AF requiring anticoagulation (as opposed to any AF) could be lower for several reasons. First, up to 10% of incident AF is isolated with a defined precipitant, low recurrence, and may not require anticoagulation.4,16,23 Only three studies reported or excluded such patients.4,16,17 Second, anticoagulation adjudication requires accurate coding of embolic and bleeding risk factors, which like AF exhibit high specificity but are under-reported.12,22 More subjective bleeding risks such as frailty and falls are particularly difficult to quantify, although a recently described frailty score based on administrative data (the Hospital Frailty Risk Score) has been described.24 Finally, patient preferences are major drivers of anticoagulation decisions but are never captured in administrative databases.

Atrial fibrillation phenotype and coding considerations

The disease spectrum (permanent, persistent, paroxysmal, isolated unprovoked or provoked episodes) was rarely reported yet also impacts accuracy of AF detection. Permanent or persistent AF is associated with greater comorbidity and hence health care encounters during which arrhythmia is continuously present. By contrast, isolated or paroxysmal AF may be under-represented by health care encounters. Treatment including rate versus rhythm control and anticoagulation also varies based on symptoms, AF duration, risk of recurrence and thromboembolism.25 The accuracy of administrative data to identify AF requiring anticoagulation is thus further lessened by limited phenotypic characterization.4 The AF phenotype may also impact the “gold standard” for diagnosing AF, whereby paroxysmal AF is missed by ECG alone but detected by chart review. In the only study examining this issue, ECG review did not improve sensitivity of AF detection over diagnosis codes alone.4 Most developed health systems collect reasons for hospital and ambulatory encounters for administration, service planning, quality improvement and reimbursement. A single primary or most responsible diagnosis is typically assigned, while conditions complicating or prolonging stay are coded in multiple secondary positions, sometimes further categorized as pre-existing or de novo disease. Differences in coding accuracy, treatment and prognosis are reported between primary and secondary positions for conditions such as heart failure.26,27 To our knowledge, such differences have not been explored in patients with AF, and no study identified by our search compared coding positions.

Strengths and limitations

Several strengths and limitations merit consideration. Our analysis is contemporary, included varied health systems, ICD-8 to ICD-10 codes, and both ambulatory and hospital populations. However, most studies originated from North America or Scandinavia, and examined ICD codes in administrative data sources. This potentially limits generalizability to other health care systems. There was significant heterogeneity in terms of population, prevalence of AF and reported accuracy parameters. Most studies assessed accuracy in restricted cohorts as opposed to the broader population.

Directions for future research

Health service researchers and administrators may interpret administrative data using either our pooled estimates or locally relevant studies from among those identified. Jurisdictions would ideally conduct nationally representative validation studies to provide estimates specific to their populations and data sources. These should examine existing codes and test new case definition algorithms in all data sources with differences in coding practices and diagnostic accuracy (eg, hospitalization, emergency department, ambulatory primary and secondary care), and in scenarios with varying disease prevalence. Though challenging and costly, random sampling of representative populations is essential to define sensitivity, enhance generalizability and reduce bias when studying inequality. To understand the true disease burden, algorithms should combine primary and secondary care data sources. More complex algorithms utilizing advanced analytics such as natural language processing and machine learning to mine free-text medical records merit investigation. Potential avenues include integrating corroboratory data such as medications and procedures, and temporal and spatial coding patterns. Future work should investigate the optimal gold standard including rhythm monitoring, electronic data sources and chart review. The reasons for false positives and negatives need to be explored in detail, as does the impact of AF phenotype and coding position. Finally, the accuracy of embolic and bleeding risk factor case definitions requires further validation in order to adjudicate appropriateness of anticoagulation management choices.

Conclusion

The overall accuracy of AF identification was reasonable for system planning and surveillance of prevalence, quality and outcomes. However, there is a marked disconnect between the volume of publications in these domains, and those examining the underpinning data. Sensitivity and PPV were the least accurate parameters with greatest uncertainty in terms of evidence and interpretation. This potentially underestimates the burden of disease and may bias estimates of outcomes and treatment quality. The optimal AF case definition should consider the purpose of the study and the data sources available. Health service administrators, researchers and clinicians should be mindful of these factors, and work together to refine our use of electronic data.

40 in total

1. Assessing accuracy of diagnosis-type indicators for flagging complications in administrative data.

Authors: Hude Quan; Gerry A Parsons; William A Ghali
Journal: J Clin Epidemiol Date: 2004-04 Impact factor: 6.437

Review 2. The clinical profile and pathophysiology of atrial fibrillation: relationships among clinical features, epidemiology, and mechanisms.

Authors: Jason Andrade; Paul Khairy; Dobromir Dobrev; Stanley Nattel
Journal: Circ Res Date: 2014-04-25 Impact factor: 17.367

3. Identifying Patients With Atrial Fibrillation in Administrative Data.

Authors: Karen Tu; Robby Nieuwlaat; Stephanie Y Cheng; Laura Wing; Noah Ivers; Clare L Atzema; Jeff S Healey; Paul Dorian
Journal: Can J Cardiol Date: 2016-06-23 Impact factor: 5.223

4. Meta-analysis in clinical trials.

Authors: R DerSimonian; N Laird
Journal: Control Clin Trials Date: 1986-09

5. Validation of electronic medical database in patients with atrial fibrillation in community hospitals.

Authors: Somjit Chotchaisuwatana; Arom Jedsadayanmata; Nathorn Chaiyakunapruk; Ketechan Jampachaisri
Journal: J Med Assoc Thai Date: 2011-06

6. Incidence of chronic atrial fibrillation in general practice and its treatment pattern.

Authors: Ana Ruigómez; Saga Johansson; Mari Ann Wallander; Luis Alberto García Rodríguez
Journal: J Clin Epidemiol Date: 2002-04 Impact factor: 6.437

7. Combined anticoagulant-antiplatelet use and major bleeding events in elderly atrial fibrillation patients.

Authors: Theresa I Shireman; Patricia A Howard; Timothy F Kresowik; Edward F Ellerbeck
Journal: Stroke Date: 2004-08-26 Impact factor: 7.914

8. Atrial fibrillation in the Malmö Diet and Cancer study: a study of occurrence, risk factors and diagnostic validity.

Authors: J Gustav Smith; Pyotr G Platonov; Bo Hedblad; Gunnar Engström; Olle Melander
Journal: Eur J Epidemiol Date: 2010-02 Impact factor: 8.082

9. Is the diagnostic coding position of acute heart failure related to mortality? A report from the Euro Heart Failure Survey-1.

Authors: Ahmad Shoaib; Mohamed Farag; Mansoor Nasir; Joseph John; Sanjay Gupta; Pierpaolo Pellicori; Renjith Antony; Rashida Perveen; Alan Rigby; Kevin M Goode; Ashraf Yassin; Andrew L Clark; John G F Cleland
Journal: Eur J Heart Fail Date: 2016-03-16 Impact factor: 15.534

10. Positive predictive value of cardiovascular diagnoses in the Danish National Patient Registry: a validation study.

Authors: Jens Sundbøll; Kasper Adelborg; Troels Munch; Trine Frøslev; Henrik Toft Sørensen; Hans Erik Bøtker; Morten Schmidt
Journal: BMJ Open Date: 2016-11-18 Impact factor: 2.692

8 in total

1. COVID-RDNet: A novel coronavirus pneumonia classification model using the mixed dataset by CT and X-rays images.

Authors: Lingling Fang; Xin Wang
Journal: Biocybern Biomed Eng Date: 2022-08-05 Impact factor: 5.687

2. Do routine hospital data accurately record comorbidity in advanced kidney disease populations? A record linkage cohort study.

Authors: Rommel Ravanan; Dominic Taylor; Ailish Nimmo; Retha Steenkamp
Journal: BMC Nephrol Date: 2021-03-17 Impact factor: 2.388

3. A Population-Based Study of Unexplained/Lone Atrial Fibrillation: Temporal Trends, Management, and Outcomes.

Authors: Thomas M Roston; Sunjidatul Islam; Nathaniel M Hawkins; Zachary W Laksman; Shubhayan Sanatani; Andrew D Krahn; Roopinder Sandhu; Padma Kaul
Journal: CJC Open Date: 2021-09-14

4. Geographic and temporal variation in the treatment and outcomes of atrial fibrillation: a population-based analysis of national quality indicators.

Authors: Derek S Chew; Flora Au; Yuan Xu; Braden J Manns; Marcello Tonelli; Stephen B Wilton; Brenda Hemmelgarn; Shiying Kong; Derek V Exner; Amity E Quinn
Journal: CMAJ Open Date: 2022-08-02

5. Use of oral anticoagulants among individuals with cancer and atrial fibrillation in the United States, 2010-2016.

Authors: Shirin Ardeshirrouhanifard; Huijun An; Ravi K Goyal; Mukaila A Raji; Jodi B Segal; G Caleb Alexander; Hemalkumar B Mehta
Journal: Pharmacotherapy Date: 2022-04-14 Impact factor: 6.251

6. Nationwide Study of Sex Differences in Incident Heart Failure in Newly Diagnosed Nonvalvular Atrial Fibrillation.

Authors: Haran Yogasundaram; Sunjidatul Islam; Douglas C Dover; Nathaniel M Hawkins; Justin Ezekowitz; Padma Kaul; Roopinder K Sandhu
Journal: CJC Open Date: 2022-05-06

7. Administrative Data in Cardiovascular Research-A Comparison of Polish National Health Fund and CRAFT Registry Data.

Authors: Cezary Maciejewski; Krzysztof Ozierański; Mikołaj Basza; Piotr Lodziński; Andrzej Śliwczyński; Leszek Kraj; Maciej Janusz Krajsman; Jefte Prado Paulino; Agata Tymińska; Grzegorz Opolski; Andrzej Cacko; Marcin Grabowski; Paweł Balsam
Journal: Int J Environ Res Public Health Date: 2022-09-22 Impact factor: 4.614

Review 8. Validity of Acute Cardiovascular Outcome Diagnoses Recorded in European Electronic Health Records: A Systematic Review.

Authors: Jennifer Davidson; Amitava Banerjee; Rutendo Muzambi; Liam Smeeth; Charlotte Warren-Gash
Journal: Clin Epidemiol Date: 2020-10-14 Impact factor: 4.790

8 in total