Literature DB >> 19777060

Selection of medical diagnostic codes for analysis of electronic patient records. Application to stroke in a primary care database.

Martin C Gulliford1, Judith Charlton, Mark Ashworth, Anthony G Rudd, Andre Michael Toschke.   

Abstract

BACKGROUND: Electronic patient records from primary care databases are increasingly used in public health and health services research but methods used to identify cases with disease are not well described. This study aimed to evaluate the relevance of different codes for the identification of acute stroke in a primary care database, and to evaluate trends in the use of different codes over time.
METHODS: Data were obtained from the General Practice Research Database from 1997 to 2006. All subjects had a minimum of 24 months of up-to-standard record before the first recorded stroke diagnosis. Initially, we identified stroke cases using a supplemented version of the set of codes for prevalent stroke used by the Office for National Statistics in Key health statistics from general practice 1998 (ONS codes). The ONS codes were then independently reviewed by four raters and a restricted set of 121 codes for 'acute stroke' was identified but the kappa statistic was low at 0.23.
RESULTS: Initial extraction of data using the ONS codes gave 48,239 cases of stroke from 1997 to 2006. Application of the restricted set of codes reduced this to 39,424 cases. There were 2,288 cases whose index medical codes were for 'stroke annual review' and 3,112 for 'stroke monitoring'. The frequency of stroke review and monitoring codes as index codes increased from 9 per year in 1997 to 1,612 in 2004, 1,530 in 2005 and 1,424 in 2006. The one year mortality of cases with the restricted set of codes was 29.1% but for 'stroke annual review,' 4.6% and for 'stroke monitoring codes', 5.7%.
CONCLUSION: In the analysis of electronic patient records, different medical codes for a single condition may have varying clinical and prognostic significance; utilisation of different medical codes may change over time; researchers with differing clinical or epidemiological experience may have differing interpretations of the relevance of particular codes. There is a need for greater transparency in the selection of sets of codes for different conditions, for the reporting of sensitivity analyses using different sets of codes, as well as sharing of code sets among researchers.

Entities:  

Mesh:

Year:  2009        PMID: 19777060      PMCID: PMC2744876          DOI: 10.1371/journal.pone.0007168

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Electronic patient records from primary care databases are increasingly used in public health and health services research with applications in disease epidemiology, drug utilisation and pharmacoepidemiology [1]. Identification of patients with specific diseases for study from electronic patient records typically requires the use of selected medical codes but the process of selecting codes has not been well described. In an early publication from the General Practice Research Database (GPRD), the Office for National Statistics (ONS) published sets of codes for a range of diseases and conditions included in the publication Key Health Statistics from General Practice [2]. This example has not often been followed by researchers, even though the selection of appropriate sets of medical codes is recognised to be a difficult yet important component of research using electronic patient records. This paper therefore describes the procedure we used to select a set of codes for diagnosis of stroke in primary care. Stroke is a long-term condition of public health importance that may be investigated in primary care databases. However, stroke represents an acute illness, around the time of initial occurrence or recurrence, as well as a long-term condition which may require secondary preventive medical care, rehabilitation and supportive care. One of the first studies to report on the epidemiology of stroke in a primary care database was the UK Office for National Statistics report Key Health Statistics in General Practice [2]. This publication reported on the prevalence of stroke in the UK General Practice Research Database (GPRD). Subjects with stroke were identified using a set of 192 medical codes that included diagnostic codes for acute stroke, as well as prevalent stroke and cerebrovascular disease. In a GPRD-based study we used the ONS codes with a 2 year stroke code free run-in period to identify incident stroke events [3]. However, in a study of trends in case fatality, we identified important trends in the use of different medical codes for stroke over time. In the present analysis, therefore, we aimed to re-evaluate the set of codes described by ONS to determine whether a subset of codes for acute stroke could be identified. The methods used for evaluation included: i) reclassification of codes by four independent raters; ii) evaluation of trends over time in the use of medical codes for stroke; and iii) evaluation of one year mortality for groups of subjects with different index medical codes for stroke. We also aimed to evaluate trends over time in the utilisation of different codes.

Methods

General Practice Research Database

The General Practice Research Database (GPRD) is a large database comprising the electronic patient records from approximately 5% of UK general practices. A number of studies have evaluated the validity of diagnoses recorded in GPRD with generally satisfactory results [4], [5]. Data were collected into the GPRD from 1987 up to the present. Clinical entries in the GPRD were coded initially using the Oxford Medical Information Systems (OXMIS) codes and later using READ codes, with the date of transition from Oxmis to READ coding systems varying for different practices in the database [1].

Data extraction

Initially, we used the set of diagnostic codes for prevalent stroke that was used by the Office for National Statistics in the publication Key Health Statistics from General Practice [2]. This set of codes does not include transient ischaemic attacks. There are 192 codes in the set used by ONS to define prevalent stroke. These were supplemented with codes from the READ and GPRD medical code dictionaries, giving a set of 202 codes. These codes were used for initial data extraction. The baseline population consisted of all up-to-standard patients (UTS) in GPRD from 1st January 1987 to 31st December 2006 inclusive. In the GPRD data are classified as ‘up-to-standard’ when they are judged to be of a high enough quality to be used for research. Cases were all patients with a first recorded stroke code during the period 1st January 1997 to 31st December 2006 inclusive. The date of stroke diagnosis was defined as the first date on which a stroke medical code was recorded. Cases were selected if they had at least 24 months of up-to-standard follow-up, prior to the date of diagnosis of stroke. This process yielded a sample of 48,239 subjects.

Data analysis

Three methods were used to evaluate the medical codes and the extracted data.

i) reclassification of codes by four independent observers

The list of 202 Read and Oxmis codes was independently reviewed by four different medically qualified raters. These included a public health/health services researcher, an epidemiologist, a stroke physician and a general practitioner. Each rater reviewed each code and determined, based on their clinical judgement, whether use of the code was likely to indicate an acute stroke event. Raters were compared using kappa statistics. For each code, the ratings were summed to give a score ranging from zero, with no raters agreeing that the code indicated acute stroke, to four, with all raters agreeing that the code was consistent with acute stroke. Based on consensus, codes rated 3 or 4 were classed as ‘acute stroke’. A rating of three affirmatives was used as a cut-point because one rater did not classify subarachnoid haemorrhage as stroke. Codes that were excluded through this process included codes for long-term stroke management including sequelae of stroke, extracranial atherosclerotic disease, disorders of the intracranial venous sinuses and other conditions. Three codes for stroke annual review or stroke monitoring were also classified into stroke annual review codes, including “Stroke/CVA annual review” (662e.00), and stroke monitoring codes, including “stroke monitoring” (662M.00) and “haemorrhagic stroke monitoring” (662o.00).

ii) evaluation of trends over time in the use of medical codes for stroke

We evaluated trends over time in the use of different sets of codes including those affirmed by 0, 1, 2, 3 or 4 raters. We analysed the medical codes recorded on the stroke index date, that is the date of the earliest reported stroke code in the medical record. Where more than one stroke code was recorded on the index date then the code with the highest rating was selected for analysis. We also used the sets of codes resulting from reclassification – ‘acute stroke’, ‘stroke annual review’ and ‘stroke monitoring’. We estimated incidence rates for stroke for different groups of codes. Rates were estimated by study year, standardising by age and sex to the European Standard Population for reference.

iii) evaluation of one year mortality for groups of subjects with different types of first medical code for stroke

We compared mortality within one year of stroke incidence for the same sets of codes outlined above. One year case fatality was estimated from a failure function in order to allow for censoring of cases that transferred out or reached the end of the study before one year follow-up was completed.

Ethics

The study was implemented utilising an anonymised dataset from the General Practice Research Database. The study protocol was approved by the Independent Scientific Advisory Committee (ISAC) of the Medicines and Healthcare products Regulatory Agency (MHRA) (ISAC Protocol No. 07_027R).

Results

Review of stroke codes

When the ONS codes were rated by four reviewers there was poor agreement concerning whether the codes were consistent with acute stroke. Table 1 shows pairwise values for percent agreement and kappa. While there was moderate or good agreement between raters one and three, one and four and between two and three, there was only weak agreement between raters one and two, two and four and three and four (Table 1). The value for kappa for all four raters was low at 0.23 suggesting no overall agreement. However, more than 80% of new stroke events were associated with codes that either three or four raters agreed were consistent with acute stroke. The number of raters who indicated a code was consistent with acute stroke and numbers of codes, were as follows: zero affirmative ratings, four codes; one affirmative rating, 36 codes; two affirmatives, 41 codes; three affirmatives, 41 codes; four affirmatives, 80 codes. Thus each of the raters agreed that codes for cerebral infarction, cerebral haemorrhage, cerebral embolism and ‘cerebral vascular accident’ were consistent with acute stroke. Of the 41 codes that only three raters affirmed as consistent with acute stroke, 16 were for subarachnoid haemorrhage which is included in the WHO definition of stroke but not in the definition of stroke used in the Quality and Outcomes Framework of the UK general practice contract. A further 12 codes were for hemiplegia or paralysis, which is a common presenting feature of acute stroke. Of the codes which two or fewer raters considered to be indicative of acute stroke, the majority of codes were for intracranial or extracranial arterial disease without mention of infarction, diseases of the cerebral venous sinuses, or for sequelae of stroke. Raters showed considerable disagreement over these codes. Based on these findings, we considered codes that were affirmed by three or more raters were consistent with acute stroke. This set of codes for ‘acute stroke’ included 121 codes.
Table 1

Agreement between four raters.

Rater 1 Agreement %Rater 1 KappaRater 2 Agreement %Rater 2 KappaRater 3 Agreement %Rater 3 Kappa
Rater 2 64.40.12----
Rater 3 76.20.4276.20.30--
Rater 4 77.70.5448.0−0.1061.90.20

Figures are observed percent agreement and kappa statistic for independent rating of 202 codes by each of four raters.

Figures are observed percent agreement and kappa statistic for independent rating of 202 codes by each of four raters.

Utilisation of stroke codes

Initial extraction of data using the ONS codes gave 48,239 cases of stroke from 1997 to 2006. The 48,239 subjects had a total of 70,288 stroke medical codes recorded on the stroke index date. The codes used and their relative frequencies are shown in the Table 2. There were only 17 codes that each accounted for more than 1% of subjects, and just two codes accounted for nearly half of subjects.
Table 2

READ and OXMIS terms for stroke, results of rating for acute stroke and relative frequency among 70,288 index stroke codes recorded in 48,239 subjects.

TypeCodeTermSum of 4 ratingsFrequency in 70,288 index codesPercent of index codes
READG66..00Stroke and cerebrovascular accident unspecified42135430.38
READG66..11CVA unspecified41542621.95
OXMIS4369ACVA (cerebrovascular accident)428594.07
READG66..13CVA - Cerebrovascular accident unspecified422543.21
READG64z.00Cerebral infarction NOS422243.16
READG61..11CVA - cerebrovascular accid due to intracerebral haemorrhage413341.90
READG61..00Intracerebral haemorrhage412511.78
READG64z.12Cerebellar infarction47621.08
OXMIS4369BStroke45250.75
READG64..13Stroke due to cerebral arterial occlusion44720.67
READG640.00Cerebral thrombosis44020.57
READG667.00Left sided CVA43930.56
READG64..12Infarction - cerebral43510.50
READG668.00Right sided CVA43020.43
READG61..12Stroke due to intracerebral haemorrhage42580.37
READG66..12Stroke unspecified42050.29
READG664.00Cerebellar stroke syndrome42000.28
READG663.00Brain stem stroke syndrome41900.27
READG613.00Cerebellar haemorrhage41790.25
READG64z300Right sided cerebral infarction41320.19
READG64z200Left sided cerebral infarction41170.17
READG641.00Cerebral embolism4970.14
READG61z.00Intracerebral haemorrhage NOS4930.13
READG63y000Cerebral infarct due to thrombosis of precerebral arteries4620.09
READG64z000Brainstem infarction4610.09
READG640000Cerebral infarction due to thrombosis of cerebral arteries4480.07
OXMIS3479AGCerebellar infarction4420.06
READG6X..00Cerebrl infarctn due/unspcf occlusn or sten/cerebrl artrs4410.06
READG64z400Infarction of basal ganglia4400.06
READG665.00Pure motor lacunar syndrome4320.05
READG63y100Cerebral infarction due to embolism of precerebral arteries4310.04
READG614.00Pontine haemorrhage4280.04
READG62z.00Intracranial haemorrhage NOS4220.03
OXMIS4339NFInfarct cerebral4170.02
READG61X000Left sided intracerebral haemorrhage, unspecified4170.02
READG61X100Right sided intracerebral haemorrhage, unspecified4170.02
OXMIS4319CEHaemorrhage intracerebral4150.02
READG6W..00Cereb infarct due unsp occlus/stenos precerebr arteries4140.02
READG611.00Internal capsule haemorrhage4130.02
READG641000Cerebral infarction due to embolism of cerebral arteries4110.02
OXMIS4319Haemorrhage cerebral4100.01
READG676000Cereb infarct due cerebral venous thrombosis, nonpyogenic4100.01
READG666.00Pure sensory lacunar syndrome4100.01
READG62..00Other and unspecified intracranial haemorrhage480.01
OXMIS4339PLParietal lobe infarct470.01
READG64z.11Brainstem infarction NOS450.01
READGyu6400[X]Other cerebral infarction450.01
OXMIS4369ARCerebrovascular accident right440.01
OXMIS4339FLInfarct frontal lobe430.00
OXMIS4369ALCerebrovascular accident left430.00
OXMIS4339BTInfarct brain stem430.00
READG641.11Cerebral embolus430.00
READG610.00Cortical haemorrhage430.00
OXMIS4349Embolism cerebral420.00
OXMIS4339TLTemporal lobe infarct420.00
READG612.00Basal nucleus haemorrhage420.00
READG616.00External capsule haemorrhage420.00
OXMIS4339WLWallenberg's syndrome420.00
READG63..11Infarction - precerebral420.00
READGyu6200[X]Other intracerebral haemorrhage420.00
OXMIS4319NAHaematoma brain nontraumatic410.00
OXMIS4319NGHaemorrhage brain nontraumatic410.00
OXMIS4339PPInfarcts cerebral septic410.00
OXMIS4339Thrombosis cerebral410.00
READG61X.00Intracerebral haemorrhage in hemisphere, unspecified410.00
OXMIS4369CRAccident cerebral400.00
OXMIS4370Cerebral ischaemia hypertensive400.00
OXMIS4360ACerebrovascular accident with hypertensi400.00
OXMIS4340Embolism cerebral with hypertension400.00
OXMIS4340CREmbolism intracranial with hypertension400.00
OXMIS4310Haemorrhage intracerebral with hypertens400.00
OXMIS344 BFHemiplegia flaccid400.00
OXMIS4360HPHypertensive hemiplegia400.00
OXMIS4339PPontine infarct400.00
OXMIS4360BStroke with hypertension400.00
OXMIS4369BNSyndrome stroke400.00
READG618.00Intracerebral haemorrhage, multiple localized400.00
READGyu6G00[X]Cereb infarct due unsp occlus/stenos precerebr arteries400.00
READGyu6300[X]Cerebrl infarctn due/unspcf occlusn or sten/cerebrl artrs400.00
READGyu6F00[X]Intracerebral haemorrhage in hemisphere, unspecified400.00
READG64..11CVA - cerebral artery occlusion323843.39
READG60..00Subarachnoid haemorrhage321012.99
READF22..11Hemiparesis317492.49
READF22..00Hemiplegia38301.18
READF22z.00Hemiplegia NOS31750.25
OXMIS4309Subarachnoid haemorrhage31470.21
READF222.00Left hemiplegia3960.14
READF223.00Right hemiplegia3920.13
READG600.00Ruptured berry aneurysm3430.06
READG602.00Subarachnoid haemorrhage from middle cerebral artery3420.06
READ2833O/E - hemiplegia3370.05
READG60X.00Subarachnoid haemorrh from intracranial artery, unspecif3340.05
READG617.00Intracerebral haemorrhage, intraventricular3320.05
READG60z.00Subarachnoid haemorrhage NOS3300.04
READG64z111Lateral medullary syndrome3180.03
OXMIS344 BLHemiplegia left3170.02
OXMIS4369VAVascular accident3100.01
OXMIS344 BRHemiplegia right3100.01
READG604.00Subarachnoid haemorrhage from posterior communicating artery390.01
OXMIS4319CRHaemorrhage intracranial380.01
READG603.00Subarachnoid haemorrhage from anterior communicating artery380.01
READG605.00Subarachnoid haemorrhage from basilar artery370.01
OXMIS4380Cerebrovascular disease with hypertensio350.01
OXMIS4379Ischaemia cerebral350.01
OXMIS344 HFHemiparesis facial320.00
OXMIS4330Cerebral thrombosis with hypertension320.00
OXMIS4359CCCaroticovertebral ischaemia310.00
OXMIS4349CREmbolism intracranial310.00
READG615.00Bulbar haemorrhage310.00
READG606.00Subarachnoid haemorrhage from vertebral artery310.00
READGyu6100[X]Other subarachnoid haemorrhage310.00
OXMIS4339BCClot blood brain300.00
OXMIS4380HPHemiplegia with hypertension300.00
OXMIS344Paralysis cerebral300.00
OXMIS344 BParalysis hemiplegia300.00
OXMIS4359PRProlonged residual ischaemic neuro defec300.00
OXMIS4300Subarachnoid haemorrhage with hypertensi300.00
OXMIS4339WAThrombosis posterior cerebellar artery300.00
READG601.00Subarachnoid haemorrhage from carotid siphon and bifurcation300.00
READGyu6E00[X]Subarachnoid haemorrh from intracranial artery, unspecif300.00
READGyu6000[X]Subarachnoid haemorrhage from other intracranial arteries300.00
READ14A7.11H/o: CVA27701.10
READ14A7.12H/O: stroke27551.07
OXMIS4389Cerebrovascular disease21680.24
READ14A7.00H/O: CVA/stroke2700.10
READG68X.00Sequelae of stroke,not specfd as h'morrhage or infarction2540.08
READF221.00Spastic hemiplegia2310.04
READF053.00Thrombophlebitis of central nervous system venous sinuses2160.02
READF051.00Thrombosis of central nervous system venous sinuses2140.02
READG620.00Extradural haemorrhage - nontraumatic2130.02
READG630.00Basilar artery occlusion2120.02
READG683.00Sequelae of cerebral infarction2120.02
READF220.00Flaccid hemiplegia2120.02
OXMIS442 CAAneurysm cerebral artery280.01
READF051000Thrombosis cavernous sinus270.01
READG631.12Thrombosis, carotid artery270.01
READF051300Thrombosis transverse sinus260.01
READF051200Thrombosis lateral sinus240.01
READF050000Embolism cavernous sinus230.00
READ662o.00Haemorrhagic stroke monitoring230.00
READG681.00Sequelae of intracerebral haemorrhage220.00
READF051z00Thrombosis of central nervous system venous sinus NOS220.00
OXMIS442 CCerebral arteriovenous aneurysm210.00
READF050100Embolism superior longitudinal sinus210.00
READG63y.00Other precerebral artery occlusion210.00
OXMIS4379AMAneurysm cerebrovascular arterioscleroti200.00
OXMIS366 CTChoroidal thrombosis200.00
OXMIS4309MMeningeal haemorrhage200.00
OXMIS4389PPPoststroke paralysis200.00
OXMIS4389PNSyndrome poststroke200.00
OXMIS4379TAThromboangiitis cerebral200.00
OXMIS4329TThrombosis carotid artery200.00
OXMIS321 CVThrombosis cavernous sinus200.00
OXMIS321 LTThrombosis lateral sinus200.00
OXMIS321 TLThrombosis superior sagittal sinus200.00
READF050z00Embolism central nervous system venous sinus NOS200.00
READF050200Embolism lateral sinus200.00
READF050.00Embolism of central nervous system venous sinus200.00
READF050300Embolism transverse sinus200.00
READF050.11Embolus of central nervous system venous sinus200.00
READF051100Thrombosis of superior longitudinal sinus200.00
READGyu6C00[X]Sequelae of stroke,not specfd as h'morrhage or infarction200.00
READ662M.00Stroke monitoring131454.47
READ662e.00Stroke/CVA annual review123453.34
READG64..00Cerebral arterial occlusion118532.64
READF26y000Hemiplegic migraine14930.70
READG621.00Subdural haemorrhage - nontraumatic12110.30
READG631.00Carotid artery occlusion11540.22
OXMIS3460MHMigraine hemiplegic1570.08
OXMIS4319LNSubdural haematoma/haemorrhage nontrauma1490.07
READG671z00Generalised ischaemic cerebrovascular disease NOS1170.02
READG63..00Precerebral arterial occlusion1110.02
READG632.00Vertebral artery occlusion1100.01
READG65z100Intermittent cerebral ischaemia170.01
READG677300Occlusion and stenosis of cerebellar arteries140.01
READG677000Occlusion and stenosis of middle cerebral artery140.01
READF05..00Phlebitis and thrombophlebitis of intracranial sinuses130.00
OXMIS3479CCerebral anoxia120.00
OXMIS344 SHSpastic hemiplegia120.00
READG65z000Impending cerebral ischaemia120.00
READG677200Occlusion and stenosis of posterior cerebral artery120.00
READ14AK.00H/O: Stroke in last year110.00
READG680.00Sequelae of subarachnoid haemorrhage110.00
OXMIS4379ANCerebral aneurysm arteriosclerotic100.00
OXMIS7876PHHemiplegic gait100.00
OXMIS4389PLLate effects CVA100.00
OXMIS321 CEThrombosis cerebral sinus100.00
READG671000Acute cerebrovascular insufficiency NOS100.00
READG633.00Multiple and bilateral precerebral arterial occlusion100.00
READG677100Occlusion and stenosis of anterior cerebral artery100.00
READG677400Occlusion+stenosis of multiple and bilat cerebral arteries100.00
READF052z00Phlebitis of central nervous system venous sinus NOS100.00
READF052.00Phlebitis of central nervous system venous sinuses100.00
READG63z.00Precerebral artery occlusion NOS100.00
READF053z00Thrombophlebitis of central nervous system venous sinus NOS100.00
READGyu6600[X]Occlusion and stenosis of other cerebral arteries100.00
READGyu6500[X]Occlusion and stenosis of other precerebral arteries100.00
READGyu6B00[X]Sequelae of other nontraumatic intracranial haemorrhage100.00
READE030400Acute confusional state, of cerebrovascular origin0340.05
READE031400Subacute confusional state, of cerebrovascular origin0230.03
READG677.00Occlusion/stenosis cerebral arts not result cerebral infarct040.01
OXMIS344 BTTRANSIENT HEMIPLEGIA000.00

(There were 19,497 subjects with two codes recorded on the index date and 2,552 subjects with three or more codes recorded on the index date)

(There were 19,497 subjects with two codes recorded on the index date and 2,552 subjects with three or more codes recorded on the index date) The distribution of different sets of codes, as index codes, within this sample is shown in Table 3. Codes that received zero affirmative ratings were rarely utilised as index codes accounting for 56 (0.1%) subjects out of the entire sample. Codes that received one affirmative rating accounted for 14.7% of the patient sample, but the number of cases identified through these codes increased from 121 per year in 1997 to 1,861 in 2004, 1,730 in 2005, and 1,618 in 2006. Of 7,091 first occurrences of these codes 5,209 (73%) were in 2004 or later. This increase was accounted for by increased utilisation of codes for stroke annual review or stroke monitoring. Overall, there were 3,112 (6.5%) of cases initially diagnosed through codes for ‘stroke monitoring’ and 2,288 (4.7%) diagnosed through codes for ‘stroke annual review’. Codes affirmed by two raters accounted for 3.5% of the patient sample. Codes that received three or four affirmative ratings accounted for 81.7% of the patient sample. However, these codes accounted for 93% of the patient sample in 1997 but 68% in 2006. There were a further 860 subjects with valid acute stroke codes ever recorded, affirmed by three or four raters, in whom the valid acute stroke code was not an index code. There were 293/860 cases in whom the valid acute stroke code was recorded within 30 days of the index stroke code.
Table 3

Number of new stroke cases by study year and type of codes used for case selection.

ONS codes (202)Zero affirmatives (4)One affirmative (36)Two affirmatives (41)Three affirmatives (41)Four affirmatives (80)‘Stroke monitoring’ (2)‘Stroke annual review’ (1)‘Acute stroke’ (121)
1997 3,40331211273402,812903,152
1998 3,81221531564123,0891913,501
1999 4,06521571674333,3063743,739
2000 4,22862161514673,3885313,855
2001 4,706122192005123,7638654,275
2002 5,04253502015543,93218554,486
2003 5,371106662016823,812384454,494
2004 6,30781,8611687493,5218857274,270
2005 5,85341,7301496883,2827018293,970
2006 5,45241,6181485803,1027536713,682
Total 48,239567,0911,6685,41734,0073,1122,28839,424
Row % 0.114.73.511.270.56.54.781.7

Figures in parentheses are numbers of codes in set.

Figures in parentheses are numbers of codes in set. Table 4 shows age and sex standardised incidence rates for stroke by study year based on different sets of codes. The incidence of stroke using all codes remained approximately constant during the earlier part of this study period. However, the EU-standardised incidence of stroke increased from 1.13 per 1,000 in 2002 to 1.19 in 2003 and 1.36 in 2004 before declining to 1.15 in 2006. The incidence of stroke associated with stroke monitoring and stroke annual review codes showed a large increase over the period, with a steep increase between 2002 and 2004. The incidence of ‘acute stroke’ remained approximately constant initially but later declined from 1.00 per 1,000 in 2003 to 0.76 in 2006.
Table 4

Incidence of first stroke by study year and type of codes used for case selection.

ONS codes (202)Zero affirmatives (4)One affirmative (36)Two affirmatives (41)Three affirmatives (41)Four affirmatives (80)‘Stroke monitoring’ (2)‘Stroke annual review’ (1)‘Acute stroke’ (121)
1997 1.120.000.050.040.140.890.000.001.03
1998 1.160.000.060.050.150.910.010.001.05
1999 1.100.000.050.050.140.860.010.001.00
2000 1.050.000.060.040.140.810.010.000.95
2001 1.080.000.060.040.140.830.020.000.97
2002 1.130.000.090.050.150.850.040.001.00
2003 1.190.000.160.040.180.810.080.011.00
2004 1.360.000.410.040.190.730.190.160.91
2005 1.230.000.370.030.170.660.150.170.83
2006 1.150.000.350.030.140.630.160.140.76

Figures are rates per 1,000 years standardised by age and sex to the European standard population.

Figures are rates per 1,000 years standardised by age and sex to the European standard population. Table 5 shows the one year case fatality of groups of subjects with different index codes. Data for 394 cases in which the first stroke code was after the death date were excluded. The mortality of 47,845 subjects with stroke identified using the ONS codes was 25.5% within one year following the initial stroke event. The case fatality of subjects classified as having ‘acute stroke’ was 29.1%. The case fatality of subjects identified through ‘stroke annual review’ codes was 4.6% and for ‘stroke monitoring’ codes 5.7%. Codes that were not rated in the affirmative by any rater were infrequently utilised but were associated with unexpectedly high mortality. The reasons for this finding are unclear.
Table 5

One year mortality by type of codes used for stroke selection.

NumberDeaths within 12 monthsOne year case fatalitya (%) (95% confidence interval)
ONS codes 47,84511,87625.5 (25.1 to 25.9)
Zero affirmatives 562343.1 (31.1 to 57.5)
One affirmative 7,0734056.2 (5.6 to 6.8)
Two affirmatives 1,66636222.6 (20.6 to 24.7)
Three affirmatives 5,3691,21823.3 (22.1 to 24.4)
Four affirmatives 33,6819,86830.0 (29.5 to 30.5)
‘Stroke monitoring’ 3,1051595.7 (4.9 to 6.6)
‘Stroke annual review’ 944.6 (3.8 to 5.7)
‘Acute stroke’ 39,05011,08629.1 (28.6 to 29.6)

estimated from failure function allowing for censoring.

estimated from failure function allowing for censoring.

Discussion

Main findings

This paper has presented an analysis of the codes for prevalent stroke used by the Office for National Statistics. Our initial approach to the evaluation of strokes in GPRD was to identify subjects with a first-ever recorded code from the ONS set, with at least a period of 24 stroke record free months available before the index diagnosis. This study yielded plausible results consistent with the literature. However, the ONS code set was designed to identify cases with prevalent stroke. In order to identify a possible set of codes for acute stroke, we used a combination of methods. Although the raters disagreed concerning the significance of codes that were not clearly associated with cerebral haemorrhage or infarction, by restricting attention to those codes for which three or four raters agreed that acute stroke was probable, we were able to achieve consensus concerning the selection of codes. Within the set of codes for ‘acute stroke’ there was disagreement concerning the inclusion of subarachnoid haemorrhage but this is compatible with inconsistent definitions that are in clinical use. The Quality and Outcomes Framework (QOF) for the UK General Practice contract does not include subarachnoid haemorrhage [6] and this was reflected in the rating performed by the general practitioner. Read codes eligible for QOF include those for cerebral haemorrhage, cerebral infarction, cerebral embolism as well as stroke and ‘CVA’ (Cerebral Vascular Accident). All QOF eligible codes were included within the set of codes for ‘acute stroke’ in this research, supporting the validity of the latter. There have evidently been major selective increases in the use of specific codes subsequent to the introduction of the QOF. The use of codes for stroke annual review and stroke monitoring has greatly increased in frequency and peaked shortly after the inclusion in the QOF with a subsequent plateau. These codes were sometimes used as incident stroke codes, with no previous mention of stroke in at least the preceding 24 months. Cases identified in this way were associated with low mortality. If associated with acute stroke events they might represent mild cases possibly explaining the low mortality. Unfortunately, we cannot directly examine if the use of codes might be associated with the severity of the stroke event due to lack of information on stroke severity. However, subjects with stroke may be admitted directly to hospital and their first contact with the general practice may be for review at some later date. The stroke onset date may therefore be imprecisely recorded in primary care, perhaps often being recorded after the true stroke onset date. A potential bias in the recording of stroke dates might also explain the low mortality among patients with those codes as first stroke codes. Additional validation of stroke diagnosis dates through analysis of information on prescribing, utilisation of secondary care services or test results may be important when the precise diagnosis dates are required. The use of codes for monitoring and review may be compatible with the analysis of certain aspects of stroke research in the general practice record since a re-analysis of our earlier research excluding these codes and only considering the 121 codes identified in this paper did not yield different results [3]. Nevertheless, it will be advisable to be cautious in utilising these codes for studies of stroke prognosis and especially for studies of secular trends in stroke outcomes.

Comparison with other studies

Several studies in the General Practice Research Database (GPRD) have evaluated the epidemiology of stroke [2], [7]–[16]. Analytical studies in GPRD that employed stroke occurrence as an outcome have generally adopted a more restrictive approach to definition. An incident stroke is usually defined as one that is associated with the first-ever recorded stroke code. A minimum period of study time free of recorded stroke codes, usually between one and three years, is often used to precede the first stroke record. Some studies have also excluded subjects ever diagnosed with multiple sclerosis or cancer, as stroke may be diagnosed with less certainty in these subjects [7], [8]. Some studies have also used evidence of relevant health care utilisation including hospital admissions, diagnostic tests and drug prescribing as contributing to the confirmation of stroke diagnoses [7], [8]. These latter approaches are demanding to implement and results may be dependent on subjects gaining access to specialist care, as well as there being adequate recording of relevant measures within the primary care record as well as the specialist record. Studies of stroke epidemiology, requiring the analysis of large numbers of stroke records require approaches that can be more readily implemented.

Strengths and limitations

Our study has the strengths of a very large sample that was drawn from a large number of UK general practices. In common with all studies that use electronic patient records, the data are subject to variation in clinical practice, as well as clinical uncertainty in diagnosis. There will always be a proportion of cases in whom the diagnosis may be uncertain. We utilised four independent raters with differing specialist expertise to evaluate diagnostic codes. We also evaluated different sets of codes for their impact on stroke incidence and stroke case fatality. A limitation of this research is the lack of a reference method to establish stroke diagnoses. This would require evaluation of electronic records against clinical records, with access to specialist records for details of scan results in most cases. In spite of this limitation we believe that the methods adopted in this study may provide a useful approach to the selection of codes for analysis of electronic patient records in future studies.

Conclusions

This study raises several questions concerning the selection of medical codes to provide case definitions in research that utilises electronic patient records. Firstly, utilisation of different medical codes may change over time. Secondly, different medical codes for a single condition may have varying clinical and prognostic significance. Thirdly, researchers with differing clinical or epidemiological experience may have differing interpretations of the relevance of particular codes. Since these issues may sometimes have an impact on the results of a study, there is a need for greater transparency in the selection of sets of codes for different conditions, for the reporting of sensitivity analyses using different sets of codes, as well as sharing of code sets among researchers.
  13 in total

1.  Use of the General Practice Research Database (GPRD) for respiratory epidemiology: a comparison with the 4th Morbidity Survey in General Practice (MSGP4).

Authors:  A Hansell; J Hollowell; T Nichols; R McNiece; D Strachan
Journal:  Thorax       Date:  1999-05       Impact factor: 9.139

2.  Cyclooxygenase-2 selective nonsteroidal anti-inflammatory drugs and the risk of ischemic stroke: a nested case-control study.

Authors:  Frank Andersohn; René Schade; Samy Suissa; Edeltraut Garbe
Journal:  Stroke       Date:  2006-05-25       Impact factor: 7.914

3.  Hormone replacement therapy use and the risk of stroke.

Authors:  Christel Renoux; Sophie Dell'aniello; Edeltraut Garbe; Samy Suissa
Journal:  Maturitas       Date:  2008-11-08       Impact factor: 4.342

4.  The General Practice Research Database: quality of morbidity data.

Authors:  J Hollowell
Journal:  Popul Trends       Date:  1997

5.  Late-onset seizures as a predictor of subsequent stroke.

Authors:  Paul Cleary; Simon Shorvon; Raymond Tallis
Journal:  Lancet       Date:  2004-04-10       Impact factor: 79.321

6.  The General Practice Research Database. Scientific and Ethical Advisory Group.

Authors:  D H Lawson; V Sherman; J Hollowell
Journal:  QJM       Date:  1998-06

7.  Chronic atrial fibrillation: Incidence, prevalence, and prediction of stroke using the Congestive heart failure, Hypertension, Age >75, Diabetes mellitus, and prior Stroke or transient ischemic attack (CHADS2) risk stratification scheme.

Authors:  Stephan Rietbrock; Emma Heeley; Jonathan Plumb; Tjeerd van Staa
Journal:  Am Heart J       Date:  2008-07       Impact factor: 4.749

8.  Migraine and the risk of stroke, TIA, or death in the UK (CME).

Authors:  Claudia Becker; Gunnar P Brobert; Per M Almqvist; Saga Johansson; Susan S Jick; Christoph R Meier
Journal:  Headache       Date:  2007 Nov-Dec       Impact factor: 5.887

9.  Triptans in migraine: the risks of stroke, cardiovascular disease, and death in practice.

Authors:  Gillian C Hall; Martin M Brown; Jingping Mo; Kenneth D MacRae
Journal:  Neurology       Date:  2004-02-24       Impact factor: 9.910

10.  Exposure to antipsychotics and risk of stroke: self controlled case series study.

Authors:  Ian J Douglas; Liam Smeeth
Journal:  BMJ       Date:  2008-08-28
View more
  35 in total

Review 1.  Recent advances in the utility and use of the General Practice Research Database as an example of a UK Primary Care Data resource.

Authors:  Tim Williams; Tjeerd van Staa; Shivani Puri; Susan Eaton
Journal:  Ther Adv Drug Saf       Date:  2012-04

Review 2.  Case-finding for common mental disorders in primary care using routinely collected data: a systematic review.

Authors:  Harriet Larvin; Emily Peckham; Stephanie L Prady
Journal:  Soc Psychiatry Psychiatr Epidemiol       Date:  2019-07-12       Impact factor: 4.328

3.  Declining 1-year case-fatality of stroke and increasing coverage of vascular risk management: population-based cohort study.

Authors:  Martin C Gulliford; Judith Charlton; Anthony Rudd; Charles D Wolfe; André Michael Toschke
Journal:  J Neurol Neurosurg Psychiatry       Date:  2010-02-22       Impact factor: 13.654

4.  Signs of the 2009 influenza pandemic in the New York-Presbyterian Hospital electronic health records.

Authors:  Hossein Khiabanian; Antony B Holmes; Brendan J Kelly; Mrinalini Gururaj; George Hripcsak; Raul Rabadan
Journal:  PLoS One       Date:  2010-09-09       Impact factor: 3.240

5.  Inception and deprescribing of statins in people aged over 80 years: cohort study.

Authors:  Martin Gulliford; Rathi Ravindrarajah; Shota Hamada; Stephen Jackson; Judith Charlton
Journal:  Age Ageing       Date:  2017-11-01       Impact factor: 10.668

6.  Psoriasis and the Risk of Major Cardiovascular Events: Cohort Study Using the Clinical Practice Research Datalink.

Authors:  Rosa Parisi; Martin K Rutter; Mark Lunt; Helen S Young; Deborah P M Symmons; Christopher E M Griffiths; Darren M Ashcroft
Journal:  J Invest Dermatol       Date:  2015-03-05       Impact factor: 8.551

7.  Why do electronic health records reveal oral anticoagulant prescription after haemorrhagic stroke?

Authors:  Alice S Forster; Alex Dregan; Tjeerd P van Staa; Lisa McDermott; Gerard McCann; Charles D A Wolfe; Anthony Rudd; Martin C Gulliford
Journal:  Br J Clin Pharmacol       Date:  2015-06       Impact factor: 4.335

8.  Utility of electronic patient records in primary care for stroke secondary prevention trials.

Authors:  Alex Dregan; Michael A Toschke; Charles D Wolfe; Anthony Rudd; Mark Ashworth; Martin C Gulliford
Journal:  BMC Public Health       Date:  2011-02-07       Impact factor: 3.295

Review 9.  Concept libraries for automatic electronic health record based phenotyping: A review.

Authors:  Zahra A Almowil; Shang-Ming Zhou; Sinead Brophy
Journal:  Int J Popul Data Sci       Date:  2021-06-16

10.  Optimising use of electronic health records to describe the presentation of rheumatoid arthritis in primary care: a strategy for developing code lists.

Authors:  Amanda Nicholson; Elizabeth Ford; Kevin A Davies; Helen E Smith; Greta Rait; A Rosemary Tate; Irene Petersen; Jackie Cassell
Journal:  PLoS One       Date:  2013-02-22       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.