Literature DB >> 27940627

Automatic identification of type 2 diabetes, hypertension, ischaemic heart disease, heart failure and their levels of severity from Italian General Practitioners' electronic medical records: a validation study.

Rosa Gini1,2, Martijn J Schuemie3,4, Giampiero Mazzaglia5, Francesco Lapi5, Paolo Francesconi1, Alessandro Pasqua5, Elisa Bianchini5, Carmelo Montalbano6, Giuseppe Roberto1, Valentina Barletta1, Iacopo Cricelli6, Claudio Cricelli7, Giulia Dal Co8, Mariadonata Bellentani8, Miriam Sturkenboom2, Niek Klazinga9.   

Abstract

OBJECTIVES: The Italian project MATRICE aimed to assess how well cases of type 2 diabetes (T2DM), hypertension, ischaemic heart disease (IHD) and heart failure (HF) and their levels of severity can be automatically extracted from the Health Search/CSD Longitudinal Patient Database (HSD). From the medical records of the general practitioners (GP) who volunteered to participate, cases were extracted by algorithms based on diagnosis codes, keywords, drug prescriptions and results of diagnostic tests. A random sample of identified cases was validated by interviewing their GPs.
SETTING: HSD is a database of primary care medical records. A panel of 12 GPs participated in this validation study. PARTICIPANTS: 300 patients were sampled for each disease, except for HF, where 243 patients were assessed. OUTCOME MEASURES: The positive predictive value (PPV) was assessed for the presence/absence of each condition against the GP's response to the questionnaire, and Cohen's κ was calculated for agreement on the severity level.
RESULTS: The PPV was 100% (99% to 100%) for T2DM and hypertension, 98% (96% to 100%) for IHD and 55% (49% to 61%) for HF. Cohen's kappa for agreement on the severity level was 0.70 for T2DM and 0.69 for hypertension and IHD.
CONCLUSIONS: This study shows that individuals with T2DM, hypertension or IHD can be validly identified in HSD by automated identification algorithms. Automatic queries for levels of severity of the same diseases compare well with the corresponding clinical definitions, but some misclassification occurs. For HF, further research is needed to refine the current algorithm. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

Entities:  

Keywords:  DIABETES & ENDOCRINOLOGY; PRIMARY CARE; Validation study; electronic medical records

Mesh:

Year:  2016        PMID: 27940627      PMCID: PMC5168667          DOI: 10.1136/bmjopen-2016-012413

Source DB:  PubMed          Journal:  BMJ Open        ISSN: 2044-6055            Impact factor:   2.692


This is the first validation study specifically performed in the Italian General Practitioners' (GP) medical records database Health Search. The positive predictive value of automatic case-finding algorithms for four chronic diseases and their levels of severity was estimated, using manual assessment of the GPs themselves as a gold standard. A total of 12 GPs contributed to the study and 300 cases were assessed for type 2 diabetes, hypertension and ischaemic heart disease, 243 for heart failure. This study is part of a national Italian project funded by the Ministry of Health and aimed to validate information systems available in the country to monitor quality of healthcare for chronic diseases. Participation of the GPs was voluntary, and sensitivity could not be assessed.

Introduction

Italy is facing an increasing burden of chronic health conditions due to ageing of the population. To provide adequate and fair healthcare across regions, Italy was advised by the Organisation for Economic Co-operation and Development to develop a set of standards around the processes and outcomes of primary care and to develop a national quality governance model to support regions in delivering care of uniform quality across the country.1 The Italian National Agency for Regional Healthcare Services started the MATRICE Project in 2011. MATRICE was aimed at developing tools to compare quality of healthcare across Italian regions of four chronic diseases: type 2 diabetes (T2DM), hypertension, ischaemic heart disease (IHD) and heart failure (HF). One of the objectives was to assess the validity of routine care data to monitor quality of healthcare supply.2 3 The Health Search IMS Health Longitudinal Patient Database (HSD) is a longitudinal primary care medical record database that was set up by members of the Italian College of General Practitioners (SIMG). More than 900 physicians, uniformly distributed across Italy, share their de-identified clinical records in the HSD. These data are extensively used for epidemiological and public health research.4 5 6 7 8 9 10 11 12 The HSD database is very similar to other Primary Care databases: for instance, in the UK, the Clinical Practice Research Datalink (CPRD, formerly GPRD),13 The Health Information Network (THIN)14 and QResearch;15 in Canada, the Canadian Primary Care Sentinel Surveillance Network (CPCSSN);16 in the Netherlands, the Integrated Primary Care Information database (IPCI);17 in Spain, the database Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria (BIFAP).18 A common feature of these countries is that the GP serves a well-defined population and is the gatekeeper to secondary care. In this type of medical records, every visit is recorded and all diagnoses, prescriptions and measurements are recorded as part of a general practitioner's daily practice. Moreover, information from specialist referrals is reported back to the general practitioner (GP) and stored in the same medical records. The medical records replace the paper records that once existed and may be considered a rather comprehensive list of health problems requiring care. Owing to its longitudinal, population-based nature, this type of databases serves well for many research purposes, such as estimate of burden of disease, pharmacoepidemiologic and health services research. Since information is collected primarily for the provision of care, the quality of coding diagnoses may not always be accurate and also this varies by type of disease.19 20 Moreover, the same code may be used for different clinical definitions of a disease, when diagnostic standards are not uniform across healthcare communities or change over time. As a consequence, case-finding algorithms that retrieve participants from such data sources using diagnostic codes may unintentionally retrieve participants whose clinical condition does not correspond to the one intended for a study. On the other hand, GPs may record clinical conditions as a free text note, and a retrieval strategy using only coded diagnoses may miss some cases. As a result, algorithms using other sources, such as results from laboratory tests, have been developed to query this type of data sources.21 Disease-specific validation studies of case-finding algorithms in GP medical record databases have been performed in CPRD,22 23 CPCSSN,20 24 IPCI and HSD itself.25 26 A panel of experts in the MATRICE Project established precise clinical definitions of T2DM, hypertension, IHD and HF, and identified levels of severity of the conditions that had to be distinguished for the purpose of monitoring healthcare quality. The aim of this study was to estimate the positive predictive value (PPV) of case-finding algorithms to detect such conditions and levels of severity from the GP medical records collected by HSD, against a gold standard based on manual comparison with the clinical definitions chosen by the MATRICE panel.

Methods

Setting

Italy has a tax-based, universal coverage national health system. Every Italian resident is entitled to choose a GP, although parents might instead opt for a specialist paediatrician for their children, up to the age of 15. Therefore, each resident from the age of 16 onwards is specifically registered with a GP. GPs are the ‘gatekeepers’ of the system, meaning that patients can only access secondary care within the healthcare system on referral of their GP.27 1 Secondary care is accessed either free of charge or on a small copayment. During their daily practice, GPs record all clinical findings, diagnoses and prescriptions in their electronic medical records. GPs participating in HSD all use the same software, which requires that each prescription is associated with a specific disease code. A disease code may be labelled as ‘suspect’ when further clinical ascertainment is needed. Results from laboratory tests may be recorded as well. Moreover, free-text fields are available in the software to collect clinical notes on diagnoses, signs, symptoms and referral letters from specialists or from hospitals. Every 6 months, GPs send their data to a central repository, after anonymisation. The central repository performs quality controls, like estimation of prevalence of common diseases, and selects GPs whose data prove to be accurate.4 Currently, data of 700 of 900 GPs, uniformly distributed across Italy, are considered accurate according to data quality checking.12

Clinical definition of the diseases and of their levels of severity

A panel of cardiologists, diabetologists, epidemiologists and experts in organisation of primary care services participating in the MATRICE Project first established clinical definitions of the four diseases and of their levels of severity. The levels of severity were selected according to whether national and international clinical guidelines contained specific indications for treatment or diagnostic follow-up in the patients with that condition. For instance, a patient with IHD after an episode of acute myocardial infarction (AMI) has an indication for treatment with β-blockers;28 hence, history of AMI is a relevant level of severity for IHD. The detailed definitions of the diseases and of the levels of severity are depicted in table 1. For T2DM, the clinical definition was at least two abnormal measurements among fasting plasma glucose, or 2-hour plasma glucose after a load of glucose, or glycated haemoglobin; or just one abnormal measurement of plasma glucose if symptoms of hyperglycaemia were observed. Four levels of severity of the disease were identified, according to the presence/absence of indication for insulin and the presence/absence of complications or organ damage. For hypertension, diagnostic criteria were two abnormal measurements for systolic and diastolic blood pressure confirmed either by a Holter blood pressure measurement or by home blood pressure monitoring. Three levels of severity were identified: no organ damage or diabetes or stroke; organ damage or diabetes or stroke without HF; hypertension with HF. For IHD, the clinical definition referred to symptoms (angina pain) or to a history of AMI or to a bioimaging observation of coronary ischaemia. Five levels of severity of IHD were identified: the most severe was HF, and among those free from HF the presence/absence of history of AMI and the presence/absence of percutaneous transluminal coronary angioplasty (PTCA) classified the four levels. HF was also identified as a condition on its own, and the clinical definition was stage C or D of the classification by the American College of Cardiology and of the American Heart Association.29
Table 1

Clinical definition of diseases and levels of severity

Clinical definitionLevels of severity
Type 2 diabetes mellitus
Syndrome diagnosed on the basis of the following criteria outlined in a first test and confirmed with a second test in an adult, non-pregnant patient, without typical symptoms of the disease: Fasting plasma glucose ≥126 mg/dL (no caloric intake for at least 8 hours), or 2-hour plasma glucose ≥200 mg/dL during an oral glucose tolerance test (OGTT) after a load of 75 g glucose, or glycated haemoglobin ≥6.5%.Or, in a patient with classic symptoms of hyperglycaemia or hyperglycaemic crisis, a random plasma glucose of plasma glucose ≥200 mg/dL (regardless of food intake).Level 1: Clinical definition of the disease, no indication for insulin therapy and no complications listed in level 3
Level 2: Clinical definition of the disease, indication for insulin therapy and absence of complications listed in level 3.
Level 3: Clinical definition of the disease, no indication for insulin therapy and one of the following: (1) arterial stenosis (coronary, carotid, peripheral arteries of lower extremities), angina pectoris, MI, TIA, ischaemic stroke of atherosclerotic origin, intermittent claudication, diabetic foot ulcer, lower limb amputation, (2) retinopathy, (3) incipient diabetic nephropathy (microalbuminuria) or overt (albuminuria or GFR abnormal)/dialysis
Level 4: As in level 3, except that insulin is indicated
Hypertension
Syndrome characterised by arterial systolic blood pressure above 140 mm Hg and/or diastolic blood pressure above 90 mm Hg in at least two measurements (patient at rest) confirmed by Holter blood pressure measurements or by home blood pressure monitoring (two measurements in the morning and two in the evening for 7 days) and then calculating the average of all measurements after discarding those of the first day (as recommended by the ESH guidelines)Level 1: Clinical definition of the disease, absence of organ damage and of diabetes
Level 2: Clinical definition of the disease, no HF in level at least C of the ACC/AHA classification and at least one of the following conditions: type 2 diabetes; hypertrophy (ECG or Echo), dilation or left ventricular asynergy (Echo); hypertensive retinopathy; GFR abnormal; microalbuminuria or proteinuria; atherosclerotic plaques in carotid arteries; atherosclerotic peripheral arterial occlusive disease; angina pectoris; coronary revascularisation; AMI; TIA or ischaemic stroke due to atherosclerosis; hypertensive encephalopathy; abdominal aortic aneurysm; aortic dissection; cerebral haemorrhage
Level 3: Clinical definition of the disease and HF in stage C or D of the ACC/AHA classification.
Ischaemic heart disease
Clinical syndrome characterised by typical angina chest pain, and/or transient myocardial ischaemia verified by stress ECG or imaging, and/or significant coronary arteries occlusion verified with angiography, or history of previous AMI.Level 1: Clinical definition of the disease, no evidence of previous AMI nor PTCA, no evidence of HF in stage C or D of the ACC/AHA classification.
Level 2: Evidence of previous PTCA, no evidence of previous AMI, no evidence of HF in stage C or D of the ACC/AHA classification.
Level 3: Evidence of previous AMI, no evidence of previous PTCA, no evidence of HF in stage C or D of the ACC/AHA classification.
Level 4: Evidence of previous PTCA and AMI, no evidence of HF in stage C or D of the ACC/AHA classification.
Level 5: Clinical definition of the disease and evidence of HF in stage C or D of the ACC/AHA classification.
Heart failure
Stage C or D of the ACC/AHA classification: syndrome characterised by the presence of symptoms and signs, current or prior dyspnoea and/or fatigue and/or fluid retention (peripheral oedema and/or pulmonary stasis), and the presence of structural heart disease (left ventricular systolic dysfunction with EF <50% and/or left ventricular diastolic dysfunction and/or right ventricular dysfunction) detected by echocardiographyNone

ACC/AHA: American Cardiology Association and American Heart Association; AMI, acute myocardial infarction; EF, ejection fraction; HF, heart failure; GFR, glomerular filtration rate; PTCA, percutaneous transluminal coronary angioplasty.

Clinical definition of diseases and levels of severity ACC/AHA: American Cardiology Association and American Heart Association; AMI, acute myocardial infarction; EF, ejection fraction; HF, heart failure; GFR, glomerular filtration rate; PTCA, percutaneous transluminal coronary angioplasty.

Case identification in the primary care medical records

A panel comprising epidemiologists from HSD and GPs belonging to SIMG, with expertise in the clinical areas of interest, developed ad hoc algorithms to identify from the GP medical records cases matching the clinical definitions of the MATRICE Project. In each algorithm, the inclusion and exclusion criteria of the clinical definition were mapped to a list of ICD9CM codes or, when deemed necessary, to free text and to conditions on diagnostic tests.

Case validation

An invitation to participate in the validation study was circulated by SIMG to a sample of GPs, and participation was voluntary. A data collection plugin for the medical record software was developed by HSD and installed in the computers of the GPs who accepted to participate in the study. For each disease, the plugin applied the algorithm to the whole list of active patients on the date of data collection and selected a random sample of 25 patients from the resulting list of cases. The plugin then showed the names of the patients in the sample to the GP for assessment. On the same screen, the clinical definitions of the disease and of its levels of severity were presented. The GP was aware that the cases were selected for a specific condition among T2DM, hypertension, IHD and HF, but was blinded to the level of severity. The GP had the choice of indicating that the patient was not affected by the disease (false positive: FP) or that the patient was affected but the level of severity could not be assessed on the basis of the information available to the GP (not staged: NS), or to assign a level. In the process, the GP was free to access the patient's medical record. Figure 1 shows a screenshot of the data collection plugin (patients listed are not real).
Figure 1

Screenshot of the data collection plugin. The GP participating in the study was asked to fill in the four tables, one per disease. In the table, ‘S.P.’ meant ‘without the disease’ and ‘N.S.’ meant that the patient had the disease but the GP was not able to assess the level of severity. The clinical definition of the condition and of each level of severity could be browsed in the upper part of the screen, while, whenever the pointer stopped on a cell in the table, a tooltip suggested the label of the corresponding level. The patients listed in the figure are not real. GP, general practitioner.

Screenshot of the data collection plugin. The GP participating in the study was asked to fill in the four tables, one per disease. In the table, ‘S.P.’ meant ‘without the disease’ and ‘N.S.’ meant that the patient had the disease but the GP was not able to assess the level of severity. The clinical definition of the condition and of each level of severity could be browsed in the upper part of the screen, while, whenever the pointer stopped on a cell in the table, a tooltip suggested the label of the corresponding level. The patients listed in the figure are not real. GP, general practitioner. When the GP had completed the manual assessment of the four samples of patients, the plugin applied the algorithms for the level of severity, linked the new columns to the data set resulting from the questionnaire, anonymised the final data set and transmitted it to HSD for statistical analysis. Data collection was performed in July 2013.

Statistical analysis

The formula to compute the PPV, for presence/absence of the condition and for each of the levels of severity, was and 95% CIs were estimated. For the three diseases where levels of severity had been validated (T2DM, hypertension and IHD), Cohen's κ was computed to the categorical distribution of levels of severity. Cohen's kappa discounts from observed concordance (the percentage of participants who are classified in the same level by the algorithms and the GP), the expected concordance (the percentage of participants who would be classified the same if assignment had been performed randomly) by means of the following formula Cohen's κ provides an overall measure of agreement about levels of severity. Analysis was performed on the whole sample of patients, irrespectively on the GP. Analysis was performed using Stata V.12.

Results

Algorithms for primary care medical records

The algorithms detecting the diseases and levels of severity from the primary care medical records are listed in table 2. Each algorithm consists of a sequence of rules, each acting as an inclusion criterion, a refinement criterion (linked to the inclusion criterion with the logical connector AND) or a refinement criterion (linked to the inclusion criterion with the logical connectors AND NOT). Each rule is itself composed by subqueries (represented in the table by keywords in round parentheses), and subjects matching at least one of the subqueries are included in the rule, that is, the subqueries are linked to each other with the logical connector OR. No specific temporal sequence between subqueries is requested. Every subquery selects records matching a specific list of codes, free text keywords and/or diagnostic test levels, which are listed in supplementary material 1. All the subqueries are applied to the whole set of longitudinal observations of the patients up to the index date, except when specified otherwise.
Table 2

Algorithms to detect diseases and levels of severity

Algorithm for the diseaseAlgorithms for levels of severity
Type 2 diabetes mellitus
[(DM) OR (GFR) OR (DM TESTS)] AND NOT (DM1)Algorithm for the diseaseAND (DM UNCOMPLICATED)AND NOT [(INSULIN) OR (DM2 CHRONIC COMPLICATIONS)OR (DM2 ASYMPTOMATIC COMPLICATIONS) OR (CVD)]
Algorithm for the diseaseAND (DM UNCOMPLICATED)AND (INSULIN)AND NOT [(DM2 CHRONIC COMPLICATIONS) OR (DM2 ASYMPTOMATIC COMPLICATIONS) OR (CVD)]
Algorithm for the diseaseAND [(DM CHRONIC COMPLICATIONS) OR (DM ASYMPTOMATIC COMPLICATIONS) OR (CVD)]AND NOT (INSULIN)
Algorithm for the disease AND [(DM CHRONIC COMPLICATIONS) OR (DM ASYMPTOMATIC COMPLICATIONS) OR (CVD)]AND (INSULIN)
Hypertension
(HYPERTENSION) OR (BP) Algorithm for the diseaseAND NOT[LEVEL 2 OR LEVEL 3]
Algorithm for the diseaseAND [(DM2) OR (HYPERTENSION COMPLICATIONS) OR (HYPERTENSIVE RETINOPATHY) OR (CHRONIC KIDNEY DISEASE) OR (ATHEROSCLEROTIC ARTERIOPATHY) OR (CAROTID ATHEROSCLEROSIS) OR (HYPERTENSIVE ENCEPHALOPATHY) OR (AORTIC ANEURYSM)]AND NOT [(HF) OR (LVEF)]
Algorithm for the diseaseAND [(HF) OR (LVEF) OR (VENTRICULAR DYSFUNCTION)]
Ischaemic heart disease
(CHD)Algorithm for the diseaseAND (CHD NO AMI)AND NOT [(AMI) OR (PAST AMI) OR (CORONARY REVASCULARIZATION ICD9CM) OR (CORONARY REVASCULARIZATION FREE TEXT) OR (HF) OR (LVEF) OR (VENTRICULAR DYSFUNCTION)]
Algorithm for the diseaseAND [(CORONARY REVASCULARIZATION ICD9CM) OR (CORONARY REVASCULARIZATION FREE TEXT)]AND NOT [(AMI) OR (PAST AMI) OR (HF) OR (LVEF) OR (VENTRICULAR DYSFUNCTION)]
Algorithm for the diseaseAND [(AMI) OR (PAST AMI)]AND NOT [(CORONARY REVASCULARIZATION ICD9CM) OR (CORONARY REVASCULARIZATION FREE TEXT) OR (HF) OR (LVEF) OR (VENTRICULAR DYSFUNCTION)]
Algorithm for the diseaseAND [(AMI) OR (PAST AMI) OR (CORONARY REVASCULARIZATION ICD9CM) OR (CORONARY REVASCULARIZATION FREE TEXT)]AND NOT [(HF) OR (LVEF) OR (VENTRICULAR DYSFUNCTION)]
Algorithm for the diseaseAND [(HF) OR (LVEF)]
Heart failure
(HF) OR (LVEF) OR (VENTRICULAR DYSFUNCTION) None

The algorithms are described by means of subqueries, represented by keywords in upper case between parentheses, whose details are in online supplementary table 1. In particular, measurements are as follows: glomerular filtration rate (GFR) 140 mm Hg at least twice ever or diastolic blood pressure (DBP) >90 mm Hg at least twice ever; Left ventricular ejection fraction (LVEF) <50%; DM TESTS: plasma glucose >200 mg/dL after an oral load of 75 g glucose or fasting plasma glucose ≥126 mg/dL or glycated haemoglobin ≥6.5%. In all subqueries, records labelled with ‘suspect’ were excluded, except in the case of the subquery (acute myocardial infarction, AMI) used to detect levels of severity 3 and 4 of ischaemic heart disease (IHD).

Algorithms to detect diseases and levels of severity The algorithms are described by means of subqueries, represented by keywords in upper case between parentheses, whose details are in online supplementary table 1. In particular, measurements are as follows: glomerular filtration rate (GFR) min/1.73 m2; systolic blood pressure (SBP) >140 mm Hg at least twice ever or diastolic blood pressure (DBP) >90 mm Hg at least twice ever; Left ventricular ejection fraction (LVEF) <50%; DM TESTS: plasma glucose >200 mg/dL after an oral load of 75 g glucose or fasting plasma glucose ≥126 mg/dL or glycated haemoglobin ≥6.5%. In all subqueries, records labelled with ‘suspect’ were excluded, except in the case of the subquery (acute myocardial infarction, AMI) used to detect levels of severity 3 and 4 of ischaemic heart disease (IHD). In all subqueries, records labelled with ‘suspect’ were excluded, except in the case of the subquery detecting patients with AMI, used to detect levels of severity 3 and 4 in patients with IHD.

Validation

A panel of 12 GPs participated in the validation study. A total of 300 patients were identified and validated for each disease, except for HF, where due to low prevalence of the condition only 243 patients were included. The PPV of the algorithms were 100% (CI 99% to 100%) for T2DM and hypertension and 98% (CI 96% to 100%) for IHD. For HF, PPV was 55% (CI 49% to 61%). For T2DM, the second and fourth levels of severity had very high PPV (88% and 93%, respectively). Around 20% of patients without indication for insulin (first and third levels of severity) had their presence of complications misclassified. Both possibilities took place: patients with complications were identified as being free from them, and vice versa. Overall, Cohen's kappa was 0.70, indicating a good level of agreement (table 3).
Table 3

Results of the validation

LevelDescriptionAutomatically assignedTrue positivesTrue level of false positivesPPV (95% CI)Cohen's K
Type 2 diabetes mellitus (N=300, true positives overall: 300)
1Diabetes with no evidence of organ damage nor complications and no indication for insulin therapy129101Level 3: 280.78 (0.70 to 0.85)0.70
2Diabetes with indication for insulin but no evidence of organ damage nor complications1715Level 4: 20.88 (0.64 to 0.99)
3Diabetes with evidence of organ damage or complications and no indication for insulin127104Level 1: 230.82 (0.74 to 0.88)
4Diabetes with evidence of organ damage or complications and with indication for insulin2725Level 3: 20.93 (0.76 to 0.99)
Hypertension (N=300, true positives overall: 300)
1Hypertension with no organ damage nor diabetes138119Level 2: 18Level 3: 10.86 (0.79 to 0.91)0.69
2Hypertension with organ damage and/or stroke and/or diabetes, no HF155127Level 1: 280.82 (0.75 to 0.88)
3Hypertension and HF76Level 2: 10.86 (0.42 to 0.99)
Ischaemic heart disease (N=300, true positives overall: 298)
1IHD, no AMI, no HF, no PTCA5654No IHD: 20.96 (0.88 to 0.99)0.71
2IHD with PTCA, no AMI, no HF10Level 5: 10
3IHD with AMI, no PTCA, no HF195152Level 1: 430.78 (0.71 to 0.84)
4IHD with PTCA and AMI, no HF00
5IHD with HF4845Level 3: 30.94 (0.83 to 0.99)
Heart failure (N=243, true positives: 134)

AMI, acute myocardial infarction; IHD, ischaemic heart disease; HF, heart failure; PPV, positive predictive value; PTCA, percutaneous transluminal coronary angioplasty.

Results of the validation AMI, acute myocardial infarction; IHD, ischaemic heart disease; HF, heart failure; PPV, positive predictive value; PTCA, percutaneous transluminal coronary angioplasty. Among hypertensive patients, every level of severity was misclassified in <20% of patients, and in the case of the middle level (organ damage and/or diabetes, no HF) patients were in fact almost all less severe with respect to the level they were automatically assigned to. Cohen's κ was 0.69, showing good agreement (table 3). In the case of IHD, the first and fifth levels of severity had excellent PPV, while AMI was incorrectly identified by the algorithm in 22% of cases. The two levels of severity characterised by the presence of PTCA were never manually indicated by the GPs, and the automatic algorithm was in almost perfect agreement in both cases. Overall, Cohen's κ was 0.69, showing good agreement (table 3).

Discussion

This study shows that almost all of the automatically detected cases of T2DM, hypertension and IHD, but only the 55% of cases of HF were true cases as assessed by the GP, on the basis of their own records and personal knowledge. Automatic classification of levels of severity of T2DM, hypertension and IHD was acceptable although less accurate. In the case of IHD, mild cases were misclassified as severe, while in T2DM and hypertension both possibilities took place: severe patients were automatically classified as mild and vice versa. Our results provide guidance on the interpretation of results of studies using those algorithms to define variables in medical records or HSD, for instance to monitor quality of healthcare. The excellent PPV of some algorithms is not unexpected: algorithms to detect diabetes (irrespective of type) and hypertension, as well as other chronic conditions, had similarly high validity in the CPCSSN.20 24 The low PPV that we observed in the case of HF is unsurprising as well. Indeed, HF is a syndrome, and several different clinical definitions of HF have been used among clinicians in the recent past, such as the Framingham and European Society of Cardiology criteria. It has been shown that the changing definition has a noticeable impact on the epidemiology of the condition.30 Our definition comprises two of the four stages of the classification of the American College of Cardiology and of the American Heart Association, which has been itself revised repeatedly in recent years.29 It is most likely that the GPs in the sample themselves adopt a different definition to diagnose HF with respect to the one proposed by the MATRICE panel. Further research could investigate whether a clinical definition modelled on the Italian guidelines may be more easily identified in primary care medical records. Finally, poor performance of ICD9CM codes in identifying a specific clinical definition of HF has been consistently reported in the literature.31 The results we observed for levels of severity are probably due to reasons which are more specific to HSD. In the case of T2DM, some misclassification occurred between level 1 and level 3, that is, patients with or without complications but without indication for insulin use. Adding rules that explore free text comments to the diagnostic codes may improve those algorithms. In the algorithm developed for this study, patients with IHD labelled with ‘suspect’ AMI were included in the ‘AMI’ level of severity, because in a previous study specific algorithms to identify AMI had been observed to have low sensitivity in HSD.25 As a result, our sensitive algorithm did indeed capture all the cases of AMI in patients with IHD; however, 22% of patients had not had an AMI, so this strategy needs to be reconsidered for future research. Remarkably, GPs were not aware of a single case of PTCA in their patients, as confirmed by automatic querying of their records. The absence of PTCA could be due to the fact that this procedure is only performed in hospital. In Italy, hospitals do not provide GPs with discharge letters; rather patients themselves must describe to the GP what happened during an inpatient care episode, and the PTCA procedure may be communicated inadequately to the GP. Strategies to improve communication between hospital and primary care should be implemented in Italy for the purpose of improving quality of primary care medical records, as well as to improve healthcare for patients with severe cardiovascular conditions.

Limitations

The sample of GPs was self-selected and may have been composed of those with more accurate data recording attitudes. In particular, two GPs in the validation sample also participated in the panel that created the algorithms. The PPV of the algorithms may be lower in the general group of GPs contributing to HSD. Providing a direct estimate of sensitivity of the case-finding algorithms was not an aim of this study, because it would have been unfeasible for GPs to assess a large enough sample of their patients. Indeed, the number of patients needed to estimate the sensitivity of a test is bigger with less prevalent conditions: for instance, to estimate sensitivity of a case-finding algorithm for T2DM with a marginal error of 5%, even assuming 10% prevalence (an overestimation, according to common estimates in Italy32) and 95% sensitivity would have required an additional sample of more than 700 participants; to obtain a valid estimate of sensitivity for HF, which has much lower prevalence and a lower expected sensitivity, GPs would have needed to assess thousands of participants.33

Conclusions

This study shows that participants with T2DM, hypertension or IHD can be validly identified in HSD by automated identification algorithms. Automatic queries for levels of severity of the same diseases compare well with the corresponding clinical definitions, but some misclassification occurs. For HF, further research is needed to refine the current algorithm.
  28 in total

1.  The UK General Practice Research Database.

Authors:  T Walley; A Mantgani
Journal:  Lancet       Date:  1997-10-11       Impact factor: 79.321

Review 2.  Quality of morbidity coding in general practice computerized medical records: a systematic review.

Authors:  Kelvin Jordan; Mark Porcheret; Peter Croft
Journal:  Fam Pract       Date:  2004-08       Impact factor: 2.267

3.  Canadian Primary Care Sentinel Surveillance Network: a developing resource for family medicine and public health.

Authors:  Richard V Birtwhistle
Journal:  Can Fam Physician       Date:  2011-10       Impact factor: 3.275

4.  2012 ACCF/AHA/ACP/AATS/PCNA/SCAI/STS guideline for the diagnosis and management of patients with stable ischemic heart disease: executive summary: a report of the American College of Cardiology Foundation/American Heart Association task force on practice guidelines, and the American College of Physicians, American Association for Thoracic Surgery, Preventive Cardiovascular Nurses Association, Society for Cardiovascular Angiography and Interventions, and Society of Thoracic Surgeons.

Authors:  Stephan D Fihn; Julius M Gardin; Jonathan Abrams; Kathleen Berra; James C Blankenship; Apostolos P Dallas; Pamela S Douglas; Joanne M Foody; Thomas C Gerber; Alan L Hinderliter; Spencer B King; Paul D Kligfield; Harlan M Krumholz; Raymond Y K Kwong; Michael J Lim; Jane A Linderbaum; Michael J Mack; Mark A Munger; Richard L Prager; Joseph F Sabik; Leslee J Shaw; Joanna D Sikkema; Craig R Smith; Sidney C Smith; John A Spertus; Sankey V Williams
Journal:  Circulation       Date:  2012-11-19       Impact factor: 29.690

5.  Postmarketing surveillance based on electronic patient records: the IPCI project.

Authors:  A E Vlug; J van der Lei; B M Mosseveld; M A van Wijk; P D van der Linden; M C Sturkenboom; J H van Bemmel
Journal:  Methods Inf Med       Date:  1999-12       Impact factor: 2.176

6.  Validation studies of the health improvement network (THIN) database for pharmacoepidemiology research.

Authors:  James D Lewis; Rita Schinnar; Warren B Bilker; Xingmei Wang; Brian L Strom
Journal:  Pharmacoepidemiol Drug Saf       Date:  2007-04       Impact factor: 2.890

7.  Prevalence of comorbidities in patients with chronic obstructive pulmonary disease.

Authors:  Mario Cazzola; Germano Bettoncelli; Emiliano Sessa; Claudio Cricelli; Gianluca Biscione
Journal:  Respiration       Date:  2010-02-04       Impact factor: 3.580

8.  Chronic kidney disease and cardiovascular risk in hypertensive type 2 diabetics: a primary care perspective.

Authors:  Maura Ravera; Giuseppe Noberasco; Michela Re; Alessandro Filippi; Anna Maria Gallina; Ursula Weiss; Rossella Cannavò; Giambattista Ravera; Claudio Cricelli; Giacomo Deferrari
Journal:  Nephrol Dial Transplant       Date:  2008-12-10       Impact factor: 5.992

9.  Asthma and COPD in an Italian adult population: role of BMI considering the smoking habit.

Authors:  Mario Cazzola; Luigino Calzetta; Davide Lauro; Germano Bettoncelli; Claudio Cricelli; Nicola Di Daniele; Paola Rogliani
Journal:  Respir Med       Date:  2013-05-20       Impact factor: 3.415

10.  Adherence to antihypertensive medications and cardiovascular morbidity among newly diagnosed hypertensive patients.

Authors:  Giampiero Mazzaglia; Ettore Ambrosioni; Marianna Alacqua; Alessandro Filippi; Emiliano Sessa; Vincenzo Immordino; Claudio Borghi; Ovidio Brignoli; Achille P Caputi; Claudio Cricelli; Lorenzo G Mantovani
Journal:  Circulation       Date:  2009-10-05       Impact factor: 29.690

View more
  12 in total

Review 1.  The Role of European Healthcare Databases for Post-Marketing Drug Effectiveness, Safety and Value Evaluation: Where Does Italy Stand?

Authors:  Gianluca Trifirò; Rosa Gini; Francesco Barone-Adesi; Ettore Beghi; Anna Cantarutti; Annalisa Capuano; Carla Carnovale; Antonio Clavenna; Mirosa Dellagiovanna; Carmen Ferrajolo; Matteo Franchi; Ylenia Ingrasciotta; Ursula Kirchmayer; Francesco Lapi; Roberto Leone; Olivia Leoni; Ersilia Lucenteforte; Ugo Moretti; Alessandro Mugelli; Luigi Naldi; Elisabetta Poluzzi; Concita Rafaniello; Federico Rea; Janet Sultana; Mauro Tettamanti; Giuseppe Traversa; Alfredo Vannacci; Lorenzo Mantovani; Giovanni Corrao
Journal:  Drug Saf       Date:  2019-03       Impact factor: 5.606

2.  Epidemiology of acromegaly in Italy: analysis from a large longitudinal primary care database.

Authors:  Federico Gatto; Gianluca Trifirò; Francesco Lapi; Francesco Cocchiara; Claudia Campana; Carlotta Dell'Aquila; Carmen Ferrajolo; Marica Arvigo; Claudio Cricelli; Massimo Giusti; Diego Ferone
Journal:  Endocrine       Date:  2018-05-24       Impact factor: 3.633

3.  Measurement error and misclassification in electronic medical records: methods to mitigate bias.

Authors:  Jessica C Young; Mitchell M Conover; Michele Jonsson Funk
Journal:  Curr Epidemiol Rep       Date:  2018-09-10

4.  Health Data for Public Health: Towards New Ways of Combining Data Sources to Support Research Efforts in Europe.

Authors:  A Burgun; E Bernal-Delgado; W Kuchinke; T van Staa; J Cunningham; E Lettieri; C Mazzali; D Oksen; F Estupiñan; A Barone; G Chène
Journal:  Yearb Med Inform       Date:  2017-09-11

5.  Monitoring compliance with standards of care for chronic diseases using healthcare administrative databases in Italy: Strengths and limitations.

Authors:  Rosa Gini; Martijn J Schuemie; Alessandro Pasqua; Emanuele Carlini; Francesco Profili; Iacopo Cricelli; Patrizio Dazzi; Valentina Barletta; Paolo Francesconi; Francesco Lapi; Andrea Donatini; Giulia Dal Co; Modesta Visca; Mariadonata Bellentani; Miriam Sturkenboom; Niek Klazinga
Journal:  PLoS One       Date:  2017-12-12       Impact factor: 3.240

6.  Development and validation of the DIabetes Severity SCOre (DISSCO) in 139 626 individuals with type 2 diabetes: a retrospective cohort study.

Authors:  Salwa S Zghebi; Mamas A Mamas; Darren M Ashcroft; Chris Salisbury; Christian D Mallen; Carolyn A Chew-Graham; David Reeves; Harm Van Marwijk; Nadeem Qureshi; Stephen Weng; Tim Holt; Iain Buchan; Niels Peek; Sally Giles; Martin K Rutter; Evangelos Kontopantelis
Journal:  BMJ Open Diabetes Res Care       Date:  2020-05

7.  Patterns and trends of utilization of incretin-based medicines between 2008 and 2014 in three Italian geographic areas.

Authors:  Giuseppe Roberto; Francesco Barone-Adesi; Francesco Giorgianni; Valeria Pizzimenti; Carmen Ferrajolo; Michele Tari; Claudia Bartolini; Roberto Da Cas; Marina Maggini; Stefania Spila-Alegiani; Paolo Francesconi; Gianluca Trifirò; Elisabetta Poluzzi; Fabio Baccetti; Rosa Gini
Journal:  BMC Endocr Disord       Date:  2019-02-07       Impact factor: 2.763

8.  Assessing the severity of cardiovascular disease in 213 088 patients with coronary heart disease: a retrospective cohort study.

Authors:  Salwa S Zghebi; Mamas A Mamas; Darren M Ashcroft; Martin K Rutter; Harm VanMarwijk; Chris Salisbury; Christian D Mallen; Caroline A Chew-Graham; Nadeem Qureshi; Stephen F Weng; Tim Holt; Iain Buchan; Niels Peek; Sally Giles; David Reeves; Evangelos Kontopantelis
Journal:  Open Heart       Date:  2021-04

9.  Using electronic health records to quantify and stratify the severity of type 2 diabetes in primary care in England: rationale and cohort study design.

Authors:  Salwa S Zghebi; Martin K Rutter; Darren M Ashcroft; Chris Salisbury; Christian Mallen; Carolyn A Chew-Graham; David Reeves; Harm van Marwijk; Nadeem Qureshi; Stephen Weng; Niels Peek; Claire Planner; Magdalena Nowakowska; Mamas Mamas; Evangelos Kontopantelis
Journal:  BMJ Open       Date:  2018-06-30       Impact factor: 2.692

Review 10.  Validity of Acute Cardiovascular Outcome Diagnoses Recorded in European Electronic Health Records: A Systematic Review.

Authors:  Jennifer Davidson; Amitava Banerjee; Rutendo Muzambi; Liam Smeeth; Charlotte Warren-Gash
Journal:  Clin Epidemiol       Date:  2020-10-14       Impact factor: 4.790

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.