| Literature DB >> 20078607 |
Emily Herrett1, Sara L Thomas, W Marieke Schoonen, Liam Smeeth, Andrew J Hall.
Abstract
AIMS: To investigate the range of methods used to validate diagnoses in the General Practice Research Database (GPRD), to summarize findings and to assess the quality of these validations.Entities:
Mesh:
Year: 2010 PMID: 20078607 PMCID: PMC2805870 DOI: 10.1111/j.1365-2125.2009.03537.x
Source DB: PubMed Journal: Br J Clin Pharmacol ISSN: 0306-5251 Impact factor: 4.335
Methods used in validations of diagnoses in the General Practice Research Database (GPRD)
| Internal | 1 | Diagnostic algorithm | A diagnosis was validated by the presence of codes indicating specific symptoms/signs, prescriptions for disease-specific drugs and/or confirmatory test results | Andersohn [ |
| 2 | Manual review of anonymized free text on computerized records | The complete computer records (including the anonymized free text) for individuals with a diagnosis were assessed for confirmatory evidence of disease status | Yang [ | |
| 3 | Sensitivity analysis | In an analytical study, a comparison of the measure of effect using a broad set of disease/therapeutic codes with that of a restricted set more likely to represent true cases | Gupta [ | |
| External | 4 | Questionnaire to GP | A questionnaire investigating various aspects of the computerized diagnosis was sent to the GP | Garcia Rodriguez [ |
| 5 | Record request to GP | Request to the GP to provide anonymized copies of paper medical records, hospital discharge summaries or death certificates. Obtained copies were examined to validate the diagnosis, using further diagnostic criteria | Hall [ | |
| 6 | Comparison of rates | Measures of disease incidence, prevalence or patterns (e.g. time trends) from GPRD data were compared with a non-GPRD, UK-based data source | Hollowell [ | |
Figure 1Stream diagram of article search, retrieval and review process. GPRD, General Practice Research Database; BCDSP, Boston Collaborative Drug Surveillance Program; ISPE, International Society of Pharmacoepidemiology; PDS, Pharmacoepidemiology Drug Safety; HSQ, Health Statistics Quarterly
Methods of validation and median percentage of cases confirmed using each method
| Internal | ||
| Manual review of computerized records or diagnostic algorithm | 31 | 86.2 (33–100) |
| Sensitivity analysis | 23 | n/a |
| External | ||
| Comparison of rates | 143 | n/a |
| Additional information from GP | 160 | 88.6 (24–100) |
| Questionnaire to GP only | 43 | 91.7 (26–93) |
| Questionnaire and record request to GP | 39 | 90.0 (41–100) |
| Record request to GP only | 78 | 82.7 (24–100) |
| Total | 357 | 88.6 (24–100) |
It was a requirement for inclusion in our review for each study (with the exception of comparison of rates and sensitivity analyses) to provide a numerical estimate of the proportion of cases confirmed.
Proportion of cases confirmed, by disease group, and number of validations in each disease category, by validation method
| Infectious and parasitic | 0 | 1 | 1 | 0 | 0 | 15 | 17 | 93.00 | (n/a) |
| Neoplasms | 7 | 0 | 5 | 2 | 5 | 7 | 26 | 95.25 | (74–100) |
| Blood and blood-forming organs | 0 | 0 | 0 | 1 | 3 | 1 | 5 | 57.61 | (31–89) |
| Endocrine, nutritional and metabolic | 4 | 0 | 0 | 5 | 1 | 7 | 17 | 87.70 | (53–100) |
| Mental and behavioural disorders | 1 | 0 | 1 | 5 | 8 | 10 | 20 | 83.00 | (52–100) |
| Nervous system | 1 | 4 | 1 | 1 | 6 | 12 | 25 | 81.00 | (39–100) |
| Eye and adnexa | 0 | 0 | 0 | 2 | 3 | 3 | 8 | 89.47 | (75–97) |
| Ear and mastoid process | 0 | 0 | 0 | 0 | 0 | 2 | 2 | n/a | |
| Circulatory system | 10 | 3 | 7 | 10 | 20 | 12 | 55 | 85.30 | (48–100) |
| Respiratory system | 2 | 0 | 3 | 2 | 2 | 25 | 34 | 88.00 | (26–100) |
| Digestive system | 1 | 6 | 13 | 1 | 19 | 12 | 46 | 87.35 | (24–100) |
| Skin and subcutaneous tissue | 1 | 1 | 5 | 1 | 0 | 5 | 13 | 94.55 | (82–100) |
| Musculoskeletal system and connective tissue | 2 | 6 | 2 | 1 | 4 | 10 | 19 | 80.00 | (33–97) |
| Genitourinary system | 0 | 1 | 0 | 4 | 1 | 1 | 7 | 91.00 | (28–100) |
| Pregnancy, childbirth and the puerperium | 0 | 0 | 0 | 0 | 0 | 0 | 0 | n/a | |
| Perinatal period | 0 | 0 | 0 | 0 | 0 | 0 | 0 | n/a | |
| Congenital | 0 | 0 | 3 | 0 | 0 | 2 | 2 | 93.50 | (71–100) |
| Injury and poisoning | 1 | 1 | 2 | 2 | 3 | 3 | 12 | 89.52 | (73–100) |
| External causes of morbidity and mortality | 0 | 0 | 0 | 1 | 0 | 8 | 9 | 100.00 | (n/a) |
| Other | 1 | 0 | 0 | 1 | 3 | 8 | 10 | 90.00 | (45–100) |
Includes multiple disease groups, ill-defined conditions, miscellaneous, stillbirth and mortality.
Based on GP record request, algorithm and manual review.
Investigated the validity of a composite outcome combining more than one diagnosis.
Quality of the validation studies*
| Internal | |||||||||||||||||
| Manual review of computerized records or diagnostic algorithm | 31 | 1268 | (50–78 172) | 27 | (87) | 314 | (57–61 097) | 30 | (97) | 89.6 | (0.12–100) | 27 | (87) | n/a | n/a | ||
| Sensitivity analysis | 23 | 4732 | (21–36 702) | 19 | (83) | n/a | n/a | n/a | n/a | ||||||||
| External | |||||||||||||||||
| Comparison of rates | 143 | 1984 | (5–200 000) | 70 | (49) | n/a | n/a | n/a | n/a | ||||||||
| Additional information from GP | 160 | 226 | (1–51 688) | 139 | (87) | 100 | (1–3 010) | 153 | (96) | 68.2 | (0.15–100) | 138 | (86) | 90.4 | (55–100) | 134 | (83) |
| Questionnaire to GP only | 43 | 1562 | (10–51 688) | 34 | (79) | 159 | (10–2 040) | 40 | (93) | 48.6 | (0.39–100) | 34 | (79) | 92.5 | (55–100) | 31 | (72) |
| Questionnaire and record request to GP | 39 | 51.5 | (10–24 131) | 36 | (92) | 40 | (10–795) | 38 | (97) | 78.9 | (0.41–100) | 36 | (92) | 90.4 | (68–100) | 35 | (88) |
| Record request to GP only | 78 | 199 | (1–22 195) | 69 | (88) | 88 | (1–3 010) | 75 | (96) | 72.3 | (0.15–100) | 68 | (87) | 90.0 | (56–100) | 68 | (88) |
For example, in an external validation of glaucoma [29], 24 131 glaucoma patients were identified based on the study inclusion criteria (column A). Of these patients, 100 were chosen to undergo validation by questionnaire and record request (column B). This means that the proportion of cases chosen to undergo validation was just 0.41% of the total (column C). Of these 100 cases, the GP responded in 95, giving a response rate of 95% (column D).
Figure 2Measures of validity of categorical data. Sensitivity: A/(A+C); specificity: D/(B+D); positive predictive value: A/(A+B); negative predictive value: D/(C+D)
Figure 3Stream diagram showing the information from General Practice Research Database (GPRD) validation studies that could be made available to researchers