| Literature DB >> 25187016 |
Amy Matcho1, Patrick Ryan, Daniel Fife, Christian Reich.
Abstract
BACKGROUND: The unique structure and coding of the Clinical Practice Research Datalink (CPRD) presents challenges for epidemiologic analysis and for comparisons with other databases. To address this limitation we sought to transform CPRD into the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM).Entities:
Mesh:
Year: 2014 PMID: 25187016 PMCID: PMC4206771 DOI: 10.1007/s40264-014-0214-3
Source DB: PubMed Journal: Drug Saf ISSN: 0114-5916 Impact factor: 5.606
Populated OMOP CDM tables, descriptions and CPRD source data tables in the CPRD CDM conversion
| CDM table name | Description | CPRD raw data tables |
|---|---|---|
| Person | Demographic information about a person | Patient |
| Drug exposure | Association between a person and a drug at a specific time | Therapy, immunisation, clinical, referral, test |
| Drug era | Association between a person and a drug over a specific time period | Therapy, immunisation, clinical, referral, test |
| Condition occurrence | A diagnosis or condition that has been recorded about a person at a certain time | Clinical, referral, test |
| Condition era | A diagnosis or condition over a period of time | Clinical, referral, test |
| Observation period | Time intervals during which healthcare information, such as drugs, conditions, and other clinical observations, may be available | Patient, practice |
| Observation | Observations are clinical facts, such as laboratory tests, signs/symptoms, which are not captured within other CDM tables | Clinical, referral, test, additional |
| Procedure occurrence | Procedures carried out on the person | Clinical, referral, test |
| Visit occurrence | Visits for healthcare services of the person | Consultation |
| Death | Time and cause of death of the person | Patient |
| Provider | Information about healthcare providers | Staff |
| Care site | Information about the site of care | Practice |
CDM common data model, CPRD clinical practice research datalink, OMOP observational medical outcomes partnership
Fig. 1Proportion of mapped terms and database records for the CPRD OMOP CDM domains. CDM common data model, CPRD clinical practice research datalink, OMOP observational medical outcomes partnership
Top 10 conditions and procedures with mapped concept description and prevalence in the CPRD OMOP CDM
| Domain | Source code/description | SNOMED-CT | Prevalence (patients with source code/all patients in database) |
|---|---|---|---|
| Condition | H05z.00 Upper respiratory infection NOS | Acute upper respiratory infection | 12.3 |
| Condition | H05z.11 Upper respiratory tract infection NOS | Acute upper respiratory infection | 9.6 |
| Condition | H03..00 Acute tonsillitis | Acute tonsillitis | 8.2 |
| Condition | H06z011 Chest infection | Lower respiratory tract infection | 8.2 |
| Condition | H06z000 Chest infection NOS | Lower respiratory tract infection | 6.9 |
| Condition | N131.00 Cervicalgia: pain in neck | Nonspecific pain in the neck region | 6.7 |
| Condition | N142.11 Low back pain | Low back pain | 6.6 |
| Condition | N245.17 Shoulder pain | Shoulder pain | 6.2 |
| Condition | H33..00 Asthma | Hyperreactive airway disease | 5.7 |
| Condition | G20..00 Essential hypertension | Essential hypertension | 5.5 |
| Procedure | 535..00 Standard chest X-ray | Plain chest X-ray | 8.4 |
| Procedure | 3395.00 Peak expiratory flow rate: PEFR/PFR | PEFR: peak expiratory flow rate | 6.7 |
| Procedure | 7305011 Syringe ear to remove wax | Syringing ear to remove wax | 5.9 |
| Procedure | 81H..00 Dressing of wound | Dressing of wound | 5.6 |
| Procedure | 6637.00 Inhaler technique observed | Inhaler technique observed | 4.8 |
| Procedure | 662..12 Hypertension monitoring | Hypertension monitoring | 4.5 |
| Procedure | 8C1B.00 Nursing care blood sample taken | Blood sample taken | 4.3 |
| Procedure | 7L17200 Blood withdrawal for testing | Taking blood sample | 4 |
| Procedure | 7L17.00 Blood withdrawal | Phlebotomy | 3.7 |
| Procedure | 663..11 Asthma monitoring | Asthma monitoring | 3.3 |
CDM common data model, CPRD clinical practice research datalink, OMOP observational medical outcomes partnership
Top 10 observations and drug exposures with mapped concept description and prevalence in the CPRD OMOP CDM
| Domain | Source code/description | SNOMED/LOINC concept description | Prevalence (patients with source code/all patients in database) |
|---|---|---|---|
| Observation | Lifestyle smoking status smoke/drink status | PhenX measure: tobacco: smoking status | 64.5 |
| Observation | Examination findings blood pressure systolic | Diastolic blood pressure | 60.3 |
| Observation | Examination findings blood pressure diastolic | Systolic blood pressure | 60.3 |
| Observation | Examination findings weight in kilograms | Body weight measured | 59.6 |
| Observation | 22A..00 O/E: weight | O/E: weight | 59.5 |
| Observation | 246..00 O/E: blood pressure reading | O/E: blood pressure reading | 58.4 |
| Observation | Examination findings weight BMI | Body mass index (BMI) [ratio] | 50.7 |
| Observation | 9N31.00 Telephone encounter | Telephone encounter | 38.5 |
| Observation | Serum creatinine | Creatinine [mass/volume] in serum or plasma | 35.9 |
| Observation | 44J3.00 Serum creatinine | Creatinine measurement, serum | 35.2 |
| Drug exposure | FLU data not entered 65E.00 influenza vaccination | Influenza virus vaccine | 19.4 |
| Drug exposure | 58932020 Paracetamol 500-mg tablets | Acetaminophen 500-mg oral tablet | 11.3 |
| Drug exposure | 69782020 Omeprazole 20-mg gastro-resistant capsules | Omeprazole 20-mg enteric-coated capsule | 10.4 |
| Drug exposure | 52994020 Aspirin 75-mg dispersible tablets | Aspirin 75-mg disintegrating tablet | 7.1 |
| Drug exposure | 58976020 Bendroflumethiazide 2.5-mg tablets | Bendroflumethiazide 2.5-mg oral tablet | 6.3 |
| Drug exposure | 72489020 Simvastatin 40-mg tablets | Simvastatin 40-mg oral tablet | 5.2 |
| Drug exposure | 59420020 Furosemide 40-mg tablets | Furosemide 40-mg oral tablet | 3.9 |
| Drug exposure | 72488020 Simvastatin 20-mg tablet | Simvastatin 20-mg oral tablet | 3.7 |
| Drug exposure | 60153020 Atenolol 50-mg tablet | Atenolol 50-mg oral tablet | 3.2 |
| Drug Exposure | 55991020 Levothyroxine sodium 100-μg tablet | Levothyroxine sodium 1-mg oral tablet | 1.9 |
CDM common data model, CPRD clinical practice research datalink, OMOP observational medical outcomes partnership
Characteristics of cases and controls for replication study
| Cases | Controls | ||||||
|---|---|---|---|---|---|---|---|
| Original study ( | Raw data study ( | CDM study ( | Original study ( | Raw data study ( | CDM study ( | Sensitivity study ( | |
| Age (years) | |||||||
| <40 | 91 (2.8) | 67 (1.9) | 67 (1.9) | 367 (2.8) | 259 (2.0) | 260 (2.0) | 252 (2.0) |
| 40–49 | 417 (12.6) | 373 (10.8) | 373 (10.8) | 1,656 (12.6) | 1,441 (10.9) | 1,443 (11.0) | 1,423 (11.0) |
| 50–59 | 830 (25.0) | 876 (25.3) | 875 (25.3) | 3,314 (25.2) | 3,330 (25.3) | 3,326 (25.3) | 3,276 (25.4) |
| 60–69 | 1,227 (37.0) | 1,263 (36.5) | 1,261 (36.5) | 4,832 (36.8) | 4,836 (36.7) | 4,830 (36.7) | 4,732 (36.7) |
| 70–75 | 750 (22.6) | 879 (25.4) | 878 (25.4) | 2,970 (22.6) | 3,304 (25.1) | 3,300 (25.1) | 3,208 (24.9) |
| Male | 2,452 (74.0) | 2,546 (73.6) | 2,543 (73.6) | 9,715 (73.9) | 9,692 (73.6) | 9,685 (73.6) | 9,463 (73.4) |
| Female | 863 (26.0) | 912 (26.4) | 911 (26.4) | 3,424 (26.1) | 3,478 (26.4) | 3,474 (26.4) | 3,428 (26.6) |
| Smoking status | |||||||
| Non | 1,079 (32.6) | 924 (26.7) | 921 (26.7) | 6,204 (47.2) | 4,475 (34.0) | 4,510 (34.3) | 5,157 (40.0) |
| Current | 1,100 (33.2) | 963 (27.8) | 964 (27.9) | 2,574 (19.6) | 2,074 (15.7) | 2,112 (16.0) | 2,286 (17.7) |
| Ex | 376 (11.3) | 291 (8.4) | 290 (8.4) | 1,353 (10.3) | 904 (6.9) | 888 (6.7) | 1,069 (8.3) |
| Unknown | 760 (22.9) | 1,280 (37.0) | 1,279 (37.0) | 3,008 (22.9) | 5,717 (43.4) | 5,649 (42.9) | 4,379 (34.0) |
| Body mass index (kg/m2) | |||||||
| <25 | 885 (26.7) | 740 (21.4) | 738 (21.4) | 4,240 (32.3) | 2,753 (20.9) | 2,861 (21.7) | 3,216 (24.9) |
| 25–29 | 1,100 (33.2) | 798 (23.1) | 798 (23.1) | 4,004 (30.5) | 2,874 (21.8) | 2,892 (22.0) | 3,275 (25.4) |
| ≥30 | 387 (11.7) | 318 (9.2) | 318 (9.2) | 1,208 (9.2) | 905 (6.9) | 880 (6.7) | 1,021 (7.9) |
| Unknown | 943 (28.4) | 1,602 (46.3) | 1,600 (46.3) | 3,687 (28.0) | 6,638 (50.4) | 6,526 (49.6) | 5,379 (41.7) |
Risk of developing first-time acute myocardial infarction adjusted for body mass index, smoking status, aspirin use and hormone replacement therapy
| Original study adjusted odds ratio | Raw data study adjusted odds ratio | CDM study adjusted odds ratio | Sensitivity study adjusted odds ratio | |
|---|---|---|---|---|
| Smoking status | ||||
| Current | 2.7 (2.4–2.9) | 2.3 (2.0–2.6) | 2.4 (2.1–2.7) | 2.4 (2.1–2.7) |
| Ex | 1.6 (1.4–1.9) | 1.6 (1.3–1.9) | 1.5 (1.3–1.9) | 1.5 (1.3–1.9) |
| Unknown | 1.5 (1.3–1.7) | 1.2 (1.0–1.3) | 1.2 (1.0–1.4) | 1.6 (1.4–1.8) |
| Body mass index (kg/m2) | ||||
| 25–29.9 | 1.4 (1.3–1.6) | 1.1 (1.0–1.3) | 1.1 (1.0–1.3) | 1.2 (1.0–1.3) |
| ≥30 | 1.7 (1.4–1.9) | 1.5 (1.2–1.9) | 1.6 (1.2–2.0) | 1.6 (1.3–2.0) |
| Unknown | 1.2 (1.1–1.4) | 1.1 (1.0–1.3) | 1.2 (1.0–1.4) | 1.3 (1.1–1.4) |
Current, recent past and past exposure to NSAIDs, stratified by duration of NSAID therapy (number of prescriptions) and ingredient exposure in current users
| NSAID category/number of prescriptions | Cases | Controls | |||||
|---|---|---|---|---|---|---|---|
| Original study ( | Raw data study ( | CDM study ( | Original study ( | Raw data study ( | CDM study (N = 13,159) | Sensitivity study ( | |
| Non-users | 1,502 (45.3) | 1,966 (56.9) | 1,981 (57.4) | 6,236 (47.5) | 8,260 (62.7) | 8,269 (62.8) | 7,099 (55.1) |
| Current | 242 (7.3) | 206 (6.0) | 187 (5.4) | 825 (6.3) | 560 (4.3) | 584 (4.4) | 683 (5.3) |
| Current 1–4 | 34 (1) | 42 (1.2) | 41 (1.2) | 111 (0.8) | 121 (0.9) | 129 (1.0) | 175 (1.4) |
| Current 5–9 | 45 (1.4) | 30 (0.9) | 27 (0.8) | 157 (1.2) | 83 (0.6) | 85 (0.6) | 105 (0.8) |
| Current 10–19 | 36 (1.1) | 34 (1.0) | 31 (0.9) | 145 (1.1) | 120 (0.9) | 98 (0.7) | 140 (1.1) |
| Current 20–29 | 38 (1.1) | 36 (1.0) | 29 (0.8) | 119 (0.9) | 88 (0.7) | 86 (0.7) | 97 (0.8) |
| Current 30+ | 89 (2.7) | 64 (1.9) | 59 (1.7) | 293 (2.2) | 148 (1.1) | 186 (1.4) | 166 (1.3) |
| Recent past | 118 (3.6) | 116 (3.4) | 105 (3.0) | 377 (2.9) | 330 (2.5) | 258 (2.0) | 389 (3.0) |
| Recent past 1–4 | 25 (0.8) | 37 (1.1) | 36 (1.0) | 105 (0.8) | 126 (1.0) | 134 (1.0) | 173 (1.3) |
| Recent past 5–9 | 21 (0.6) | 16 (0.5) | 14 (0.4) | 95 (0.7) | 72 (0.5) | 48 (0.4) | 74 (0.6) |
| Recent past 10–19 | 23 (0.7) | 24 (0.7) | 23 (0.7) | 77 (0.6) | 65 (0.5) | 42 (0.3) | 57 (0.4) |
| Recent past 20–29 | 14 (0.4) | 16 (0.5) | 15 (0.4) | 44 (0.3) | 24 (0.2) | 16 (0.1) | 44 (0.3) |
| Recent past 30+ | 35 (1.1) | 23 (0.7) | 17 (0.5) | 56 (0.4) | 43 (0.3) | 18 (0.1) | 41 (0.3) |
| Past | 1,453 (43.8) | 1,170 (33.8) | 1,181 (34.2) | 5,701 (43.4) | 4,020 (30.5) | 4,048 (30.8) | 4,720 (36.6) |
| Past 1–4 | 984 (29.7) | 915 (26.5) | 921 (26.7) | 4,002 (30.5) | 3,242 (24.6) | 3,275 (24.9) | 3,753 (29.1) |
| Past 5–9 | 311 (9.4) | 136 (3.9) | 132 (3.8) | 1,190 (9.1) | 482 (3.7) | 449 (3.4) | 607 (4.7) |
| Past 10–19 | 91 (2.7) | 77 (2.2) | 76 (2.2) | 352 (2.7) | 193 (1.5) | 219 (1.7) | 243 (1.9) |
| Past 20–29 | 26 (0.8) | 25 (0.7) | 24 (0.7) | 82 (0.6) | 61 (0.5) | 55 (0.4) | 60 (0.5) |
| Past 30+ | 41 (1.2) | 17 (0.5) | 28 (0.8) | 75 (0.6) | 42 (0.3) | 50 (0.4) | 57 (0.4) |
| Ibuprofen | 60 (24.8) | 68 (33.0) | 64 (34.2) | 204 (24.7) | 144 (25.7) | 173 (29.6) | 189 (27.7) |
| Diclofenac | 97 (40.1) | 68 (33.0) | 62 (33.1) | 277 (33.6) | 173 (30.9) | 179 (30.7) | 214 (31.3) |
| Piroxicam | 10 (4.1) | 9 (4.4) | 9 (4.8) | 28 (3.4) | 34 (6.1) | 41 (7.0) | 33 (4.8) |
| Ketoprofen | 15 (6.2) | 5 (2.4) | 3 (1.6) | 48 (5.8) | 32 (5.7) | 26 (4.5) | 35 (5.1) |
| Indomethacin | 15 (6.2) | 21 (10.0) | 20 (10.7) | 56 (6.8) | 44 (7.9) | 58 (9.9) | 55 (8.1) |
| Naproxen | 19 (7.9) | 14 (6.8) | 13 (7.0) | 105 (12.7) | 69 (12.3) | 58 (9.9) | 78 (11.4) |
Risk of first-time acute myocardial infarction associated with current, recent past and past NSAID therapy, duration and ingredient in current users
| NSAID category/number prescriptions | Original study adjusted odds ratio | Raw data study adjusted odds ratio | CDM study adjusted odds ratio | Sensitivity study adjusted odds ratio |
|---|---|---|---|---|
| Current | 1.17 (0.99–1.37) | 1.51 (1.24–1.83) | 1.29 (1.06–1.57) | 1.07 (0.89– 1.29) |
| Current 1–4 | 1.30 (0.87–1.93) | 1.32 (0.87–2.01) | 1.15 (0.77– 1.74) | 0.80 (0.54– 1.18) |
| Current 5–9 | 1.10 (0.78–1.54) | 1.44 (0.87–2.40) | 1.32 (0.79– 2.21) | 1.03 (0.64– 1.67) |
| Current 10–19 | 0.97 (0.66–1.42) | 1.40 (0.89–2.19) | 1.31 (0.83– 2.07) | 1.05 (0.69– 1.62) |
| Current 20–29 | 1.31 (0.89–1.91) | 1.58 (0.99–2.54) | 1.45 (0.88–2.40) | 1.17 (0.75–1.82) |
| Current 30+ | 1.21 (0.94–1.55) | 1.71 (1.20–2.45) | 1.30 (0.91– 1.85) | 1.28 (0.90– 1.83) |
| Recent past | 1.26 (1.01–1.57) | 1.41 (1.09–1.82) | 1.63 (1.24– 2.16) | 1.20 (0.94– 1.55) |
| Recent past 1–4 | 0.95 (0.61–1.48) | 1.24 (0.80–1.93) | 1.21 (0.79– 1.87) | 0.77 (0.51– 1.16) |
| Recent past 5–9 | 0.90 (0.55–1.46) | 0.95 (0.51–1.78) | 1.26 (0.60– 2.68) | 1.10 (0.56– 2.18) |
| Recent past 10–19 | 1.13 (0.70–1.83) | 1.36 (0.77–2.41) | 1.66 (0.91–3.03) | 1.88 (1.05–3.37) |
| Recent past 20–29 | 1.33 (0.72–2.46) | 2.35 (1.11–5.01) | 3.79 (1.61–8.90) | 1.65 (0.82, 3.32) |
| Recent past 30+ | 2.71 (1.75–4.22) | 2.09 (1.12–3.91) | 3.25 (1.43–7.36) | 2.00 (1.07–3.73) |
| Past | 1.04 (0.96–1.13) | 1.21 (1.10–1.32) | 1.17 (1.07–1.28) | 0.89 (0.82–0.97) |
| Past 1–4 | 1.02 (0.93–1.12) | 1.18 (1.07–1.30) | 1.14 (1.04–1.26) | 0.89 (0.81–0.98) |
| Past 5–9 | 1.06 (0.92–1.22) | 1.07 (0.85–1.35) | 1.07 (0.84–1.35) | 0.75 (0.59–0.94) |
| Past 10–19 | 1.00 (0.78–1.28) | 1.82 (1.31–2.53) | 1.26 (0.92–1.74) | 1.15 (0.84–1.58) |
| Past 20–29 | 1.26 (0.80–2.01) | 1.13 (0.62–2.06) | 1.73 (0.94– 3.19) | 1.42 (0.77–2.61) |
| Past 30+ | 2.33 (1.57–3.46) | 1.57 (0.81–3.02) | 2.44 (1.34–4.45) | 1.07 (0.55–2.07) |
| Ibuprofen | 1.17 (0.87–1.58) | 1.73 (1.23–2.44) | 1.47 (1.04–2.08) | 1.16 (0.84–1.61) |
| Diclofenac | 1.38 (1.08–1.77) | 1.65 (1.17–2.33) | 1.53 (1.08–2.16) | 1.11 (0.80–1.53) |
| Piroxicam | 1.65 (0.78–3.49) | 0.93 (0.38– 2.24) | 0.76 (0.33–1.72) | 0.73 (0.29–1.80) |
| Ketoprofen | 1.39 (0.77–2.51) | 0.64 (0.20– 1.99) | 0.59 (0.16–2.19) | 0.55 (0.16–1.95) |
| Indomethacin | 1.03 (0.58–1.85) | 1.96 (1.04– 3.72) | 1.29 (0.70–2.37) | 1.68 (0.91–3.09) |
| Naproxen | 0.68 (0.42–1.13) | 0.81 (0.41– 1.61) | 0.69 (0.35–1.37) | 0.59 (0.31–1.14) |
| A transformation of the Clinical Practice Research Datalink (CPRD) into the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) was performed |
| Quality assessments indicated that source code mappings, Read code domain classification, imputation algorithm for drug duration and special handling of lifestyle/clinical data were accurate and acceptable data loss occurred across CDM domains |
| A case–control replication study was performed on the CPRD raw data and the CPRD CDM and results between the raw data and CDM study agreed for conditions, demographics and lifestyle data. There was slight nonsteroidal anti-inflammatory drug exposure data loss caused by unmapped drugs |