| Literature DB >> 27631769 |
Sophie V Eastwood1, Rohini Mathur2, Mark Atkinson3, Sinead Brophy3, Cathie Sudlow4,5, Robin Flaig4,5, Simon de Lusignan6, Naomi Allen7,5, Nishi Chaturvedi1.
Abstract
OBJECTIVES: UK Biobank is a UK-wide cohort of 502,655 people aged 40-69, recruited from National Health Service registrants between 2006-10, with healthcare data linkage. Type 2 diabetes is a key exposure and outcome. We developed algorithms to define prevalent and incident diabetes for UK Biobank. The algorithms will be implemented by UK Biobank and their results made available to researchers on request.Entities:
Mesh:
Year: 2016 PMID: 27631769 PMCID: PMC5025160 DOI: 10.1371/journal.pone.0162388
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Data sources used in the development of UK Biobank diabetes prevalence and incidence algorithms.
Solid arrows indicate established linkages, dotted arrows indicate anticipated linkages. 1data used to derive prevalence algorithms, 2data used to test prevalence algorithms, 3data used to derive incidence algorithms, 4data used to test incidence algorithms. HES = Hospital episode statistics, SMR01 = Scottish morbidity record, SAIL = Secure anonymised information linkage databank, PEDW = patient episode database for Wales, CPRD = clinical practice research datalink.
Baseline characteristics of UK Biobank (all), Welsh UK Biobank participants (with UK Biobank, primary and secondary care data linkage), and CPRD (primary and secondary care data linkage).
| UKB cohort | Linked Welsh UKB sub-cohort | CPRD | |
|---|---|---|---|
| 502, 665 | 12,228 | 1,101,101 | |
| 273,468 (54) | 6,580 (54) | 544,585 (50) | |
| 57±8 | 53 ±8.2 | 53±8.5 | |
| 168,307 (34) | 4,938 (44) | 573,301 (52) | |
| 472,831 (95) | - | 727,758 (66) | |
| 8,067(2) | - | 17,228 (2) | |
| 8,066 (2) | - | 10,645 (1) | |
| 10,907 (2) | - | 11,653 (1) | |
| - | 333,817 (30) |
Diabetes-related baseline self-report variables available in UK Biobank, N = 502,665.
| Diagnoses | Diabetes diagnosed by doctor | 26,408 (5.3) |
| Gestational diabetes only | 1,072 (0.4) | |
| Diabetes diagnosed by doctor (excluding gestational only) | 25,336 (5.1) | |
| Age diabetes diagnosed, years | 54 (46–60) | |
| Medication | On insulin | 5,613 (1.1) |
| Started insulin within one year diagnosis of diabetes | 3,034 (0.6) | |
| Diagnoses | Non-specified diabetes | 21,738 (4.3) |
| Age at non-specified diabetes diagnosis, years | 55(47–61) | |
| Gestational diabetes | 285 (0.10) | |
| Age at gestational diabetes diagnosis, years | 37 (30–44) | |
| Type 1 diabetes | 428 (0.09) | |
| Age at type 1 diabetes diagnosis, years | 30 (20–43) | |
| Type 2 diabetes | 3,367 (0.67) | |
| Age at type 2 diabetes diagnosis, years | 56 (49–61) | |
| Medications | Insulin | 5,317 (20.0) |
| Metformin | 14,657 (55.5) | |
| Sulphonylureas | 5,596 (21.1) | |
| Meglitinides | 135 (0.51) | |
| Glitazones | 1,997 (7.6) | |
| Any non-metformin oral anti-diabetic drug | 6,809 (25.8) | |
| Any diabetes medication | 19,045 (72.1) | |
| Complications | Diabetic nephropathy | 20 (0.08) |
| Age at diabetic nephropathy, years | 55 (40–63) | |
| Diabetic neuropathy/ ulcers | 153 (0.6) | |
| Age at diabetic neuropathy, years | 54 (46–60) | |
| Diabetic eye disease | 1,172 (4.4) | |
| Age at diabetic eye disease, years | 54 (46–61) | |
| Any diabetes complication | 1,293 (4.9) | |
Data are n (%), n (% women) for gestational diabetes variables or median (IQR).
Fig 2(a) Prevalence algorithm 1: Distinction between diabetes presence or absence, and initial sorting of diabetes type using baseline UK Biobank assessment data. See S1 appendix for rationale and further data for each step. (b) Prevalence algorithm 2: Finalising type 1 diabetes diagnosis and classification into probable and possible categories. See S1 appendix for rationale and further data for each step.(c) Prevalence algorithm 3: Finalising type 2 diabetes diagnosis and classification into probable and possible categories. See S1 appendix for rationale and further data for each step.(d) Final diabetes diagnostic status in UKB.
Cross-tabulation of final diabetes status from UK Biobank prevalence algorithms against primary care diabetes data for linked Welsh UK Biobank participants.
| Classification of diabetes status of UKB population according to UKB data | Diabetes unlikely | Uncertain diabetes status | Gestational diabetes | Probable type 1 diabetes | Possible type 1 diabetes | Probable type 2 diabetes | Possible type 2 diabetes | |
|---|---|---|---|---|---|---|---|---|
| 11560(95) | 0 | 23(0.2) | 36(0.3) | 9(0.1) | 513(4.2) | 87(0.7) | ||
| 10 | 0 | 2 | 29 | 8 | 386 | 70 | ||
| 0 | 0 | 0 | 22 | 2 | 3 | 15 | ||
| 0 | 0 | 0 | 27 | 3 | 5 | 21 | ||
| 9 | 0 | 1 | 6 | 2 | 378 | 55 | ||
| 9 | 0 | 1 | 7 | 4 | 383 | 58 | ||
| 3 | 0 | 0 | 20 | 4 | 163 | 43 | ||
| 0 | 0 | 2 | 0 | 0 | 0 | 1 | ||
| 0 | 0 | 0 | 20 | 2 | 1 | 11 | ||
| 9 | 0 | 0 | 0 | 3 | 379 | 48 | ||
| 55 (44–61) | 34 (27–39) | 30 (23–38) | 40 (33–46) | 56 (50–60) | 50 (43–55) | |||
| 1 | 0 | 0 | 28 | 4 | 49 | 62 | ||
| 0 | 0 | 0 | 20 | 1 | 1 | 15 | ||
| 6 | 0 | 1 | 9 | 4 | 347 | 55 | ||
| 3 | 0 | 1 | 1 | 0 | 97 | 2 | ||
| 0 | 0 | 0 | 8 | 1 | 1 | 12 | ||
| 4 | 0 | 0 | 0 | 3 | 260 | 42 | ||
| 0 | 0 | 0 | 28 | 2 | 1 | 24 | ||
| 8 | 0 | 1 | 27 | 6 | 374 | 67 | ||
| 7 | 0 | 1 | 22 | 6 | 356 | 63 | ||
Dates of relevant primary care codes precede the UK Biobank assessment date for each patient.
aCodes classified as Type 1 diabetes (definite), insulin dependent diabetes (probable type 1 diabetes) or juvenile onset diabetes (possible type 1 diabetes).
bCodes classified as Type 2 diabetes (definite), non-insulin dependent diabetes (probable type 2 diabetes) or adult onset diabetes (possible type 2 diabetes).
cType 1 diabetes codes, no type 2 diabetes codes (there may be other or non-specific codes).
dType 2 diabetes codes but no type 1 diabetes codes (there may be other or non-specific codes).
Comparison of final diabetes status from prevalence algorithms in UK Biobank versus diabetes diagnoses in secondary care data at baseline.
| Diabetes unlikely | Possible gestational diabetes | Probable type 1 diabetes | Possible type 1 diabetes | Probable type 2 diabetes | Possible type 2 diabetes | |
|---|---|---|---|---|---|---|
| UK Biobank cohort (n = 502,665) | 476,191 | 794 | 1,487 | 350 | 20,570 | 3,273 |
| Hospital admissions codes present prior to baseline assessment date (% of all those in UKB) | 284,780 (60) | 539 (68) | 1,163 (78) | 289 (83) | 14,659 (71) | 2,606 (80) |
| Diabetes diagnosis in hospital admissions data at baseline in any position: | ||||||
| 642 (0.2) | 13 (2) | 963 (83) | 242 (84) | 6,430 (44) | 1,866 (72) | |
| 39 (6) | 1 (8) | 734 (76) | 98 (41) | 115 (2) | 508 (27) | |
| 507 (80) | 10 (77) | 128 (14) | 119 (49) | 6,146 (96) | 1,292 (69) | |
| 120 (19) | 2 (15) | 100 (10) | 23 (10) | 491 (8) | 175 (9) |
Data are n (%). Derived from hospital admissions (available from 1997 onwards), date of first diabetes code used.
Fig 3a. Diabetes incidence algorithms for primary care data, run in CPRD. b. Diabetes incidence algorithm for secondary care data, run in UK Biobank-held in-patient data. *Includes categories:probable type 1 diabetes, probable type 2 diabetes,. **ICD-10: E10, E11, E13, E14. Includes main or secondary diagnostic codes for in-patient data.
Characteristics of those with incident diabetes from 1st January 2006 to 1st January 2015, comparing those identified in primary care (with or without secondary care diagnosis) and those identified in secondary care alone in the CPRD database.
| Denominator = 1,048,972 free from diabetes in CPRD on Jan 1st 2006 | Incident Type 2 diabetes diagnostic code (C10F) in primary care data (from | No incident diabetes diagnostic code (C10F) in primary care data (n = 1,003,941) | |
|---|---|---|---|
| Incident Secondary care diabetes diagnostic code | No secondary care diabetes diagnostic code at any time | Incident secondary care diabetes diagnostic code | |
| 18,440 | 26,591 | 7,519 | |
| 57 (8) | 55 (8) | 56 (8) | |
| 10,795 (59) | 15,796 (59) | 4,318 (57) | |
| 2,334 (13) | 1,072 (4) | 409 (5) | |
| 13,908 (75) | 17,613 (66) | 496 (7) | |
| 7,437 (40) | 6,675 (25) | 253 (3) | |
| 14,750 (80) | 18,055 (68) | 898 (12) | |
| 16,510 (90) | 23,191 (88) | 994 (22) | |
| 10,799 (59) | 12,394 (47) | 249 (3) | |
| 3,697 (20) | 3,951 (15) | 170 (2) | |
| 1,249 (7) | 982 (4) | 177 (2) | |
| 229 (1) | 88 (0.3) | 105 (1.4) | |
| 32 (6) | 32 (6) | 30 (6) | |
| 142 (19) | 142 (19) | 138 (19) | |
| 14,812 (80) | 19,119 (72) | 4,020 (53) | |
| 15,639 (85) | 20,980 (79) | 3,103 (41) | |
| 3,725 (20) | 2,855 (11) | 1,190 (16) | |
| 139 (1) | 81 (0.3) | 36 (0.5) | |
| 1,019 (6) | 683 (3) | 298 (4) | |
| 4,723 (26) | 4,981 (19) | 1,713 (23) | |
Data are mean±SD or n (%), extracted from primary care records on/ as close as possible after 1st January 2006.
*Co-morbidities and diabetic complications are at any time.
Fig 4Flow of participants identified with diabetes in UK Biobank.
*or mid-point of last consultation/episode without diabetes diagnosis (UK Biobank inception if not available) and 1st diabetes diagnosis dates.