| Literature DB >> 30608539 |
Laurentius C J Slobbe1,2, Koen Füssenich1,3, Albert Wong1, Hendriek C Boshuizen1,4, Markus M J Nielen5, Johan J Polder1,2, Talitha L Feenstra1,3, Hans A M van Oers1,2.
Abstract
BACKGROUND: Aggregated claims data on medication are often used as a proxy for the prevalence of diseases, especially chronic diseases. However, linkage between medication and diagnosis tend to be theory based and not very precise. Modelling disease probability at an individual level using individual level data may yield more accurate results.Entities:
Year: 2019 PMID: 30608539 PMCID: PMC6660107 DOI: 10.1093/eurpub/cky270
Source DB: PubMed Journal: Eur J Public Health ISSN: 1101-1262 Impact factor: 3.367
Pharmaceutical utilization in dataset
| Persons with at least one recorded episode for each chronic disease in 2008–2010 | Total number of pharmaceutical groups utilized in 2010 | Average number of pharmaceutical groups utilized per person | |
|---|---|---|---|
| Persons without disease | 184 826 | 328 385 |
|
| Persons with 1 chronic disease | 60 065 | 235 032 |
|
| Persons with 2 chronic diseases | 20 090 | 125 335 |
|
| Persons with 3 chronic diseases | 7609 | 63 064 |
|
| Persons with 4 chronic diseases | 2697 | 27 190 |
|
| Persons with 5 or more chronic diseases | 1436 | 17 219 |
|
| Total | 276 723 | 796 225 |
|
| Percentage with at least one chronic disease: | 33.2% | ||
| Percentage study population with multiple diseases: | 11.5% |
Legend: Training set population has been divided into six strata, based on the number of chronic diseases present. First column presents stratum. Second column gives population size. Third column gives total number of pharmaceutical groups utilized. Pharmaceutical groups have been defined in terms of an ATC 4 position code: A01A, A02A, etc. Last column gives average utilization in stratum.
Model outcome AUC with confidence interval, ordered by mean AUC
| Disease | AUC (95% conf. interval) | Prevalence in training set per 10 000 persons |
|---|---|---|
| Parkinson’s disease |
| 15 |
| Diabetes mellitus |
| 421 |
| Osteoporosis |
| 103 |
| Heart failure |
| 82 |
| Chronic obstructive pulmonary disease |
| 209 |
| Chronic enteritis/colitis ulcerosa |
| 31 |
| HIV/AIDS |
| 4 |
| Asthma |
| 424 |
| Epilepsy |
| 41 |
| Coronary heart disease |
| 255 |
| Visual disorder |
| 191 |
| Schizophrenia |
| 10 |
| Rheumatoid arthritis |
| 66 |
| Dementia |
| 28 |
| Congenital neurological anomaly |
| 3 |
| Multiple sclerosis |
| 9 |
| Cancer |
| 264 |
| Chronic alcohol abuse |
| 45 |
| Depressive disorder |
| 253 |
| Stroke (including TIA) |
| 137 |
| Congenital cardiovascular anomaly |
| 7 |
| Chronic back or neck disorder |
| 432 |
| Osteoarthritis |
| 282 |
| Anxiety disorder, neurosis, PTSS |
| 154 |
| Mental retardation |
| 13 |
| Hearing disorder |
| 62 |
| Anorexia |
| 8 |
| Gastric or duodenal ulcer |
| 25 |
| Tuberculosis |
| 2 |
Legend: First column gives name of chronic disease. Second column gives model outcome of RF-analysis as AUC with 95% confidence interval, in order of decreasing AUC. Third column states prevalence of chronic disease or condition in the training set. (n = 276 723).
Predictors of chronic diseases in Random Forest analysis
| Disease | ATC4 groups with strongest relation with disease in RF-analysis | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| [1]= strongest relation, [10]= weakest relation | ||||||||||
| [1] | [2] | [3] | [4] | [5] | [6] | [7] | [8] | [9] | [10] | |
| Parkinson's disease | N04B | N04A | Birthyear | L04A | A07E | C10A | C09B | N05A | C03C | N06D |
| Diabetes mellitus | A10A | A10B | C10A | B01A | H04A | C09A | C09B | C08C | C03C | C07A |
| Osteoporosis | M05B | A12A | A11C | L04A | C10A | B01A | C09D | D01A | C09C | D06A |
| Heart failure | C03C | C03D | A12B | C01A | C08D | C08C | C01D | C03A | C10A | C07B |
| Chronic obstructive pulmonary disease | R03A | R03B | R06A | A01A | R01A | N06B | J01F | R05D | A06A | J01C |
| Chronic enteritis/colitis ulcerosa | A07E | L04A | L01B | B03B | A06A | J01M | A11C | A07D | N02A | M01A |
| HIV/AIDS | J05A | J01E | J04A | J01F | N01B | J07B | D06B | A02B | J01C | J01A |
| Asthmaa | R03A | R03B | R03C | Birthyear | H02A | A07A | S01G | R01A | R06A | R05D |
| Epilepsy | N03A | N05B | Birthyear | A03F | N05A | B01A | N06A | D04A | D11A | N05C |
| Coronary heart disease | C01D | C08D | C03C | C01B | C03A | C09A | D06A | C01E | C09C | B03A |
| Visual disorder | S01E | S01B | Birthyear | S01C | A10B | S01F | D02A | S01A | A10A | S01X |
| Schizophreniaa | N05A | N05B | N05C | N04A | N06A | N06B | Birthyear | N03A | A06A | N07C |
| Rheumatoid arthritis | L04A | P01B | A07E | B03B | N02A | L01B | H02A | D02B | M05B | D06A |
| Dementiaa | N06D | Birthyear | N05A | A12A | C03C | N03A | M05B | Y | D03D | C09D |
| Congenital neurological anomaly | M03B | G04B | N03A | J01X | D07X | N05A | J01E | A12A | D01A | N05B |
| Multiple sclerosis | L03A | M03B | G04B | N03A | N06A | N04B | B03B | S01A | J01X | C03C |
| Cancer | Birthyear | L02B | H03A | Y | D06A | A04A | Gender | L02A | G03C | A12A |
| Chronic alcohol abuse | N07B | N05A | Birthyear | N05B | G04C | A02B | N06A | M04A | A10B | N05C |
| Depressive disorder | N06A | N05A | N05B | N06B | N05C | N07B | N03A | A03F | A11C | G04B |
| Stroke (including TIA) | B01A | V03A | C01D | C01B | C07A | Birthyear | C08C | C01A | S01C | S01E |
| Congenital cardiovascular anomaly | B01A | C07A | J01C | N03A | C09A | D06A | R03B | Y | D02A | S02C |
| Chronic back or neck disorder | N02A | M01A | A02B | A06A | N02B | C05A | S02C | N03A | H02A | R05D |
| Osteoarthritis | Birthyear | B01A | Gender | M04A | C10A | N02B | C10B | C03A | S01E | N02A |
| Anxiety disorder, neurosis, PTSS | N06A | N05B | N05A | C07A | N01B | N05C | A03F | A06A | N03A | D05A |
| Mental retardation | N05A | N03A | D02A | D06A | A06A | N05B | Y | D10A | S01F | N01B |
| Hearing disorder | Birthyear | L02A | B02A | S01X | D05A | G04C | H02A | A10B | C08D | C07B |
| Anorexia | Gender | G03A | A06A | A01A | Y | J01X | G03H | R05D | G01A | A12B |
| Gastric or duodenal ulcer | A02B | D05A | G04B | A07A | A03A | A03F | M01A | A11C | D06B | A06A |
| Tuberculosis | J04A | D02A | D07X | C01B | J01A | C08C | C03C | S01C | S02C | C03E |
Legend: First column gives name of chronic disease. The next 10 columns list the predictors used in the final RF-model, in order of decreasing importance. To facilitate comparison, diseases are presented in the same order as in table 2.
For these diseases, comparison is possible with pharmaceutical groups listed in risk adjustment compulsory insurance. The shaded groups are also used for the detection of these diseases by Dutch insurers.
These diseases have been compared with ATC-groups mentioned in the relevant Dutch treatment guidelines. The shaded groups are included in these guidelines.
Figure 1Example of comparison between Dutch population prevalence for ages 30–80 estimated from model applied to drug utilization data and estimation based on training set for COPD, male