Literature DB >> 31471439

Soft clustering using real-world data for the identification of multimorbidity patterns in an elderly population: cross-sectional study in a Mediterranean population.

Concepción Violán1,2, Quintí Foguet-Boreu1,3,4, Sergio Fernández-Bertolín5,2, Marina Guisado-Clavero5,2, Margarita Cabrera-Bean6, Francesc Formiga7, Jose Maria Valderas5,8, Albert Roso-Llorach5,2.   

Abstract

OBJECTIVES: The aim of this study was to identify, with soft clustering methods, multimorbidity patterns in the electronic health records of a population ≥65 years, and to analyse such patterns in accordance with the different prevalence cut-off points applied. Fuzzy cluster analysis allows individuals to be linked simultaneously to multiple clusters and is more consistent with clinical experience than other approaches frequently found in the literature.
DESIGN: A cross-sectional study was conducted based on data from electronic health records.
SETTING: 284 primary healthcare centres in Catalonia, Spain (2012). PARTICIPANTS: 916 619 eligible individuals were included (women: 57.7%). PRIMARY AND SECONDARY OUTCOME MEASURES: We extracted data on demographics, International Classification of Diseases version 10 chronic diagnoses, prescribed drugs and socioeconomic status for patients aged ≥65. Following principal component analysis of categorical and continuous variables for dimensionality reduction, machine learning techniques were applied for the identification of disease clusters in a fuzzy c-means analysis. Sensitivity analyses, with different prevalence cut-off points for chronic diseases, were also conducted. Solutions were evaluated from clinical consistency and significance criteria.
RESULTS: Multimorbidity was present in 93.1%. Eight clusters were identified with a varying number of disease values: nervous and digestive; respiratory, circulatory and nervous; circulatory and digestive; mental, nervous and digestive, female dominant; mental, digestive and blood, female oldest-old dominant; nervous, musculoskeletal and circulatory, female dominant; genitourinary, mental and musculoskeletal, male dominant; and non-specified, youngest-old dominant. Nuclear diseases were identified for each cluster independently of the prevalence cut-off point considered.
CONCLUSIONS: Multimorbidity patterns were obtained using fuzzy c-means cluster analysis. They are clinically meaningful clusters which support the development of tailored approaches to multimorbidity management and further research. © Author(s) (or their employer(s)) 2019. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.

Entities:  

Keywords:  chronic conditions; cluster analysis; epidemiology; multimorbidity

Year:  2019        PMID: 31471439      PMCID: PMC6719769          DOI: 10.1136/bmjopen-2019-029594

Source DB:  PubMed          Journal:  BMJ Open        ISSN: 2044-6055            Impact factor:   2.692


Studies focus on diseases rather than individuals as the unit of analysis in assessing multimorbidity patterns (hard clustering forces each individual to belong to a single cluster, whereas soft clustering allows elements to be simultaneously classified into multiple cluster). Reliable and valid identification of disease clusters is needed for the development of evidence-based clinical practice guidelines and pathways of care for patients that correspond to the wide spectrum of diseases in patients with multimorbidity. Soft clustering analysis allows for diseases to be linked simultaneously to multiple clusters and is more consistent with clinical experience than other approaches frequently found in the literature. The different cut-off points (prevalence filters) applied to obtain multimorbidity patterns permitted the identification of common nuclear diseases which remained independent of their prevalence. The literature provides support for the aetiopathophysiological and epidemiological associations between conditions forming part of the same cluster.

Introduction

The term multimorbidity widely refers to the existence of numerous medical conditions in a single individual.1 In many regions of the world there is evidence that a substantial, and probably growing, proportion of the adult population is affected by multiple chronic conditions. Moreover, the association of multimorbidity with increasing age leading to a two-fold prevalence in the final decades of life has been proven.2 Multimorbidity has been estimated to be at around 62% between 65 and 74 years, and around 81.5% after 85 years.3 Its true extent is, however, difficult to gauge as there is no agreed definition or classification system.4–7 Most of the published literature focuses on diseases rather than individuals as the unit of analysis in assessing multimorbidity patterns.8 Orienting the analysis of multimorbidity patterns at an individual level, and not of disease, could have crucial implications for patients. In the current context of limited evidence on interventions for unselected patients with multimorbidity, such an approach–would allow better understanding of population groups, and facilitate the development and implementation of strategies aimed at prevention, diagnosis, treatment and prognosis. It would also elicit essential information for the development of clinical guidelines, pathways of care, and lead to better understanding of the nature and range of the required health services.9 10 Cluster analysis involves assigning individuals so that the items (diseases) in the same cluster are as similar as possible, while individuals belonging to different clusters are as dissimilar as possible. The identification of clusters is based on similarity measures and their choice may depend on the data or the purpose of the analysis.11 12 Hard clustering forces each element to belong to a single cluster, whereas soft clustering (also referred to as fuzzy clustering) allows elements to be simultaneously classified into multiple clusters. Empirical evidence is needed on how both established and novel techniques influence the identification of multimorbidity patterns. A recent systematic review recommended that future epidemiological studies cover a broad selection of health conditions in order to avoid missing potentially key nosological associations and enhance external validity. When many conditions are considered, the clustering of individuals based on morbidity data will encounter high-dimensional issues. This is particularly important when a clustering-based approach is adopted to assess the impact of multimorbidity on individual health outcomes and health service uses.2 8 13–15 The identification of multimorbidity patterns seems to be implicitly dependent on the prevalence of the included diseases.2 8 16 17 However, to the best of our knowledge no previous study has analysed the identification of multimorbidity patterns explicitly based on the prevalence of the diseases. The aim of this study was to identify, with soft clustering methods, multimorbidity patterns in the electronic health records of a population ≥65 years, and to analyse such patterns in accordance with the different prevalence cut-off points applied.

Methods

Study population

A cross-sectional analysis was carried out in Catalonia (Spain), a Mediterranean region of 7 515 398 inhabitants (2012). The Catalan Health Institute provides universal coverage and operates 284 primary healthcare centres (PHC).

Data sources

Since 2006 the Information System for Research in Primary Care (SIDIAP) database includes anonymised longitudinal electronic health records from primary and secondary care which gather information on demographics, diagnoses, prescriptions and socioeconomic status.18 In our study the inclusion criteria were individuals aged 65–99 years on 31 December 2011 with at least one PHC visit since 2012. Only participants who survived until 31 December 2012 (index date) were included in the analysis.

Variables

Diseases were coded in the SIDIAP using the International Classification of Diseases version 10. An operational definition of multimorbidity was the simultaneous presence of more than one of the selected 60 chronic diseases previously identified by the Swedish National study of Aging and Care in Kungsholmen (SNAC-K).19 Additional variables included in the study were sociodemographics (age, sex, socioeconomic status (MEDEA index20)), clinical variables (including number of chronic diseases and invoiced drugs) and use of health services (number of visits to family physicians, nurses and emergency services).

Statistical analysis

Descriptive statistics were used to summarise overall information. Disease prevalence was computed for all the included population. Descriptive analyses were stratified by the presence of multimorbidity. Comparison was performed using Student’s t-test or Mann-Whitney test for continuous variables and χ2 test for categorical ones. In order to obtain the most representative clusters all patients were included irrespective of whether they presented multimorbidity or not. Sex and age variables, together with chronic diseases selected by prevalence, were included in the analysis. The number of features to be considered varied from the 62 original ones (no prevalence filtering applied) to 54 and 49, for a 1% and 2% prevalence threshold, respectively. Due to the large number of diseases, a principal component analysis for categorical and continuous data (PCAmix) was implemented to reduce complexity. With this technique both continuous and dichotomous variables were simultaneously processed through the application of Multiple Correspondence Analysis to the binary variables and PCA to the continuous ones. Using Karlis-Saporta-Spinaki criterion to select the optimal number of dimensions to retain, the data set of 49 features per individual per 2% prevalence cut-off was transformed to a new dimensionally reduced data set of 13 continuous features per individual, which concentrated most of the variability of the newly transformed data set.21 Once the transformed data set was obtained, clusters of chronic conditions at baseline were identified using the fuzzy c-means clustering algorithm.22 This Machine learning technique forces every individual to belong to every cluster in accordance with its characteristics and by assigning a membership degree factor in (0, 1) to each individual with respect to each pattern. This provides the flexibility enabling patients to belong to more than one multimorbidity pattern.23 The main parameters in this clustering procedure were the number of clusters and a fuzziness parameter, denoted m, which ranged from just above 1 to infinity. High m values produce a fuzzy set of clusters, so that individuals are equally distributed across clusters, whereas lower ones generate non-overlapped clusters. Further details on the stability and validation techniques applied to obtain the best fuzzy c-means parameters and the set of centroids are presented in online supplementary additional file 1. To describe the multimorbidity patterns, frequencies and percentages of diseases (P) in each cluster were calculated. Observed/expected ratios (O/E ratios) were calculated by dividing disease prevalence in the cluster by disease prevalence in the overall population. As the membership of each individual to any of the clusters was given by a membership degree factor, and not as a binary variable, the observed disease prevalence (O) in a cluster was computed as the sum of the disease membership degree factors corresponding to all individuals suffering the disease. Exclusivity, defined as the proportion of patients with the disease included in the cluster over the total number of patients with the disease, was also calculated. Further details on how these ratios were computed using the membership factors are given in online supplementary additional file 1. A disease was considered to be part of a multimorbidity cluster when O/E ratio was ≥2 or exclusivity value ≥25%.24 Clusters names were also defined taking into account the dominant gender or age in the cluster compared with the overall sample distribution. We conducted a sensitivity analysis by modifying the prevalence threshold for disease inclusion in the cluster analysis. For chronic diseases we considered as alternatives no filtering, and ≥1% and ≥2% filters among the included population. In order to conform to the Karlis-Saporta-Spinaki rule, a different number of dimensions of the transformed data set were retained to construct the clusters for every prevalence cut-off: 13 dimensions for the 2% prevalence, 14 dimensions for the 1% prevalence and 17 dimensions with no filtering. The content of each cluster was compared across filtering approaches in terms of diseases associated with that cluster, characteristics of the included population and cluster size. Clinical evaluation of the consistency and significance of these solutions was also conducted. The analyses were carried out using R V.3.3.1 (R Foundation for Statistical Computing, Vienna, Austria). The significance level was set at 0.05.

Patient and public involvement

Patients were not involved in the study based on anonymised data.

Results

In this study, 916 619 individuals were included (women: 57.7%; mean age: 75.4 (SD: 7.4)), and 853 085 (93.1%) of them met multimorbidity criteria (figure 1).
Figure 1

Study population flow chart. *See 60 chronic diseases group defined in Swedish National study of Aging and Care in Kungsholmen (SNAC-K).25

Study population flow chart. *See 60 chronic diseases group defined in Swedish National study of Aging and Care in Kungsholmen (SNAC-K).25 Participants’ characteristics are summarised in table 1. Statistically significant differences were present between the multimorbidity and non-multimorbidity groups for all the variables included in the analysis (table 1).
Table 1

Characteristics of study participants aged 65–94 years stratified by multimorbidity and non-multimorbidity (n=916 619, Catalonia, 2012)

Variables*Multimorbidity(n=853 085)Non-multimorbidity(n=63 534)All(n=916 619)
Sex, women, n (%)496 294 (58.2)32 837 (51.7)529 131 (57.7)
Age, mean (SD)75.6 (7.4)73.2 (7.3)75.4 (7.4)
Age (categories), n (%)
 (65, 70)225 514 (26.4)26 664 (42.0)252 178 (27.5)
 (70, 80)370 356 (43.4)24 230 (38.1)394 586 (43.0)
 (80, 90)224 143 (26.3)10 601 (16.7)234 744 (25.6)
 ≥9033 072 (3.9)2039 (3.2)35 111 (3.8)
MEDEA index†
 Q1130 894 (16.5)13 897 (23.4)144 791 (17.0)
 Q2126 537 (16.0)9894 (16.6)136 431 (16.0)
 Q3129 246 (16.3)8976 (15.1)138 222 (16.2)
 Q4125 322 (15.8)7666 (12.9)132 988 (15.6)
 Q5110 916 (14.0)5967 (10.0)116 883 (13.7)
 Rural169 190 (21.4)13 059 (22.0)182 249 (21.4)
Number of chronic diseases, median (IQR)6.0 (4.0; 8.0)1.0 (0.0; 1.0)6.0 (4.0; 8.0)
Number of chronic diseases (categories), n (%)
 00 (0.0)25 380 (39.9)25 380 (2.8)
 10 (0.0)38 154 (60.1)38 154 (4.2)
 (2, 5)268 836 (31.5)0 (0.0)268 836 (29.3)
 (5, 10)463 709 (54.4)0 (0.0)463 709 (50.6)
 ≥10120 540 (14.1)0 (0.0)120 540 (13.2)
Number of drugs, median (IQR)5.0 (3.0; 8.0)0.0 (0.0; 1.0)5.0 (2.0; 8.0)
Number of drugs (categories)
 072 557 (8.5)40 811 (64.2)113 368 (12.4)
 148 704 (5.7)8378 (13.2)57 082 (6.2)
 (2, 5)247 095 (29.0)11 572 (18.2)258 667 (28.2)
 (5, 10)360 030 (42.2)2651 (4.2)362 681 (39.6)
 ≥10124 699 (14.6)122 (0.2)124 821 (13.6)
Number of visits, median (IQR)10.0 (6.0; 17.0)1.0 (0.0; 4.0)9.0 (5.0; 16.0)
Number of visits 2012 (categories), n (%)
 024 543 (2.9)23 402 (36.8)47 945 (5.2)
 124 281 (2.8)9603 (15.1)33 884 (3.7)
 (2, 5)114 198 (13.4)16 241 (25.6)130 439 (14.2)
 (5, 10)239 181 (28.0)10 168 (16.0)249 349 (27.2)
 ≥10450 882 (52.9)4120 (6.5)455 002 (49.6)

*All comparisons between variables in multimorbidity and non-multimorbidity showed p<0.001.

†MEDEA index goes from 1 (least deprived) to 5 (most deprived), in this variable n=851 564.

Characteristics of study participants aged 65–94 years stratified by multimorbidity and non-multimorbidity (n=916 619, Catalonia, 2012) *All comparisons between variables in multimorbidity and non-multimorbidity showed p<0.001. †MEDEA index goes from 1 (least deprived) to 5 (most deprived), in this variable n=851 564. Among the 60 SNAC-K chronic diseases, the most prevalent were: hypertension (71.0%), dyslipidaemia (50.9%), osteoarthritis and other degenerative joint diseases (32.8%), obesity (28.7%), diabetes (25.1%) and anaemia (18.3%) (table 2).
Table 2

Prevalence of the 60 chronic diseases included in the study in individuals aged 65–94 years (n=916 619, Catalonia, 2012)

RankChronic conditionsFrequencyPercentage (%)All diseases included1%2%
1Hypertension650 89971
2Dyslipidaemia466 58550.9
3Osteoarthritis and other degenerative joint diseases300 80332.8
4Obesity262 88828.7
5Diabetes230 46025.1
6Anaemia167 57718.3
7Cataract and other lens diseases156 62217.1
8Chronic kidney diseases153 75616.8
9Prostate diseases153 63516.8
10Osteoporosis151 84716.6
11Depression and mood diseases148 75116.2
12Solid neoplasms137 04515
13Colitis and related diseases131 51214.4
14Venous and lymphatic diseases126 99713.9
15Other musculoskeletal and joint diseases124 76513.6
16Dorsopathies124 60313.6
17Neurotic, stress-related and somatoform diseases123 39513.5
18COPD, emphysema, chronic bronchitis109 60312
19Ischaemic heart disease95 43410.4
20Deafness, hearing impairment90 2619.9
21Sleep disorders88 7399.7
22Thyroid diseases88 4459.7
23Other genitourinary diseases85 4689.3
24Cerebrovascular disease80 2648.8
25Atrial fibrillation80 2478.8
26Oesophagus, stomach and duodenum diseases80 0438.7
27Heart failure74 0778.1
28Other eye diseases68 9397.5
29Glaucoma66 1627.2
30Inflammatory arthropathies62 4506.8
31Dementia59 2136.5
32Cardiac valve diseases52 1005.7
33Peripheral neuropathy49 1275.4
34Other psychiatric and behavioural diseases46 8415.1
35Asthma43 6634.8
36Allergy40 3944.4
37Autoimmune diseases39 3504.3
38Ear, nose, throat diseases38 7524.2
39Peripheral vascular disease30 6743.4
40Other neurological diseases28 5413.1
41Chronic pancreas, biliary tract and gall bladder diseases27 3213
42Migraine and facial pain syndromes25 9992.8
43Bradycardias and conduction diseases25 4762.8
44Chronic liver diseases22 6332.5
45Other digestive diseases22 0222.4
46Parkinson and parkinsonism20 8332.3
47Other metabolic diseases18 9972.1
48Other cardiovascular diseases16 8331.8
49Other skin diseases15 3631.7
50Chronic ulcer of the skin13 8691.5
51Blood and blood-forming organ diseases13 5751.5
52Other respiratory diseases99741.1
53Epilepsy89811
54Haematological neoplasms81740.9
55Chronic infectious diseases66470.7
56Inflammatory bowel diseases55490.6
57Schizophrenia and delusional diseases47920.5
58Blindness, visual impairment47720.5
59Multiple sclerosis5760.1
60Chromosomal abnormalities770

In the last three columns, list of diseases was included by prevalence cut-off (All, 1%, 2%).

COPD, chronic obstructive pulmonary disease.

Prevalence of the 60 chronic diseases included in the study in individuals aged 65–94 years (n=916 619, Catalonia, 2012) In the last three columns, list of diseases was included by prevalence cut-off (All, 1%, 2%). COPD, chronic obstructive pulmonary disease. Eight multimorbidity patterns were identified using fuzzy c-means algorithm with fuzziness parameter of m=1.1, after computing different validation indices to obtain the optimal number of clusters (online supplementary additional file 1). This number was the same for the three different prevalence thresholds: no filtering, and ≥1% and ≥2% filters. The cluster formed by the most prevalent diseases was designated non-specified, youngest-old dominant (O/E ratio <2 and exclusivity <20). The remaining seven clusters were specific: nervous and digestive; respiratory, circulatory and nervous; circulatory and digestive; mental, nervous and digestive, female dominant; mental, digestive and blood, female oldest-old dominant; nervous, musculoskeletal and circulatory, female dominant; and genitourinary, mental and musculoskeletal, male dominant (table 3). Table 3 shows the results, considering a 2% prevalence filter, for each pattern based on the 15 diseases with the higher O/E ratios.
Table 3

Most frequent 15 diseases found in multimorbidity patterns in individuals aged 65–94 years (n=916 619, Catalonia, 2012)

PatternDiseaseOO/E ratioEXPatternDiseaseOO/E ratioEX
1Nervous and digestive(n=40 037)Parkinson and parkinsonism38.71774.32Respiratory, circulatory and nervous(n=50 639)Asthma34.57.240
Other neurological diseases49.515.969.4Peripheral vascular disease13.94.222.9
Chronic liver diseases13.25.423.4Parkinson and parkinsonism8.53.820.8
Chronic pancreas, biliary tract and gall bladder diseases7.92.711.6Other neurological diseases11.73.720.7
Dementia14.72.39.9COPD, emphysema, chronic bronchitis312.614.3
Other digestive diseases4.828.7Allergy10.82.413.5
Cerebrovascular disease16.91.98.4Heart failure16.6211.3
Colitis and related diseases24.11.77.3Ischaemic heart disease21.1211.2
Other metabolic diseases3.41.77.2Other eye diseases141.910.3
Depression and mood diseases251.56.7Autoimmune diseases7.21.79.3
Anaemia26.11.46.2Other psychiatric and behavioural diseases8.51.79.2
Oesophagus, stomach and duodenum diseases11.31.35.6Ear, nose, throat diseases7.11.79.2
Sleep disorders12.41.35.6Anaemia30.41.79.2
Other eye diseases9.61.35.6Peripheral neuropathy8.81.69.1
Dorsopathies171.25.4Cerebrovascular disease14.31.69
3Circulatory and digestive(n=67 492)Heart failure51.46.446.94Mental, nervous and digestive, female dominant(n=94 453)Neurotic, stress-related and somatoform diseases64.94.849.7
Cardiac valve diseases34.2644.3Depression and mood diseases66.44.142.1
Atrial fibrillation47.35.439.8Migraine and facial pain syndromes8.22.929.6
Bradycardias and conduction diseases13.54.935.9Sleep disorders19220.2
Ischaemic heart disease33.73.223.8Oesophagus, stomach and duodenum diseases14.91.717.6
Chronic pancreas, biliary tract and gall bladder diseases82.719.7Osteoporosis281.717.4
Chronic liver diseases6.12.518.2Thyroid diseases161.717.1
Chronic kidney diseases35.92.115.8Colitis and related diseases23.71.717
Anaemia38.62.115.5Other genitourinary diseases14.41.515.9
Cerebrovascular disease18.32.115.4Ear, nose, throat diseases6.21.515.2
COPD, emphysema, chronic bronchitis23.6214.5Venous and lymphatic diseases19.91.414.8
Other digestive diseases4.61.914Allergy6.11.414.3
Peripheral vascular disease6.11.813.3Osteoarthritis and other degenerative joint diseases451.414.1
Other metabolic diseases3.21.511.3Dorsopathies181.313.7
Dementia9.51.510.9Cardiac valve diseases7.41.313.5
5Mental, digestive and blood, female oldest-old dominant(n=106 845)Dementia21.83.439.46Nervous, musculoskeletal and circulatory, female dominant(n=145 074)Peripheral neuropathy12.42.336.6
Other digestive diseases5.82.428.1Other musculoskeletal and joint diseases261.930.2
Anaemia38.52.124.6Venous and lymphatic diseases26.41.930.2
Chronic kidney diseases33.3223.1Dorsopathies25.31.929.4
Colitis and related diseases26.21.821.3Obesity511.828.2
Cerebrovascular disease14.81.719.7Other genitourinary diseases161.727.2
Osteoporosis261.618.3Osteoarthritis and other degenerative joint diseases551.726.5
Cataract and other lens diseases25.91.517.7Osteoporosis24.81.523.7
Deafness, hearing impairment141.416.5Other eye diseases10.71.422.4
Venous and lymphatic diseases19.51.416.4Cataract and other lens diseases22.51.320.8
Osteoarthritis and other degenerative joint diseases45.51.416.2Thyroid diseases12.61.320.7
Depression and mood diseases22.51.416.1Glaucoma9.21.320.1
Other genitourinary diseases12.31.315.4Diabetes31.31.219.7
Other eye diseases9.91.315.4Ear, nose, throat diseases5.21.219.5
Sleep disorders12.41.314.9Dyslipidaemia62.71.219.5
7Genitourinary, mental and musculoskeletal, male dominant(n=173 746)Prostate diseases54.73.361.88Non-specified, youngest-old dominant(n=238 333)Dyslipidaemia38.40.819.6
Other psychiatric and behavioural diseases11.12.241.2Thyroid diseases7.30.819.6
Inflammatory arthropathies12.41.834.5Osteoporosis12.20.719.2
COPD, emphysema, chronic bronchitis20.51.732.5Hypertension47.60.717.4
Solid neoplasms21.81.527.7Glaucoma4.40.616
Peripheral vascular disease4.71.426.7Solid neoplasms9.10.615.7
Ischaemic heart disease13.71.325Migraine and facial pain syndromes1.70.615.7
Diabetes31.81.324Autoimmunediseases2.20.513.4
Ear, nose, throat diseases5.31.323.7Other metabolic diseases1.10.513.3
Deafness, hearing impairment11.61.222.3Allergy2.20.513
Allergy4.81.120.5Chronic liver diseases1.20.512.8
Hypertension75.81.120.2Other genitourinary diseases4.50.512.7
Glaucoma7.5119.6Oesophagus, stomach and duodenum diseases4.10.512.2
Autoimmune diseases4.4119.4Other psychiatric and behavioural diseases2.40.512
Obesity29119.2Diabetes10.80.411.2

COPD, chronic obstructive pulmonary disease; EX, exclusivity; O, disease prevalence in the cluster; O/E ratio, observed/expected ratio.

Most frequent 15 diseases found in multimorbidity patterns in individuals aged 65–94 years (n=916 619, Catalonia, 2012) COPD, chronic obstructive pulmonary disease; EX, exclusivity; O, disease prevalence in the cluster; O/E ratio, observed/expected ratio. Women were more represented than men in almost all clusters, from 52.7% for respiratory, circulatory and neurological to 83.6% for mental, nervous and digestive, female dominant. The exception was genitourinary, mental and musculoskeletal, male dominant in which men made up 90.9% due to the presence of male reproductive system diseases (table 4).
Table 4

Variables characterising each cluster in baseline study for 2% prevalence cut-off point (n=916 619)

1Nervous and digestive2Respiratory, circulatory and nervous3Circulatory and digestive4Mental, nervous and digestive, female dominant5Mental, digestive and blood, female oldest-old dominant6Nervous, musculoskeletal and circulatory, female dominant7Genitourinary, mental and musculoskeletal, male dominant8Non-specified, youngest-old dominantAll
Number of people40 03750 63967 49294 453106 845145 074173 746238 333916 619
Multimorbidity, n (%)39 776 (99.3)50 513 (99.8)67 443 (99.9)94 442 (100.0)106 696 (99.9)144 869 (99.9)171 983 (99.0)177 363 (74.4)853 085 (93.1)
Polypharmacy, n (%)28 484 (71.1)38 869 (76.8)54 658 (81.0)64 154 (67.9)71 830 (67.2)86 317 (59.5)90 603 (52.1)52 588 (22.1)487 502 (53.1)
Women, n (%)22 628 (56.5)26 690 (52.7)38 023 (56.3)78 922 (83.6)85 735 (80.2)113 629 (78.3)15 730 (9.1)147 773 (62.0)529 131 (57.7)
Men, n (%)17 409 (43.5)23 949 (47.3)29 469 (43.7)15 531 (16.4)21 110 (19.8)31 445 (21.7)158 016 (90.9)90 560 (38.0)387 488 (42.3)
Age (categories), n (%)
 (65, 70)7188 (18.0)10 400 (20.5)7233 (10.7)28 305 (30.0)12 036 (11.3)38 829 (26.8)52 003 (29.9)96 184 (40.4)252 178 (27.5)
 (70, 80)17 804 (44.5)22 743 (44.9)24 724 (36.6)40 577 (43.0)33 624 (31.5)70 643 (48.7)84 037 (48.4)100 435 (42.1)394 586 (43.0)
 (80, 90)13 460 (33.6)15 568 (30.7)29 908 (44.3)22 638 (24.0)48 453 (45.3)32 714 (22.6)34 785 (20.0)37 217 (15.6)234 744 (25.6)
 (90, 99)1587 (4.0)1927 (3.8)5628 (8.3)2934 (3.1)12 732 (11.9)2888 (2.0)2920 (1.7)4497 (1.9)35 111 (3.8)
MEDEA index*
 R7831 (21.8)9300 (20.2)13 718 (23.2)17 266 (19.7)22 183 (23.0)27 401 (20.0)35 145 (21.5)49 405 (21.9)182 249 (21.4)
 U16010 (16.7)6890 (15.0)9537 (16.1)15 027 (17.2)16 556 (17.2)19 599 (14.3)25 656 (15.7)45 516 (20.2)144 791 (17.0)
 U25690 (15.8)7134 (15.5)9140 (15.4)14 335 (16.4)15 272 (15.8)21 379 (15.6)25 951 (15.9)37 530 (16.6)136 431 (16.0)
 U35941 (16.5)7520 (16.4)9187 (15.5)14 223 (16.3)15 421 (16.0)23 261 (16.9)26 908 (16.5)35 761 (15.8)138 222 (16.2)
 U45540 (15.4)7686 (16.7)9016 (15.2)14 012 (16.0)14 272 (14.8)23 780 (17.3)26 526 (16.2)32 157 (14.2)132 988 (15.6)
 U54982 (13.8)7421 (16.2)8638 (14.6)12 652 (14.5)12 699 (13.2)21 923 (16.0)23 064 (14.1)25 506 (11.3)116 883 (13.7)
Number of chronic diseases, median (IQR)8.0 (6.0; 10.0)8.0 (6.0; 10.0)8.0 (7.0; 11.0)7.0 (6.0; 9.0)7.0 (5.0; 9.0)6.0 (5.0; 8.0)5.0 (4.0; 7.0)3.0 (3.0; 4.0)6.0 (4.0; 8.0)
Number of chronic diseases (categories), n (%)
 00 (0.0)0 (0.0)0 (0.0)0 (0.0)1 (0.0)0 (0.0)235 (0.1)25 144 (10.5)25 380 (2.8)
 1262 (0.7)125 (0.2)49 (0.1)11.0 (0.0)149 (0.1)204 (0.1)1528 (0.9)35 826 (15.0)38 154 (4.2)
 (2, 5)5409 (13.5)4507 (8.9)4275 (6.3)8781 (9.3)14 601 (13.7)22 400 (15.4)57 561 (33.1)151 302 (63.5)268 836 (29.3)
 (5, 10)23 502 (58.7)30 257 (59.8)37 910 (56.2)62 490 (66.2)73 427 (68.7)105 620 (72.8)104 915 (60.4)25 588 (10.7)463 709 (50.6)
 ≥1010 864 (27.1)15 749 (31.1)25 259 (37.4)231 715 (24.5)18 668 (17.5)16 850 (11.6)9506 (5.5)473 (0.2)120 540 (13.2)
Number of drugs, median (IQR)7.0 (4.0; 9.0)7.0 (5.0; 10.0)8.0 (5.0; 11.0)6.0 (4.0; 9.0)6.0 (4.0; 9.0)5.0 (3.0; 8.0)5.0 (3.0; 7.0)2.0 (0.0; 4.0)5.0 (2.0; 8.0)
Number of drugs (categories)
 02576 (6.4)2491 (4.9)3349 (5.0)5636 (6.0)7037 (6.6)8330 (5.7)13 389 (7.7)70 561 (29.6)113 368 (12.4)
 11212 (3.0)1072 (2.1)1015 (1.5)2939 (3.1)3390 (3.2)6772 (4.7)11 440 (6.6)29 242 (12.3)57 082 (6.2)
 (2, 5)7766 (19.4)8207 (16.2)8471 (12.6)21 725 (23.0)24 587 (23.0)43 656 (30.1)58 314 (33.6)85 942 (36.1)258 667 (28.2)
 (5, 10)18 510 (46.2)23 597 (46.6)31 850 (47.2)46 022 (48.7)52 653 (49.3)68 193 (47.0)73 694 (42.4)48 161 (20.2)362 681 (39.6)
 ≥109973 (24.9)15 272 (30.2)22 808 (33.8)18 132 (19.2)19 177 (17.9)18 123 (12.5)16 909 (9.7)4427 (1.9)124 821 (13.6)
Number of visits 2012, median (IQR)12.0 (7.0; 20.0)14.0 (8.0; 22.0)18.0 (9.0; 30.0)11.0 (6.0; 19.0)12.0 (7.0; 19.0)11.0 (7.0; 17.0)9.0 (5.0; 15.0)5.0 (2.0; 9.0)9.0 (5.0; 16.0)
Number of visits 2012 (categories), n (%)
 0976 (2.4)871 (1.7)1143 (1.7)2219 (2.3)2515 (2.4)2410.3 (1.7)4137 (2.4)33 673 (14.1)47 945 (5.2)
 1874 (2.2)754 (1.5)929 (1.4)2055 (2.2)2238 (2.1)2412.4 (1.7)4685 (2.7)19 938 (8.4)33 884 (3.7)
 (2, 5)4000 (10.0)3918 (7.7)4329 (6.4)10 589 (11.2)11 018 (10.3)14 943.7 (10.3)24 319 (14.0)57 322 (24.1)130 439 (14.2)
 (5, 10)9158 (22.9)10 774 (21.3)10 883 (16.1)24 504 (25.9)27 003 (25.3)42 180.7 (29.1)54 212 (31.2)70 634 (29.6)249 349 (27.2)
 ≥1025 030 (62.5)34 322 (67.8)50 209 (74.4)55 085 (58.3)64 071 (60.0)83 126.5 (57.3)86 393 (49.7)56 766 (23.8)455 002 (49.6)

For the sake of simplicity, all numbers in the table were rounded to its closest natural number.

*MEDEA index goes from 1 (least deprived) to 5 (most deprived), in this variable n=851 564.

Variables characterising each cluster in baseline study for 2% prevalence cut-off point (n=916 619) For the sake of simplicity, all numbers in the table were rounded to its closest natural number. *MEDEA index goes from 1 (least deprived) to 5 (most deprived), in this variable n=851 564. The highest O/E ratio and exclusivity value were observed in nervous and digestive for Parkinson, parkinsonism and other neurological diseases (17.0% and 74.3%, and 15.9% and 69.4%, respectively). The lowest values were found in non-specified, youngest-old dominant. Clusters 1–3 presented the highest median number of visits with circulatory and digestive being associated with the greatest number of visits over a 1 year period (median 18 visits), and the non-specified, youngest-old dominant pattern presenting the lowest median number of visits which was equal to 5 (table 4). Online supplementary additional file 2 shows tables of variables characterising each cluster in baseline study for 1% and for no prevalence cut-off points. Multimorbidity patterns varied according to requirements for minimal prevalence of selected conditions in the population. As an example, figure 2 depicts the composition of cluster 1 according to prevalence levels of disease, and the other clusters are shown in online supplementary additional file 3. Disease prevalence varied more greatly in the less populated patterns (eg, non-specified, youngest-old dominant) (online supplementary additional file 3). Nevertheless, there was a group that remained in some clusters across all prevalence levels, for instance, some in neurological and digestive (Parkinson and parkinsonism, other neurological diseases, chronic liver diseases, chronic pancreas, biliary tract and gall bladder diseases) formed part of the cluster regardless of changes in cut-off prevalence (online supplementary additional file 3). The selected level of prevalence resulted in changes in O/E ratios, with some of them doubling their values.
Figure 2

Composition of cluster 1 (nervous and digestive) in individuals aged 65–94 years according to disease levels of prevalence (n=916 619, Catalonia, 2012). COPD, chronic obstructive pulmonary disease; O/E ratio, observed/expected ratio.

Composition of cluster 1 (nervous and digestive) in individuals aged 65–94 years according to disease levels of prevalence (n=916 619, Catalonia, 2012). COPD, chronic obstructive pulmonary disease; O/E ratio, observed/expected ratio.

Discussion

The soft clustering method we employed identified eight multimorbidity patterns, regardless of the prevalence selected. The non-specified, youngest-old dominant cluster included the largest number of individuals and those who presented the smallest multimorbidity prevalence. In this pattern diseases did not exhibit an association higher than chance because values of the O/E ratio and exclusivity were less than 2% and 20%, respectively. This suggests that such patients during their lives could change group. Two clusters presenting gender dominance were observed: nervous, musculoskeletal and circulatory, female dominant was predominately made up of women >70 years, while genitourinary, mental and musculoskeletal, male dominant was mostly formed of men of the same age. Such patterns represent 61% of the elderly participants included in the study. The rest had fewer individuals and some diseases were over-represented such as Parkinson and parkinsonism in nervous and digestive, and asthma in respiratory, circulatory and nervous. We observed that some diseases with O/E ratios ≥2 were consistently associated with each other as part of the same clusters (for instance, nervous and digestive; respiratory, circulatory and nervous; circulatory and digestive; and mental, nervous and digestive, female dominant) regardless of the prevalence threshold that had been set. They can be considered core components of those clusters. Further research is needed to establish the role of these conditions from a longitudinal perspective.

Comparison with the literature

Comparison with other studies is hindered by variations in methods, data sources and structures, populations and diseases studied. Nevertheless, there are similarities with other authors. The non-specified pattern is the one most replicated in the literature, for example, Prados-Torres et al who employed an exploratory factor analysis25 and our group with k-means.24 Specifically, although the age range and the exclusivity threshold in our previous study were different, the hard clustering method provided clusters that overlap with some of the patterns obtained in this study, since both clustering results were predominantly defined by the O/E ratio (≥2) criteria. However, the soft approach allows a more flexible distribution of the individual and diseases. Recent research has provided support for physiopathological and genetic associations that explain the observed multimorbidity patterns. For instance, neurological and digestive included chronic liver disease which has been linked to Parkinson through the accumulation of toxic substances in the brain (ammonia and manganese) and neuroinflammation.26 A higher risk of Parkinson among patients with chronic hepatitis C virus has also been reported (OR: 1.35),27 in addition to associations between digestive diseases and neurodegenerative ones (eg, Parkinson and Alzheimer) through the microbiome-gut-brain axis.27 A possible link between microbiota and digestive diseases such as chronic pancreatitis and pancreatic cancer has also been suggested.28 29 For the respiratory, circulatory and neurological cluster there is evidence of an association between chronic bronchial pathology, particularly asthma and chronic obstructive pulmonary disease (COPD), and the risk of cardiovascular events.30 Longitudinal studies have observed an increased risk of developing Parkinson among individuals suffering from asthma and/or COPD.31 32 The association between asthma and allergy is known, and its coexistence defines a specific phenotype. For the circulatory and digestive cluster, non-alcoholic fatty liver disease has been associated with the development of atrial fibrillation,33 and hepatitis C infection with an increase in the risk of developing cardiovascular and cerebrovascular events.34 In addition, anaemia has been associated with advanced stages of chronic renal diseases and erythropoietin deficiency.35 Iron deficiency anaemia has been associated with an increased risk of stroke36 through thromboembolic phenomena secondary to reactive thrombocytosis. Chronic kidney disease produces auricle injuries (dilatation, fibrosis) and systemic inflammation, both of which can favour the onset and maintenance of atrial fibrillation.37

Strengths and limitations

A major strength of this study is that it has employed a large, high-quality database made up of primary care records representative of the Catalan population aged ≥65 years.18 Patterns of multimorbidity have been studied based on the whole eligible sample. This approach is epidemiologically robust as the prevalence of diseases has been estimated on the whole sample rather than limited to patients with multimorbidity.2 Another strength is that individuals rather than diseases have been considered as the unit of analysis.8 24 Such an approach permits a more realistic and rational monitoring of participants than cohort studies in order to analyse multimorbidity patterns along time. Moreover, the use of different prevalence cut-offs to obtain multimorbidity patterns has allowed the identification of nuclear diseases. We selected the higher prevalence (2%) because the patterns obtained had more clinical representativeness. The inclusion of all the potential diagnoses may have signified a greater complexity that would have hindered both the interpretation of findings and comparison with other studies. Compared with hierarchical clustering, fuzzy c-means cluster analysis is less susceptible to: outliers in the data, choice of distance measure and the inclusion of inappropriate or irrelevant variables.38 Nevertheless, some disadvantages of the method are that different solutions for each set of seed points can occur and there is no guarantee of optimal clustering.11 To minimise this shortcoming, we carried out 100 cluster realisations with different seeds to finally use the average result of all of them. In addition, the method is not efficient when a large number of potential cluster solutions are to be considered.38 To address this limitation, we computed the optimal number of clusters using analytical indexes (online supplementary additional file 1). Other limitations need to be taken into account. The dimensional reduction method performed in this work to reduce data complexity was PCAmix. Such methods can produce low percentages of variation on principal axes and make it difficult to choose the number of dimensions to retain. In order to decide on the most suitable number of dimensions we applied the Karlis-Saporta-Spinaki rule27 which resulted in a 13-dimensional space for the 2% prevalence cut-off. Furthermore, the feasibility of developing clinical practice guidelines in accordance with these patterns might prove difficult due to the dimension of the diseases included in each pattern. Nonetheless, new clinical practice guidelines should consider the diseases that are over-represented (O/E ratio ≥2).

Implications for practice, policy and research

Soft clustering methods offer a new methodological approach to understanding the relationships between specific diseases in individuals. This is an essential step in improving the care of patients and health systems. Analysing multimorbidity patterns permits the identification of patient subgroups with different associated diseases. Our analysis focuses on groups of patients as opposed to diseases. In this case, a disease is present in all patterns (clusters), but in different degrees. In this context, the O/E ratios are used to measure which diseases are over-represented in each cluster and to lead the clinical practice guidelines. The inclusion of varying cut-off points (prevalence filters) of the diseases that form the multimorbidity patterns allowed us to identify common nuclear diseases that remained independent from the prevalence that build such patterns. It is noteworthy that 60% of the population ≥65 years was included in multimorbidity patterns made up of the most prevalent diseases. The rest of the population was grouped into five more specific patterns which permitted their better management. While clinical guidelines are currently aimed at covering the management of the diseases found in the non-specified, youngest-old dominant cluster, there is a lack of information regarding the associated diseases in the other patterns. The challenge will be to refocus healthcare policy from that based on individual diseases, with the accompanying consequences (increased risk of functional decline, poorer quality of life, greater use of services, polypharmacy and increased mortality), to a multimorbidity orientation.39 Further investigation on this topic is called for with particular focus on five major issues. First, the genetic study of these patterns will help the identification of risk subgroups. Second, research is needed on the life style and environmental factors (diet, physical exercise, toxics) associated with such patterns. Third, longitudinal studies should be performed to establish the onset order of the core diseases. Fourth, alternative approaches to handle covariates in cluster analysis should be addressed in future analysis plan. Recently, a new method that allows the covariates to be incorporated into the membership factor to model individual probabilities of cluster membership has been proposed.40 And fifth, the characteristics of the diseases in the same cluster and their potential implication on the quality of primary care should be ascertained in greater detail. Our findings suggest non-hierarchical cluster analysis identified multimorbidity patterns and phenotypes of certain subgroups of patients that were more consistent with clinical practice.
  12 in total

1.  Five-year trajectories of multimorbidity patterns in an elderly Mediterranean population using Hidden Markov Models.

Authors:  Concepción Violán; Sergio Fernández-Bertolín; Marina Guisado-Clavero; Quintí Foguet-Boreu; Jose M Valderas; Josep Vidal Manzano; Albert Roso-Llorach; Margarita Cabrera-Bean
Journal:  Sci Rep       Date:  2020-10-09       Impact factor: 4.379

2.  Divergences on expected pneumonia cases during the COVID-19 epidemic in Catalonia: a time-series analysis of primary care electronic health records covering about 6 million people.

Authors:  Ermengol Coma; Leonardo Méndez-Boo; Núria Mora; Carolina Guiriguet; Mència Benítez; Francesc Fina; Mireia Fàbregas; Elisabet Balló; Francisa Ramos; Manuel Medina; Josep M Argimon
Journal:  BMC Infect Dis       Date:  2021-03-20       Impact factor: 3.090

Review 3.  Definitions and Prevalence of Multimorbidity in Large Database Studies: A Scoping Review.

Authors:  Ying Pin Chua; Ying Xie; Poay Sian Sabrina Lee; Eng Sing Lee
Journal:  Int J Environ Res Public Health       Date:  2021-02-09       Impact factor: 3.390

4.  Identifying multimorbidity patterns of non-communicable diseases in paediatric inpatients: a cross-sectional study in Shanghai, China.

Authors:  Ning Chen; Liang Zhou; Jiaoling Huang; Wenya Yu; Chen Chen; Hua Jin; Xiaoxiao Shi; Zhaohu Yu; Qian Liu; Yan Yang; Zhaoxin Wang; Jianwei Shi
Journal:  BMJ Open       Date:  2021-04-01       Impact factor: 2.692

5.  Multimorbidity patterns of chronic conditions and geriatric syndromes in older patients from the MoPIM multicentre cohort study.

Authors:  Marisa Baré; Susana Herranz; Albert Roso-Llorach; Rosa Jordana; Concepción Violán; Marina Lleal; Pere Roura-Poch; Marta Arellano; Rafael Estrada; Gloria Julia Nazco
Journal:  BMJ Open       Date:  2021-11-15       Impact factor: 2.692

6.  Analysis of multimorbidity networks associated with different factors in Northeast China: a cross-sectional analysis.

Authors:  Jianxing Yu; Yingying Li; Zhou Zheng; Huanhuan Jia; Peng Cao; Yuzhen Qiangba; Xihe Yu
Journal:  BMJ Open       Date:  2021-11-03       Impact factor: 2.692

7.  Identifying multimorbidity profiles associated with COVID-19 severity in chronic patients using network analysis in the PRECOVID Study.

Authors:  Jonás Carmona-Pírez; Antonio Gimeno-Miguel; Kevin Bliek-Bueno; Beatriz Poblador-Plou; Jesús Díez-Manglano; Ignatios Ioakeim-Skoufa; Francisca González-Rubio; Antonio Poncel-Falcó; Alexandra Prados-Torres; Luis A Gimeno-Feliu
Journal:  Sci Rep       Date:  2022-02-18       Impact factor: 4.379

Review 8.  Theoretical explanations for socioeconomic inequalities in multimorbidity: a scoping review.

Authors:  Ludmila Fleitas Alfonzo; Tania King; Emily You; Diana Contreras-Suarez; Syafiqah Zulkelfi; Ankur Singh
Journal:  BMJ Open       Date:  2022-02-23       Impact factor: 2.692

9.  Multimorbidity Profiles and Infection Severity in COVID-19 Population Using Network Analysis in the Andalusian Health Population Database.

Authors:  Jonás Carmona-Pírez; Ignatios Ioakeim-Skoufa; Antonio Gimeno-Miguel; Beatriz Poblador-Plou; Francisca González-Rubio; Dolores Muñoyerro-Muñiz; Juliana Rodríguez-Herrera; Juan Antonio Goicoechea-Salazar; Alexandra Prados-Torres; Román Villegas-Portero
Journal:  Int J Environ Res Public Health       Date:  2022-03-23       Impact factor: 3.390

10.  Multimorbidity clustering of the emergency department patient flow: Impact analysis of new unscheduled care clinics.

Authors:  Adrien Wartelle; Farah Mourad-Chehade; Farouk Yalaoui; Hélène Questiaux; Thomas Monneret; Ghislain Soliveau; Jan Chrusciel; Antoine Duclos; David Laplanche; Stéphane Sanchez
Journal:  PLoS One       Date:  2022-01-31       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.