| Literature DB >> 35144239 |
Lexin Zhou1, Nekane Romero-García1, Juan Martínez-Miranda2, J Alberto Conejero3, Juan M García-Gómez1, Carlos Sáez1.
Abstract
BACKGROUND: The COVID-19 pandemic has led to an unprecedented global health care challenge for both medical institutions and researchers. Recognizing different COVID-19 subphenotypes-the division of populations of patients into more meaningful subgroups driven by clinical features-and their severity characterization may assist clinicians during the clinical course, the vaccination process, research efforts, the surveillance system, and the allocation of limited resources.Entities:
Keywords: COVID-19; Mexico; characterization; clustering; epidemiology; observational; subphenotypes
Mesh:
Year: 2022 PMID: 35144239 PMCID: PMC9098229 DOI: 10.2196/30032
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Figure 1Data set preprocessing flowchart for data in Mexico from January 13, 2020 to September 30, 2020.
List of variables contained in the data set for the study cases; they were originally coded in Spanish and translated into English by the authors for this work. Section 4 in Multimedia Appendix 1 provides the description of the original data set.
| Variable | Description | Type (value/format) |
| Sex | Sex of the person (defined in the metadata published by the Mexican government) | Discrete (Male, Female) |
| Age | Age in years at the time of the admission | Numerical integer |
| Pregnant | Presence of pregnancy | Discrete (Yes, No) |
| Obesity | Presence of obesity | Discrete (Yes, No) |
| Smoke | Presence of smoking habit | Discrete (Yes, No) |
| Pneumonia | Presence of pneumonia | Discrete (Yes, No) |
| Diabetes | Presence of diabetes | Discrete (Yes, No) |
| COPDa | Presence of chronic obstructive pulmonary disease | Discrete (Yes, No) |
| Asthma | Presence of asthma | Discrete (Yes, No) |
| INMUSUPRb | Presence of immunosuppression | Discrete (Yes, No) |
| Hypertension | Presence of hypertension | Discrete (Yes, No) |
| CKDc | Presence of chronic kidney disease | Discrete (Yes, No) |
| Cardiovascular | Presence of cardiovascular | Discrete (Yes, No) |
| Other disease | Presence of other diseases | Discrete (Yes, No) |
| Hospitalized | Whether a patient was hospitalized or ambulant | Discrete (Yes, No) |
| Intubated | Whether a patient was intubated | Discrete (Yes, No) |
| ICUd | Whether a patient had been in an intensive care unit | Discrete (Yes, No) |
| Other case contact | Whether a patient was detected to have contact with other coronavirus cases | Discrete (Yes, No) |
| Result_lab | Coronavirus test result | Discrete (Positive SARS-CoV-2, Non-Positive SARS-CoV-2, Pending, Inadequate result, Not Applied) |
| Admission_date | The date when a patient attended the care unit (not necessarily hospitalized) | Date (dd/mm/yyyy) |
| Symptoms_date | The date of symptom onset | Date (dd/mm/yyyy) |
| Death_date | The date of death | Date (dd/mm/yyyy) |
| Entity_um | The state where a patient received attention from a medical unit | Discrete |
| TCIe | The type of institution in the National Health System that provided medical care | Discretef |
| Outcomeg | Death result of the patient (we used this to calculate mortality and recovery rate) | Discrete (Deceased, Non-Deceased) |
| Survival>15daysg | Whether a patient survived more than 15 days from symptoms onset | Discrete (Yes, No) |
| Survival>30daysg | Whether a patient survived more than 30 days from symptoms onset | Discrete (Yes, No) |
| Survival>15days_deceasedg | Whether a deceased patient survived more than 15 days from symptom onset | Discrete (Yes, No) |
| Survival>30days_deceasedg | Whether a deceased patient survived more than 30 days from symptom onset | Discrete (Yes, No) |
| From Symptom to Hospital daysg | The days that it took between symptom onset and hospitalization | Numerical integer |
aCOPD: chronic obstructive pulmonary disease.
bINMUSUPR, immunosuppression.
cCKD: chronic kidney disease.
dICU, intensive care unit.
eTCI: type of clinical institution.
fIMSS (Mexican Institute of Social Security), SSA (Secretariat of Health), ISSSTE (Institute for Social Security and Services for State Workers), PRIVATE, PEMEX (Mexican Petroleum Institution), STATE, SEMAR (Secretariat of the Navy), SEDENA (Secretariat of the National Defense), IMSS-BIENESTAR, UNIVERSITARY, MUNICIPAL, RED CROSS, DIF (National System for Integral Family Development).
gVariables that were created by combining or transforming other variables in the original data set.
Figure 2Research methodology flowchart. LOESS: locally estimated scatterplot smoothing; MCA: multiple correspondence analysis; PCA: principal component analysis.
Figure 3Principal component analysis (PCA) of the 56 age-sex clusters, meta-clustering results, and locally estimated scatterplot smoothing (LOESS)–based delineations for 7 severity ranges: (A) PCA from 56 age-sex stratified clusters; (B) scatterplot of the 11 meta-clusters (MCs) defined from the 56 clusters; (C) LOESS scatterplot for mortality; (D) LOESS scatterplot for intensive care unit (ICU) admission; (E) LOESS scatterplot for intubation; (F) LOESS scatterplot for survival at 15 days among deceased patients; (G) LOESS scatterplot for hospitalization; and (H) LOESS scatterplot for days from symptom onset to hospitalization. All the scatter plots share coordinates. Each subgroup is denoted using the following abbreviation: [AgeGroup][Sex][ClusterID].
Figure 4Heat map showing the quantified characteristics among 56 age-sex–specific clusters of the 11 meta-clusters (MCs) of data collected in Mexico between January 13, 2020 and September 30, 2020.; the size of each cluster (n) was categorized into 6 ranges. CKD: chronic kidney disease; COPD: chronic obstructive pulmonary disease; ICU: intensive care unit; RR: recovery rate.
Distribution of age, features, and comorbidities with the quantitative description of demographic features, treatment, and epidemiological characteristics among the 11 meta-clusters (MCs) based on the arithmetic mean presuming that each age-sex cluster is representative of its population; thus, the size (n) of each age-sex cluster was ignored.
| Characteristics | MC1 | MC2 | MC3 | MC4 | MC5 | MC6 | MC7 | MC8 | MC9 | MC10 | MC11 | |
| Age-sex clusters (total n=56), n | 8 | 6 | 8 | 8 | 3 | 7 | 4 | 3 | 5 | 3 | 1 | |
| Patients in the MC (total n=778,892), n | 407,005 | 13,826 | 11,3537 | 11,1950 | 42280 | 21642 | 9239 | 9687 | 40557 | 7777 | 1192 | |
|
| ||||||||||||
|
| Age (years), mean | 43.4 | 18 | 39.8 | 44.8 | 46.4 | 56.3 | 65.3 | 68.7 | 66.8 | 68.2 | 76.4 |
|
| Female sex, % | 50 | 50 | 50 | 50 | 33.33 | 42.86 | 50 | 33.33 | 60 | 66.67 | 0 |
|
| ||||||||||||
|
| <18 | 25 | 66.67 | 12.5 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 18-49 | 25 | 33.33 | 50 | 25 | 66.67 | 28.57 | 0 | 0 | 0 | 0 | 0 |
|
| 50-64 | 25 | 0 | 37.5 | 25 | 33.33 | 42.86 | 50 | 33.33 | 40 | 33.33 | 0 |
|
| >64 | 25 | 0 | 0 | 25 | 0 | 28.57 | 50 | 66.67 | 60 | 66.67 | 100 |
| Pregnancy (yes), % | 0.49 | 1.28 | 0.3 | 0.8 | 0.33 | 0.26 | 0.01 | 0 | 0.01 | 0 | 0 | |
|
| ||||||||||||
|
| Obesity | 0.44 | 11.78 | 59.88 | 12.01 | 75.54 | 18.89 | 20.15 | 19.05 | 25.94 | 50.51 | 23.99 |
|
| Smoker | 0 | 9.67 | 34.09 | 0.8 | 10.77 | 8.1 | 4.38 | 38.03 | 0.22 | 42.02 | 76.85 |
|
| ||||||||||||
|
| Diabetes | 0 | 4.42 | 4.5 | 39.06 | 57.14 | 35.62 | 76.44 | 20.45 | 95 | 61.23 | 31.96 |
|
| COPDa | 0 | 4.51 | 0 | 0.73 | 0 | 5.1 | 2.03 | 43.91 | 2.36 | 37.46 | 91.86 |
|
| Asthma | 0.37 | 3.2 | 18.17 | 1.15 | 2.03 | 2.69 | 0.49 | 25.72 | 0.08 | 19.79 | 19.63 |
|
| INMUSUPRb | 0 | 13.03 | 0.1 | 1.4 | 0 | 40.38 | 0 | 0.91 | 0 | 0.03 | 0 |
|
| Hypertension | 0 | 9.13 | 7.59 | 41.15 | 68.79 | 46.79 | 83.71 | 34.38 | 96.33 | 77.86 | 52.94 |
|
| Other disease | 0 | 38.32 | 0.3 | 1.22 | 0 | 48.63 | 1.85 | 1.73 | 0 | 0.82 | 0 |
|
| Cardiovascular | 0 | 17.52 | 0.1 | 2.46 | 2.17 | 14.25 | 21.64 | 4.73 | 5.52 | 26.51 | 27.77 |
|
| CKDc | 0 | 4.27 | 0 | 3.87 | 0.22 | 31.84 | 81.67 | 1.04 | 1.92 | 1.28 | 1.01 |
|
| ||||||||||||
|
| Hospitalized | 19.87 | 46.08 | 14.15 | 42.22 | 44.91 | 58.56 | 70.72 | 57.17 | 60.8 | 60.11 | 70.47 |
|
| ICUd | 1.59 | 9.82 | 1.23 | 4.48 | 5.06 | 4.01 | 4.87 | 4.81 | 5.56 | 5.24 | 5.62 |
|
| Intubated | 3.44 | 9.03 | 2.18 | 7.9 | 8.46 | 12.12 | 13.38 | 11.5 | 12.13 | 12.42 | 12.84 |
| Pneumonia, % | 12.36 | 37 | 9.08 | 37.18 | 41.52 | 42.44 | 52.14 | 43.55 | 48.1 | 46.8 | 53.61 | |
| Recovery, % | 90.27 | 91.37 | 95.22 | 82.81 | 81.3 | 66.94 | 53.96 | 66.43 | 64.95 | 64.42 | 55.96 | |
| Survival >15 days, % | 93.46 | 93.73 | 97.01 | 88.39 | 87.27 | 76.34 | 65.37 | 77.1 | 75.26 | 75.34 | 67.28 | |
| Survival >30 days, % | 90.74 | 91.8 | 95.5 | 83.74 | 82.14 | 68.26 | 55.71 | 68.2 | 66.33 | 65.88 | 56.96 | |
| Survival >15 days (deceased), % | 30.76 | 28.64 | 36.21 | 31.09 | 31.59 | 28.46 | 24.8 | 31.7 | 29.73 | 31.01 | 25.71 | |
| Survival >30 days (deceased), % | 6.61 | 4.64 | 5.93 | 5.79 | 4.52 | 4.2 | 3.82 | 5.26 | 4.04 | 4.24 | 2.29 | |
| Time from symptom onset to hospitalization (days), mean | 3.78 | 3.2 | 4.87 | 4.37 | 5.21 | 4.48 | 4.3 | 4.85 | 4.92 | 4.94 | 4.82 | |
| Other case contact, % | 45.84 | 40.23 | 51.18 | 36.6 | 36.04 | 27.39 | 20.9 | 27.88 | 27.56 | 28 | 20.89 | |
aCOPD: chronic obstructive pulmonary disease.
bINMUSUPR: immunosuppression.
cCKD: chronic kidney disease.
dICU: intensive care unit.
Figure 5Main features of the 11 resultant meta-clusters, sorted by recovery, in addition to the thresholds for the different severity outcomes and input variable categories; based on data collected in Mexico between January 13, 2020 and September 30, 2020. COPD: chronic obstructive pulmonary disease; ICU: intensive care unit; INMUSUPR: immunosuppression.
Figure 6Heat maps of the probability distribution of the (A) 56 age-sex specific clusters and (B) 11 meta-clusters (MCs) for each Mexican state where patients received treatment or medical attention, using data collected in Mexico between January 13, 2020 and September 30, 2020. Rows represent the clusters, and columns represent the states and are arranged according to a hierarchical clustering of their values. We compared the clusters' distribution within each age range to circumvent any correlation or association with comorbidities and habits.
Figure 7Heat maps of the probability distribution of the (A) 56 age-sex specific clusters and (B) 11 MCs for each type of clinical institution (TCI), using data collected in Mexico between January 13, 2020 and September 30, 2020. Rows represent the clusters, and columns represent the TCIs and are arranged according to a hierarchical clustering of their values. We compared the clusters' distribution within each age range to circumvent any correlation or association with comorbidities and habits. DIF: National System for Integral Family Development; IMSS: Mexican Institute of Social Security; ISSSTE: Institute for Social Security and Services for State Workers; PEMEX: Mexican Petroleum Institution; SEDENA: Secretariat of the National Defense; SEMAR: Secretariat of the Navy; SSA: Secretariat of Health.