| Literature DB >> 36016170 |
Athina Bikaki1, Michael Machiorlatti2, Loren Cliff Clark2, Candace A Robledo2, Ioannis A Kakadiaris1.
Abstract
Hispanic communities have been disproportionately affected by economic disparities. These inequalities have put Hispanics at an increased risk for preventable health conditions. In addition, the CDC reports Hispanics to have 1.5× COVID-19 infection rates and low vaccination rates. This study aims to identify the driving factors for COVID-19 vaccine hesitancy of Hispanic survey participants in the Rio Grande Valley. Our analysis used machine learning methods to identify significant associations between medical, economic, and social factors impacting the uptake and willingness to receive the COVID-19 vaccine. A combination of three classification methods (i.e., logistic regression, decision trees, and support vector machines) was used to classify observations based on the value of the targeted responses received and extract a robust subset of factors. Our analysis revealed different medical, economic, and social associations that correlate to other target population groups (i.e., males and females). According to the analysis performed on males, the Matthews correlation coefficient (MCC) value was 0.972. An MCC score of 0.805 was achieved by analyzing females, while the analysis of males and females achieved 0.797. Specifically, several medical, economic factors, and sociodemographic characteristics are more prevalent in vaccine-hesitant groups, such as asthma, hypertension, mental health problems, financial strain due to COVID-19, gender, lack of health insurance plans, and limited test availability.Entities:
Keywords: COVID-19; decision trees; ensemble voting classification; feature selection; high-risk Hispanic population; logistic regression; multiple imputation; support vector machines; vaccine hesitancy
Year: 2022 PMID: 36016170 PMCID: PMC9413740 DOI: 10.3390/vaccines10081282
Source DB: PubMed Journal: Vaccines (Basel) ISSN: 2076-393X
Number of Responses per category.
| Class | n |
|---|---|
| Consented | 307 |
| Total with no missing data | 296 |
| Total who answered for vaccination | 239 |
| Number of Hispanics who answered for vaccination | 220 |
| Number of Hispanic Males and Females at Hidalgo | 190 |
Selected questions are grouped into medical, economic, and sociodemographic factors.
| 1 | Medical | |
| 1.1. | Mental health issues (e.g., depression, anxiety, ADHD) (Y/N) | |
| 1.2. | Hypertension (Y/N) | |
| 1.3. | Diabetes (Y/N) | |
| 1.4. | Asthma (Y/N) | |
| 2 | Economic | |
| 2.1. | Income Class | |
| 2.2. | Difficulty paying for food | |
| 2.3. | Difficulty paying rent or mortgage | |
| 2.4. | Difficulty paying for medical care | |
| 2.5. | Difficulty paying for utility bills | |
| 2.6. | Difficulty paying for transportation or car payments | |
| 2.7. | Difficulty paying for credit card bills | |
| 2.8. | Helping family with money due to unemployment | |
| 2.9. | Family helping you with money due to unemployment | |
| 2.10. | Family or friends moved in with you due to unemployment | |
| 2.11. | Getting food from a food bank | |
| 2.12. | Asking for payment relief for some of your bills | |
| 2.13. | How would you describe the money situation in your household right now? | |
| 3 | Social | |
| 3.1. | Gender | |
| 3.2. | Age Class | |
| 3.3. | Education Level | |
| 3.4. | Language | |
| 3.5. | Civil status | |
| 3.6. | Household size | |
| 3.7. | Insurance Status (Y/N) | |
| 3.8. | Employment Status | |
| 3.9. | COVID-19 tests availability |
Figure 1The grouping of responses to the question “How likely are you to get a COVID-19 vaccine when it becomes available?”.
Variable and rate of missingness.
| Variable(s) | Rate of Missingness |
|---|---|
| Age Class | 1 (0.6%) |
| Education Level, size of household (HH Size), Mental Health | 2 (1.1%) |
| Hypertension, FS1, FS3, FS4, FS8, FS10, FS11, HH Money | 3 (1.6%) |
| Insurance Status, FS2, FS5, FS7 | 4 (2.1%) |
| FS9 | 5 (2.6%) |
| Hypercholesteremia (HCL) | 6 (3.2%) |
| FS6 | 9 (4.7%) |
| Test Availability | 22 (11.6%) |
| Income Class | 28 (14.7%) |
Simple descriptive statistics n (%) for all covariates (n = 190). We denote financial strain as FS, household size as HH; missing data are present on insurance status, HH Size, educational status, mental health, hypertension, HCL, income, FS-FS11, and HH Money. See Table 1 for the exact number of missing data. The complete case and imputed value are the same if no missing data are present.
| Variable | Class | Overall | Complete Case | Imputed Data | ||||
|---|---|---|---|---|---|---|---|---|
| Vaccine Hesitancy | Vaccine Hesitancy | |||||||
| Willing | Hesitant | Willing | Hesitant | |||||
| Gender | M | 63 (33.2) | 52 (31.7) | 11 (42.3) | 0.2861 | 52 (31.7) | 11 (42.3) | 0.2861 |
| F | 127 (66.8) | 112 (68.3) | 15 (57.7) | 112 (68.3) | 15 (57.7) | |||
| Language | Spanish | 30 (15.8) | 26 (15.9) | 4 (15.4) | 0.9514 | 26 (15.9) | 4 (15.4) | 0.9514 |
| English | 160 (84.2 | 138 (84.1) | 22 (84.6) | 138 (84.1) | 22 (84.6) | |||
| Age Class | 18–34 | 79 (41.8) | 68 (41.7) | 11 (42.3) | 0.044 | 69 (42.1) | 11 (42.3) | 0.0435 |
| 35–54 | 71 (37.6) | 57 (35.0) | 14 (53.9) | 57 (34.8) | 14 (53.9) | |||
| 55+ | 39 (20.6) | 38 (23.3) | 1 (3.9) | 38 (23.2) | 1 (3.9) | |||
| Marriage Status | Widowed/Separated/Divorced/Single | 94 (49.5) | 84 (51.2) | 10 (38.5) | 0.2267 | 84 (51.2) | 10 (38.5) | 0.2267 |
| Married/Couple | 96 (50.5) | 80 (48.8) | 16 (61.5) | 80 (48.8) | 16 (61.5) | |||
| Insurance Status | No | 52 (28) | 40 (25.0) | 12 (46.2) | 0.0258 | 41 (25.2) | 12 (46.2) | 0.0279 |
| Yes | 134 (72) | 120 (75.0) | 14 (53.9) | 123 (74.8) | 14 (53.9) | |||
| Test available | Easy/Very Easy | 141 (83.9) | 115 (81.0) | 26 (100) | 0.0069 | 132 (80.4) | 26 (100) | 0.0149 |
| Hard/Very Hard | 27 (16.1) | 27 (19.0) | - | 32 (19.6) | - | |||
| HH Size | 1 | 14 (7.5) | 13 (8.0) | 1 (3.9) | 0.2735 | 13 (8.0) | 1 (3.9) | 0.2783 |
| 2 | 48 (25.5) | 44 (27.2) | 4 (15.4) | 44 (27.1) | 4 (15.4) | |||
| 3+ | 126 (67) | 105 (64.8) | 21 (80.8) | 107 (64.9) | 21 (80.8) | |||
| Educational Status | LT HS | 20 (10.6) | 18 (11.0) | 2 (8.0) | 0.6807 | 18 (11.0) | 3 (10.0) | 0.6872 |
| HS/GED | 56 (29.8) | 46 (28.2) | 10 (40.0) | 46 (28.2) | 10 (40.0) | |||
| Some College AA/AS | 49 (26.1) | 43 (26.4) | 6 (24.0) | 43 (26.3) | 6 (23.1) | |||
| BA/BS or Higher | 63 (33.5) | 56 (34.4) | 7 (28.0) | 56 (34.4) | 7 (26.9) | |||
| Mental Health | No | 151 (80.3) | 126 (77.8) | 25 (96.2) | 0.0287 | 126 (77.1) | 25 (96.2) | 0.0247 |
| Yes | 37 (19.7) | 36 (22.2) | 1 (3.9) | 38 (22.9) | 1 (3.9) | |||
| Hypertension | No | 133 (71.1) | 111 (68.5) | 22 (88.0) | 0.0454 | 113 (68.9) | 23 (88.5) | 0.0399 |
| Yes | 54 (28.9) | 51 (31.5) | 3 (12.0) | 51 (31.1) | 3 (11.5) | |||
| HCL | No | 139 (75.5) | 117 (73.6) | 22 (88.0) | 0.1191 | 121 (73.8) | 22 (85.8) | 0.1991 |
| Yes | 45 (24.5) | 42 (26.4) | 3 (12.0) | 43 (26.2) | 4 (14.2) | |||
| Diabetes | No | 142 (74.7) | 123 (75) | 19 (73.08) | 0.8339 | 123 (75.0) | 19 (73.1) | 0.8339 |
| Yes | 48 (25.3) | 41 (25) | 7 (26.92) | 41 (25.0) | 7 (26.9) | |||
| Asthma | No | 168 (88.4) | 145 (88.41) | 23 (88.46) | 0.9945 | 145 (88.4) | 23 (88.5) | 0.9945 |
| Yes | 22 (11.6) | 19 (11.59) | 3 (11.54) | 19 (11.6) | 3 (11.5) | |||
| Income Class | 0 to USD 39,999 | 97 (59.9) | 85 (62.04) | 12 (48) | 0.0517 | 102 (62.2) | 12 (47.3) | 0.0451 |
| USD 40,000 to USD 69,999 | 44 (27.2) | 32 (23.36) | 12 (48) | 39 (23.6) | 13 (48.5) | |||
| USD 70,000 to USD 99,999 | 12 (7.4) | 12 (8.76) | - | 15 (8.8) | - | |||
| USD 100k+ | 9 (5.6) | 8 (5.84) | 1 (4) | 9 (5.4) | 1 (4.2) | |||
| FS1 | Somewhat hard/Not Hard | 169 (90.4) | 147 (89.63) | 22 (95.65) | 0.3595 | 147 (89.6) | 24 (93.9) | 0.4075 |
| Pay for food | Hard/Very Hard/Cannot Afford | 18 (9.6) | 17 (10.37) | 1 (4.35) | 17 (10.4) | 2 (6.2) | ||
| FS2 | Somewhat hard/Not Hard | 166 (89.3) | 144 (88.89) | 22 (91.67) | 0.6818 | 146 (88.7) | 24 (92.3) | 0.5854 |
| Pay for rent/mortgage | Hard/Very Hard/Cannot Afford | 20 (10.8) | 18 (11.11) | 2 (8.33) | 19 (11.3) | 2 (7.7) | ||
| FS3 | Somewhat hard/Not Hard | 161 (86.1) | 139 (85.28) | 22 (91.67) | 0.3982 | 140 (85.3) | 23 (89.6) | 0.5604 |
| Pay for medical care | Hard/Very Hard/Cannot Afford | 26 (13.9) | 24 (14.72) | 2 (8.33) | 24 (14.7) | 3 (10.4) | ||
| FS4 | Somewhat hard/Not Hard | 160 (85.6) | 138 (84.66) | 22 (91.67) | 0.3621 | 138 (84.2) | 23 (88.1) | 0.6373 |
| Pay for utility bills | Hard/Very Hard/Cannot Afford | 27 (14.4) | 25 (15.34) | 2 (8.33) | 26 (15.9) | 3.1 (11.9) | ||
| FS5 | Somewhat hard/Not Hard | 167 (89.8) | 144 (88.89) | 23 (95.83) | 0.2945 | 146 (89.0) | 24 (91.9) | 0.6368 |
| Pay for transportation/car payments | Hard/Very Hard/Cannot Afford | 19 (10.2) | 18 (11.11) | 1 (4.17) | 18 (11.0) | 2 (8.1) | ||
| FS6 | Somewhat hard/Not Hard | 157 (86.7) | 136 (86.62) | 21 (87.5) | 0.9062 | 140 (85.4) | 22 (85.4) | 0.6861 |
| Pay for credit card bills | Hard/Very Hard/Cannot Afford | 24 (13.3) | 21 (13.38) | 3 (12.5) | 24 (14.6) | 4 (14.6) | ||
| FS7 | No | 147 (79) | 127 (77.91) | 20 (86.96) | 0.3186 | 128 (78.1) | 22 (85.8) | 0.4111 |
| Helping family with money due to unemployment | Yes | 39 (21) | 36 (22.09) | 3 (13.04) | 36 (22.0) | 4 (14.2) | ||
| FS8 | No | 167 (89.3) | 145 (88.96) | 22 (91.67) | 0.6884 | 146 (89.0) | 24 (91.9) | 0.6454 |
| Family helping you with money due to unemployment | Yes | 20 (10.7) | 18 (11.04) | 2 (8.33) | 18 (11.0) | 2 (8.1) | ||
| FS9 | No | 170 (91.9) | 149 (91.41) | 21 (95.45) | 0.5143 | 150 (91.4) | 24 (93.1) | 0.4845 |
| Family or friends moved in with you due to unemployment | Yes | 15 (8.1) | 14 (8.59) | 1 (4.55) | 14 (8.6) | 2 (6.9) | ||
| FS10 | No | 168 (89.8) | 144 (88.34) | 24 (100) | 0.0776 | 144 (87.9) | 26 (100) | 0.0605 |
| Getting food from a food bank | Yes | 19 (10.2) | 19 (11.66) | - | 20 (12.1) | - | ||
| FS11 | No | 170 (90.9) | 147 (90.18) | 23 (95.83) | 0.3688 | 148 (90.2) | 24 (91.9) | 0.5960 |
| Asking for payment relief for some of your bills | Yes | 17 (9.1) | 16 (9.82) | 1 (4.17) | 16 (9.8) | 2 (8.1) | ||
| HH Money | Have to Cut Back/Cannot Make Ends Meet | 47 (25.1) | 39 (23.93) | 8 (33.33) | 0.3213 | 39 (23.9) | 9 (34.6) | 0.2740 |
| Comfortable/Enough But No Extra | 140 (74.9) | 124 (76.07) | 16 (66.67) | 125 (76.1) | 17 (65.4) | |||
| Vaccine | Willing | 164 (86.3) | N/A | N/A | N/A | N/A | N/A | |
| Hesitant | 26 (13.7) | N/A | N/A | N/A | N/A | N/A | ||
Figure 2The system architecture we followed used stacking RFE wrapped with LR, DT, and SVM.
Selected transformations on the answers to specific questions.
| Question | Survey Answers | Transformed Answers |
|---|---|---|
| 2.1 |
10.000 or less 10.000–19.999 20.000–29.999 30.000–39.999 40.000–49.999 50.000–59.999 60.000–69.999 70.000–79.999 80.000–89.999 90.000–99.999 Over 100.000 |
0–39.999 {1,2,3,4} 40.000–59.999 {5,6} Over 60.000 {7,8,9,10,11} |
| 2.2, |
Very hard Hard Somewhat hard Not very hard I cannot afford this anymore |
Hard {1,2,5} Somewhat hard {3,4} |
| 2.13 |
Comfortable with extra Enough but no extra Have to cut back Cannot make ends meet |
Somewhat comfortable {1,2} Have to cut back {3} Cannot make ends meet {4} |
| 3.1 | Free text (age) |
18–34 35–54 Over 55 |
| 3.2 |
Less than high school Some high school High school graduate or GED. Associate’s or technical degree Bachelor’s degree Graduate degree No Answer |
Up to high school studies {1,2,3} Undergraduate/Graduate studies {4,5,6} |
| 3.4 |
Married Widowed Separated Divorced Single, never married A member of an unmarried couple |
Not married {2,3,4,5} Married {1,6} |
| 3.5 |
1 2 3 4 5 6 or more |
1 {1} 2 {2} 3 or more {3,4,5,6} |
| 3.8 |
Very easy Easy Very hard Hard |
Easy {1,2} Hard {3,4} |
Binary logistic regression results–backwards selection. This model correctly predicts 63.89% of hesitant or willing on average over the ten imputation models using gender, mental health, hypertension, diabetes, and response to household money. * p < 0.10; ** 0.01 < p < 0.05.
| Variable | Class | Est (95% CI) | OR (95% CI) | |
|---|---|---|---|---|
| Intercept | - | 1.38 (−2.14, −0.61) | - | 0.0004 |
| Gender | F v. M | −0.76 (−1.68, 0.16) | 0.47 (0.19, 1.18) | 0.1063 |
| MH | Yes vs. No | −1.95 (−4.03, 0.14) | 0.14 (0.02, 1.15) | 0.0671 * |
| HT | Yes vs. No | −1.79 (−3.31, −0.28) | 0.17 (0.04, 0.76) | 0.0205 ** |
| Diabetes | Yes vs. No | 1.07 (−0.10, 2.25) | 2.93 (0.90, 9.49) | 0.0732 * |
| HH Money | Have to Cut Back/Cannot Make End vs. | 1.09 (0.06, 2.12) | 2.98 (1.07, 8.32) | 0.0373 ** |
Classification methods evaluation for our methodology (RFE) and traditional binary logistic regression (BLR). For each case, the common factors and best classification scores are bolded.
| Gender | Method | Factors | TP | FN | TN | FP | Sens | Spec | MCC | |
|---|---|---|---|---|---|---|---|---|---|---|
| RFE | M + F | LR | Asthma, FS3, FS9, FS10, Gender, | 177 | 83 | 1245 | 395 | 0.681 | 0.759 | 0.665 |
| RFE | M + F | DT | Asthma, FS3, FS9, FS10, Gender, | 250 | 10 | 1345 | 295 |
|
|
|
| RFE | M + F | SVM | Asthma, FS3, FS9, FS10, Gender, | 242 | 18 | 836 | 804 | 0.931 | 0.510 | 0.652 |
| RFE | M | DT | 100 | 10 | 520 | 0 |
|
|
| |
| RFE | F | DT | Age Class, Educational Status, FS1, FS3, FS5, FS6, FS7, FS10, HH Size, | 149 | 1 | 938 | 182 |
|
|
|
| BLR | M + F | LR | 150 | 110 | 1121 | 519 | 0.577 | 0.684 | 0.594 | |
| BLR | M + F | DT | 220 | 40 | 1312 | 328 |
|
|
| |
| BLR | M + F | SVM | 135 | 125 | 1185 | 455 | 0.519 | 0.723 | 0.590 |