| Literature DB >> 35072145 |
Douglas S Lee1,2,3,4, Chloe X Wang1,5, Finlay A McAlister6,7,8, Shihao Ma1,5,9, Anna Chu1, Paula A Rochon1,4,10, Padma Kaul7,8, Peter C Austin1, Xuesong Wang1, Sunil V Kalmady7, Jacob A Udell1,2,4,10, Michael J Schull1,4,11, Barry B Rubin2,12, Bo Wang1,2,5,9,13.
Abstract
BACKGROUND: SARS-Cov-2 infection rates are high among residents of long-term care (LTC) homes. We used machine learning to identify resident and community characteristics predictive of SARS-Cov-2 infection.Entities:
Keywords: Artificial intelligence; Elderly; Long-term care; Machine Learning; Population health; Quality of care; Respiratory infection; SARS-CoV-2
Year: 2022 PMID: 35072145 PMCID: PMC8763396 DOI: 10.1016/j.lana.2021.100146
Source DB: PubMed Journal: Lancet Reg Health Am ISSN: 2667-193X
Figure 1Study flow diagram.
Baseline characteristics of study cohort (late phase, N = 31,238).
| Characteristic | Units or range | |
|---|---|---|
| Age | Years | 86 (78, 91) |
| Male | % | 9,555 (31%) |
| Education secondary school or less | Yes | 12,412 (40%) |
| No | 5194 (17%) | |
| Unknown | 13,632 (44%) | |
| Community size | 1,500,000+ | 7291 (23%) |
| 500,000 to 1,499,999 | 7005 (22%) | |
| 100,000 to 499,999 | 7966 (26%) | |
| 10,000 to 99,999 | 4721 (15%) | |
| ≤ 10,000 | 4144 (13%) | |
| Census subdivision population density | Density per km2 | 480 (146, 1428) |
| Postal code population size | Count | 129 (88, 215) |
| Dissemination area population size | Count | 664 (523, 1017) |
| Dissemination area population density | Density per km2 | 1986 (710, 3718) |
| Regional sex distribution | Male population size | 45.7 (42.5, 48.8) |
| Female population size | 54.1 (51.2, 57.5) | |
| Age-stratified distribution, men | % who are 0 to 44 years | 21.8 (16.6, 27.0) |
| % who are 45 to 64 years | 11.9 (9.8, 14.1) | |
| % who are ≥ 65 years | 11.3 (7.7, 15.3) | |
| Age-stratified distribution, women | % who are 0 to 44 years | 21.7 (16.5, 26.3) |
| % who are 45 to 64 years | 13.2 (11.0, 15.3) | |
| % who are ≥ 65 years | 17.4 (9.8, 27.6) | |
| Regional family units, % of population | % Never married | 23.1 (18.5, 28.2) |
| % Married or common-law | 50.5 (42.8, 59.7) | |
| % Separated or divorced | 9.1 (7.1, 12.1) | |
| % Widowed | 13.5 (5.9, 21.7) | |
| Ontario marginalization index | Ethnic concentration score | -0.49 (-0.81, 0.19) |
| Deprivation factor score | -0.21 (-0.68, 0.52) | |
| Visible minorities | % | 9.9 (2.9, 24.9) |
| Occupied private dwellings that are apartments in buildings and other attached dwellings | % | 54.2 (20.2, 97.2) |
| Post-secondary education | % with none | 81.7 (66.1, 99.3) |
| Languages spoken | English but not French | 90.6 (84.1, 93.3) |
| English and French | 7.3 (5.0, 12.4) | |
| French but not English | 0.0 (0.0, 0.0) | |
| Neither English or French | 0.7 (0.0, 2.0) | |
| Non-immigrants | % | 80.8 (66.4, 91.1) |
| % of immigrants who immigrated in specified years | Prior to 1981 | 7.4 (4.4, 12.2) |
| 1981 to 1990 | 2.1 (0.0, 4.0) | |
| 1991 to 2000 | 2.4 (0.0, 6.2) | |
| 2001 to 2010 | 2.5 (0.0, 6.4) | |
| 2011 to 2016 | 0.0 (0.0, 3.0) | |
| Non-permanent residents | % | 0.0 (0.0, 1.7) |
| % Employed in DA by sex | Men | 60.0 (51.7, 68.8) |
| Women | 53.4 (44.6, 62.0) | |
| Bed size | # of beds | 152 (104, 197) |
| Time in residence | Days | 608 (262, 1211) |
| Payment source | Government | 27,531 (88%) |
| Private | 393 (1%) | |
| Self | 22,949 (73%) | |
| Unknown | 1,823 (6%) | |
| COPD duration, if present | Years | 9.7 (4.7, 17.5) |
| Coronary revascularization, if received | Time since last procedure | 11.6 (7.1, 16.5) |
| Dementia duration, if present | Years | 5.0 (2.8, 8.2) |
| ED visits with a respiratory diagnosis | # of visits in prior year | 0 (0, 0) |
| ED visits any reason | # of visits in prior year | 0 (0, 1) |
| ED visits with transfer to residential care | # of visits in prior year | 0 (0, 0) |
| Frailty | Hospital Frailty Risk Score | -1 (-1, 5.9) |
| Hypertension duration, if present | Years | 19.9 (13.6, 26.4) |
| Heart failure duration, if present | Years | 5.7 (2.7, 11.3) |
| Hospitalizations for respiratory infection | # of hospitalizations since 2000 | 0 (0, 0) |
| Primary care physician visits | # in prior year | 12 (11, 13) |
| ADL scale – long form | Range: 0–28 | 19 (14, 22) |
| ADL self-performance hierarchy | Range: 0=independent to 6=total dependence | 4 (3, 5) |
| Aggressive behavior scale | Range: 0=low to 12=high | 0 (0, 2) |
| CHESS (Changes in Health, End-stage disease, Symptoms & Signs) | Range: 0=no instability to 5=highest instability | 0 (0, 1) |
| Cognitive performance scale | Range: 0=lowest risk to 6=highest risk | 3 (2, 4) |
| Dehydration risk | Present | 2661 (9%) |
| Depression rating scale | Range: 0–14 | 1 (0, 3) |
| Index of social engagement | Range: 0=low level of social engagement, 6=high level | 3 (2, 4) |
| Pressure ulcer risk scale | Range: 0=lowest risk to 8=highest risk | 2 (1, 3) |
| Stage 2+ pressure ulcer | Protocol initiated=Yes | 1576 (5%) |
| Undernutrition | Medium or high risk | 5195 (17%) |
| Urinary incontinence | Present | 21,298 (68%) |
| Bilirubin | µmol/L | 8 (5, 11) |
| Calcium, total | mmol/L | 2.30 (2.22, 2.39) |
| GFR | mL/min/1.73 m2 | 65 (49, 80) |
| Hemoglobin | g/L | 122 (111, 132) |
| High density lipoprotein | mmol/L | 1.22 (0.99, 1.52) |
| Iron saturation | Percent | 0.22 (0.16, 0.30) |
| Lymphocyte count | x109/L | 1.6 (1.2, 2.1) |
| Platelet count | x109/L | 231 (188, 281) |
| Serum albumin | g/L | 38 (35, 41) |
| Serum potassium level | mmol/L | 4.3 (4.0, 4.6) |
| Magnesium | mmol/L | 0.84 (0.77, 0.90) |
| Thyroid stimulating hormone | mIU/L | 2.03 (1.31, 3.04) |
| Thyroxine (Free T4) | pmol/L | 14.0 (12.0, 16.0) |
| Total cholesterol | mmol/L | 4.01 (3.29, 4.87) |
| Total:HDL cholesterol ratio | 3.20 (2.58, 4.00) | |
| Total iron binding capacity | µmol/L | 51.8 (44.2, 59.0) |
| Urate | µmol/L | 324 (265, 393) |
ADL = activities of daily living, COPD = chronic obstructive pulmonary disease, ED = emergency department, GFR = glomerular filtration rate, HDL = high density lipoprotein
Community size unknown for 111 individuals (0.4%). Missing laboratory data varied from 0.2% for eGFR to 92.5% for ionized calcium.
May have more than one source of funding.
Top 50 features associated with SARS-CoV-2 infection in late wave 1*.
| # | Characteristic and description | Type of feature | Spearman's rank correlation | Relationship with SARS-CoV-2 infection |
|---|---|---|---|---|
| 1 | Community size | Community | -0.8142 | Larger community size correlated with higher infection risk |
| 2 | Census subdivision population density | Community | 0.8917 | Higher population density correlated with higher infection risk |
| 3 | Number of long-term care beds at the home | LTC-related | 0.1993 | More beds correlated with higher infection risk |
| 4 | Percent never married† | Community | 0.8335 | Higher % never married correlated with higher infection risk |
| 5 | Percent visible minority population† | Community | 0.8962 | Higher % visible minority correlated with higher infection risk |
| 6 | Female employment rate† | Community | 0.2031 | Higher % women employed correlated with higher infection risk |
| 7 | Percent male 45 to 64 years† | Community | 0.8170 | Higher % men in age group correlated with higher infection risk |
| 8 | Hospital frailty risk score | Comorbidity | 0.7948 | Increased frailty correlated with higher infection risk |
| 9 | Total female population size† | Community | -0.8909 | Greater female population correlated with lower infection risk |
| 10 | Percent immigrated 1981 to 1990† | Community | 0.8960 | Higher % immigrant from 1981 to 1990 correlated with higher infection risk |
| 11 | Percent non-permanent residents† | Community | 0.7115 | Higher % non-permanent residents correlated with higher infection risk |
| 12 | Dissemination area population size† | Community | 0.2995 | Larger population size in DA correlated with higher infection risk |
| 13 | Percent who speak French and not English† | Community | 0.0160 | Higher % French-only speakers correlated with higher infection risk |
| 14 | Male employment rate† | Community | 0.7607 | Higher employment rate correlated with higher infection risk |
| 15 | Percent immigrated prior to 1981† | Community | 0.4582 | Higher % immigrant prior to 1981 correlated with higher infection risk |
| 16 | Postal code population size | Community | 0.5528 | Larger population in postal code region correlated with higher infection risk |
| 17 | ONMARG ethnic concentration score† | Community | -0.3618 | Lower ethnic concentration correlated with higher infection risk |
| 18 | ONMARG deprivation factor score† | Community | 0.6639 | Greater community deprivation correlated with higher infection risk |
| 19 | Dissemination area population density† | Community | 0.7251 | Higher population density in DA correlated with higher infection risk |
| 20 | Percent who speak English and French† | Community | 0.7156 | Higher % speakers of English and French correlated with higher infection risk |
| 21 | Percent of occupied private dwellings that are apartments in buildings and other attached dwellings† | Community | -0.7036 | Higher % dwellings that are apartments in buildings and other attached dwellings correlated with lower infection risk |
| 22 | Total male population size† | Community | 0.7493 | Higher % male population correlated with higher infection risk |
| 23 | Surname-based Chinese ethnicity | Demographic | -0.3654 | Chinese ethnicity by surname correlated with lower infection risk |
| 24 | Secondary school education only | Demographic | -0.8454 | Less than secondary education correlated with higher infection risk |
| 25 | Percent female 0 to 44 years† | Community | -0.8770 | Higher % women in age group correlated with lower infection risk |
| 26 | Percent who speak English and not French† | Community | -0.0173 | Higher % speakers of English correlated with lower infection risk |
| 27 | Percent who speak neither English or French† | Community | -0.6941 | Higher % non-English, non-French speakers correlated with lower infection risk |
| 28 | Duration of hypertension | Comorbidity | -0.7811 | Longer duration of hypertension correlated with lower infection risk |
| 29 | Percent female 45 to 64 years† | Community | 0.8562 | Higher % women in age group correlated with higher infection risk |
| 30 | Percent without post-secondary education † | Community | 0.6004 | Higher % without post-secondary education correlated with higher infection risk |
| 31 | Number of days in long-term care | LTC-related | -0.9633 | Longer days in long-term care correlated with lower infection risk |
| 32 | Magnesium | Laboratory | 0.1939 | Higher magnesium level correlated with higher infection risk |
| 33 | Percent immigrated between 2001 and 2010† | Community | 0.5002 | Higher % immigrant from 2001 to 2010 correlated with higher infection risk |
| 34 | Percent of population male 65+ years† | Community | -0.1600 | Higher % men in age group correlated with lower infection risk |
| 35 | Thyroxine (Free T4) | Laboratory | -0.9267 | Higher thyroxine correlated with lower infection risk |
| 36 | Bilirubin | Laboratory | 0.7387 | Higher bilirubin correlated with lower infection risk |
| 37 | ED visits resulting in transfer to residential care | Comorbidity | 0.5886 | Higher # of ED visits in prior year correlated with higher infection risk |
| 38 | Total calcium | Laboratory | -0.9539 | Lower calcium correlated with higher infection risk |
| 39 | Depression Rating Scale | Functional | -0.5050 | Lower depression score correlated with higher infection risk |
| 40 | Percent immigrated 2011 to 2016† | Community | -0.8057 | Higher % immigrant from 2011 to 2016 correlated with higher infection risk |
| 41 | Percent widowed† | Community | -0.7014 | Higher % widowed correlated with lower infection risk |
| 42 | Thyroid stimulating hormone | Laboratory | 0.4098 | Higher TSH correlated with higher infection risk |
| 43 | eGFR | Laboratory | 0.5391 | Higher eGFR correlated with higher infection risk |
| 44 | Percent separated or divorced† | Community | 0.7892 | Higher % separated or divorced correlated with higher infection risk |
| 45 | Percent non-immigrants† | Community | -0.3616 | Higher % non-immigrants correlated with lower infection risk |
| 46 | Number of prior family doctor visits | Comorbidity | 0.5159 | More visits in year prior to index date correlated with higher infection risk |
| 47 | Pressure Ulcer Risk Scale (PURS) | Functional | 0.8601 | Higher pressure ulcer risk correlated with higher infection risk |
| 48 | Duration of dementia | Comorbidity | 0.4323 | Longer duration of dementia correlated with higher infection risk |
| 49 | Number of ED visits in prior year | Comorbidity | 0.7707 | Higher # of ED visits in prior year correlated with higher infection risk |
| 50 | Percent married or common-law | Community | 0.0868 | Higher % married or common law correlated with higher infection risk |
ED = emergency department, LTC = Long-term care, ONMARG = Ontario Marginalization Index
All features have a p-value < 0.001 except percent of population with knowledge of French and not English (p=0.011)
Based on the 2016 Canadian Census and at the dissemination area (DA) level. DA's are small, relatively stable geographic areas with populations sizes of 400 to 700 people bounded by the road network and determined from Statistics Canada's Census of Population Program.
Performance of ML models for SARS-CoV-2 infection in long-term care residents in late wave 1 in Ontario
| Population | C-index(95% CI) | Sensitivity(95% CI) | Specificity(95% CI) |
|---|---|---|---|
| Tested residents only | 0.934 (0.915, 0.951) | 0.887 (0.854, 0.915) | 0.869 (0.857, 0.878) |
| Tested + untested residents | 0.934 (0.918, 0.949) | 0.865 (0.831, 0.908) | 0.862 (0.854, 0.868) |
Untested residents were considered to be test negative; c-index represents discrimination of SARS-CoV-2 positive vs. test negative or not tested
Positivity for SARS-CoV-2 based on risk quartiles from full model (all features + OLIS) in late wave 1
| Test Positivity – Full models | Total N | Positives,n (%) | Odds Ratio (95%CI) | p-value |
|---|---|---|---|---|
| Tested residents only | ||||
| Risk quartile 1 (lowest) | 1483 | <6 | Reference | NA |
| Risk quartile 2 | 1448 | 5–9 | 2.40 | 0.221 |
| Risk quartile 3 | 1728 | 17 (1.0%) | 4.90 | 0.006 |
| Risk quartile 4 | 1594 | 300 (18.8%) | 114.26 | < 0.001 |
| Tested + untested residents | ||||
| Risk quartile 1 (lowest) | 2174 | ≤6 | Reference | NA |
| Risk quartile 2 | 2333 | 3–7 | 2.80 | 0.291 |
| Risk quartile 3 | 2527 | 20 (0.8%) | 8.66 | < 0.001 |
| Risk quartile 4 | 2416 | 299 (12.4%) | 153.30 | < 0.001 |
NA = not applicable
Untested residents were considered to be test negative; odds ratios represent odds of being SARS-CoV-2 test positive vs. test negative or not tested
Figure 2SARS-CoV-2 positivity rates by long-term care home during late wave 1.
Figure 3Proportion of residents in the highest risk quartile by long-term care home in late wave 1.
Figure 4aLong-term care home positivity rates versus proportion of residents in the two highest quartiles during early wave 1.
Figure 4bLong-term care home positivity rates versus proportion of residents in the two highest quartiles during late wave 1.