Literature DB >> 33738316

A Risk Score to Predict Clostridioides difficile Infection.

Laurie Aukes1, Bruce Fireman1, Edwin Lewis1, Julius Timbol1, John Hansen1, Holly Yu2, Bing Cai2, Elisa Gonzalez2, Jody Lawrence2, Nicola P Klein1.   

Abstract

BACKGROUND: Clostridioides difficile infection (CDI) is a major cause of severe diarrhea. In this retrospective study, we identified CDI risk factors by comparing demographic and clinical characteristics for Kaiser Permanente Northern California members ≥18 years old with and without laboratory-confirmed incident CDI.
METHODS: We included these risk factors in logistic regression models to develop 2 risk scores that predict future CDI after an Index Date for Risk Score Assessment (IDRSA), marking the beginning of a period for which we estimated CDI risk.
RESULTS: During May 2011 to July 2014, we included 9986 CDI cases and 2 230 354 members without CDI. The CDI cases tended to be older, female, white race, and have more hospitalizations, emergency department and office visits, skilled nursing facility stays, antibiotic and proton pump inhibitor use, and specific comorbidities. Using hospital discharge as the IDRSA, our risk score model yielded excellent performance in predicting the likelihood of developing CDI in the subsequent 31-365 days (C-statistic of 0.848). Using a random date as the IDRSA, our model also predicted CDI risk in the subsequent 31-365 days reasonably well (C-statistic 0.722).
CONCLUSIONS: These results can be used to identify high-risk populations for enrollment in C difficile vaccine trials and facilitate study feasibility regarding sample size and time to completion.
© The Author(s) 2021. Published by Oxford University Press on behalf of Infectious Diseases Society of America.

Entities:  

Keywords:  Clostridioides difficile risk score model

Year:  2021        PMID: 33738316      PMCID: PMC7953654          DOI: 10.1093/ofid/ofab052

Source DB:  PubMed          Journal:  Open Forum Infect Dis        ISSN: 2328-8957            Impact factor:   3.835


Clostridioides difficile infection (CDI) accounts for a large portion of nosocomial morbidity and mortality. In 2011, CDI caused an estimated half a million infections in the United States with approximately 29 000 associated deaths [1]. In general, the incidence and virulence of CDI have been increasing. Hospitalizations due to CDI increased from 8.8 per 1000 nonpregnant adults in 2004 to 13.7 in 2013 [2]. Antibiotics, hospitalization, and older age are important risk factors [3, 4] which have been associated with this increase; however, CDI also occurs without these risk factors (eg, pregnant women, emergency department [ED], outpatient, and nonhealthcare settings) [4]. In contrast, limited information exists regarding the burden of CDI in specific populations such as individuals with underlying medical conditions and those receiving care in the ED and outpatient settings. The clinical importance of C difficile has resulted in it being targeted for vaccine development [5-8]. However, predicting who is most likely to develop CDI within the next year and be eligible for C difficile vaccine clinical studies remains a challenge. A risk score model that incorporates important risk factors from inpatient and outpatient settings to predict incident CDI could potentially identify such individuals. Although risk models have been previously proposed, they have had limited generalizability due to small samples, inclusion of data from only 1 healthcare setting (eg, inpatient) or age group (eg, ≥65 years), and/or were tailored to specific medical facilities [9-17]. To date, no risk score has incorporated into 1 model the many potential CDI risk factors often available in an electronic medical record (EMR). We aimed to build CDI risk score models that may be more generalizable to other populations and healthcare settings that could be used to target high-risk individuals for studying preventive measures including C difficile vaccination. To build this risk score, we identified risk factors associated with CDI at Kaiser Permanente Northern California (KPNC) and predicted the risk of CDI.

METHODS

Setting

Kaiser Permanente Northern California delivers integrated healthcare services to members who receive almost all of their care at KPNC facilities, which includes 65 medical clinics and 27 hospitals. At the time of the study, this included 3.2 million members, including 2.45 million adults aged ≥18 years. Kaiser Permanente Northern California’s EMR captures all healthcare encounters including diagnoses, medications, laboratory tests, and any CDI event. Kaiser Permanente Northern California’s single centralized laboratory conducted C difficile stool testing by 2-stage procedure: (1) enzyme immunoassay for C difficile antigen, glutamate dehydrogenase (Alere, Waltham, Massachusetts); and (2) positive and equivocal enzyme immunoassay tests confirmed using polymerase chain reaction (PCR) to detect C difficile toxin B gene sequences (Cepheid, Sunnyvale, California). Only freshly passed stool specimens (no cathartic or enema) with liquid or loose stool is acceptable; formed stool samples are rejected. Kaiser Permanente Northern California has region-wide standardized infection control policies; however, our approximately 100 hospitals and clinics may each have facility-specific CDI risk and/or protective factors (eg, outbreaks, member social/economic make-up, facility size/layout, and staff practices) potentially mirroring variations in other healthcare settings and adding to the generalizability of our risk scores.

Study Population and Design

This retrospective study was conducted in 2 phases using KPNC’s EMR from May 2010 to July 2014. In Phase 1, we ascertained potential risk factors for CDI, and, in Phase 2, we developed CDI risk scores. Using the following overall approach, the unit of study for both phases was an index date (Figure 1). In Phase 1, among all KPNC members aged ≥18 years between May 2011 and July 2014, we identified all incident CDI cases. We used the remaining KPNC population for comparison. We profiled for potential CDI risk factors by comparing demographic and clinical characteristics between those with and without CDI in the 1 year before an index date. The index date for this comparison was either the CDI diagnosis date or a randomly chosen date for the remaining population without CDI. We required 1 year of membership before the CDI diagnosis date or random date to collect characteristics.
Figure 1.

Study Design. Phase 1 example demonstrates that for an index date of 8/1/2011, we looked back one year to 8/1/2010 to identify and characterize potential risk factors for Clostridioides difficile infection (CDI) at the population level. Phase 2 example demonstrates that for an Index Date for Risk Score Assessment (IDRSA) of 6/1/2013, we looked back one year to 6/1/2012 to collect individual risk factors for CDI (as identified in Phase 1) and then looked forward one year to 6/1/2014 to predict risk for CDI during that year.

Study Design. Phase 1 example demonstrates that for an index date of 8/1/2011, we looked back one year to 8/1/2010 to identify and characterize potential risk factors for Clostridioides difficile infection (CDI) at the population level. Phase 2 example demonstrates that for an Index Date for Risk Score Assessment (IDRSA) of 6/1/2013, we looked back one year to 6/1/2012 to collect individual risk factors for CDI (as identified in Phase 1) and then looked forward one year to 6/1/2014 to predict risk for CDI during that year. Individuals were categorized as having incident CDI (CDI cases) or having no record of CDI (comparison population) as follows. (1) For CDI cases, we defined an incident CDI case as having a C difficile PCR-positive test during the study period without history of laboratory-confirmed C difficile during the 1 year before the CDI diagnosis date (the first C difficile PCR-positive test on study). If a subject had a distant history of prior CDI, the CDI event during the study period must have occurred at least 1 year after a prior positive PCR test to be considered incident CDI. (2) In the comparison population, we included all individuals without a positive PCR test for C difficile during the 1 year before a random date. We used the SAS uniform random number generator function to select a random date during the study period for each individual without CDI to act as an index date for risk factor comparison. Thus, for both CDI cases and comparison population, we ensured no CDI episodes occurred in the 1 year before the diagnosis or random date. In Phase 2, we used the results from Phase 1 to inform the development of risk score models to predict CDI by anchoring the models on an Index Date for Risk Score Assessment (IDRSA). Using risk factors (identified in Phase 1) present for each individual in the year before the IDRSA, we created risk scores to predict CDI in the year after the IDRSA. Utilizing a subset of the study period and population, we developed models to evaluate 2 different IDRSAs: (1) a hospital discharge IDRSA model (anchored on hospital discharge dates) included only members aged ≥18 years who had a hospitalization between May 2011 and July 2013; (2) a random IDRSA model (anchored on randomly selected dates, discussed further below) included all members ≥50 years between May 2011 and July 2013. This age group was selected for the random model because those aged 50+ are more likely to develop CDI and be the target population for a C difficile vaccine efficacy study. For each model, we required membership at least 1 year before the IDRSA (to collect risk factors) and 1 year after the IDRSA (to look for future CDI).

Patient Consent

This study was determined exempt by the KPNC Institutional Review Board. Waivers of informed consent and Health Insurance Portability and Accountability Act (HIPAA) were granted. Kaiser Permanente Northern California’s Institutional Review Board approved this study.

Statistical Analysis

In Phase 1, we compared all incident CDI cases with the comparison population. We examined covariates potentially related to CDI including demographics, comorbid medical conditions (using Healthcare Cost and Utilization Project and International Classification of Diseases, Ninth Revision, codes [Supplementary Table 1]), healthcare utilization (hospitalization, outpatient, ED, and long-term care), medications (antibiotics, proton-pump inhibitors, and immunocompromising medications), and Charlson comorbidity scores [18]. We assessed all risk factors for their presence during different baseline periods (eg, 12 weeks or 1 year) before the diagnosis (CDI cases) or random date (comparison population). We calculated CDI incidence rates per 100 000 person-years and 2-sided 95% confidence intervals using exact Poisson methods. Incidence rates were calculated overall and for subgroups of interest defined by demographics and comorbidities. In Phase 2, we used a hospital discharge date or random date as our IDRSA to mark the beginning of the period for which we estimated risk for CDI. We used logistic regression to estimate the risk of developing CDI in relation to the risk factors identified during the baseline period (eg, 12 weeks or 1 year) before the IDRSA. We used 2 different IDRSAs to create models to predict the future risk of CDI in the 31- to 365-day period starting at each IDRSA. The primary risk score model was anchored to hospital discharge date as the IDRSA because we envisioned that hospital discharge could be a touch point to identify high-risk populations for recruitment into C difficile vaccine trials. Using a cross-validation strategy, we developed the hospital discharge model using a random 70% of the data and then evaluated it with the remaining 30% [19]. To achieve a broader representation of the general population and make decisions about people not in the hospital, we created a simplified secondary risk score model in which a randomly selected date (using the same SAS random number function discussed above) was used as the IDRSA. The rationale for using a random date as the IDRSA was to simulate the real-world setting whereby a nonhospitalized individual’s eligibility for inclusion in C difficile vaccine trial could be evaluated based on the presence of risk factors. Each model was fit to datasets created specifically for each IDRSA. (1) The hospital discharge IDRSA dataset included demographic and hospitalization variables identified as risk factors in Phase 1 (eg, age, diagnoses, medications) for all hospitalized individuals (CDI cases and comparison population) aged 18+ years; individuals not hospitalized did not contribute. To choose which risk factors to use, we started with approximately 200 possible risk factors and used a forward selection process consisting of an automated stepwise hierarchy with built-in P value threshold of ≤0.15 for each risk factor to be kept in the model; 100 risk factors remained that met the threshold. (2) The random IDRSA dataset included demographic and risk factor variables for all individuals (CDI cases and comparison population) aged ≥50 years. We chose approximately 20 risk factors based on known CDI risk factors (eg, age, healthcare utilization, antibiotics, medical history) that would be more generalizable and practical for use at a clinical site, including sites with less extensive EMR. For each cohort—the cohort of hospitalized patients aged ≥18 years on their discharge date (hospital discharge IDRSA) and the cohort of members aged ≥50 years on a randomly selected date (random IDRSA)—we used logistic regression to examine the risk of CDI in relation to risk factors and demographics. For each cohort, a logistic regression model was fitted to yield a predicted probability of CDI risk for each person during each follow-up period. Each fitted model also yielded an estimate of the odds ratio (and corresponding relative risk) associated with each risk factor. We reported 95% Wald confidence intervals for the odds ratios and corresponding 2-sided P values. From both risk score models, we excluded the first 30 days after IDRSA to theoretically allow for time for vaccination and development of immunity. Therefore, we conducted a supplemental analysis to create a separate random date IDRSA model with follow-up of 1–30 days after IDRSA to evaluate the number of CDI cases that might occur early and thus not be included in the models with longer follow-up. We used SAS software, version 9.2 (SAS Institute) for all analyses.

RESULTS

Phase 1 Results: Risk Factors

From May 2011 through July 2014, we identified 9986 incident CDI cases and 2 230 354 in the comparison population, for a CDI incidence rate of 141 per 100 000 person-years among members aged ≥18. Individuals with CDI were more likely to be older (aged ≥65 years, 59% vs 21%), female (61% vs 53%), and white race (70% vs 53%) than those in the comparison population (Table 1).
Table 1.

Incidence of Clostridioides difficile Infection vs Comparison by Demographics, Individuals Aged 18+ Years, Kaiser Permanente Northern California, May 2011 to July 2014

CovariateNumber of CDI Cases%CDI CasesNon-CDI Population%Non-CDI Population Person-Years (PY) in Total PopulationCDI Incidence per 100 000 PY (95% CI)
Total9986 100.02 230 354100.07 079 474141 (138–144)
Age Category
 18–49 Years169216.91 146 76851.4%3 629 13447 (44–49)
 50–64 Years235923.6627 21228.1%1 989 444119 (114–124)
 65–74 Years203720.4261 72611.7%833 491244 (234–255)
 75–84 Years230023.0139 6326.3%448 505513 (492–534)
 85+ Years198 16.055 0162.5%178 900893 (850–928)
 All ≥50 Years8294 83.11 083 58648.6%3 450 341240 (235–246)
 All ≥65 Years5935 59.4456 37420.5%1 460 896406 (396–417)
Gender
 Female6038 60.51 182 33853.0%3 755 268161 (157–165)
 Male3948 39.51 048 01647.0%3 324 206119 (115–123)
Race
 White6973 69.81 177 71952.8%3 743 627186 (182–191)
 Black917 9.2159 6627.2%507 430181 (169–193)
 Asian877 8.8381 11917.1%1 207 10773 (68–78)
 Hispanic922 9.2267 03912.0%846 757109 (102–116)

Abbreviations: CDI, Clostridioides difficile infection; CI, confidence interval.

Incidence of Clostridioides difficile Infection vs Comparison by Demographics, Individuals Aged 18+ Years, Kaiser Permanente Northern California, May 2011 to July 2014 Abbreviations: CDI, Clostridioides difficile infection; CI, confidence interval. During the year before incident CDI, a greater percentage of people with CDI than without were hospitalized (69% vs 10%), had an ED visit (51% vs 14%), skilled nursing stay (25% vs 0.6%), nursing home stay (4% vs 0.1%), or at least 10 outpatient visits (53% vs 16%), used a proton pump inhibitor (36% vs 7%), had a Charlson comorbidity index score of 3 (which indicates moderate risk of death) (11% vs 2%), and had multiple medical conditions, such as pneumonia (21% vs 1%), chronic kidney disease (26% vs 4%), coronary artery disease (22% vs 3%), congestive heart failure (22% vs 2%), urinary tract infection (28% vs 3%), diabetes (23% vs 6%), and peripheral vascular disease (9% vs 1%), and were prescribed an antibiotic in the prior 12 weeks (81% vs 11%) (Table 2).
Table 2.

Selected Risk Factors Within 1 Year (Except as Noteda) Before Diagnosis or Random Date, Clostridioides difficile Infection vs Comparison, Individuals Aged 18+ Years, Kaiser Permanente Northern California, May 2011 to July 2014

Risk FactoraCDI CasesNon-CDI PopulationRatio (%CDI Cases/%Non-CDI)CDI Incidence per 100 000 PY (95% CI)
N = 9986%N = 2 230 354%
≥1 inpatient hospitalization685668.7227 21310.26.7927 (905–949)
≥2 inpatient hospitalization416641.762 8602.814.91967 (1908–2028)
Emergency room visit511951.3302 06913.53.8527 (513–542)
>10 outpatient visits521953.3290 76315.93.4559 (543–573)
Skilled nursing facility stay252925.312 3730.646.15371 (5163–5584)
Custodial care facility stay3683.724120.133.54189 (3772–4640)
Systemic antibiotic use (within 12 weeks)803980.5236 47610.67.61040 (1018–1064)
Prescription systemic proton pump inhibitor use354135.5148 7006.75.3736 (712–761)
Congestive heart failure219722.034 8421.613.81877 (1799–1957)
Chronic kidney disease (less severe)256425.798 2194.45.8805 (774–837)
Severe chronic kidney disease (on dialysis)6366.453090.232.03386 (3128–3659)
Diabetes mellitus234123.4129 0475.84.0564 (541–587)
Chronic obstructive pulmonary disease175017.570 5043.25.5767 (731–803)
Peripheral vascular disease8848.919 7480.99.91356 (1268–1448)
Coronary artery disease223122.374 3643.36.8922 (884–961)
Acute myocardial infarction4934.958820.316.32447 (2236–2673)
Liver disease144714.545 9552.16.9966 (917–1017)
Pneumonia212421.329 1361.316.42150 (2060–2244)
Urinary tract infection 278627.957 3722.610.71466 (1412–1521)
Charlson comorbidity score of 3107310.748 7992.24.9681 (641–723)

Abbreviations: CDI, Clostridioides difficile infection; CI, confidence interval; PY, person-years.

aNumber of patients with that risk factor (eg, comorbid condition, medication, healthcare use) within 1 year before CDI diagnosis date (first positive C difficile toxin text) or random date except for antibiotic use that was within 12 weeks before diagnosis or random date.

Selected Risk Factors Within 1 Year (Except as Noteda) Before Diagnosis or Random Date, Clostridioides difficile Infection vs Comparison, Individuals Aged 18+ Years, Kaiser Permanente Northern California, May 2011 to July 2014 Abbreviations: CDI, Clostridioides difficile infection; CI, confidence interval; PY, person-years. aNumber of patients with that risk factor (eg, comorbid condition, medication, healthcare use) within 1 year before CDI diagnosis date (first positive C difficile toxin text) or random date except for antibiotic use that was within 12 weeks before diagnosis or random date.

Phase 2 Results: Risk Scores

Among the 104 518 hospital discharges in the validation set (representing 30% of all hospital discharges), the hospital discharge IDRSA model yielded excellent performance in predicting the likelihood of developing CDI during the 31–365 days after hospital discharge in the 30% of data withheld for validation. A C-statistic of 0.848 was generated as the measure of fit for the model (Supplementary Figure 1). From the model validation results based on risk factors included in the hospital discharge IDRSA model (Table 3 and Supplementary Table 2), we can see the number of hospitalized individuals aged ≥18 years expected to develop CDI by risk score threshold or range as demonstrated by the following examples. (1) Higher risk scores of ≥0.30 will target 1.1% of the hospital discharges (1146 of 104 518) and 11.1% of all posthospital CDI cases (379 of 3423) (Table 3). (2) Alternatively, lower risk scores of ≥0.05 will target 11.9% of hospital discharges (12 423 of 104 518) and 57.2% of all posthospital CDI cases (1957 of 3423) (Table 3), which would provide better representation of hospitalized adults for enrollment in large Phase 3 C difficile vaccine efficacy studies. (3) In the 0.30 to 0.35 risk score range, we predict 35.2% will get CDI (Supplementary Table 2).
Table 3.

Validation Model Risk Scores for Developing a Clostridioides difficile Infection 31 to 365 Days After a Hospital Discharge Index Date for Risk Score Assessment for Individuals Aged 18+ Years Using Various Risk Score Thresholds, Kaiser Permanente Northern California, May 2011 to July 2014

Risk Score ThresholdTotal Number of Eligible Hospital Discharges (% of All Hospital Discharges)Number Who Had CDI in 31–365 Days After Hospital Discharge IDRSAPercentage of Hospital Discharges in This Row Who Had CDI (Col 3/Col 2)Percentage of Total CDI Cases (n = 3423)
Risk ≥0.00104 518 (100)34233.3%100%
 Risk ≥0.00583 989 (80.4)33724.0%98.5%
 Risk ≥0.0154 248 (51.9)31885.9%93.1%
 Risk ≥0.0231 403 (30)28339.0%82.8%
 Risk ≥0.0321 328 (20.4)249111.7%72.8%
 Risk ≥0.0415 825 (15.1)219713.9%64.2%
Risk ≥0.0512 423 (11.9)195715.8%57.2%
Risk ≥0.105486 (5.2)120522.0%35.2%
Risk ≥0.153203 (3.1)84426.4%24.7%
Risk ≥0.202129 (2)64730.4%18.9%
Risk ≥0.251514 (1.4)48331.9%14.1%
Risk ≥0.301146 (1.1)37933.1%11.1%

Abbreviations: CDI, Clostridioides difficile infection; Col, column; IDRSA, Index Date for Risk Score Assessment.

Validation Model Risk Scores for Developing a Clostridioides difficile Infection 31 to 365 Days After a Hospital Discharge Index Date for Risk Score Assessment for Individuals Aged 18+ Years Using Various Risk Score Thresholds, Kaiser Permanente Northern California, May 2011 to July 2014 Abbreviations: CDI, Clostridioides difficile infection; Col, column; IDRSA, Index Date for Risk Score Assessment. Our random IDRSA model also predicted CDI reasonably well in the 31- to 365-day follow-up period (C–statistic 0.722) (Supplementary Figure 2). From the model validation results based on the random date IDRSA model (Table 4 and Supplementary Table 3), we can see the number of individuals aged ≥50 years expected to get CDI by risk score threshold or range demonstrated by the following examples. (1) Risk scores ≥0.05 will target 0.07% of individuals (707 of 972 172) and 1.6% of all CDI cases (30 of 1918) (Table 4). (2) Alternatively, lower risk scores of ≥0.01 will target 2.1% of a random population aged ≥50 years (20 676 of 972 172) and 18.8% of all CDI cases (360 of 1918) (Table 4), which may be more feasible for enrollment of nonhospitalized individuals in C difficile vaccine trials. (3) In the 0.05 to 0.10 risk score range, we predict 4.2% will get CDI (Supplementary Table 3).
Table 4.

Validation Model Risk Scores for Developing a Clostridioides difficile Infection 31 to 365 Days After a Random Index Date for Risk Score Assessment (IDRSA) for Individuals Aged 50+ Years Using Various Thresholds, Kaiser Permanente Northern California May 2011 to July 2014

Risk Score ThresholdTotal Number of Eligible Individuals N = 972 172 (% of total)Number Who Had CDI in 31–365 days After Random IDRSAPercent of Total Population in This Row Who Had CDI (Col 3/Col 2)Percent of Total CDI Cases (n = 1918)
Risk score ≥0.00972 17219180.2%100%
 Risk ≥0.00555 726 (5.7)6381.1%33.3%
 Risk ≥0.0120 676 (2.1)3601.7%18.8%
 Risk ≥0.026550 (0.7)1672.6%8.7%
 Risk ≥0.032870 (0.3)983.4%5.1%
 Risk ≥0.041352 (0.1)513.8%2.7%
Risk score ≥0.05707 (0.07)304.2%1.6%
Risk score ≥0.1037 (0.004)25.4%0.1%
Risk score ≥0.1123 (0.002)28.7%0.1%
Risk score ≥0.128 (0.0008)112.5%0.1%
Risk score ≥0.133 (0.0003)00.0%0.0%

Abbreviations: CDI, Clostridioides difficile infection; Col, column; IDRSA, Index Date for Risk Score Assessment.

Validation Model Risk Scores for Developing a Clostridioides difficile Infection 31 to 365 Days After a Random Index Date for Risk Score Assessment (IDRSA) for Individuals Aged 50+ Years Using Various Thresholds, Kaiser Permanente Northern California May 2011 to July 2014 Abbreviations: CDI, Clostridioides difficile infection; Col, column; IDRSA, Index Date for Risk Score Assessment. Supplemental analyses limited to 1–30 days after the random IDRSA model revealed that of the 157 CDI cases that occurred during this period, only 1 had a risk score threshold of developing CDI in the first 30 days above 0.05%, indicating that our model was not overly affected by excluding cases during the 1–30 days after a random date. The potential risk factors included in each risk score model, along with coefficients generated from logistic regression (Supplementary Tables 4 and 5), can be used to calculate a risk score for individuals. For example, for each of the 19 risk factors in the simplified model noted in Supplementary Table 5, individuals would be assigned “1’s” for present risk factors and “0’s” for absent risk factors. Each “1” and “0” would be multiplied by the coefficient provided. All coefficients (including the baseline intercept) would then be added together and the sum exponentiated to yield the probability that an individual with this risk score recruited for a clinical trial would have CDI in 31–365 days following a random date.

DISCUSSION

In this large study consisting of approximately 2.2 million KPNC members and 9986 CDI cases, we identified CDI risk factors, which we used to estimate the risk of developing CDI 31 to 365 days after either a hospital discharge or a random date. The hospital discharge IDRSA model was excellent in predicting future CDI (C-statistic 0.848), whereas the random IDRSA model predicted CDI reasonably well (C-statistic 0.722). Although this study newly identified that having at least 10 outpatient visits in the past year was a risk factor for CDI (53% in CDI cases vs 16% in the comparison), most factors in our study, such as older age, hospitalization (acute and long-term care), ED visits, outpatient visits, antibiotic use, proton pump inhibitor use, and specific comorbidities, have been reported previously [1, 3, 10, 16, 20], which confirms the importance of these CDI risk factors and lends credibility to our prediction models. Overall, our model was able to identify persons who, at the time of discharge from the hospital, were at high risk for developing CDI during the subsequent year. Our risk score can be interpreted as the probability of developing CDI during the follow-up interval, in a practice setting similar to KPNC during 2011–2014. A risk score that easily identifies individuals at high risk for CDI would be useful for C difficile vaccine trials. The ability of such a tool to enrich the study population for individuals more likely to have CDI within the next year would allow a vaccine efficacy study to reach the required endpoints for conclusion in a meaningful timeframe with a feasible sample size. These tools could also be used to target individuals at high risk for CDI who may benefit from precautionary measures such as avoidance of unnecessary antibiotics and proton-pump inhibitors and isolation procedures for infection control. Other risk score models for CDI have been developed, but most were limited in scope by focusing on only hospital-acquired and/or institution-specific CDI [9, 11–15, 17] and/or had small samples (eg, <300 CDI cases vs ~10 000 cases in our study) [9, 11, 13, 17]. Kuntz et al [10] predicted risk of CDI after outpatient visits, but their study was also limited in size (620 CDI cases). A study comparable to ours by Zilberberg et al [16] collected risk factors from a large Medicare sample to develop a simplified model that assigned risk based on 22 predictors. Their model also performed well in predicting future CDI (C-statistic 0.864). Although their study was more geographically diverse than ours (US Medicare claims vs Northern California), our study included a larger age range (≥18 years in our study vs ≥65) and a more racially/ethnically diverse population (70% white in our study vs 90%). Study strengths include KPNC’s large membership size, which increased power and confidence in the data, and our ability to access the complete medical record for all individuals in the study. Kaiser Permanente Northern California’s integrated EMR allowed us to both identify all individuals with relevant risk factors and to follow them over time for subsequent development of CDI. We also created a simplified model using a random date that could be applicable to different healthcare settings. This study has limitations. First, CDI risk at KPNC may differ from elsewhere due to different hospital and geographic settings. Second, the usefulness of our findings may be limited in settings where extensive data are unavailable. Third, although KPNC’s CDI testing procedure implemented in 2011 was in use during the study period, data from 2010 used to confirm no prior CDI may have been misclassified due to different testing procedures used earlier. Fourth, our ability to capture complete data for KPNC members in long-term care facilities outside of KPNC was limited and may have missed CDI if no CDI care was received at KPNC. Fifth, these are predictive models rather than causal models, and important causal factors may be omitted arbitrarily by our variable selection processes. The importance of a variable in our predictive models may not reflect its importance in causing CDI. Sixth, our validation dataset was selected randomly rather than temporally so we did not examine how a predictive algorithm developed from earlier data predicts CDI in a later time period. Seventh, we did not make use of newer machine learning methods for variable selection and cross-validation (to address overfitting). Finally, although previous CDI is a well known risk factor for recurrence [1, 21], our study focused on incident CDI and not recurrent CDI. One goal of our study was to create the best possible risk score model for predicting future CDI that could be used to plan for C difficile vaccine clinical efficacy trials. Our complex model, which incorporates many risk factors, demonstrates that we could identify 57.2% of hospitalized individuals expected to get CDI by focusing on just hospitalized individuals with risk scores above 0.05 (approximately 12% of all hospitalizations). Creating similar risk scores based on many risk factors may be practical in other healthcare settings that also have extensive EMRs. A secondary goal was to create a simplified risk score model that can be scored with readily available risk factors in a wide range of clinical settings without extensive EMRs. We also expanded the population for this model to include individuals in the outpatient setting because the incidence of CDI in this population has been increasing. Although a simplified model such as the random IDRSA model sacrifices predictive power and does not predict future CDI with the same accuracy as the hospital discharge IDRSA model, this simplified version more easily identifies high-risk individuals based on the most important risk factors. Further, this simplified risk score model has the advantage of greater flexibility in situations such as recruiting for a clinical trial (for example, sending a recruitment letter to potential subjects before a planned visit).

CONCLUSIONS

In conclusion, we have confirmed important risk factors of CDI. We used these and other results to inform modeling and create 2 types of predictive risk scores—one focused on a hospital setting and one that includes the outpatient/community setting. These models can be used to plan for and recruit subjects into future C difficile vaccine clinical efficacy trials.

Supplementary Data

Supplementary materials are available at Open Forum Infectious Diseases online. Consisting of data provided by the authors to benefit the reader, the posted materials are not copyedited and are the sole responsibility of the authors, so questions or comments should be addressed to the corresponding author. Click here for additional data file.
  17 in total

1.  Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases.

Authors:  R A Deyo; D C Cherkin; M A Ciol
Journal:  J Clin Epidemiol       Date:  1992-06       Impact factor: 6.437

Review 2.  Clostridium difficile Infection.

Authors:  Christopher L Knight; Christina M Surawicz
Journal:  Med Clin North Am       Date:  2013-04-24       Impact factor: 5.456

3.  Development of vaccine for Clostridium difficile infection using membrane fraction of nontoxigenic Clostridium difficile.

Authors:  Mitsutoshi Senoh; Masaaki Iwaki; Akihiko Yamamoto; Haru Kato; Tadashi Fukuda; Keigo Shibayama
Journal:  Microb Pathog       Date:  2018-06-27       Impact factor: 3.738

4.  A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions.

Authors:  Jenna Wiens; John Guttag; Eric Horvitz
Journal:  J Am Med Inform Assoc       Date:  2014-01-30       Impact factor: 4.497

5.  Predicting the risk of Clostridium difficile infection following an outpatient visit: development and external validation of a pragmatic, prognostic risk score.

Authors:  J L Kuntz; E S Johnson; M A Raebel; R W Platt; A F Petrik; X Yang; M L Thorp; S J Spindel; N Neil; D H Smith
Journal:  Clin Microbiol Infect       Date:  2014-11-11       Impact factor: 8.067

6.  Risk Factors for Community-Associated Clostridium difficile Infection in Adults: A Case-Control Study.

Authors:  Alice Y Guh; Susan Hocevar Adkins; Qunna Li; Sandra N Bulens; Monica M Farley; Zirka Smith; Stacy M Holzbauer; Tory Whitten; Erin C Phipps; Emily B Hancock; Ghinwa Dumyati; Cathleen Concannon; Marion A Kainer; Brenda Rue; Carol Lyons; Danyel M Olson; Lucy Wilson; Rebecca Perlmutter; Lisa G Winston; Erin Parker; Wendy Bamberg; Zintars G Beldavs; Valerie Ocampo; Maria Karlsson; Dale N Gerding; L Clifford McDonald
Journal:  Open Forum Infect Dis       Date:  2017-10-26       Impact factor: 3.835

7.  Predictors of first recurrence of Clostridium difficile infection: implications for initial management.

Authors:  David W Eyre; A Sarah Walker; David Wyllie; Kate E Dingle; David Griffiths; John Finney; Lily O'Connor; Alison Vaughan; Derrick W Crook; Mark H Wilcox; Timothy E A Peto
Journal:  Clin Infect Dis       Date:  2012-08       Impact factor: 9.079

8.  Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile.

Authors:  Jenna Wiens; Wayne N Campbell; Ella S Franklin; John V Guttag; Eric Horvitz
Journal:  Open Forum Infect Dis       Date:  2014-07-15       Impact factor: 3.835

9.  A Multi-Center Prospective Derivation and Validation of a Clinical Prediction Tool for Severe Clostridium difficile Infection.

Authors:  Xi Na; Alan J Martin; Saurabh Sethi; Lorraine Kyne; Kevin W Garey; Sarah W Flores; Mary Hu; Dhara N Shah; Kelsey Shields; Daniel A Leffler; Ciarán P Kelly
Journal:  PLoS One       Date:  2015-04-23       Impact factor: 3.240

10.  Molecular features of lipoprotein CD0873: A potential vaccine against the human pathogen Clostridioides difficile.

Authors:  William J Bradshaw; Jean-François Bruxelle; Andrea Kovacs-Simon; Nicholas J Harmer; Claire Janoir; Severine Péchiné; K Ravi Acharya; Stephen L Michell
Journal:  J Biol Chem       Date:  2019-08-16       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.