| Literature DB >> 34316397 |
Pranav S Pandit1, Deniece R Williams2, Paul Rossitto2, John M Adaska3, Richard Pereira4, Terry W Lehenbauer2,4, Barbara A Byrne5, Xunde Li6, Edward R Atwill6, Sharif S Aly2,4.
Abstract
BACKGROUND: Understanding the effects of herd management practices on the prevalence of multidrug-resistant pathogenic Salmonella and commensals Enterococcus spp. and Escherichia coli in dairy cattle is key in reducing antibacterial resistant infections in humans originating from food animals. Our objective was to explore the herd and cow level features associated with the multi-drug resistant, and resistance phenotypes shared between Salmonella, E. coli and Enterococcus spp. using machine learning algorithms.Entities:
Keywords: Antimicrobial resistance; Cull cows; Dairy cattle; Decision tree classification; Enterococcus; Escherichia coli; Gradient boosting; Random forest; Salmonella
Year: 2021 PMID: 34316397 PMCID: PMC8288115 DOI: 10.7717/peerj.11732
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Hypertuning of model parameters and validation.
| Model parameters | Output definition based on resistance for number of antimicrobial classes | Combined resistance in | Resistance in antimicrobial classes in either | ||||
|---|---|---|---|---|---|---|---|
| Parameter explanation | Parameter values tested | Best performing model parameters | |||||
| Criterion | The function to measure the quality of a split | gini, entropy | gini | entropy | entropy | gini | gini |
| Splitter | The strategy used to choose the split at each node. | best, random | best | best | best | random | random |
| Maximum depth | The maximum depth of the tree | 10, 20, 30, 40, 45, 50, 70 | 10 | 45 | 50 | 45 | 10 |
| Minimum split | The minimum number of samples required to split an internal node | 2, 3, 4, 6 | 6 | 6 | 6 | 3 | 3 |
| Maximum features | The number of features to consider when looking for the best split | auto, sqrt | auto | auto | auto | sqrt | sqrt |
| Minimum leaf | The minimum number of samples required to be at a leaf node | 1, 3, 4, 6, 7, 8 | 4 | 4 | 6 | 1 | 7 |
| Model performance for holdout dataset | |||||||
| Number of samples in holdout dataset | 19 | 73 | 73 | 73 | 73 | ||
| Precision | Positive predictive value | 0.72 | 0.65 | 0.51 | 0.44 | 0.47 | |
| Recall | Sensitivity | 0.79 | 0.68 | 0.52 | 0.44 | 0.47 | |
| F1-score | Harmonic mean of PPV and sensitivity | 0.75 | 0.66 | 0.50 | 0.44 | 0.47 | |
| Bootstrap | Whether bootstrap samples are used when building trees | True, False | FALSE | TRUE | TRUE | TRUE | TRUE |
| Criterion | The function to measure the quality of a split | gini, entropy | gini | entropy | entropy | entropy | entropy |
| Maximum depth | The maximum depth of the tree | 10, 20, 30, 40, 45, 50, 70 | 30 | 50 | 30 | 40 | 10 |
| Maximum features | The number of features to consider when looking for the best split | auto, sqrt | sqrt | sqrt | sqrt | auto | auto |
| Minimum leaf | The minimum number of samples required to be at a leaf node | 1, 3, 4, 6, 7, 8 | 1 | 8 | 4 | 8 | 8 |
| Minimum split | The minimum number of samples required to split an internal node | 2, 3, 4, 6 | 3 | 3 | 2 | 4 | 4 |
| Number of estimators | The number of trees in the forest | 100, 200, 300, 500 | 200 | 100 | 100 | 100 | 100 |
| Model performance for holdout dataset | |||||||
| Number of samples in holdout dataset | 19 | 73 | 73 | 73 | 73 | ||
| Precision | Positive predictive value | 0.66 | 0.62 | 0.46 | 0.47 | 0.41 | |
| Recall | Sensitivity | 0.74 | 0.67 | 0.47 | 0.48 | 0.41 | |
| F1-score | Harmonic mean of PPV and sensitivity | 0.69 | 0.61 | 0.46 | 0.47 | 0.38 | |
| Column sample | Subsample ratio of columns when constructing each tree | 0.2, 0.1,0.15, 0.4, 0.7 | 0.1 | 0.2 | 0.2 | 0.4 | 0.1 |
| Gamma | Minimum loss reduction required to make a further partition on a leaf node of the tree | 0.0, 0.1, 0.2, 0.4, 0.45, 0.5, 0.6, 0.7 | 0.1 | 0.1 | 0.2 | 0.6 | 0.5 |
| Learning rate | Boosting learning rate | 0.001, 0.002, 0.005, 0.008, 0.01, 0.02, 0.05, 0.10, 0.25, 0.5 | 0.25 | 0.5 | 0.008 | 0.5 | 0.05 |
| Maximum depth | Maximum tree depth for base learners | 3, 4, 7, 8, 9, 10, 15, 20 | 3 | 8 | 8 | 7 | 7 |
| Minimum child weight | Minimum sum of instance weight (hessian) needed in a child | 1, 3, 5, 7 | 3 | 1 | 1 | 1 | 7 |
| Number of estimators | Number of boosting rounds | 3, 5, 10, 30, 40, 50, 100 | 5 | 30 | 10 | 3 | 30 |
| Objective | Learning task, binary, multiple, etc. | multi:softprob | multi:softprob | multi:softprob | multi:softprob | multi:softprob | multi:softprob |
| Model performance for holdout dataset | |||||||
| Number of samples in holdout dataset | 19 | 73 | 73 | 73 | 73 | ||
| Precision | Positive predictive value | 0.61 | 0.59 | 0.51 | 0.49 | 0.46 | |
| Recall | Sensitivity | 0.68 | 0.6 | 0.51 | 0.48 | 0.44 | |
| F1-score | Harmonic mean of PPV and sensitivity | 0.65 | 0.59 | 0.51 | 0.48 | 0.38 | |
Note:
Classification algorithms trained and tested to predict multidrug resistance phenotypes from bacterial isolates for bacterial species and groups of bacteria. Parenthesis (n) sample size of the number of cows for each model. For each bacterial and bacterial group model, hyper-tuning of the decision tree classifier, random forest, and XGBoost models is presented. The table shows parameters tuned, values tested for tuning models, best model parameters, and the performance of the selected model in terms of precision, recall and F1-score for the holdout dataset.
Predictive features.
| Cow related features | Herd related features |
|---|---|
| Low milk production cull (LowMilkCull) | Milking herd size (HerdSize) |
| Reproduction cull (ReproCull) | Milk production level (RollingHerdAvg) |
| Lameness cull (LameCull) | Holstein Breed (Holstein) |
| Mastitis cull (MastitisCull) | Jersey Breed (Jersey) |
| Other reasons cull (OtherCull) | Percent culled monthly (CullPctMonth) |
| Antimicrobial Drug Use for cull condition (AMD) | Times culled monthly (CullTimesMonth) |
| Anti-inflammatory treatment for condition (Ani-Inf) | Main cull reason disease |
| No-Treatment for condition (No-Treatment) | Percent culled sold for beef (PctCullBeef) |
| Other treatment for condition (Other) | Percent carcasses condemned (PctCullCondemned) |
| Percent culled injected within 2~3 weeks (PctInject) | |
| Veterinarian gives sick cow treatments (VetTreats) | |
| Dairy manager gives sick cow treatments (ManagerTreats) | |
| Staff gives sick cow treatments (StaffTreats) | |
| Prevent Residue by avoiding specific drugs (ResiduePrevent) | |
| Chalk on cows to track drug withdrawal (Chalk4Withdrawal) | |
| Keep drug inventory (Inventory) | |
| Penicillin | |
| Ceftiofur | |
| Tetracycline | |
| Antibiotics used separately (SeparateUse) | |
| Antibiotics combinations used (CombinationUse) | |
| Track antibiotic dose used (TrackAntibioticDose) | |
| Track antibiotic route used (TrackAntibioticRoute) | |
| Familiarity with ELDU (FamiliarELDU) | |
| Frequency of ELDU (FreqELDU) | |
| No ELDU (NoELDU) | |
| Number of cull cows culled today (NumberCulled) | |
| Use of | |
| Sampling Season |
Note:
Dairy cattle and herd related features used as predictors in classification models.
Resistance phenotypes detected in Enterococcus spp. isolates.
| Resistance phenotypes observed in | Number of cows (total | The proportion of cows (%) | 95% CI |
|---|---|---|---|
| Nitrofuran antibacterial | 17 | 10.83 | [5.967–15.689] |
| Macrolides | 15 | 9.55 | [4.956–14.152] |
| Nitrofuran antibacterial, Macrolides | 15 | 9.55 | [4.956–14.152] |
| Oxazolidinones, Nitrofuran antibacterial, Macrolides | 9 | 5.73 | [2.096–9.369] |
| Oxazolidinones, Nitrofuran antibacterial | 8 | 5.1 | [1.656–8.535] |
| Tetracyclines | 7 | 4.46 | [1.23–7.687] |
| Oxazolidinones | 6 | 3.82 | [0.823–6.821] |
| Tetracyclines, Nitrofuran antibacterial | 6 | 3.82 | [0.823–6.821] |
| Tetracyclines, Nitrofuran antibacterial, Macrolides | 5 | 3.18 | [0.438–5.931] |
| Streptogramin, Oxazolidinones, Nitrofuran antibacterial, Macrolides | 4 | 2.55 | [0.083–5.013] |
| Tetracyclines, Amphenicols, Oxazolidinones, Nitrofuran antibacterial | 4 | 2.55 | [0.083–5.013] |
| Streptogramin, Nitrofuran antibacterial | 4 | 2.55 | [0.083–5.013] |
| Tetracyclines, Amphenicols, Nitrofuran antibacterial, Macrolides | 3 | 1.91 | [0.0–4.052] |
| Amphenicols, Oxazolidinones, Nitrofuran antibacterial, Macrolides | 3 | 1.91 | [0.0–4.052] |
| Tetracyclines, Macrolides | 3 | 1.91 | [0.0–4.052] |
| Amphenicols, Oxazolidinones, Nitrofuran antibacterial | 3 | 1.91 | [0.0–4.052] |
| Oxazolidinones, Macrolides | 3 | 1.91 | [0.0–4.052] |
| Amphenicols, Streptogramin, Oxazolidinones, Nitrofuran antibacterial, Macrolides | 3 | 1.91 | [0.0–4.052] |
| Streptogramin | 2 | 1.27 | [0.0–3.028] |
| Tetracyclines, Oxazolidinones | 2 | 1.27 | [0.0–3.028] |
| Tetracyclines, Amphenicols, Nitrofuran antibacterial | 2 | 1.27 | [0.0–3.028] |
| Tetracyclines, Amphenicols, Macrolides, Streptogramin, Oxazolidinones, Nitrofuran antibacterial, Macrolides | 2 | 1.27 | [0.0–3.028] |
| Amphenicols, Streptogramin, Oxazolidinones, Nitrofuran antibacterial | 2 | 1.27 | [0.0–3.028] |
| Tetracyclines,Oxazolidinones, Nitrofuran antibacterial, Macrolides | 2 | 1.27 | [0.0–3.028] |
| Other single isolates of MDR phenotypes | 23 | 14.72 | [9.118–20.180] |
| Other single isolates of AMR phenotypes | 4 | 2.55 | [0.083–5.013] |
Notes:
Other single isolates of amr phenotypes from Enterococcus sp. isolates: (1) amphenicols, nitrofuran antibacterial (2) streptogramin, oxazolidinones (3) amphenicols, macrolides (4) macrolides, oxazolidinone.
Other single isolates of MDR phenotypes from Enterococcus spp. isolates: (1) tetracyclines, amphenicols, oxazolidinones, nitrofuran antibacterial, macrolides (2) oxazolidinones, nitrofuran antibacterial, macrolides, glycopeptides (3) tetracyclines, amphenicols, oxazolidinones, macrolides (4) streptogramin, oxazolidinones, nitrofuran antibacterial (5) tetracyclines, streptogramin, nitrofuran antibacterial, macrolides (6) tetracyclines, amphenicols, streptogramin, oxazolidinones, nitrofuran antibacterial, macrolides (7) tetracyclines, amphenicols, macrolides, streptogramin, oxazolidinones, macrolides (8) amphenicols, macrolides, streptogramin, oxazolidinones, nitrofuran antibacterial (9) macrolides, streptogramin, oxazolidinones, nitrofuran antibacterials, macrolides (10) amphenicols, nitrofuran antibacterial, macrolides (11) oxazolidinones, nitrofuran antibacterial, glycopeptides (12) amphenicols, oxazolidinones, macrolides (13) tetracyclines, macrolides, oxazolidinones, nitrofuran antibacterial, macrolides (14) tetracyclines, streptogramin, nitrofuran antibacterial (15) tetracyclines, macrolides, nitrofuran antibacterial, macrolides (16) amphenicols, streptogramin, macrolides (17) tetracyclines, streptogramin, oxazolidinones (18) streptogramin, nitrofuran antibacterial, macrolides (19) tetracyclines, macrolides, oxazolidinones, nitrofuran antibacterial (20) tetracyclines, oxazolidinones, nitrofuran antibacterial (21) tetracyclines, amphenicols, macrolides, oxazolidinones, nitrofuran antibacterial (22) tetracyclines, amphenicols, macrolides, nitrofuran antibacterial (23) amphenicols, macrolides, nitrofuran antibacterial.
Phenotypes that are multi-drug resistant.
Resistant phenotypes detected in E. coli isolates.
| Resistant phenotypes observed in | Number of cows (total | The proportion of cows (%) | 95% CI |
|---|---|---|---|
| Tetracyclines | 25 | 30.86 | [20.805–40.924] |
| Aminoglycosides | 11 | 13.58 | [6.12–21.041] |
| Cephalosporins | 9 | 11.11 | [4.267–17.955] |
| Aminoglycosides, Tetracyclines | 7 | 8.64 | [2.523–14.761] |
| Folate pathway antagonist | 5 | 6.17 | [0.932–11.414] |
| Amphenicols | 5 | 6.17 | [0.932–11.414] |
| Tetracyclines, Cephalosporins | 2 | 2.47 | [0.0–5.849] |
| Aminoglycosides, Tetracyclines, Amphenicols | 2 | 2.47 | [0.0–5.849] |
| Other single isolates of AMR phenotypes | 5 | 6.17 | [0.932–11.374] |
| Other single isolates of MDR phenotypes | 5 | 6.17 | [0.932–11.414] |
Notes:
Other single isolates of AMR phenotypes from E. coli: (1) quinolones, aminoglycosides (2) amphenicols, tetracyclines (3) tetracyclines, folate pathway antagonist (6) cephalosporins, fluoroquinolones (7) amphenicols, folate pathway antagonist.
Other single isolates of mdr phenotypes from E. coli: (1) amphenicols, folate pathway antagonist, aminoglycosides (2) amphenicols, tetracyclines, cephalosporins, aminoglycosides (3) amphenicols, tetracyclines, folate pathway antagonist (4) amphenicols, tetracyclines, folate pathway antagonist, aminoglycosides (5) tetracyclines, cephalosporins, fluoroquinolones, quinolones, aminoglycosides.
Phenotypes that are multi-drug resistant.
Figure 1Seasonal variation in the prevalence of multidrug antimicrobial resistance.
Seasonal variation in the prevalence of multidrug antimicrobial resistance (MDR; resistance to three or more drug classes), and antimicrobial resistance (AMR; resistance to one or two drug classes only) in Salmonella (A), E. coli (B) and Enterococcus spp. (C) isolates from six California dairy herds. Orange and green dashed lines show the annual average prevalence of MDR and AMR in all six herds respectively. The proportion of cows that did not show any resistance are the inverse of the sum of MDR and AMR proportions and not shown in the figure. Point estimates and single standard error deviation are represented by circles and whiskers respectively.
Tetracycline antimicrobial resistance phenotype shared (highlighted in grey) between commensal bacteria (Enterococcus spp., E. coli) and Salmonella.
| Resistance Phenotypes observed | ||||
|---|---|---|---|---|
| Tetracyclines | Nitrofuran antibacterial | Aminoglycosides, Tetracyclines | 4 | AMR |
| Tetracyclines | Macrolides, Oxazolidinones, Nitrofuran antibacterial | Tetracyclines, Amphenicols | 4 | MDR |
| Tetracyclines, Penicillins | Macrolides, Nitrofuran antibacterial | Aminoglycosides, Tetracyclines, Folate pathway antagonist, Amphenicols | 4 | MDR |
| Tetracyclines, Folate pathway antagonist | Macrolides, Amphenicols | Tetracyclines | 4 | AMR |
| Tetracyclines | Tetracyclines, Nitrofuran antibacterial | Tetracyclines, Folate pathway antagonist, Amphenicols | 4 | MDR |
| Tetracyclines | Streptogramin, Oxazolidinones | Tetracyclines | 6 | AMR |
Notes:
Phenotypes that are multi-drug phenotypes.
Each row represents bacterial resistance phenotypes of bacterial isolates from a single culled dairy cow.
Figure 2Optimum decision tree to classify cows shedding multi-drug resistant (MDR), antimicrobial-resistant (AMR), and non-resistant Salmonella, Enterococcus spp., E. coli based on management practices observed in Californian dairy herds.
Nodes are represented by stacked histograms depicting distribution samples in the data (AMR, MDR, no resistance), followed by optimum decision point (pointer on the x-axis). Arrows on the left and right indicate lesser and greater than the decision point respectively. Final nodes are represented by pie chart with distribution of samples. Factor acronym definitions are described in Table 2.
Figure 3Top ten herd management practices based on variable importance (Gini coefficient) in classifying cows shedding multi-drug resistant (MDR), antimicrobial-resistant (AMR), and non-resistant for Salmonella, Enterococcus spp., E. coli in Cal.
Factor acronym definitions described in Table 2.
Figure 4Mean SHAP values depicting the impact of herd management practices on predicting multi-drug resistant phenotype in either Salmonella, Enterococcus spp., and E. coli shed in dairy cows for Gradient boosting classification (XGboost) model.
Factor acronym definitions described in Table 2.
Figure 5Partial dependence indicating the association of top six predictive herd management practices in classifying cows as multi-drug resistant phenotype in either Salmonella, Enterococcus spp. and E. coli shed in dairy cows for Gradient boost.
Partial dependence plots are generated for values presented in the data resulting in the non-linear x-axis. Blue shaded region and error bars represents standard deviation of partial dependence (n = 238).
Figure 6Mean SHAP values depicting the impact of herd management practices on predicting multi-drug resistant phenotype in Salmonella shed in dairy cows for Gradient boosting classification (XGboost) model.
Factor acronym definitions described in Table 2.
Figure 7Partial dependence indicating the association of top-six predictive herd management practices in classifying cows as multi-drug resistant phenotype in Salmonella shed in dairy cows for Gradient boosting classification (XGboost) model.
Partial dependence plots are generated for values presented in the data resulting in the non-linear x-axis. Blue shaded region and error bars represent the standard deviation of partial dependence (n = 58).