| Literature DB >> 33907241 |
Imogen Schofield1, David C Brodbelt2, Noel Kennedy2, Stijn J M Niessen3,4, David B Church3, Rebecca F Geddes3, Dan G O'Neill2.
Abstract
Cushing's syndrome is an endocrine disease in dogs that negatively impacts upon the quality-of-life of affected animals. Cushing's syndrome can be a challenging diagnosis to confirm, therefore new methods to aid diagnosis are warranted. Four machine-learning algorithms were applied to predict a future diagnosis of Cushing's syndrome, using structured clinical data from the VetCompass programme in the UK. Dogs suspected of having Cushing's syndrome were included in the analysis and classified based on their final reported diagnosis within their clinical records. Demographic and clinical features available at the point of first suspicion by the attending veterinarian were included within the models. The machine-learning methods were able to classify the recorded Cushing's syndrome diagnoses, with good predictive performance. The LASSO penalised regression model indicated the best overall performance when applied to the test set with an AUROC = 0.85 (95% CI 0.80-0.89), sensitivity = 0.71, specificity = 0.82, PPV = 0.75 and NPV = 0.78. The findings of our study indicate that machine-learning methods could predict the future diagnosis of a practicing veterinarian. New approaches using these methods could support clinical decision-making and contribute to improved diagnosis of Cushing's syndrome in dogs.Entities:
Year: 2021 PMID: 33907241 PMCID: PMC8079424 DOI: 10.1038/s41598-021-88440-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Descriptive statistics and univariable associations of features included in machine-learning prediction of the diagnosis of Cushing’s syndrome in dogs attending primary-care veterinary practices in the UK (Cases, n = 398; non-cases: n = 541).
| Variable | Category | Cases (%) | Non-cases (%) | |
|---|---|---|---|---|
| Age at first suspicion (median, IQR, years) | – | 10.8 (IQR 9.0–12.5) | 10.2 (IQR 8.2–12.1 | 0.004 |
| Weight at first suspicion (median, IQR, kg) | – | 11.4 kg (IQR 8.8–20.0) | 13.2 kg (IQR 9.3–25.1) | 0.008 |
| Weight change in last 12 months (10% change) | Loss | 41 (10.3) | 70 (12.9) | 0.41 |
| Gain | 32 (8.0) | 47 (8.7) | ||
| No change | 325 (81.7) | 424 (78.4) | ||
| Sex-neuter | Female entire | 58 (14.6) | 39 (7.2) | 0.001 |
| Female neutered | 154 (38.7) | 236 (43.6) | ||
| Male entire | 53 (13.3) | 61 (11.3) | ||
| Male neutered | 133 (33.4) | 205 (37.9) | ||
| Breed | Beagle | 1 (0.3) | 11 (2.0) | < 0.001 |
| Bichon frise | 32 (8.0) | 24 (4.4) | ||
| Border collie | 5 (1.3) | 12 (2.2) | ||
| Border terrier | 23 (5.8) | 11 (2.0) | ||
| Boxer | 6 (1.5) | 7 (1.3) | ||
| Cavalier King Charles spaniel | 7 (1.8) | 11 (2.0) | ||
| Cocker spaniel | 5 (1.3) | 20 (3.7) | ||
| Crossbreed | 90 (22.6) | 114 (21.1) | ||
| Jack Russell terrier | 39 (9.8) | 39 (7.2) | ||
| Labrador retriever | 6 (1.5) | 39 (7.2) | ||
| Lhasa apso | 7 (1.8) | 4 (0.7) | ||
| Poodle | 5 (1.3) | 8 (1.5) | ||
| Schnauzer | 6 (1.5) | 24 (4.4) | ||
| Shih tzu | 19 (4.8) | 3 (0.6) | ||
| Staffordshire bull terrier | 29 (7.3) | 26 (4.8) | ||
| West Highland white terrier | 13 (3.3) | 46 (8.5) | ||
| Yorkshire terrier | 20 (5.0) | 20 (3.7) | ||
| Other purebreed | 85 (21.4) | 122 (22.6) | ||
| Polydipsia | Yes | 279 (70.1) | 261 (48.2) | < 0.001 |
| No | 119 (29.9) | 280 (51.8) | ||
| Polyuria | Yes | 234 (58.8) | 195 (36.0) | < 0.001 |
| No | 164 (41.2) | 346 (64.0) | ||
| Polyphagia | Yes | 98 (24.6) | 77 (14.2) | < 0.001 |
| No | 300 (75.4) | 464 (85.8) | ||
| Vomiting | Yes | 19 (4.8) | 59 (10.9) | 0.001 |
| No | 379 (95.2) | 482 (89.1) | ||
| Diarrhoea | Yes | 26 (6.5) | 57 (10.5) | 0.03 |
| No | 372 (93.5) | 484 (89.5) | ||
| Potbelly/hepatomegaly | Yes | 197 (49.5) | 116 (21.4) | < 0.001 |
| No | 201 (50.5) | 425 (78.6) | ||
| Thin/dry skin | Yes | 96 (24.1) | 100 (18.5) | 0.04 |
| No | 302 (75.9) | 441 (81.5) | ||
| Alopecia | Yes | 118 (29.7) | 81 (15.0) | < 0.001 |
| No | 280 (70.3) | 460 (85.0) | ||
| Pruritus | Yes | 15 (3.8) | 45 (8.3) | 0.005 |
| No | 383 (96.2) | 496 (91.7) | ||
| Muscle wastage | Yes | 54 (13.6) | 45 (8.3) | 0.01 |
| No | 344 (86.4) | 496 (91.7) | ||
| Lethargy | Yes | 73 (18.3) | 112 (20.7) | 0.37 |
| No | 325 (81.7) | 429 (79.3) | ||
| Panting | Yes | 80 (20.1) | 99 (18.3) | 0.49 |
| No | 318 (79.9) | 442 (81.7) | ||
| Neurological signs | Yes | 18 (4.5) | 31 (5.7) | 0.41 |
| No | 380 (95.5) | 510 (94.3) | ||
| Hospitalised in previous 12 months | Yes | 55 (13.8) | 81 (15.0) | 0.62 |
| No | 343 (86.2) | 460 (85.0) | ||
| Raised ALKP activity | Yes | 211 (53.0) | 263 (48.6) | 0.001 |
| No | 14 (3.5) | 55 (10.2) | ||
| Unknown | 173 (43.5) | 223 (41.2) | ||
| Raised ALT activity | Yes | 163 (41.0) | 179 (33.1) | < 0.001 |
| No | 28 (7.0) | 98 (18.1) | ||
| Unknown | 207 (52.0) | 264 (48.8) |
Least Absolute Shrinkage and Selection Operator (LASSO) prediction model for a diagnosis of Cushing’s syndrome, applied to dogs attending primary-care veterinary practices in the UK (Cases, n = 259; non-cases: n = 367). Coefficients were estimated following application of a penalty (lambda = 0.014) during tenfold cross validation. Coefficients marked with a full-stop indicate coefficients that have been shrunk to zero and therefore removed from the model.
| Feature | Coefficient |
|---|---|
| Age at first suspicion (years) | 0.77 |
| Weight at first suspicion (kg) | . |
| Beagle | −0.66 |
| Bichon frise | 0.31 |
| Border collie | 0.09 |
| Border terrier | 0.22 |
| Boxer | . |
| Cavalier King Charles spaniel | 0.18 |
| Cocker spaniel | −0.11 |
| Crossbreed | . |
| Jack Russell terrier | . |
| Labrador retriever | −1.33 |
| Lhasa apso | 0.29 |
| Poodle | . |
| Other purebreed | . |
| Schnauzer | −0.56 |
| Shih tzu | 1.12 |
| Staffordshire bull terrier | −0.29 |
| West Highland white terrier | −1.17 |
| Female neutered | . |
| Female entire | 0.52 |
| Male entire | . |
| Weight loss | −0.25 |
| Weight gain | . |
| Polydipsia absent | −0.38 |
| Polyuria present | 0.64 |
| Polyphagia present | 0.06 |
| Vomiting present | −0.46 |
| Diarrhoea present | −0.35 |
| Potbelly present | 0.75 |
| Thin/dry skin present | 0.12 |
| Alopecia present | 0.93 |
| Pruritus present | −0.12 |
| Muscle wastage present | 0.12 |
| Lethargy present | . |
| Panting present | −0.10 |
| Neurological signs present | . |
| ALKP not elevated | −1.04 |
| ALT not elevated | −0.57 |
| (Intercept) | −0.93 |
Training dataset and independent test dataset performance metrics of four machine-learning models for predicting a diagnosis of Cushing’s syndrome in dogs attending primary-care practice in the UK (training dataset: cases n = 259, non-cases: n = 367; testing dataset: cases = 139, non-cases = 174; dataset prevalence = 0.44). AUROC, Area under the receiver-operating characteristic curve; PPV, positive predictive value; NPV, negative predictive value; LASSO, least absolute shrinkage and selection operator; RF, random forest; SVM, support vector machine; RBF, radial basis function.
| LASSO | RF | Linear SVM | RBF SVM | |
|---|---|---|---|---|
| AUROC (95% Confidence interval) | 0.83 (0.80–0.86) | 0.77 (0.73–0.81) | 0.83 (0.80–0.87) | 0.84 (0.81–0.87) |
| Sensitivity | 0.66 | 0.54 | 0.72 | 0.67 |
| Specificity | 0.86 | 0.84 | 0.83 | 0.86 |
| PPV | 0.77 | 0.71 | 0.75 | 0.77 |
| NPV | 0.78 | 0.72 | 0.81 | 0.79 |
| Accuracy (95% Confidence interval) | 0.77 (0.74–0.81) | 0.72 (0.68–0.75) | 0.78 (0.75–0.82) | 0.78 (0.75–0.81) |
| Kappa | 0.53 | 0.40 | 0.55 | 0.54 |
| AUROC (95% Confidence interval) | 0.85 (0.80–0.89) | 0.74 (0.68–0.79) | 0.73 (0.67–0.78) | 0.72 (0.66–0.78) |
| Sensitivity | 0.71 | 0.49 | 0.63 | 0.58 |
| Specificity | 0.82 | 0.83 | 0.73 | 0.74 |
| PPV | 0.75 | 0.70 | 0.65 | 0.64 |
| NPV | 0.78 | 0.67 | 0.71 | 0.69 |
| Accuracy (95% Confidence interval) | 0.77 (0.72–0.81) | 0.68 (0.63–0.73) | 0.68 (0.63–0.73) | 0.67 (0.61–0.72) |
| Kappa | 0.52 | 0.33 | 0.36 | 0.32 |
Figure 1Receiver operating characteristic curve for the final prediction models for a diagnosis of Cushing’s syndrome evaluated in an independent test dataset, applied to dogs attending primary-care veterinary practices in the UK (n = 313; cases = 139 and non-cases = 174). LASSO, least absolute shrinkage and selection operator; RF, random forest; SVM, support vector machine; RBF, radial basis function.
Figure 2Calibration plots of the final prediction models for a diagnosis of Cushing’s syndrome, applied to dogs attending primary-care veterinary practices in the UK (n = 313; cases = 139 and non-cases = 174). The plot describes the mean observed proportions of dogs with a diagnosis of Cushing’s compared to the mean predicted probabilities, by deciles of predictions. The 45 degree line denotes perfect calibration.
Confusion matrix for the Least Absolute Shrinkage and Selection Operator (LASSO) model test dataset predictions of Cushing’s syndrome in dogs attending primary-care practice in the UK (n = 313; cases = 139 and non-cases = 174).
| Observed | ||
|---|---|---|
| Prediction | Cushing’s +ve | Cushing’s −ve |
| Cushing’s +ve | 98 | 32 |
| Cushing’s −ve | 41 | 142 |