| Literature DB >> 35351003 |
Frank Lien1, Huang-Shen Lin2, You-Ting Wu3, Tzong-Shi Chiueh4,5,6.
Abstract
BACKGROUND: Biomarkers, such as leukocyte count, C-reactive protein (CRP), and procalcitonin (PCT), have been commonly used to predict the occurrence of life-threatening bacteremia and provide prognostic information, given the need for prompt intervention. However, such diagnosis methods require much time and money. Therefore, we propose a method with a high prediction capability using machine learning (ML) models based on complete blood count (CBC) and differential leukocyte count (DC) and compare its performance with traditional CRP or PCT biomarker methods and those of models incorporating CRP or PCT biomarkers.Entities:
Keywords: Bacteremia; Blood count; Differential count; Machine learning; Procalcitonin
Mesh:
Substances:
Year: 2022 PMID: 35351003 PMCID: PMC8962279 DOI: 10.1186/s12879-022-07223-7
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.090
Fig. 1Flowchart of research samples. aBlood culture. bMachine learning
The three research groups: CBC/DCa, CRP&CBC/DCb, and PCT&CBC/DCc
| Groups | CBC/DC | CRP&CBC/DC | PCT&CBC/DC |
|---|---|---|---|
| Number of cases | |||
| Training and validation set | |||
| 2014 | 54,729 | 44,846 | 2434 |
| 2015 | 55,982 | 46,775 | 2898 |
| 2016 | 61,438 | 53,959 | 2924 |
| 2017 | 59,843 | 53,784 | 4246 |
| 2018 | 58,433 | 53,645 | 4531 |
| Testing set | |||
| 2019 | 60,350 | 55,794 | 6879e |
| Age | 48.7 ± 30.0 | 46.8 ± 30.7 | 53.3 ± 27.6 |
| Male | 54.6% | 54.3% | 58.3% |
| Female | 45.4% | 45.7% | 41.7% |
aComplete blood count/differential leukocyte count
bC-reactive protein and complete blood count/differential leukocyte count
cProcalcitonin and complete blood count/differential leukocyte count
eOf the 6879 cases, only 3070 inpatients’ data were used for prediction
dNumbers in bold represent the total number of cases
Bacteremia prediction capability indicated with AUCsa of biomarkers (CRP/PCT) and models (random forest/logistic regression)
| Methods/Group | CBC/DCb | CRP&CBC/DCc | PCT&CBC/DCd | |||||
|---|---|---|---|---|---|---|---|---|
| Cross-validation | Testing | Cross-validation | Testing | Cross-validation | Testing | |||
| Used biomarker | – | – | CRP | 0.692 ± 0.017 | 0.699 | PCT | 0.748 ± 0.021 | 0.731 |
| MLe models | ||||||||
| Random forest | 0.792 ± 0.010 | 0.802 | CRP excludedf | 0.797 ± 0.010 | 0.806 | PCT excludedh | 0.759 ± 0.022 | 0.767 |
| Includedg | 0.806 ± 0.011 | 0.814 | Included | 0.777 ± 0.018 | 0.767 | |||
| Logistic regression | 0.763 ± 0.009 | 0.772 | Excluded | 0.769 ± 0.009 | 0.775 | Excluded | 0.735 ± 0.030 | 0.734 |
| Included | 0.784 ± 0.011 | 0.790 | Included | 0.761 ± 0.024 | 0.745 | |||
aAreas under the ROC curve
bComplete blood count/differential leukocyte count
cC-reactive protein and complete blood count/differential leukocyte count
dProcalcitonin and complete blood count/differential leukocyte count
eMachine learning
fTrained and validated based on CBC/DC data of CRP&CBC/DC group (n = 253,009)
gTrained and validated based on CBC/DC and CRP data of CRP&CBC/DC group
hTrained and validated based on CBC/DC data of PCT&CBC/DC group (n = 17,033)
Fig. 2Testing set correlation between models. (A: excluding/including CRPa , r = 0.851, p < 0.005; B: excluding/including PCTb. (r = 0.851, p < 0.005); (, r = 0.951, p < 0.005). aC-reactive protein. bProcalcitonin
Comparative prediction capability for prognosis through PCTa level and random forest model based on CBC/DCb
| Blood cultures | Prediction methods | |||
|---|---|---|---|---|
| Random forest | PCT level | |||
| With pathogenic bacterial isolates | ||||
| Group | Positive | Negative | Positive | Negative |
| N | 1888 | 1098 | 1536 | 1450 |
| All-cause in-hospital mortality | 45.10% | 14.50% | 47.40% | 19.40% |
| Overall median survival (days) | 27 | 58 | 25 | 58 |
| With possible contaminations | ||||
| Group | Positive | Negative | Positive | Negative |
| N | 1604 | 1067 | 1285 | 1386 |
| All-cause in-hospital mortality | 42.50% | 14.40% | 44.50% | 19.00% |
| Overall median survival (days) | 29 | 58 | 28 | 58 |
aProcalcitonin
bComplete blood count/differential leukocyte count
Fig. 3Feature importance of final A CBC/DCa, B CBC/DC&CRPb, and C CBC/DC&PCTc models. aComplete blood count/differential leukocyte count. bComplete blood count/differential leukocyte count and C-reactive protein. cComplete blood count/differential leukocyte count and procalcitonin