| Literature DB >> 31619240 |
Zenghua Ren1, Yudan Hu1, Ling Xu2.
Abstract
BACKGROUND: The differential diagnosis of tuberculous pleural effusion (TPE) is challenging. In recent years, artificial intelligence (AI) machine learning algorithms have started being used to an increasing extent in disease diagnosis due to the high level of efficiency, objectivity, and accuracy that they offer.Entities:
Keywords: Artificial intelligence; Diagnostic model; Machine learning algorithm; Tuberculous pleural effusion
Mesh:
Year: 2019 PMID: 31619240 PMCID: PMC6796452 DOI: 10.1186/s12931-019-1197-5
Source DB: PubMed Journal: Respir Res ISSN: 1465-9921
Fig. 1Workflow for constructing diagnostic models using MLAs
Comparison of clinical and laboratory findings between TPE and non-TPE patients
| TPE( | non-TPE( | ||
|---|---|---|---|
| Gender | |||
| Male | 131(68.2) | 157(62.5) | 0.214 |
| Female | 61(31.8) | 94(37.5) | |
| Age(years) | 36.5(24.3, 59.0) | 67.0(56.0, 77.0) | 0.000 |
| Has a history of smoking | 53(27.6) | 96(38.2) | 0.019 |
| Fever> 37.5 °C | 127(66.1) | 57(22.7) | 0.000 |
| Cough | 151(78.6) | 205(81.7) | 0.011 |
| Sputum | 86(44.8) | 143(57.0) | 0.427 |
| Bloody sputum | 2(1.0) | 12(4.8) | 0.526 |
| Chest tightness | 73(38.0) | 117(46.6) | 0.070 |
| Chest pain | 106(55.2) | 103(41.0) | 0.003 |
| Anorexia | 68(35.4) | 98(39.0) | 0.000 |
| Fatigue | 57(29.7) | 24(9.6) | 0.000 |
| Night sweats | 60(31.3) | 12(4.8) | 0.000 |
| Weight loss | 32(16.7) | 40(15.9) | 0.700 |
| In blood: | |||
| WBC(× 109/L) | 5.8(4.9, 7.3) | 7.6(5.8, 9.9) | 0.000 |
| N% | 66.4(60.0, 72.4) | 71.3(65.8, 77.3) | 0.000 |
| L% | 19.8(15.4, 24.2) | 18.0(12.4, 23.4) | 0.009 |
| M% | 10.5(8.5, 13.1) | 7.7(6.0, 9.2) | 0.000 |
| HB(g/L) | 129.0(119.0, 139.0) | 130.0(116.5, 140.5) | 0.564 |
| PLT(×109/L) | 281.0(227.3, 342.5) | 250.0(210.0, 301.8) | 0.000 |
| ESR(mm/h) | 59.0(43.0, 85.0) | 36.0(19.0, 69.0) | 0.000 |
| CRP(mg/L) | 58.8(29.8, 101.0) | 26.6(6.7, 76.2) | 0.000 |
| LDH(U/L) | 191.5(158.3, 225.8) | 208.0(171.0, 280.0) | 0.044 |
| ADA(U/L) | 9.0(7.0, 12.0) | 7.0(5.0, 11.0) | 0.022 |
| CEA(ng/mL) | 1.4(0.9, 2.2) | 5.9(2.1, 29.1) | 0.000 |
| In pleural fluid: | |||
| Bloody effusion | 2(1.0) | 40(15.9) | 0.000 |
| Positive Rivalta test | 189(98.4) | 212(84.5) | 0.000 |
| WBC(× 106/L) | 1200.0(427.3, 2560.0) | 432.5(135.0, 1200.0) | 0.000 |
| RBC(× 106/L) | 1600.0(800.0, 3160.0) | 1600.0(720.0, 4420.0) | 0.218 |
| N% | 10.0(4.0, 21.0) | 19.0(6.0, 50.0) | 0.000 |
| L% | 86.5(70.0, 92.0) | 70.0(41.0, 86.0) | 0.000 |
| Total protein(g/L) | 52.0(49.0, 55.0) | 47.0(42.0, 52.0) | 0.000 |
| Glucose(mmol/L) | 5.2(4.4, 6.5) | 6.1(4.4, 7.2) | 0.004 |
| Chloride(mmol/L) | 104.0(100.3, 106.0) | 104.0(101.0, 107.0) | 0.089 |
| LDH(U/L) | 415.0(265.0, 609.0) | 264.0(167.0, 460.0) | 0.000 |
| ADA(U/L) | 26.0(21.0, 40.0) | 7.0(5.0, 13.0) | 0.000 |
| CEA(ng/mL) | 1.0(0.6, 1.6) | 30.0(1.9, 170.0) | 0.000 |
Data in the table are expressed either as a frequency (percentage) or a median (interquartile range)
TPE tuberculous pleural effusion, non-TPE non-tuberculous pleural effusion (including parapneumonic pleural effusion and malignant pleural effusion), WBC white blood cells, RBC red blood cells, N neutrophils, L lymphocytes, M monocytes, HB hemoglobin, PLT platelets, ESR erythrocyte sedimentation rate, CRP C-reactive protein, LDH lactate dehydrogenase, ADA adenosine deaminase, CEA carcinoembryonic antigen
Performances of the four algorithmic models and pfADA for diagnosing TPE
| AUC | Sensitivity | Specificity | PPV | NPV | PLR | NLR | Accuracy | |
|---|---|---|---|---|---|---|---|---|
| pfADA | 0.890 | 85.4% | 84.1% | 80.4% | 88.3% | 5.37 | 0.17 | 84.7% |
| Logistic regression | 0.876 | 80.5% | 84.8% | 80.2% | 85.2% | 5.47 | 0.23 | 82.9% |
| KNN | 0.895 | 78.6% | 86.6% | 82.3% | 84.0% | 6.28 | 0.24 | 83.2% |
| SVM | 0.918 | 83.2% | 85.9% | 82.3% | 86.6% | 6.23 | 0.20 | 80.4% |
| RF | 0.971 | 89.1% | 93.6% | 91.3% | 91.5% | 14.97 | 0.12 | 91.6% |
TPE tuberculous pleural effusion, pfADA pleural fluid adenosine deaminase, KNN k-nearest neighbor, SVM support vector machine, RF random forest, AUC area under the curve, PPV positive predictive value, NPV negative predictive value, PLR positive likelihood ratio, NLR negative likelihood ratio
Fig. 2Sensitivity, specificity, accuracy, PPV, NPV curves for pfADA and the four algorithmic models. PPV: positive predictive value; NPV: negative predictive value; pfADA: pleural fluid adenosine deaminase; Logistic: logistic regression; KNN: k-nearest neighbor; SVM: support vector machine; RF: random forest
Fig. 3The PLR and NLR curves for pfADA and the four algorithmic models. PLR: positive likelihood ratio; NLR: negative likelihood ratio; pfADA: pleural fluid adenosine deaminase, Logistic: logistic regression; KNN: k-nearest neighbor; SVM: support vector machine; RF: random forest
Fig. 4The impacts of the first 12 features on the accuracy of the RF model. pfADA: pleural fluid adenosine deaminase; pfCEA: pleural fluid carcinoembryonic antigen; WBC: white blood cells; M: monocyte; L: lymphocyte; N: neutrophil; pfLDH: pleural fluid lactate dehydrogenase
Performance of the new RF model for diagnosing TPE
| AUC | Sensitivity | Specificity | PPV | NPV | PLR | NLR | Accuracy | |
|---|---|---|---|---|---|---|---|---|
| RF | 0.965 | 90.6% | 92.3% | 80.9% | 93.0% | 13.1 | 0.1 | 91.5% |
RF random forest, TPE tuberculous pleural effusion, AUC area under the curve, PPV positive predictive value, NPV negative predictive value, PLR positive likelihood ratio, NLR negative likelihood ratio