| Literature DB >> 34735491 |
Alberto Garcia-Zamalloa1,2, Diego Vicente3,4, Rafael Arnay5, Arantzazu Arrospide6,7,8,9, Jorge Taboada10, Iván Castilla-Rodríguez5,9, Urko Aguirre8,9,11, Nekane Múgica12, Ladislao Aldama12, Borja Aguinagalde13, Montserrat Jimenez14, Edurne Bikuña14, Miren Begoña Basauri15, Marta Alonso3, Emilio Perez-Trallero3.
Abstract
OBJECTIVE: To analyze the performance of adenosine deaminase in pleural fluid combined with other parameters routinely measured in clinical practice and assisted by machine learning algorithms for the diagnosis of pleural tuberculosis in a low prevalence setting, and secondly, to identify effusions that are non-tuberculous and most likely malignant. PATIENTS AND METHODS: We prospectively analyzed 230 consecutive patients diagnosed with lymphocytic exudative pleural effusion from March 2013 to June 2020. Diagnosis according to the composite reference standard was achieved in all cases. Pre-test probability of pleural tuberculosis was 3.8% throughout the study period. Parameters included were: levels of adenosine deaminase, pH, glucose, proteins, and lactate dehydrogenase, red and white cell counts and lymphocyte percentage in pleural fluid, as well as age. We tested six different machine learning-based classifiers to categorize the patients. Two different classifications were performed: a) tuberculous/non-tuberculous and b) tuberculous/malignant/other.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34735491 PMCID: PMC8568264 DOI: 10.1371/journal.pone.0259203
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Patient selection process.
Fig 2Classification pipeline.
Classification pipeline.
Characteristics of patients and pleural fluid samples by diagnosis.
| Tuberculous | Malignant | Parapneumonic / Other effusions | |||||
|---|---|---|---|---|---|---|---|
| N | % | N | % | N | % | p-value | |
| Total | 44 | 19.1 | 124 | 53.9 | 62 | 27 | |
| Sex | 0.21 | ||||||
| Male | 31 | 70.4 | 70 | 56.4 | 34 | 54.8 | |
| Female | 13 | 29.6 | 54 | 43.6 | 28 | 45.2 | |
| Nationality | <0.001 | ||||||
| Spanish | 32 | 72.7 | 121 | 97.6 | 58 | 93.6 | |
| Other | 12 | 27.3 | 3 | 2.4 | 4 | 6.4 | |
| Median | Median | Median | |||||
| Age, years | 54.8 | 22.2 | 68.1 | 12.7 | 68.8 | 18.5 | 0.001 |
| ADA, U/l | 72 | 57.3–81 | 22 | 18–27.5 | 23 | 19–28 | <0.001 |
| pH | 7.4 | 7.4–7.5 | 7.4 | 7.4–7.5 | 7.5 | 7.4–7.5 | 0.08 |
| Glucose | 84.5 | 66–96 | 101 | 84–120.5 | 110.5 | 92–128 | <0.001 |
| Cell no. | 2315 | 1338.5–4126 | 1760.5 | 1140–2905 | 1715.5 | 618–3056 | 0.06 |
| MNC | 93 | 80–97.8 | 91.5 | 79.3–96 | 75 | 59–86 | <0.001 |
| PMNC | 7 | 2.2–20 | 8.5 | 4.05–20.75 | 25.5 | 14–41 | <0.001 |
| RBC | 3650 | 1805–10000 | 8000 | 2840–29300 | 6950 | 2700–40000 | 0.04 |
| LDH, U/l | 444.5 | 266.5–651.5 | 403 | 212–623 | 212.5 | 165–372 | <0.001 |
| Proteins | 5 | 4.6–5.3 | 4.4 | 3.8–4.8 | 4.3 | 3.5–4.9 | <0.001 |
| LDH / ADA | 6.1 | 4.2–10.7 | 15.5 | 9–28.4 | 10.1 | 7.3–15.9 | <0.001 |
ADA: adenosine deaminase; Cell no: number of cells/mm3; MNC: percentage of mononuclear cells in pleural fluid; PMNC: percentage of polymorphonuclear cells in pleural fluid; RBC: number of red blood cells/mm3. LDH: lactate dehydrogenase.
a Value shown as Mean and Standard deviation.
Fig 3Receiver operating characteristic curves using validation predictions.
The dots correspond to the points that maximize the Youden index.
Fig 4Precision-recall curves using validation predictions.
Precision-recall curve for each method using the validation predictions. The dots correspond to the points that maximize the F1 score.
Accuracy (Acc), sensitivity (SEN), specificity (SPF) and F1 score (F1) of all the classifiers in the validation stage, using three different thresholds (T): 0.5, the one that maximizes the Youden index on the receiver operating characteristic curve and the one that maximizes the F1 score on the precision-recall curve, and their confidence intervals at 95% level.
| Method | T | AUC | Acc (95% CI) | SEN (95% CI) | SPF (95% CI) | F1 (95% CI) |
|---|---|---|---|---|---|---|
|
| ||||||
| Logit | 0.5 | 0.97 | 0.92 (0.87,0.95) | 0.71 (0.54, 0.85) | 0.97 (0.92,0.99) | 0.77 (0.65, 0.87) |
| SVC | 0.5 | 0.98 | 0.97 (0.93,0.99) | 0.89 (0.73,0.97) | 0.99 (0.95,1.00) | 0.91 (0.83, 0.97) |
| DT | 0.5 | 0.97 | 0.92 (0.88, 0.96) | 0.83 (0.66,0.93) | 0.95 (0.90,0.98) | 0.81 (0.69, 0.90) |
| KNN | 0.5 | 0.94 | 0.91 (0.86,0.95) | 0.66 (0.48, 0.81) | 0.97 (0.93,0.99) | 0.74 (0.60, 0.86) |
| RF | 0.5 | 0.98 | 0.94 (0.90,0.97) | 0.83 (0.66,0.93) | 0.97 (0.92,0.99) | 0.84 (0.73, 0.92) |
| MLP | 0.5 | 0.98 | 0.96 (0.92,0.98) | 0.89 (0.73,0.97) | 0.97 (0.93,0.99) | 0.89 (0.80, 0.96) |
|
| ||||||
| Logit | 0.26 | 0.97 | 0.93 (0.89, 0.97) | 1.00 (0.90,1) | 0.92 (0.86,0.96) | 0.85 (0.76, 0.92) |
| SVC | 0.29 | 0.98 | 0.96 (0.92, 0.98) | 0.94 (0.81,0.99) | 0.97 (0.92,0.99) | 0.90 (0.82, 0.97) |
| DT | 0.25 | 0.97 | 0.93 (0.89, 0.97) | 0.94 (0.81,0.99) | 0.93 (0.88,0.97) | 0.85 (0.75, 0.92) |
| KNN | 0.30 | 0.94 | 0.93 (0.89,0.97) | 0.86 (0.70,0.95) | 0.95 (0.91,0.98) | 0.83 (0.72, 0.92) |
| RF | 0.19 | 0.98 | 0.92 (0.88,0.96) | 0.97 (0.85,1.00) | 0.91 (0.86,0.95) | 0.83 (0.73, 0.91) |
| MLP | 0.17 | 0.98 | 0.96 (0.92,0.98) | 0.94 (0.81,0.99) | 0.97 (0.92,0.99) | 0.90 (0.82, 0.97) |
|
| ||||||
| Logit | 0.28 | 0.97 | 0.95 (0.90,0.97) | 0.97 (0.85,1.00) | 0.94 (0.89,0.97) | 0.87 (0.78, 0.94) |
| SVC | 0.35 | 0.98 | 0.97 (0.93,0.99) | 0.91 (0.77,0.98) | 0.98 (0.94,1.00) | 0.91 (0.84, 0.97) |
| DT | 0.40 | 0.97 | 0.94 (0.90,0.97) | 0.91 (0.77,0.98) | 0.95 (0.90,0.98) | 0.85 (0.76, 0.93) |
| KNN | 0.30 | 0.94 | 0.93 (0.89,0.97) | 0.86 (0.70,0.95) | 0.95 (0.91,0.98) | 0.83 (0.72, 0.92) |
| RF | 0.43 | 0.98 | 0.95 (0.91,0.98) | 0.91 (0.77, 0.98) | 0.96 (0.91,0.99) | 0.88 (0.78,0.95) |
| MLP | 0.17 | 0.98 | 0.96 (0.92,0.98) | 0.94 (0.81, 0.99) | 0.97 (0.92,0.99) | 0.90 (0.82, 0.97) |
Threshold (T), area under the curve (AUC), accuracy (Acc), sensitivity (SEN), specificity(SPF) and F1 score (F1) of all the classifiers in the test stage, using the best thresholds found in the validation stage with their confidence intervals at 95% level.
| Method | T | AUC | Acc (95% CI) | SEN (95% CI) | SPF (95% CI) | F1 (95% CI) |
|---|---|---|---|---|---|---|
| Logit | 0.28 | 0.98 | 0.96 (0.85,0.99) | 1.00 (0.66,1.00) | 0.95 (0.82,0.99) | 0.90 (0.71,1.00) |
| SVC | 0.35 | 0.96 | 0.93 (0.82,0.99) | 0.89 (0.52,1.00) | 0.95 (0.82,0.99) | 0.84 (0.60,1.00) |
| DT | 0.40 | 0.96 | 0.96 (0.85,0.99) | 1.00 (0.66,1.00) | 0.95 (0.82,0.99) | 0.90 (0.71,1.00) |
| KNN | 0.30 | 0.96 | 0.91 (0.79,0.98) | 0.78 (0.40,0.97) | 0.95 (0.82,0.99) | 0.78 (0.48,0.96) |
| RF | 0.43 | 0.98 | 0.93 (0.82,0.99) | 0.89 (0.52,1.00) | 0.95 (0.82,0.99) | 0.84 (0.63,1.00) |
| MLP | 0.17 | 0.98 | 0.93 (0.82,0.99) | 0.89 (0.52,1.00) | 0.95 (0.82,0.99) | 0.84 (0.59,1.00) |
Fig 5Post-test probability of TPE after positive (top) or negative (bottom) results of 1) ADA>40 U/l plus implicit lymphocyte percentage >50% alone or 2) in addition to age and routine pleural fluid parameters included in the machine learning algorithms; for different pre-test probabilities of disease.
Comparative table of Bayesian probabilities of test parameters used: ADA > 40 U/l (plus lymphocyte percentage > 50%) versus the whole set of variables included in the machine learning algorithms.
PPV: positive predictive value. NPV: negative predictive value.
|
|
| |||
| Sensitivity | 98% | 91% | ||
| Specificity | 93% | 98% | ||
| Pre-test probability | PPV | NPV | PPV | NPV |
| 5% | 42.4% | 99.9% | 70.5% | 99.5% |
| 10% | 60.8% | 99.7% | 83.5% | 99.0% |
| 15% | 71.2% | 99.6% | 88.9% | 98.4% |
| 20% | 77.8% | 99.4% | 91.9% | 97.8% |
| 30% | 85.7% | 99.0% | 95.1% | 96.2% |
| 40% | 90.3% | 98.4% | 96.8% | 94.2% |
| 50% | 93.3% | 97.6% | 97.8% | 91.6% |