| Literature DB >> 36083977 |
Qi Liu1, Jinyang Du2, Yuge Li3, Guiyuan Peng3, Xuefang Wang1, Yong Zhong1, Ruxu Du1.
Abstract
Nasopharyngeal carcinoma (NPC) is one of the most common types of cancers in South China and Southeast Asia. Clinical data has shown that early detection is essential for improving treatment effectiveness and survival rate. Unfortunately, because the early symptoms of NPC are rather minor and similar to that of diseases such as Chronic Rhinosinusitis (CRS), early detection is a challenge. This paper proposes using machine learning methods to detect NPC using routine medical test data, namely Random Forest (RF), Support Vector Machine (SVM), and Artificial Neural Network (ANN), k-Nearest-Neighbor (KNN) and Logistic Regression (LR). We collected a dataset containing 523 newly diagnosed NPC patients before treatment, 501 newly diagnosed CRS patients before treatment as well as 600 healthy controls. The routine medical test data including age, gender, blood test features, liver function test features, and urine sediment test features. For comparison, we also used data from Epstein-Barr Virus (EBV) antibody tests, which is a specialized test not included among routine medical tests. In our first test, all four methods were tested on classifying NPC vs CRS vs controls; RF gives the best overall performance. Using only routine medical test data, it gives an accuracy of 83.1%, outperforming LR by 12%. In our second test, using only routine medical test data, when classifying NPC vs non-NPC (i.e. CRS or controls), RF achieves an accuracy of 88.2%. In our third test, when classifying NPC vs. controls, RF using only routine test data achieves an accuracy significantly better than RF using only EBV antibody data. Finally, in our last test, RF trained with NPC vs controls, using routine test data only, continued to perform well on an entirely separate dataset. This is a promising result because preliminary NPC detection using routine medical data is easy and inexpensive to implement. We believe this approach will play an important role in the detection and treatment of NPC in the future.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36083977 PMCID: PMC9462828 DOI: 10.1371/journal.pone.0274263
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1The MRI images.
(a) a Stage II NPC patient (b) a CRS patient.
The subjects’ features.
| NPC Patients | CRS | Controls | ||
|---|---|---|---|---|
|
| ||||
|
| 1.28e-8 | |||
| Male | 403 (77.1%) | 307(61.3%) | 376 (62.7%) | |
| Female | 120 (22.9%) | 194(38.7%) | 224 (37.3%) | |
|
| 2.69e-19 | |||
| 19–40 years | 118 (22.6%) | 203(40.5%) | 239 (39.8%) | |
| 41–60 years | 273 (52.2%) | 217(43.3%) | 299 (49.8%) | |
| >60 years | 132 (25.2%) | 81(16.2%) | 62 (10.3%) | |
|
| ||||
| WBC | 7.01±2.11 | 7.04±2.29 | 6.14±1.17 | 2.25e-18 |
| NEUT | 4.47±1.89 | 4.45±1.96 | 3.51±0.92 | 8.92e-28 |
| LYM | 1.77±0.64 | 2.08±2.06 | 2.06±0.45 | 2.22e-5 |
| MONO | 0.50±0.20 | 0.42±0.18 | 0.37±0.10 | 4.99e-40 |
| EOSIN | 0.21±0.25 | 0.28±2.06 | 0.16±0.10 | 0.21 |
| BASO | 0.03±0.02 | 0.06±0.31 | 0.03±0.01 | 0.014 |
| RBC | 4.79±1.78 | 5.36±8.35 | 4.92±0.36 | 0.129 |
| Hb | 137.38±17.20 | 141.87±17.78 | 150.17±11.04 | 1.05e-41 |
| PLT | 262.92±70.55 | 253.22±61.74 | 240.59±45.51 | 2.87e-9 |
|
| ||||
| LAP | 30.30±48.27 | 24.88±3.54 | 24.46±3.55 | 0.001 |
| ADA | 9.96±4.20 | 8.83±2.59 | 8.96±2.63 | 3.45e-9 |
| PA | 257.59±57.14 | 296.50±44.13 | 296.17±43.12 | 3.66e-48 |
| ALT | 22.41±29.28 | 20.31±13.79 | 20.28±8.65 | 0.112 |
| AST | 20.86±17.22 | 18.82±6.79 | 20.29±4.07 | 0.008 |
| GGT | 34.13±56.40 | 27.27±19.83 | 23.42±10.74 | 1.27e-6 |
| ALP | 78.20±38.49 | 73.55±23.34 | 67.38±14.13 | 1.30e-10 |
| TBIL | 9.90±4.07 | 10.56±10.10 | 11.10±3.45 | 0.008 |
| DBIL | 3.82±1.59 | 3.98±3.49 | 3.94±1.10 | 0.487 |
| IBIL | 6.09±2.85 | 6.58±9.58 | 7.14±2.50 | 0.009 |
| TBA | 4.49±6.70 | 3.39±5.05 | 2.79±1.64 | 2.64e-8 |
| Urea | 5.31±13.92 | 5.30±10.75 | 4.77±1.01 | 0.582 |
| Cr | 82.48±40.83 | 72.43±16.99 | 79.07±11.74 | 3.50e-9 |
|
| ||||
| SG | 1.02±0.04 | 1.02±0.01 | 1.02±0.01 | 0.006 |
| PH | 5.89±0.64 | 6.04±0.61 | 6.13±0.68 | 5.94e-9 |
| LEU | 13.43±69.38 | 17.94±77.31 | 6.08±42.30 | 0.008 |
| PRO | 0.02±0.23 | 0.08±1.54 | 0.00±0.06 | 0.275 |
| NTT | 0 | 0 | 0 | \ |
| U_WBC | 10.41±87.01 | 7.06±39.93 | 1.49±1.32 | 0.020 |
| Crystal | 5.24±24.10 | 2.43±11.89 | 0.40±2.81 | 8.92e-7 |
In the table, the following abbreviations are used: WBC—white blood cell count; NEUT—Neutrophil count; LYM—Lymphocyte count; MONO—Monocyte count; EOSIN—Eosinophil count; BASO—Basophil count; RBC—Red blood cell count; Hb—Hemoglobin determination; PLT—Platelet count; LAP—Leucine aminopeptidase; ADA—Adenosine deaminase; PA—Prealbumin; ALT—Alanine aminotransferase; AST—Aspartate aminotransferase; GGT— γ-glutamyl transferase; ALP—Alkaline phosphatase; TBIL—Total bilirubin; DBIL—Direct bilirubin; IBIL—Indirect bilirubin; TBA—Total bile acid; Urea—Urea; Cr—Creatinine; SG—Urine Specific Gravity; PH—Urine Ph; LEU—Urinary leukocyte esterase; PRO—Urine protein; NTT—Urine nitrite; U_WBC—Urine white blood cell count; Crystal—Crystal count.
Fig 2The confusion matrices.
The performances of the five machine learning methods.
| Method | Precision | Recall | Accuracy | AUC |
|---|---|---|---|---|
| (95%CI) | (95%CI) | (95%CI) | (95%CI) | |
| RF | 83.3% | 82.4% | 83.1% | 0.954 |
| (81.5~85.1) % | (76.9~87.9) % | (80.8~85.4) % | (0.947~0.961) | |
| LR | 70.8% | 70.4% | 71.2% | 0.874 |
| (67.6~74.0) % | (63.8~77.1) % | (69.2~73.2) % | (0.865~0.884) | |
| SVM | 70.6% | 70.2% | 70.9% | 0.875 |
| (67.5~73.6) % | (64.0~76.4) % | (69.9~72.0) % | (0.867~0.883) | |
| ANN | 70.2% | 69.3% | 70.5% | 0.868 |
| (68.4~71.9) % | (59.1~79.4) % | (69.7~71.3) % | (0.865~0.871) | |
| KNN | 70.5% | 68.7% | 70.0% | 0.864 |
| (67.4~73.6) % | (59.7~77.7) % | (68.5~71.5) % | (0.855~0.873) |
The performance of RF.
| Precision | Recall | Accuracy | AUC | |
|---|---|---|---|---|
| (95%CI) | (95%CI) | (95%CI) | (95%CI) | |
| Healthy (Label 0) | 84.2% | 94.1% | ||
| (81.3~87.0) % | (91.9~96.4) % | |||
| NPC (Label 1) | 80.2% | 83.4% | ||
| (77.2~83.1) % | (78.7~88.0) % | |||
| CRS (Label 2) | 85.5% | 69.7% | ||
| (83.6~87.4) % | (67.7~71.7) % | |||
| Average | 83.3% | 82.4% | 83.1% | 0.954 |
| (81.5~85.1) % | (76.9~87.9) % | (80.8~85.4) % | (0.947~0.961) |
Fig 3Feature importance ranking from RF.
Performance of five methods: NPC vs mix of 50% CRS and 50% controls.
| Method | Precision | Recall | Accuracy | AUC |
|---|---|---|---|---|
| (95%CI) | (95%CI) | (95%CI) | (95%CI) | |
| RF | 87.9% | 84.7% | 88.2% | 0.942 |
| (86.6~89.2) % | (77.9~91.5) % | (86.7~89.7) % | (0.931~0.954) | |
| LR | 83.9% | 81.7% | 85.3% | 0.907 |
| (81.2~86.6) % | (74.8~88.6) % | (83.7~86.9) % | (0.891~0.922) | |
| SVM | 83.9% | 81.4% | 85.2% | 0.905 |
| (81.1~86.7) % | (74.2~88.5) % | (83.3~87.1) % | (0.889~0.922) | |
| ANN | 80.5% | 78.7% | 82.4% | 0.886 |
| (76.5~84.4) % | (71.2~86.3) % | (81.9~82.9) % | (0.872~0.901) | |
| KNN | 83.3% | 74.6% | 82.1% | 0.888 |
| (80.8~85.8) % | (60.9~88.3) % | (80.7~83.5) % | (0.882~0.895) |
Performance of RF: NPC vs CRS.
| Precision | Recall | Accuracy | AUC | |
|---|---|---|---|---|
| (95%CI) | (95%CI) | (95%CI) | (95%CI) | |
| CRS | 85.3% | 81.0% | ||
| (81.6~89.1) % | (78.5~83.5) % | |||
| NPC | 82.7% | 86.4% | ||
| (81.2~84.2) % | (82.1~90.7) % | |||
| Average | 84.0% | 83.7% | 83.8% | 0.920 |
| (81.9~86.1) % | (80.8~86.7) % | (81.8~85.7) % | (0.906~0.933) |
Fig 4Feature importance ranking when classifying NPC vs. a mixture of 50% CRS and 50% controls.
Fig 5Feature importance ranking when classifying NPC vs. CRS.
Top importance rankings given by RF.
| NPC vs mixed | NPC vs CRS | |||
|---|---|---|---|---|
| Feature index | Weight | Feature index | Weight | |
| 1 | PA | 0.175 | PA | 0.187 |
| 2 | LAP | 0.101 | LAP | 0.129 |
| 3 | MONO | 0.088 | ADA | 0.074 |
| 4 | ADA | 0.053 | MONO | 0.059 |
| 5 | Hb | 0.052 | Cr | 0.047 |
| 6 | Crystal | 0.045 | SG | 0.040 |
| 7 | LYM | 0.044 | LYM | 0.039 |
| 8 | ALP | 0.038 | Age | 0.039 |
The performance of RF in detecting NPC.
| Use routine medical test only | Use EBV only | Use both | |
|---|---|---|---|
| Accuracy (95%CI) | 95.0% (93.8~96.2)% | 90.4% (87.5~93.2)% | 96.9% (95.1~98.7)% |
| Sensitivity (95%CI) | 93.3% (90.3~96.3)% | 89.8% (86.9~92.8)% | 96.9% (94.5~99.3)% |
| Specificity (95%CI) | 96.5% (94.7~98.4)% | 90.8% (87.9~93.8)% | 96.8% (95.5~98.2)% |
| Youden index (95%CI) | 89.8% (87.2~92.4)% | 80.7% (74.9~86.4)% | 93.8% (90.1~97.5)% |
| AUC (95%CI) | 0.986 (0.983~0.989) | 0.928 (0.905~0.952) | 0.990 (0.982~0.998) |
| MCC (95%CI) | 0.901 (0.877~0.924) | 0.807 (0.749~0.864) | 0.937 (0.900~0.974) |
Statistics of misclassified subjects.
| Use routine medical data only | Use EBV only | Use both | ||
|---|---|---|---|---|
| False Positive | 21 (3.5%) | 55 (9.2%) | 19 (3.2%) | |
| False Negative | Stage II | 3 (7.5%) | 3 (7.5%) | 2 (5.0%) |
| Stage III | 18 (7.0%) | 30 (11.7%) | 9 (3.5%) | |
| Stage IV | 14 (6.2%) | 20 (8.8%) | 5 (2.2%) | |
| Total | 56 (5.0%) | 108 (9.6%) | 35 (3.1%) | |
Performance of RF on the secondary dataset.
| Precision | Recall | Accuracy | AUC | |
|---|---|---|---|---|
| (95%CI) | (95%CI) | (95%CI) | (95%CI) | |
| Controls | 88.2% | 96.8% | ||
| (87.8~88.6) % | (95.7~97.9) % | |||
| NPC | 96.5% | 87.1% | ||
| (95.4~97.7) % | (86.5~87.7) % | |||
| Average | 92.4% | 91.9% | 91.9% | 0.975 |
| (89.6~95.2) % | (88.7~95.2) % | (91.5~92.3) % | (0.971~0.979) |