| Literature DB >> 36231500 |
Dorian Culié1,2, Renaud Schiappa2, Sara Contu2, Boris Scheller1,2, Agathe Villarme1, Olivier Dassonville1, Gilles Poissonnet1, Alexandre Bozec1,2, Emmanuel Chamorey2.
Abstract
The selection of patients for the constitution of a cohort is a major issue for clinical research (prospective studies and retrospective studies in real life). Our objective was to validate in real life conditions the use of a Deep Learning process based on a neural network, for the classification of patients according to the pathology involved in a head and neck surgery department. 24,434 Electronic Health Records (EHR) from the first visit between 2000 and 2020 were extracted. More than 6000 EHR were manually classified in ten groups of interest according to the reason for consultation with a clinical relevance. A convolutional neural network (TensorFlow, previously reported by Hsu et al.) was then used to predict the group of patients based on their pathology, using two levels of classification based on clinically relevant criteria. On the first and second level of classification, macro-average performances were: 0.95, 0.83, 0.85, 0.97, 0.84 and 0.93, 0.76, 0.83, 0.96, 0.79 for accuracy, recall, precision, specificity and F1-score versus accuracy, recall and precision of 0.580, 580 and 0.582 for Hsu et al., respectively. We validated this model to predict the pathology involved and to constitute clinically relevant cohorts in a tertiary hospital. This model did not require a preprocessing stage, was used in French and showed equivalent or better performances than other already published techniques.Entities:
Keywords: cohort constitution; neural network; patient classification
Mesh:
Year: 2022 PMID: 36231500 PMCID: PMC9564535 DOI: 10.3390/ijerph191912200
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Flow chart of the two levels of classification.
Distribution of patients according to groups in the initial dataset.
| Group | Number of Patients (%) |
|---|---|
| Thyroid and parathyroid pathology | 2509 (38.92) |
| Salivary gland pathology | 283 (4.39) |
| Head and neck skin pathology | 841 (13.04) |
| Oral cavity | 618 (9.58) |
| Hypopharynx/larynx | 659 (10.22) |
| Oropharynx | 431 (6.68) |
| Nasopharynx | 38 (0.58) |
| Isolated cervical lymph nodes | 363 (5.63) |
| Nasal cavity and sinuses | 66 (1.02) |
| Other | 638 (9.89) |
|
|
|
Algorithm performance for first classification level.
| Groups | UADT | Thyroid and Parathyroid Pathology | Head and Neck Skin Pathology | Other | Salivary Gland Pathology | Macro-Average |
|---|---|---|---|---|---|---|
| 438 (33.4) | 538 (41.0) | 174 (13.3) | 111 (8.5) | 51 (3.9) | 1312 (100) | |
|
| 0.91 | 0.96 | 0.98 | 0.93 | 0.98 | 0.95 |
|
| 0.89 | 0.96 | 0.92 | 0.58 | 0.78 | 0.83 |
|
| 0.84 | 0.95 | 0.93 | 0.70 | 0.84 | 0.85 |
|
| 0.92 | 0.97 | 0.99 | 0.97 | 0.99 | 0.97 |
|
| 0.87 | 0.96 | 0.93 | 0.63 | 0.81 | 0.84 |
UADT: upper aerodigestive tract.
Figure 2Confusion matrix of first level classification. UADT: upper aerodigestive tract.
Algorithm performance for second classification level.
| Groups | Isolated Cervical Lymph Nodes | Oral Cavity | Oro-Pharynx | Hypopharynx/Larynx | Nasal Cavity and Sinuses | Naso-Pharynx | Macro-Average |
|---|---|---|---|---|---|---|---|
| 71 (16.3) | 120 (27.6) | 88 (22.2) | 142 (32.6) | 8 (1.8) | 6 (1.4) | 435 (100) | |
|
| 0.93 | 0.90 | 0.88 | 0.91 | 0.99 | 0.99 | 0.93 |
|
| 0.77 | 0.81 | 0.70 | 0.91 | 0.73 | 0.63 | 0.76 |
|
| 0.77 | 0.84 | 0.73 | 0.82 | 1 | 0.83 | 0.83 |
|
| 0.96 | 0.94 | 0.93 | 0.92 | 1 | 1 | 0.96 |
|
| 0.77 | 0.82 | 0.72 | 0.86 | 0.84 | 0.71 | 0.79 |
Figure 3Confusion matrix of second level classification. UADT: upper aerodigestive tract.