| Literature DB >> 36201136 |
Michele Fraccaroli1, Arnaud Nguembang Fadja2, Alice Bizzarri3, Giulia Mazzuchelli3, Evelina Lamma3.
Abstract
Recently, Artificial Intelligence (AI) and Machine Learning (ML) have been successfully applied to many domains of interest including medical diagnosis. Due to the availability of a large quantity of data, it is possible to build reliable AI systems that assist humans in making decisions. The recent Covid-19 pandemic quickly spread over the world causing serious health problems and severe economic and social damage. Computer scientists are actively working together with doctors on different ML models to diagnose Covid-19 patients using Computed Tomography (CT) scans and clinical data. In this work, we propose a neural-symbolic system that predicts if a Covid-19 patient arriving at the hospital will end in a critical condition. The proposed system relies on Deep 3D Convolutional Neural Networks (3D-CNNs) for analyzing lung CT scans of Covid-19 patients, Decision Trees (DTs) for predicting if a Covid-19 patient will eventually pass away by analyzing its clinical data, and a neural system that integrates the previous ones using Hierarchical Probabilistic Logic Programs (HPLPs). Predicting if a Covid-19 patient will end in a critical condition is useful for managing the limited number of intensive care at the hospital. Moreover, knowing early that a Covid-19 patient could end in serious conditions allows doctors to gain early knowledge on patients and provide special treatment to those predicted to finish in critical conditions. The proposed system, entitled Neural HPLP, obtains good performance in terms of area under the receiver operating characteristic and precision curves with values of about 0.96 for both metrics. Therefore, with Neural HPLP, it is possible not only to efficiently predict if Covid-19 patients will end in severe conditions but also possible to provide an explanation of the prediction. This makes Neural HPLP explainable, interpretable, and reliable. Graphical abstract Representation of Neural HPLP. From top to bottom, the two different types of data collected from the same patient and used in this project are represented. This data feeds the two different machine learning systems and the integration of the two systems using Hierarchical Probabilistic Logic Program.Entities:
Keywords: Covid-19; Decision Trees; Deep Learning; Hierarchical Probabilistic Logic Program; Severity
Year: 2022 PMID: 36201136 PMCID: PMC9540054 DOI: 10.1007/s11517-022-02674-1
Source DB: PubMed Journal: Med Biol Eng Comput ISSN: 0140-0118 Impact factor: 3.079
Fig. 1Neural-Symbolic Integration system: DT and 3D-CNN are integrated using HPLP
Clinical data for the main experiment
| Clinical attribute | Acronym |
|---|---|
| Age | - |
| Gender | - |
| Organization Cost Centre | CdcoUO |
| Intensification of care | - |
| Pneumology department | - |
| Anesthesia and resuscitation department | - |
| Clinical onset with fever | - |
| Hospitalization day | - |
| Discharge day | - |
| In-hospital days | - |
| Symptoms cardiopulmonary onset | - |
| Gastrointestinal onset symptoms | - |
| Systolic Blood Pressure at the entrance | SBP |
| Diastolic Blood Pressure at the entrance | DBP |
| Heart rate | - |
| Breath frequency | - |
| Body temperature | - |
| Modified Early Warning Score | MEWS |
| Partial pressure of oxygen in a gaseous environment | pO2 |
| PO2 / FiO2 ratio | PF |
| High Resolution TC | HRTC |
| High Resolution TC per ground glass | HRTCpergrpoundglass |
| White blood cells | WBC |
| Lymphocytes | - |
| C-reactive Protein | CRP |
| Procalcitonin | PCT |
| Creatinine | - |
| Glomerular Filtration Rate | GFR |
| Lactate Dehydrogenase | LDH |
| Brain Natriuretic Peptide | BNP |
| Fibrinogen | - |
| D-Dimero | - |
| Isoamylase | - |
| Alanine Aminotransferase | ALT |
| Creatine Phosphokinase | CPK |
| Ferritin | - |
| Troponin | - |
| Smoking habit | - |
| Hypertension | - |
| Ischemic heart disease | - |
| Heart failure | - |
| IRCIIIIVV | - |
| ICTUSoTIA | - |
| Chronic Peripheral Obliterative Arteriopathy | AOCP |
| Chronic Obstructive Pulmonary Disease | COPD |
| Mild liver disease | - |
| Moderate liver disease | - |
| Peptic ulcer | - |
| AIDS | - |
| Hemiplegia | - |
| Localized or hematological neoplasm | - |
| Metastasis | - |
| Dementia | - |
| Charlson index | - |
| Microcythemia | - |
| Inflammatory Bowel Disease | IBD |
| Diabetes | - |
| Diabetes without organ damage | - |
| Diabetes with organ damage | - |
Fig. 2Example of images of a slide of a DICOM voxel for the three classes. From left to right images belonging to class CT-0, CT-1 and CT-234 respectively
Fig. 3Segmentation of CT scans. The odd images represent an original slice of the DICOM voxel that depict the lungs of the patient. The even images represent the binary masks obtained after the pre-processing
Fig. 4Generic Hierarchical Probabilistic Logic Program
Clinical data for the additional experiment
| Clinical attribute | Clinical attribute |
|---|---|
| Age | Alkaline phosphatase |
| Sex | Alanine aminotransferase |
| Temperature | Aspartate aminotransferase |
| malattie pregresse | Urea nitrogen |
| covid | Calcium |
| CT | Chlorine |
| Morbidity | Total carbon dioxide |
| Mortality | Creatinine |
| Erythrocyte sedimentation rate | Latitude-glutamyltransferase |
| C-reactive protein | Globulin |
| Procalcitonin | Potassium |
| Mean corpuscular hemoglobin concentration | Magnesium |
| Mean corpuscular hemoglobin | Sodium |
| Mean corpuscular volume | Phosphorus |
| Hematocrit | Total bilirubin |
| Hemoglobin | Serum total protein |
| Red blood cell | Uric acid |
| Platelet distribution width | Total cholesterol |
| Plateletcrit | Creatine kinase |
| Mean platelet volume | High density lipoprotein cholesterol |
| Platelet count | Lactate dehydrogenase |
| Basophil count | Triglyceride |
| Eosinophil count | Anion gap |
| Monocyte count | Direct bilirubin |
| Lymphocyte count | Glucose |
| Neutrophil count | Low density lipoprotein cholesterol |
| Basophil percent | Osmotic pressure |
| Eosinophil percent | Prealbumin |
| Monocyte percent | Total bile acids |
| Lymphocyte percent | Pseudo-hydroxybutyrate dehydrogenase |
| Neutrophil percent | Cystatin C |
| White blood cell | Leucine aminopeptidase |
| Platelet larger cell ratio | 5’nucleotidase |
| Standard deviation of red cell volume distribution width | Homocysteine |
| Coefficient variation of red cell volume distribution width | Serum amyloid protein A |
| D-Dimer | Small density low density lipoprotein |
| Thrombin time | CD3+ T cell |
| Fibrinogen | CD4+ T cell |
| Activated partial thromboplastin time | CD8+ T cell |
| International normalization ratio | B lymphocyte |
| Prothrombin time | Natural killer cell |
| Albumin/Globulin ratio | CD4/CD8 ratio |
| Albumin | Interleukin-2 |
| Interleukin-4 | White blood cell count |
| Interleukin-6 | Squamous epithelial cell |
| Interleukin-10 | Viscose rayon |
| TNF-pseudo | Unclassified crystal |
| IFN-latitude | Specific gravity |
| Fibrin/fibrinogen degradation products | Complement C1q |
| Antithrombin III | Hyaline cast |
| B-type brain natriuretic peptide precursor | Pathological cast |
| Indirect bilirubin | pH |
| Fungi (1-3)-tail-D-glucan | Complement C3 |
| Urea | Immunoglobulin M |
| High-sensitivity C-reactive protein | Immunoglobulin A |
| Red blood cell count | Immunoglobulin G |
| Non-squamous epithelial cell | Yeast |
| Choline esterase | Complement C4 |
| Sialic acid | Lipase |
| Pseudo-L-Fucosidase | Anti-streptolysin O |
| Lipoprotein A | Rheumatoid factor |
| Apolipoprotein A1 | Bacterial count |
| Apolipoprotein B | Lactic acid |
| Leukocyte mass |
Areas under the curves and loss for SLEAHP_DEEP
| AUCROC | AUCPR | Loss | |
|---|---|---|---|
| Fold 1 | 0.67347 | 0.80148 | -10.80451 |
| Fold 2 | 0.83333 | 0.90110 | -8.99724 |
| Fold 3 | 1.00000 | 1.00000 | -5.57603 |
| Average |
Areas under the curves and loss for SLEAHP_EM
| AUCROC | AUCPR | Loss | |
|---|---|---|---|
| Fold 1 | 0.95918 | 0.95876 | -4.64181 |
| Fold 2 | 0.93750 | 0.94097 | -5.45861 |
| Fold 3 | 1.00000 | 1.00000 | -4.17694 |
| Average |
Fig. 5SLEAHP_DEEP loss function: training using the first two folds
Fig. 6SLEAHP_EM loss function: training using the first two folds