| Literature DB >> 35271646 |
Judith Somekh1, Nir Lotan1, Ehud Sussman1, Gur Arye Yehuda1.
Abstract
BACKGROUND: Mechanical ventilation (MV) is a lifesaving therapy used for patients with respiratory failure. Nevertheless, MV is associated with numerous complications and increased mortality. The aim of this study is to define the effects of MV on gene expression of direct and peripheral human tissues.Entities:
Mesh:
Year: 2022 PMID: 35271646 PMCID: PMC8912236 DOI: 10.1371/journal.pone.0264919
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparison between the machine learning models used in the study.
| Model | Advantages | Disadvantages |
|---|---|---|
| XGBoost | • Effective for a relatively small number of samples with a large number of features | • May exhibit overfitting if hyperparameters are not adjusted correctly |
| • Encapsulated explainability capabilities that can help validate the correctness of the model, e.g., by checking the relevance of the most significant gene levels to the tested condition | • Applicable for numeric features only | |
| • Includes improvements to the original gradient boosting model that increase the performance and the accuracy of the results | ||
| RF | • Easy to implement both for classification and regression tasks | • Lower performance than more modern methods |
| • Provides some level of explainability | • Nonoptimal performance when classes are unbalanced | |
| • Avoids overfitting | ||
| ANN | • Excels at cognitive tasks (image/video/text/voice data) | • Requires a large number of samples |
| • Hard to explain and detect the feature importance | ||
| • May be less effective with tabular data |
Binary classification model evaluations.
| Evaluation | Average Accuracy | Average F1_score | Average Recall | Average Precision |
|---|---|---|---|---|
| Model | ||||
| Neural Net | 0.934 | 0.916 |
| 0.903 |
| Random Forest | 0.912 | 0.880 | 0.862 | 0.905 |
| XGBoost |
|
| 0.924 |
|
Fig 1AUC comparison of the 18 classifiers.
It can be seen that the XGBoost model outperforms the RF and ANN models for the six tested tissues.
Binary classification models’ AUCs for the different organs.
| AUC per organ | Average AUC | Adipose-Subcutaneous | Liver | Lung | Muscle-Skeletal | Nerve-Tibial | Skin-Sun Exposed (Lower leg) |
|---|---|---|---|---|---|---|---|
| Model | |||||||
| Neural Net | 0.934 | 0.939 | 0.931 | 0.941 | 0.940 | 0.937 | 0.913 |
| Random Forest | 0.912 | 0.879 | 0.953 | 0.933 | 0.931 | 0.913 | 0.864 |
| XGBoost |
|
|
|
|
|
|
|
Fig 2Top 20 genes and average SHAP impact (absolute values) on the magnitude of model classification output.
(A) Values for the lung tissue. (B) Values for muscle-skeletal tissue. It can be seen that GKN1 gene expression values have the highest impact on the MV classification.
Top 20 genes with highest importance SHAP scores in each tissue.
| Adi-pose | Score | Liver | Score | Lung | Score | Muscle | Score | Nerve Tibial | Score | Skin | Score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| 0.49 | MGMT | 0.63 | PHF13 | 0.30 |
| 0.52 | FDCSP | 0.54 | AVP | 0.53 |
|
| VENTX | 0.42 | DNAJB4 | 0.55 | MT4 | 0.29 |
| 0.34 | HSD11B2 | 0.50 |
| 0.41 |
|
|
| 0.31 | C5orf24 | 0.26 | HRG | 0.23 | TET1 | 0.26 | CRABP1 | 0.44 | ODF3L1 | 0.24 |
|
| PELO | 0.27 | TOR1A | 0.24 | TBC1D22B | 0.23 | FGF6 | 0.21 |
| 0.31 |
| 0.23 |
|
|
| 0.25 | GATAD1 | 0.22 | LCE5A | 0.20 | SMCO1 | 0.19 | SLN | 0.22 | 0.23 | |
|
| 0.22 | C12orf60 | 0.14 | ALPK3 | 0.18 | SPSB4 | 0.18 | SLITRK6 | 0.18 |
| 0.20 | |
|
|
| 0.20 | GPRIN1 | 0.13 |
| 0.17 | ZCCHC24 | 0.17 | 0.18 | NEURL2 | 0.19 | |
|
| MUC21 | 0.20 | KCNJ8 | 0.13 | LCE2C | 0.16 | 0.17 | XIRP1 | 0.15 | 0.19 | ||
|
| C22orf31 | 0.19 | DRD4 | 0.12 | CYP1A2 | 0.14 | CCND1 | 0.16 | HBQ1 | 0.14 | 0.17 | |
|
| CX3CL1 | 0.17 | ENTPD7 | 0.12 | ADRA1B | 0.14 |
| 0.15 | BHLHE40 | 0.13 | NIPSNAP3A | 0.15 |
|
| C10orf99 | 0.15 | SLC25A21-AS1 | 0.11 | TERF2IP | 0.12 |
| 0.14 | SFRP2 | 0.13 | FLG | 0.13 |
|
| DEFA6 | 0.15 | OPN1SW | 0.10 | LCE2A | 0.12 | FITM1 | 0.13 | GUCA2A | 0.12 | RD3 | 0.11 |
|
| PRND | 0.14 | PAQR8 | 0.09 | SST | 0.11 | MRPL16 | 0.13 | CD248 | 0.12 |
| 0.10 |
|
| SMCP | 0.13 | STX11 | 0.09 | B3GNT2 | 0.10 | TLL2 | 0.12 | 0.12 | ATP4B | 0.10 | |
|
| TNFRSF21 | 0.09 | LRRC40 | 0.08 | DYRK2 | 0.10 | SPRR2A | 0.12 | DKK4.00 | 0.09 | WFDC12 | 0.10 |
|
|
| 0.09 | CECR6 | 0.08 | DPRX | 0.10 | CEACAM6 | 0.12 | 0.09 | OCM2 | 0.10 | |
|
| HUS1B | 0.09 | OMG | 0.08 | CHP2 | 0.09 | SLN | 0.11 | DUPD1 | 0.09 | DUSP1 | 0.10 |
|
| TIFAB | 0.09 | IFITM10 | 0.08 | AGTR2 | 0.09 | SPATA25 | 0.10 | TAS2R46 | 0.08 | SCNN1G | 0.09 |
|
| P2RY13 | 0.08 | TMEM200C | 0.07 | SPINT3 | 0.09 | SOCS4 | 0.09 | DBX2 | 0.08 | LGALS7B | 0.09 |
|
| TCF21 | 0.08 | PSPN | 0.07 | LSM11 | 0.09 | TBC1D12 | 0.09 | PSD2 | 0.08 | IER2 | 0.09 |
Genes that are common to more than two tissues are highlighted in bold.
Fig 3SHAP variable importance plots.
(A) SHAP values for muscle-skeletal tissue. (B) SHAP values for the lung tissue. The plot includes all samples in the training data and the values represent the impact of the gene on model prediction output. SHAP values explain to what extent the feature (gene) contributes to the prediction of the model.
Pathway enrichment analysis of the ventilation predictive marker genes across the six tested tissues.
| Tissue | Top pathways (adjusted p-value < 0.05) | |
|---|---|---|
|
| Adipose Sub. | Viral protein interaction with cytokine and cytokine receptor |
| Cytokine-cytokine receptor interaction | ||
| Amoebiasis | ||
| Chemokine signalling pathway | ||
|
| Liver | Cytokine-cytokine receptor interaction |
| Viral protein interaction with cytokine and cytokine receptor | ||
| Chemokine signalling pathway | ||
| Neuroactive ligand-receptor interaction | ||
|
| Lung | Cytokine-cytokine receptor interaction |
| Viral protein interaction with cytokine and cytokine receptor | ||
| Neuroactive ligand-receptor interaction | ||
| Chemokine signalling pathway | ||
| Taste transduction | ||
|
| Muscle | Cytokine-cytokine receptor interaction |
| Viral protein interaction with cytokine and cytokine receptor | ||
| Chemokine signalling pathway | ||
|
| Nerve Tibial | Viral protein interaction with cytokine and cytokine receptor |
| Cytokine-cytokine receptor interaction | ||
| Neuroactive ligand-receptor interaction | ||
|
| Skin | Cytokine-cytokine receptor interaction |
| Viral protein interaction with cytokine and cytokine receptor | ||
| Chemokine signalling pathway | ||
| Neuroactive ligand-receptor interaction |