| Literature DB >> 35619722 |
Myrthe van Baardwijk1,2,3,4, Iacopo Cristoferi1,2,3, Jie Ju1, Hilal Varol1,3, Robert C Minnee2,3, Marlies E J Reinders3,5, Yunlei Li1, Andrew P Stubbs1, Marian C Clahsen-van Groningen1,3,6.
Abstract
Introduction: A decentralized and multi-platform-compatible molecular diagnostic tool for kidney transplant biopsies could improve the dissemination and exploitation of this technology, increasing its clinical impact. As a first step towards this molecular diagnostic tool, we developed and validated a classifier using the genes of the Banff-Human Organ Transplant (B-HOT) panel extracted from a historical Molecular Microscope® Diagnostic system microarray dataset. Furthermore, we evaluated the discriminative power of the B-HOT panel in a clinical scenario. Materials andEntities:
Keywords: bioinformatics; diagnosis; gene expression; graft rejection; kidney transplantation; machine learning; pathology; transcriptomics
Mesh:
Substances:
Year: 2022 PMID: 35619722 PMCID: PMC9128066 DOI: 10.3389/fimmu.2022.841519
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 8.786
Figure 1Overview of Data Collection and Preprocessing. Data has been retrieved from the GEO dataset repository. Probes have been matched using raw annotation files and then aggregated based on the median values and using robustscale per feature. Finally, the genes from the Banff Human Organ Transplant B-HOT panel have been filtered. B-HOT, Banff-Human Organ Transplant; GEO, Gene Express Omnibus; KNN, K-nearest neighbors.
Overview of datasets composition.
| Dataset | NR | ABMR | TCMR |
|---|---|---|---|
|
| 774 | 326 | 81 |
|
| 60 | 15 | 2 |
ABMR, Antibody-Mediated Rejection; NR, Non-Rejection; TCMR, T-Cell-Mediated Rejection.
Demographics and clinical characteristics GSE data sets.
| Patient Demographics | GSE98320 ( | GSE129166 ( | |
|---|---|---|---|
| Mean Recipient Age (range) | 52 (18-86) | 50.2 (2.7-78.5) | |
| Recipient sex (male/female) | 559/486 (53%/47%) | 224/141 (61.4%/38.6%) | |
| Ethnicity | European | 522 (50%) | 318 (87.8%) |
| African | 83 (8%) | 6 (1.7%) | |
| Other/Not available | 440 (42%) | 38 (10.5%) | |
| Mean donor age (range) | 43 (0.03-85) | 50.6 (5.0-91.0) | |
| Donor sex | 512/533 (49%/51%) | 177/180 (50.4%/49.6%) | |
| Donor type (deceased/living) | 692/353 (66%/44%) | 278/83 (77.0%-23.0%) | |
|
|
|
| |
| Median time of biopsy after transplant in days (range) | 591 (1-11,453) | 908 (6-12,564) | |
| Early biopsies (<1 year) | 507 (42%) | 207 (53.5%) | |
| Late biopsies (>= 1 year) | 701 (58%) | 180 (46.5%) | |
Adapted from (8, 9).
Figure 2Overview of model development workflow. B-HOT, Banff-Human Organ Transplant; CV, Cross-Validation; KNN, K-nearest neighbors.
Figure 3Principal component analysis and Batch Correction. (A, B). Principal component analysis biplots of samples labeled based on their origin dataset before (A) and after (B) batch effect removal using ComBat. (C, D). Principal component analysis biplots of samples labeled based on their diagnosis before (C) and after (D) batch effect removal using ComBat. ABMR, Antibody-Mediated Rejection; NR, Non-Rejection; PC, Principal Component; TCMR, T-Cell-Mediated Rejection.
Overview of nested cross-validation performances of all models.
| Model | Performance | NR | ABMR | TCMR | |
|---|---|---|---|---|---|
|
| AUC | 0.980 | 0.976 | 0.995 | |
| Precision | 0.924 | 0.893 | 0.900 | ||
| Recall | 0.960 | 0.835 | 0.779 | ||
| F1-score | 0.941 | 0.861 | 0.827 | ||
| Average Accuracy | 0.913 | ||||
|
| AUC | 0.979 | 0.971 | 0.987 | |
| Precision | 0.926 | 0.891 | 0.813 | ||
| Recall | 0.963 | 0.825 | 0.728 | ||
| F1-score | 0.944 | 0.855 | 0.759 | ||
| Average Accuracy | 0.909 | ||||
|
| AUC | 0.980 | 0.976 | 0.994 | |
| Precision | 0.936 | 0.886 | 0.921 | ||
| Recall | 0.963 | 0.862 | 0.767 | ||
| F1-score | 0.949 | 0.874 | 0.828 | ||
| Average Accuracy | 0.921 | ||||
ABMR, Antibody-Mediated Rejection; AUC, Area under the ROC curve; B-HOT, Banff-Human Organ Transplant; NR, Non-Rejection; TCMR, T-Cell-Mediated Rejection.
Figure 4Overview of the Most-Predictive Features of the three Random Forest Models. (A) The twenty most predictive features for classification within GSE98320 of the B-HOT Model. (B) The twenty most predictive features for classification within GSE98320 of the Forward Sequential Feature Selected Model. (C) The twenty most predictive features for classification within GSE98320 of B-HOT+ Model. B-HOT, Banff-Human Organ Transplant.
Figure 5B-HOT+ Model Performances. (A) ROC curve of cross-validation within GSE98320 of the B-HOT+ model. (B) Precision/Recall curve of cross-validation within GSE98320 of the B-HOT+ model. ABMR, Antibody-Mediated Rejection; AP, Average Precision; AUC, Area Under the ROC Curve; B-HOT, Banff-Human Organ Transplant; NR, Non-Rejection; ROC, Receiver Operating Characteristic; TCMR, T-Cell-Mediated Rejection.
Figure 6Validation Set Performances. (A) ROC curve of independent validation within GSE129166 of the B-HOT+ model. (B) Precision/Recall curve of independent validation within GSE129166 of the B-HOT+ model. ABMR, Antibody-Mediated Rejection; AP, Average Precision; AUC, Area Under the ROC Curve; B-HOT, Banff-Human Organ Transplant; NR, Non-Rejection; ROC, Receiver operating characteristic.
Overview of B-HOT+ Model Validation Performances.
| Model | Performance | NR | ABMR | TCMR |
|---|---|---|---|---|
|
| AUC | 0.965 | 0.982 | X* |
| Precision | 0.65 | 1.000 | X* | |
| Recall | 0.867 | 0.862 | X* | |
| F1-score | 0.742 | 0.938 | X* |
*Insufficient sample size.
ABMR, Antibody-Mediated Rejection; AUC, Area under the ROC curve; B-HOT, Banff-Human Organ Transplant; NR, Non-Rejection; TCMR, T-Cell-Mediated Rejection.