| Literature DB >> 23216969 |
Oliver P Günther1, Virginia Chen, Gabriela Cohen Freue, Robert F Balshaw, Scott J Tebbutt, Zsuzsanna Hollander, Mandeep Takhar, W Robert McMaster, Bruce M McManus, Paul A Keown, Raymond T Ng.
Abstract
BACKGROUND: Biomarker panels derived separately from genomic and proteomic data and with a variety of computational methods have demonstrated promising classification performance in various diseases. An open question is how to create effective proteo-genomic panels. The framework of ensemble classifiers has been applied successfully in various analytical domains to combine classifiers so that the performance of the ensemble exceeds the performance of individual classifiers. Using blood-based diagnosis of acute renal allograft rejection as a case study, we address the following question in this paper: Can acute rejection classification performance be improved by combining individual genomic and proteomic classifiers in an ensemble?Entities:
Mesh:
Substances:
Year: 2012 PMID: 23216969 PMCID: PMC3575305 DOI: 10.1186/1471-2105-13-326
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schematic representation of the biomarker development pipeline for genomic microarray data. The analysis starts with a pre-filtering step applied to the full pre-processed data set (54613 probe sets from the Affymetrix Human Genome U133 Plus 2 GeneChip) on top of the funnel, followed by uni- and multivariate ranking and filtering steps before arriving at a biomarker panel. The numbers on the right indicate the number of features (probe sets) at each step. The biomarker development pipeline for proteomic data looks similar except that data sets are typically smaller and proteomic-specific pre-processing steps need to be applied.
Figure 2Schematic overview of ensemble classifiers. Ensemble classifiers represent a combination of genomic and proteomic classifiers. Individual classifier output is aggregated by either average probability or vote threshold (a modified version of majority vote).
Overview of individual classifier performance and definition of ensembles
| Genomics 1 | LDA | 24 | 0.73 | 0.90 | 0.73 | | ||||
| Genomics 2 | SVM | 50 | 0.82 | 0.95 | 0.96 | | | | ||
| Genomics 3 | RF | 50 | 0.64 | 0.95 | 0.92 | | | |||
| Genomics 4 | EN | 43 | 0.73 | 1.00 | 0.93 | | | | ||
| Genomics 5 | EN | 174 | 0.73 | 1.00 | 0.95 | | | | ||
| Proteomics 1 | SVM | 12 | 0.64 | 0.95 | 0.94 | | ||||
| Proteomics 2 | EN | 10 | 0.64 | 0.81 | 0.90 | | | | ||
| Proteomics 3 | SVM | 33 | 0.55 | 0.81 | 0.83 | | | | ||
| Proteomics 4 | EN | 13 | 0.55 | 0.86 | 0.85 | | | | ||
| Proteomics 5 | SVM | 13 | 0.64 | 0.95 | 0.94 |
Shown is a list of 5 genomic and 5 proteomic classifiers, their individual classification performance and their inclusion into 5 ensembles that are explored in this paper. LDA stands for linear discriminant analysis; EN for Elastic Net (Generalized Linear Model); SVM for Support Vector Machine, and RF for Random Forest. Sensitivity, specificity and area under the ROC [receiver operator characteristics] Curve (AUC) for the individual classifiers were estimated using cross-validation.
Summary of classification performance for the Average Probability aggregation method
| 0.73 | 0.64 | 0.73 | 0.68 | 0.90 | 0.90 | 0.95 | 0.93 | 0.95 | 0.73 | 0.94 | 0.84 | |
| 0.82 | 0.55 | 0.82 | 0.69 | 0.95 | 0.86 | 1.00 | 0.93 | 0.98 | 0.73 | 0.96 | 0.88 | |
| 0.73 | 0.64 | 0.73 | 0.65 | 0.95 | 0.81 | 0.95 | 0.91 | 0.97 | 0.73 | 0.94 | 0.88 | |
| 0.82 | 0.55 | 0.73 | 0.64 | 0.90 | 0.81 | 1.00 | 0.92 | 0.97 | 0.83 | 0.95 | 0.90 | |
| 0.82 | 0.55 | 0.82 | 0.66 | 0.95 | 0.81 | 1.00 | 0.92 | 0.98 | 0.73 | 0.96 | 0.89 | |
Shown is classification performance as measured by sensitivity, specificity and AUC – for the 5 ensembles defined in Table 1 when using the average probability aggregation method. The minimum, maximum and average performances of individual classifiers in the respective ensemble are included in the table for comparison.
Summary of classification performance for the Vote Threshold aggregation method
| 0.82 | 0.64 | 0.73 | 0.68 | 0.86 | 0.90 | 0.95 | 0.93 | 0.89 | 0.73 | 0.94 | 0.84 | |
| 0.91 | 0.55 | 0.82 | 0.69 | 0.76 | 0.86 | 1.00 | 0.93 | 0.89 | 0.73 | 0.96 | 0.88 | |
| 1.00 | 0.64 | 0.73 | 0.65 | 0.76 | 0.81 | 0.95 | 0.91 | 0.90 | 0.73 | 0.94 | 0.88 | |
| 0.91 | 0.55 | 0.73 | 0.64 | 0.81 | 0.81 | 1.00 | 0.92 | 0.95 | 0.83 | 0.95 | 0.90 | |
| 1.00 | 0.55 | 0.82 | 0.66 | 0.62 | 0.81 | 1.00 | 0.92 | 0.90 | 0.73 | 0.96 | 0.89 | |
Shown is classification performance for the 5 ensembles defined in Table 1 when using the vote threshold aggregation method. Similarly to Table 2, individual classifier performances are included for comparison.
Figure 3Comparison of predicted probabilities of acute rejection. Estimated probability of acute rejection (AR) for each of the AR and NR samples as returned by the Genomics 1 and Proteomics 1 classifiers in Ensemble 1, and the Ensemble 1 classifier which represents a combination of Genomics 1 and Proteomics 1. Samples are grouped along the x-axis into 11 AR (left group) and 21 NR (right group). Each point represents a probability of acute rejection for a specific sample. Three color-coded probabilities are shown per sample. Red and black points represent probabilities from Ensemble 1, orange and grey points from Genomics 1 and yellow and brown points from Proteomics 1.
Figure 4Classifier comparison within Ensemble 4. Scatter plot of the predicted posterior probabilities of AR from the Genomics 5 and Proteomics 3 classifier in Ensemble 4. Red points represent 11 AR samples, while black points represent 21 NR samples. Points that fall into yellow areas were classified identically to the genomic and the proteomic classifiers while points in the grey area were classified differently. AR samples are classified correctly when the probability for the corresponding red point is at or above 0.5. NR samples are predicted correctly when the probability is below 0.5.
Figure 5Comparison of all classifier pairs in Ensemble 2. Shown is a matrix of scatter plots of the predicted probabilities of AR for all 10 pairs of classifiers in Ensemble 2 as defined in Table 1. Red and black points indicate AR and NR samples respectively, the interpretation of yellow and grey areas is the same as Figure 4.
Overview of individual classifier performance and definition of ensembles
| mRNA-Classifier1 | EN | 182 | 0.9298 | 0.9737 | 0.8421 | 0.9737 | X | | | X | X | |
| mRNA-Classifier2 | EN | 73 | 0.9123 | 1.0000 | 0.7368 | 0.9709 | | | | | X | |
| mRNA-Classifier3 | EN | 36 | 0.8947 | 0.9737 | 0.7368 | 0.9501 | | | X | | X | |
| mRNA-Classifier4 | LDA | 2 | 0.9298 | 0.9211 | 0.9474 | 0.9640 | | | | | X | |
| mRNA-Classifier5 | RF | 500 | 0.8947 | 0.9737 | 0.7368 | 0.9418 | | | | | X | |
| mRNA-Classifier6 | SVM | 500 | 0.9298 | 0.9474 | 0.8947 | 0.9640 | | | | | X | |
| mRNA-Classifier7 | EN | 43 | 0.9123 | 0.9474 | 0.8421 | 0.9598 | | | | | | X |
| mRNA-Classifier8 | EN | 25 | 0.9298 | 0.9737 | 0.8421 | 0.9612 | | | | | | X |
| mRNA-Classifier9 | EN | 17 | 0.9298 | 0.9737 | 0.8421 | 0.9695 | | | | | | X |
| mRNA-Classifier10 | LDA | 2 | 0.9298 | 0.9211 | 0.9474 | 0.9640 | | | | X | | X |
| mRNA-Classifier11 | RF | 50 | 0.9298 | 0.9474 | 0.8947 | 0.9584 | | | X | | | X |
| mRNA-Classifier12 | SVM | 50 | 0.8947 | 0.9211 | 0.8421 | 0.9557 | | X | | X | | X |
| miRNA-Classifier1 | EN | 66 | 0.8947 | 0.9211 | 0.8421 | 0.9626 | X | | | | X | |
| miRNA-Classifier2 | EN | 21 | 0.9474 | 0.9737 | 0.8947 | 0.9709 | | | X | | X | |
| miRNA-Classifier3 | EN | 8 | 0.9649 | 0.9737 | 0.9474 | 0.9723 | | | | X | X | |
| miRNA-Classifier4 | LDA | 4 | 0.9298 | 0.9211 | 0.9474 | 0.9626 | | | | X | X | |
| miRNA-Classifier5 | RF | 152 | 0.8947 | 0.8947 | 0.8947 | 0.9765 | | | | | X | |
| miRNA-Classifier6 | SVM | 152 | 0.9123 | 0.9474 | 0.8421 | 0.9626 | | | | | X | |
| miRNA-Classifier7 | EN | 36 | 0.9298 | 0.9474 | 0.8947 | 0.9709 | | | | | | X |
| miRNA-Classifier8 | EN | 16 | 0.9298 | 0.9474 | 0.8947 | 0.9848 | | | | | | X |
| miRNA-Classifier9 | EN | 12 | 0.9474 | 0.9737 | 0.8947 | 0.9806 | | | | | | X |
| miRNA-Classifier10 | LDA | 4 | 0.9298 | 0.9211 | 0.9474 | 0.9626 | | | | | | X |
| miRNA-Classifier11 | RF | 50 | 0.9123 | 0.9211 | 0.8947 | 0.9778 | | | | X | | X |
| miRNA-Classifier12 | SVM | 50 | 0.8947 | 0.9211 | 0.8421 | 0.9612 | X | X | X |
Shown is a list of 12 mRNA- and 12 miRNA classifiers, their individual classification performance and their inclusion into 6 ensembles that are explored for classification of tumour vs normal samples. Abbreviations are the same as in Table 1.
Summary of classification performance for the Average Probability aggregation method
| 1.0000 | 0.9211 | 0.9737 | 0.9474 | 0.8421 | 0.8421 | 0.8421 | 0.8421 | 0.9972 | 0.9626 | 0.9737 | 0.9681 | |
| 0.9737 | 0.9211 | 0.9211 | 0.9211 | 0.8421 | 0.8421 | 0.8421 | 0.8421 | 0.9931 | 0.9557 | 0.9612 | 0.9584 | |
| 1.0000 | 0.9211 | 0.9737 | 0.9539 | 0.8421 | 0.7368 | 0.8947 | 0.8421 | 0.9917 | 0.9501 | 0.9709 | 0.9602 | |
| 1.0000 | 0.9211 | 0.9737 | 0.9386 | 0.9474 | 0.8421 | 0.9474 | 0.9035 | 0.9986 | 0.9557 | 0.9778 | 0.9677 | |
| 1.0000 | 0.8947 | 1.0000 | 0.9518 | 0.8947 | 0.7368 | 0.9474 | 0.8553 | 0.9972 | 0.9418 | 0.9765 | 0.9643 | |
| 1.0000 | 0.9211 | 0.9737 | 0.9430 | 0.9474 | 0.8421 | 0.9474 | 0.8816 | 0.9986 | 0.9557 | 0.9848 | 0.9672 | |
Shown is performance for tumour vs normal classification for the 6 ensembles defined in Table 4 using the average probability aggregation method. The minimum, maximum and average performances of individual classifiers in the respective ensemble are included in the table for comparison.
Summary of classification performance for the Vote Threshold aggregation method
| 1.0000 | 0.9211 | 0.9737 | 0.9474 | 0.7368 | 0.8421 | 0.8421 | 0.8421 | 0.9875 | 0.9626 | 0.9737 | 0.9681 | |
| 1.0000 | 0.9211 | 0.9211 | 0.9211 | 0.6842 | 0.8421 | 0.8421 | 0.8421 | 0.9917 | 0.9557 | 0.9612 | 0.9584 | |
| 1.0000 | 0.9211 | 0.9737 | 0.9539 | 0.6842 | 0.7368 | 0.8947 | 0.8421 | 0.9861 | 0.9501 | 0.9709 | 0.9602 | |
| 1.0000 | 0.9211 | 0.9737 | 0.9386 | 0.7368 | 0.8421 | 0.9474 | 0.9035 | 0.9875 | 0.9557 | 0.9778 | 0.9677 | |
| 1.0000 | 0.8947 | 1.0000 | 0.9518 | 0.6316 | 0.7368 | 0.9474 | 0.8553 | 0.9903 | 0.9418 | 0.9765 | 0.9643 | |
| 1.0000 | 0.9211 | 0.9737 | 0.9430 | 0.6842 | 0.8421 | 0.9474 | 0.8816 | 0.9931 | 0.9557 | 0.9848 | 0.9672 | |
Shown is performance for tumour vs normal classification for the 6 ensembles defined in Table 4 using the vote threshold aggregation method. Similarly to Table 5, individual classifier performances are included for comparison.
Figure 6Comparison of predicted probabilities of tumour. Estimated probability of tumour for each of the tumour- and normal samples as returned by all six classifiers in Ensemble D, and the Ensemble D classifier itself. Samples are grouped along the x-axis into 38 tumour (left) and 19 normal (right). Seven color-coded probabilities are shown per sample. Red and black points represent probabilities from Ensemble D, orange and grey crosses from the three mRNA classifiers, and pink and blue stars from the three miRNA classifiers.