| Literature DB >> 28594877 |
Samuel A Danziger1,2, Leslie R Miller1, Karanbir Singh1, G Adam Whitney3, Elaine R Peskind4,5, Ge Li5,6, Robert J Lipshutz1,3, John D Aitchison1,2, Jennifer J Smith1,2,3.
Abstract
We have established proof of principle for the Indicator Cell Assay Platform™ (iCAP™), a broadly applicable tool for blood-based diagnostics that uses specifically-selected, standardized cells as biosensors, relying on their innate ability to integrate and respond to diverse signals present in patients' blood. To develop an assay, indicator cells are exposed in vitro to serum from case or control subjects and their global differential response patterns are used to train reliable, disease classifiers based on a small number of features. In a feasibility study, the iCAP detected pre-symptomatic disease in a murine model of amyotrophic lateral sclerosis (ALS) with 94% accuracy (p-Value = 3.81E-6) and correctly identified samples from a murine Huntington's disease model as non-carriers of ALS. Beyond the mouse model, in a preliminary human disease study, the iCAP detected early stage Alzheimer's disease with 72% cross-validated accuracy (p-Value = 3.10E-3). For both assays, iCAP features were enriched for disease-related genes, supporting the assay's relevance for disease research.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28594877 PMCID: PMC5464608 DOI: 10.1371/journal.pone.0178608
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Development of an indicator cell assay.
For each disease indication, the global differential gene expression pattern of the indicator cells is measured in response to serum from normal and diseased subjects, and is used to identify a reliable disease classifier using a small number of features. To deploy the assay, the cell-based components can be automated and miniaturized, and the expression of classifier genes can be measured using a targeted, cost-effective and high-throughput readout to classify new subjects.
Fig 2Classifier pipelines.
I. 47 training samples or 12 test samples passed through the pre-processing pipeline, which resulted in normalized gene intensity scores. II. The classifiers were trained. 12 non-carrier (non-car.) samples in the 47 training examples were used to construct a linear model of gene expression based on control genes, which was used to convert all gene intensities into log2 gene expression ratios. These ratios were used to identify differentially expressed gene sets and ultimately select 64 gene sets or 134 genes that could differentiate between non-carrier and pre-symptomatic carrier mice using SVMs. III. The classifiers were tested. Gene intensities from 12 pre-processed but de-identified examples were converted to expression ratios using the linear model trained in II, and data sets composed of the 64 gene sets or 134 genes identified in II were extracted. The de-identified examples described by these data sets were classified as carrier or non-carrier using the SVMs trained in II. The pipeline for Classifier 3 was identical to that of Classifier 1 except that a different GSA score threshold was used (|GSA| ≥ 1 instead of GSA ≤ -1), resulting in selection of 106 gene sets instead of 64. For classifier testing with Huntington’s samples, classifiers were trained with all 59 normal and disease samples (not shown) and tested against 6 Huntington’s samples. For each classifier configuration, the fraction of Huntington’s samples that were correctly predicted as non-carriers of the ALS mutation is shown as ‘Hunt’ tally.
Blind accuracies of ALS classifiers.
| actual class of | predicted class of blind sample | |||
|---|---|---|---|---|
| blind sample | Classifier 1 | Classifier 2 | Classifier 3 | |
| 1 | Non-carrier | Non-carrier | Non-carrier | |
| 2 | Non-carrier | Non-carrier | Non-carrier | Non-carrier |
| 3 | Non-carrier | Non-carrier | Non-carrier | Non-carrier |
| 4 | Carrier | Carrier | Carrier | |
| 5 | Carrier | Carrier | Carrier | Carrier |
| 6 | Carrier | Carrier | Carrier | Carrier |
| 7 | Non-carrier | Non-carrier | ||
| 8 | Non-carrier | Non-carrier | Non-carrier | |
| 9 | Non-carrier | Non-carrier | Non-carrier | Non-carrier |
| 10 | Carrier | Carrier | Carrier | Carrier |
| 11 | Carrier | Carrier | Carrier | Carrier |
| 12 | Carrier | Carrier | Carrier | Carrier |
| performance metrics | Classifier 1 | Classifier 2 | Classifier 3 | |
| Accuracy | 0.83 | 0.92 | 0.83 | |
| p-Value | 3.17E-03 | 2.44E-04 | 3.17E-03 | |
| q-value | 5.28E-03 | 1.22E-03 | 5.28E-03 | |
| Sensitivity | 1 | 1 | 0.83 | |
| Specificity | 0.66 | 0.83 | 0.83 | |
| MCC | 0.71 | 0.85 | 0.67 | |
| MCC p-Value | 1.01E-02 | 5.37E-04 | 1.79E-02 | |
| AUC p-Value | 1.14E-02 | 2.71E-03 | 1.12E-02 | |
aClassifier based on average expression of 64 down-regulated gene sets.
bClassifier based on expression of 134 genes selected from down-regulated gene sets.
cClassifier based on average expression of 106 up- and down-regulated gene sets.
dClassifier 1 was a true double-blind prediction, whereas Classifiers 2 and 3 were based on post-hoc analysis.
eq-value (Benjamin Hochberg False discovery rate).
fMatthews Correlation Coefficient.
gArea under a receiver operator characteristic (ROC) curve (varying number of votes needed for disease classification).
ALS classifier accuracies with 6 Huntington's disease samples included in test set.
| Metric | Classifier 1 | Classifier 2 | Classifier 3 |
|---|---|---|---|
| Accuracy | 0.89 | 0.94 | 0.83 |
| p-Value | 7.25E-05 | 3.81E-06 | 6.56E-04 |
| Sensitivity | 1 | 1 | 0.83 |
| Specificity | 0.83 | 0.92 | 0.83 |
| FPR | 0.17 | 0.08 | 0.17 |
| FDR | 0.25 | 0.14 | 0.29 |
| F1 | 0.86 | 0.92 | 0.77 |
| MCC | 0.79 | 0.89 | 0.64 |
| MCC p-Value | 9.42E-05 | 9.71E-07 | 3.87E-03 |
| AUC | 0.94 | 0.95 | 0.92 |
| AUC p-Value | 4.22E-04 | 1.11E-04 | 2.13E-03 |
aClassifier based on average expression of 64 down-regulated gene sets.
bClassifier based on expression of 134 genes selected from down-regulated gene sets.
cClassifier based on average expression of 106 up- and down-regulated gene sets.
dClassifier 1 was a true double-blind prediction, whereas Classifiers 2 and 3 were based on post-hoc analysis.
efalse positive rate
ffalse discovery rate
gF1
hMatthew's correlation coefficient
iarea under the curve are calculated from the receiver operator characteristic.
Fig 3The 134 genes from the ALS gene expression classifier.
Genes that are either transcriptionally responsive to ER stress [32] or directly targeted by TFs ATF4 (Atf4) and CHOP (Ddit3) during ER stress [32] are shown as nodes (depicted using Cytoscape [33]). Node color indicates mean log2 differential expression level in the indicator cell assay (disease versus normal) ranging from –0.44 (green) to 0.1 (pink). Node shape indicates genes that are transcriptionally responsive to ER stress (oval) or are not (hexagon) [32]. 133 of the 134 genes are known to have human orthologs and underlined genes co-occur with “amyotrophic lateral sclerosis” in titles or abstracts of articles in PubMed (on 01/15/15).
Description of patient classes for AD iCAP.
| class | N | description | mean Abeta-42 (pg/mL) | Abeta-42 range (pg/mL) | average age (y) | male: female | average CDR |
|---|---|---|---|---|---|---|---|
| preclinical AD | 20 | cognitively normal; decreased CSF Abeta-42 | 129 | 67–186 | 73 | 11:9 | 0 |
| early symptomatic | 18 | clinical MCI or early AD; decreased CSF Abeta-42 | 139 | 45–189 | 72 | 9:9 | 0.53 |
| normal | 20 | cognitively normal; no decrease in CSF Abeta-42 | 598 | 478–785 | 72 | 11:9 | 0 |
a Two disease classes were merged into a single early AD class.
b,c Patient samples have CSF Abeta-42 ≤ 192 pg/mL and > 192 pg/mL, respectively. b and c are ranked in the lowest and highest percentiles for CSF Abeta-42 levels in their respective classes in the biorepository. CSF amyloid-b (1–42) fragment is abbreviated as Abeta-42.
dNumber of samples used to develop the classifier.
eAverage clinical dementia rating.
Leave-one-out cross-validated Alzheimer's disease classifier.
| Classifier Dataset | Accuracy | p-Value | Sensitivity | Specificity | MCC | MCC p-Values | AUC | AUC p-Values |
|---|---|---|---|---|---|---|---|---|
| iCAP | 0.724 | 1.53E-04 | 0.838 | 0.524 | 0.382 | 3.10E-03 | 0.640 | 2.90E-02 |
| APOE | 0.672 | 2.68E-03 | 0.973 | 0.143 | 0.220 | 9.75E-02 | 0.564 | 4.12E-01 |
| iCAP.APOE | 0.690 | 1.12E-03 | 0.757 | 0.571 | 0.328 | 1.19E-02 | 0.689 | 4.68E-03 |
N = 58 samples, 38 MCI and pre-MCI Disease samples and 20 Normal Samples. No multiple testing correction was performed. MCC, Matthews correlation coefficient; AUC, area under the ROC curve.