| Literature DB >> 31348505 |
Gil Shamai1, Yoav Binenbaum2,3, Ron Slossberg4, Irit Duek5, Ziv Gil3,5, Ron Kimmel4.
Abstract
Importance: Immunohistochemistry (IHC) is the most widely used assay for identification of molecular biomarkers. However, IHC is time consuming and costly, depends on tissue-handling protocols, and relies on pathologists' subjective interpretation. Image analysis by machine learning is gaining ground for various applications in pathology but has not been proposed to replace chemical-based assays for molecular detection. Objective: To assess the prediction feasibility of molecular expression of biomarkers in cancer tissues, relying only on tissue architecture as seen in digitized hematoxylin-eosin (H&E)-stained specimens. Design, Setting, and Participants: This single-institution retrospective diagnostic study assessed the breast cancer tissue microarrays library of patients from Vancouver General Hospital, British Columbia, Canada. The study and analysis were conducted from July 1, 2015, through July 1, 2018. A machine learning method, termed morphological-based molecular profiling (MBMP), was developed. Logistic regression was used to explore correlations between histomorphology and biomarker expression, and a deep convolutional neural network was used to predict the biomarker expression in examined tissues. Main Outcomes and Measures: Positive predictive value (PPV), negative predictive value (NPV), and area under the receiver operating characteristics curve measures of MBMP for assessment of molecular biomarkers.Entities:
Year: 2019 PMID: 31348505 PMCID: PMC6661721 DOI: 10.1001/jamanetworkopen.2019.7700
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Figure 1. Prediction of Estrogen Receptor Positivity Using Deep Convolutional Neural Network
The receiver operating characteristic curves for cohort 1 and cohort 2 were obtained by fitting the computed r score per patient to the estrogen receptor status (a single tissue microarray image or 3 tissue microarray images in cohort 2 and 14 images in cohort 1). The area under the receive operating characteristic (AUC) is indicated for each case.
Performance of MBMP and Comparison With Other Methods
| Source | Data Set | Assay Methods Compared (Antibody) | PPV, % | NPV, % | Sensitivity, % | Specificity, % | Accuracy, % |
|---|---|---|---|---|---|---|---|
| Proposed method | Cohort 1 (01-011) | MBMP and IHC (SP1) | 98 | 68 | 93 | 90 | 92 |
| Proposed method | Cohort 2 (02-008) | MBMP and IHC (SP1) | 97 | 76 | 93 | 87 | 91 |
| Cheang et al,[ | Cohort 2 (02-008) | IHC (SP1) and DCC | 98 | 62 | 86 | 92 | 87 |
| Cheang et al,[ | Cohort 2 (02-008) | IHC (1D5) and DCC | 97 | 51 | 78 | 92 | 81 |
| Cheang et al,[ | Cohort 2 (02-008) | IHC (1D5) and IHC (SP1) | 97 | 78 | 88 | 94 | 90 |
| Barnes et al,[ | Their own data set | LBA and IHC (1D5) | NA | NA | NA | NA | 81 |
| Regan et al,[ | IBCSG | LBA and IHC (1D5) | NA | NA | NA | NA | 88 |
| Harvey et al,[ | San Antonio tumor bank | LBA and IHC (1D5) | NA | NA | NA | NA | 86 |
| Hammond et al,[ | IBCSG premenopausal | Primary institution by LBA/ELISA and central testing by IHC (1D5) | 91 | 63 | NA | NA | 82 |
| Hammond et al,[ | IBCSG postmenopausal | Primary institution by LBA/ELISA and central testing by IHC (1D5) | 93 | 73 | NA | NA | 88 |
Abbreviations: DCC, dextran-coated charcoal; ELISA, enzyme-linked immunosorbent assay; IBCSG, International Breast Cancer Study Group; IHC, immunohistochemistry; LBA, ligand binding assay; MBMP, morphological-based molecular profiling; NA, not applicable; NPV, negative predictive value; PPV, positive predictive value.
Concordance rates between MBMP low and high thresholds (low, 0.25; high, 0.75) and different criterion standard assays for estrogen receptor detection were obtained from Hammond et al[12] and Chean et al.[14] The statistical measures were computed considering the second method as the ground truth.
Figure 2. Amount of Data vs System Performance
For cohort 2 (A-D), the resulting area under the receiver operating characteristics (ROC) curve (AUC) for prediction of Ki-67, estrogen receptor (ER), progesterone receptor (PR), and ERBB2 status used the proposed logistic regression classifier. The AUC is plotted with respect to the biopsy cut size, the number of patients in the cohort, the image resolution, and the number of tissue microarray (TMA) slides per patient. For both cohorts (E), the resulting AUC for prediction of ER status used the proposed deep convolutional neural network. The AUC is plotted with respect to the number of TMA images per patient for cohorts 1 and 2. In cohort 2, 3 TMA images were available for each patient, whereas in cohort 1, 14 TMA images were available per patient.
Figure 3. The Resulting r Scores for Prediction of Estrogen Receptor (ER) Positivity in All Patients
The r scores were obtained using the proposed deep convolutional neural network. The horizontal axis represents the entire cohort population, normalized between 0 and 1, and sorted by the r score. The r scores are stratified by the ER status (A and B), by the percentage of cells expressing ER (only for patients with ER-positive tumor) (C), and by the tumor grade. Cases of high-grade malignant neoplasms for which the system could identify ER-associated morphological signal are boxed (D). PR indicates progesterone receptor.
Figure 4. Hematoxylin-Eosin (H&E)–Stained Images With Corresponding Response Maps
Patients with estrogen receptor (ER)–negative tumors are presented in the 2 left columns, and those with ER-positive tumors in the 2 right columns. Red regions correspond to morphological patterns that contribute to ER-positive prediction. Green regions correspond to morphological patterns that contribute to ER-negative prediction. Higher color intensity corresponds to a stronger contribution. The resulting r score is indicated for each case. The immunohistochemistry (IHC) images were never shown to the system.