| Literature DB >> 34815630 |
Cheng Lu1, Rakesh Shiradkar1, Zaiyi Liu2.
Abstract
In the last decade, the focus of computational pathology research community has shifted from replicating the pathological examination for diagnosis done by pathologists to unlocking and discovering "sub-visual" prognostic image cues from the histopathological image. While we are getting more knowledge and experience in digital pathology, the emerging goal is to integrate other-omics or modalities that will contribute for building a better prognostic assay. In this paper, we provide a brief review of representative works that focus on integrating pathomics with radiomics and genomics for cancer prognosis. It includes: correlation of pathomics and genomics; fusion of pathomics and genomics; fusion of pathomics and radiomics. We also present challenges, potential opportunities, and avenues for future work.Entities:
Keywords: Radiomics; digital pathology; genomics; pathomics; prognosis
Year: 2021 PMID: 34815630 PMCID: PMC8580801 DOI: 10.21147/j.issn.1000-9604.2021.05.03
Source DB: PubMed Journal: Chin J Cancer Res ISSN: 1000-9604 Impact factor: 4.026
Overview of research works on correlating pathomics with genomics
| References | Approach | Data used | Results | |
| CAE, convolutional autoencoder; BCa, breast cancer; H&E, hematoxylin and eosin; IHC, immunohistochemistry; WSI, whole-section image; NSCLC, non-small cell lung cancer; TCGA, The Cancer Genomic Atlas; CNN, convolutional neural networks; TMA, tissue microarrays; CCA, canonical correlation analysis. | ||||
| Ash | 1) CAE was first applied to histology image to extracted features; 2) sparse canonical correlation analysis (CCA) was then applied to the image features and gene expression to find subsets of gene expression values that correlate to subsets of image features. | Three cohorts (BCa, lower grade glioma, and Genotype-Tissue Expression project) with histological images and bulk RNA-sequencing data from paired tissue samples. | 1) Gene sets associated with the structure of the extracellular matrix and cell wall infrastructure, implicating uncharacterized genes in extracellular processes; 2) found sets of genes associated with specific cell types; 3-image features that capture population variation in thyroid and in colon tissues associated with genetic variants. | |
| AbdulJabbar | 1) Train deep learning model to identify cancer cells, lymphocytes, stromal cells and an “other” cell class in H&E-stained images (validated by sequencing data, IHC, and pathologists); 2) define immune hot and cold regions based on lymphocytes percentage (validated by the RNA-seq classification). | WSI, RNA-seq from multiregion TRAcking Cancer Evolution through Therapy (Rx) (TRACERx, n=100); The Leicester Archival Thoracic Tumor Investigatory Cohort (LATTICe-A, n=970). | High geospatial immune variability between tumor regions; Tumors with more than one immune cold region had a higher risk of relapse in lung adenocarcinomas. | |
| Lu | Image features that captured cellular diversity in local region were correlated with bulk RNA expression data. | N=405 NSCLC histology image with bulk RNA expression data from TCGA | CellDiv features were found to be strongly associated with apoptotic signalling and cell differentiation pathways. | |
| Subramanian | Use CCA and sparse CCA to correlate gene expression and histological features describing nucleus shape, texture and intensity. | N=615 BCa samples from TCGA with histology images and gene expression data. | CCA found significant correlation of image features with expression of | |
| Martins | Stroma were segmented from H&E-stained images and quantified by a fraction score. The stroma score and gene expression were correlated using Pearson correlation. | Two independent cohorts of TMAs of ovarian cancer (n=521). | Stroma strongly biases estimate of PTEN expression | |
| Wang | Image features captured tumor morphology were correlated with gene expression data. The strong correlated image features and gene lists/clusters were test for prognostic ability in independent test cohorts. | TCGA Triple-Negative BCa (n=44) with image and gene data. Evaluating the image features in a local TMA cohort (n=143). | Forty-eight pairs of significantly correlated image features and gene clusters were identified; four image features were prognostic in a validation cohort; gene clusters correlated with these four image features were prognostic in public gene datasets. | |
Overview of research works on fusion of pathomics with genomics
| References | Aims | Approach | Data used | Results | |
| CNN, convolutional neural networks; GCN, graph convolutional networks; H&E, hematoxylin and eosin; IHC, immunohistochemistry; CNV, copy number variant; CCRCC, clear cell renal cell carcinoma; CV, cross-validation; TCGA, The Cancer Genomic Atlas; WSI, whole-section image; HR, hazard ratio; 95% CI, 95% confidence interval; ROI, region of interest; HPF, high power field; DNN, deep neural network; SVM, support vector machine; BCa, breast cancer; ER, estrogen receptor. | |||||
| Chen
| Constructing a prognostic models for glioma and CCRCC | Histologic image-based features extracted by CNN, and graph-based image features extracted by GCN, and genomic features learned by Feed Forward Network. All above mentioned data were integrated by a multimodal learning paradigm, which modeled on pairwise feature interactions across modalities by taking the Kronecker product of unimodal feature representations and gating attention mechanism, for prognostication. | Glioma: 1,505 H&E-stained images from 769 patient with 320 genomic features from CNV, mutation status and bulk RNA-Seq expression; 1,251 H&E-stained CCRCC images from 417 patients with 357 genomic features from CNV and RNA-Seq. | C-index=0.826 for Glioma; C-index=0.720 for CCRCC. Both models’ performance are higher than the corresponding unimodal models.
| |
| Shao
| Proposing a framework combining pathological images and multi-modal genomic data for the prognosis of early-stage cancer patients. | 1) A generalized sparse canonical correlation analysis, named ordinal multi-modality feature selection (OMMFS) that captures the intrinsic relationship among multiple views, to identify important features from WSI and multi-modal data; 2) cox proportional hazard model was applied for prognosticating patients. | Kidney renal clear cell carcinoma, kidney renal papillary cell carcinoma, and lung squamous cell
| The identified image and multi-modal features were strongly correlated with patients survival outcome, thus enable effective stratification of patients. | |
| Cheerla
| Constructing a deep learning based pancancer model for predicting survival of patients. | Auto encoder to extract four data modalities (gene expression, miRNA data, clinical data, and WSI) into a single feature vector for each patient, handling missing data through a resilient, multimodal dropout method. | Gene expression (n=10,198), miRNA data (n=10,125), clinical data (n=7,512), and WSI (n=10,914) from TGCA (20 different cancer types). | The pan-cancer prognostic model yielded a C-index of 0.78 overall. | |
| Cheng
| Constructing a prognostic model for clear cell renal cell carcinoma | 1) Nuclear features (nucleus size, shape, texture, and distance to neighbors) were aggregated statistically into patient-level features; 2) gene co-expression network analysis (GCNA) to cluster genes into co-expressed modules (clusters of highly interconnected/correlated genes); 3) lasso-regularized Cox proportional hazards model was used to calculate the risk scores based on the feature from 1 and 2. | WSI, transcriptome, and somatic mutation. N=410 from TCGA. | 1) Patients with high percentage of stromal tissue are related to poor prognosis; 2) risk index is independent of known prognostic factors with HR (95% CI)=3.06 (2.10−4.45) P<0.005.
| |
| Mobadersany
| Predicting the overall survival of patients diagnosed with glioma | Hybrid architecture combing abstracted histologic image features from convolutional layers and genomic variables (IDH mutation status and 1p/19q codeletion) to fully connected layers. When predicting of a newly diagnosed patient, 9 HPFs were sampled from each ROI, and the median risk score was selected to represent that ROI. Second highest risk score among all ROIs of a WSI was used as the final risk score. | N=1,061 WSIs from 769 patients from TCGA. Genomic variables (IDH mutation status and 1p/19q codeletion). | Model achieved prognostic power with c index of 0.754 and correlate with molecular subtypes and histologic grade; the c-index boosted to 0.801 while integrating with genomic variables.
| |
| Ren
| Constructing a survival model for predicting the recurrent of prostate cancer patients with Gleason score 7 | 1) Pathway activities were quantified by pathway scores using RNA sequences; 2) image patches from WSI and pathway scores were integrated into DNN to extract “deep features”; 3) “deep features” and clinical prognostic factors were fed into a Cox model. | N=339 WSIs and RNA (Illumina HiSeq) sequencing data from TCGA. | Integrated model yielded C-index=0.74, and C-index=0.71 for histology image only. | |
| Yuan
| Correlation between histology image features and genomic data; Prognosticating early-stage ER-BCa patients | Cancer cells, stroma cells and lymphocytes were detected from the histology image and the proportions of these cells are used as image features to correlate and combine with genomic data. | N=564 early-stage BCa patients with H&E-stained WSIs and genomic data. | A SVM predictor integrating gene expression and image features achieved 86%±3.0% cross-validation accuracy and improved stratification of the patient cohort. | |
Overview of research works on fusion of pathomics with radiomics
| References | Aims | Approach | Data used | Results | |
| MRI, magnetic resonance imaging; H&E, hematoxylin and eosin; AUC, area under the curve; MRF, magnetic resonance fingerprinting; ADC, apparent diffusion coefficient; ROI, region of interest; CT, computed tomography; NPC, nasopharyngeal cancer; NSCLC, non-small cell lung cancer; nCRT, neoadjuvant chemoradiotherapy; SVM, support vector machine; GBM, glioblastoma; TCIA, The Cancer Imaging Archive; TCGA, The Cancer Genomic Atlas. | |||||
| Penzias
| Identify morphologic basis of radiomic features for prostate cancer risk stratification | Radiomic features from T2W MRI that were associated with low- and high-risk prostate cancer were identified, pathomic features that were best correlated with these features were explored | A single institution cohort of 36 patient studies was used with T2W MRI, post-surgical H&E slides | Gabor features on T2W MRI performance (AUC=0.69) and gland lumen shape features (AUC=0.75) resulted in best classification performance | |
| Shiradkar
| Establish the morphologic basis of MR fingerprinting values on the prostate. | Co-registration of whole mount pathology with MRI, MRF followed by correlation of tissue compartments with MR measurements within prostate cancer, prostatitis and normal prostate ROI | A set of 14 patient studies who underwent MRI, MRF scans followed by radical prostatectomy | Tissue compartments of epithelium, lumen and stroma were significantly correlated with T1, T2 MRF, ADC values (P<0.05) | |
| Alvarez-Jimenez
| Association between radiomic and pathomic features that distinguish adenocarcinoma and squamous cell carcinoma | Pathomic features from digitized H&E slides of lung cancer; radiomic features from lung cancer CT scans; Cross scale associations were computed between radiomic and pathomic features to compare with individual feature classes | N=171 pathology studies, n=101 lung CT studies acquired from publicly available databases. | Cross-scale associated features resulted in better discrimination (AUC=0.78) of NSCLC subtypes compared to using individual feature classes | |
| Zhang
| A prognostic nomogram integrating radiomics and pathology signature to prognosticate NPC | Radiomics from MRI images are combined with a pathomic signature obtained from a deep learning model along with clinical factors to build a multi-scale prognostic nomogram for nasopharyngeal cancer | N=220 NPC patients were divided into n=132 for training, n=88 for internal and external validation. | Multi-scale nomogram resulted in an improved predictor of survival (C-index 0.82 | |
| Vaidya
| Integrating radiomic and pathomic signatures of NSCLC to predict cancer recurrence | Radiomic features from ROIs on lung CT were combined with pathomic eatures from H&E slides of resected tissue to build an integrated supervised machine learning classifier. | 50 NSCLC patients were used for training and 43 patients for external validation | The combined classifier resulted in higher AUC=0.78 compared to radiomic (AUC=0.74) and pathomic classifier (AUC=0.67) alone | |
| Braman
| Deep learning prognostic model for gliomas integrating radiology, pathology, genomics and clinical data | Deep learning model where each modality embeddings are combined via attention gated tensor fusion. A multimodal orthogonalization loss is presented to maximize information from each modality so they are complementary. | 176 patients witn T1w and T2w-FLAIR sequences annotated by 7 radiologists, H& slides and DNA sequencing info | Presented model results in C-index of 0.788±0.067, significantly outperforming (P=0.023) the best performing unimodal (C-index of 0.718±0.064) | |
| Shao
| Integrating radiological and pathological information on pre-treatment info to predict pathological response in rectal cancer | Computational features were derived from rectal pre-treatment MRI and digitized H&E slides, combined to create a radiopathomic signature (RPS) to predict treatment response | N=981 patients who received nCRT along with pretreatment MRI and biopsy whole slide images. | RPS resulted in AUC of 0.84−0.98 at each grade of pathological response with significantly higher performance compared to without integration. | |
| Rathore
| Integrating radiomic and pathomic features for prognosis of GBM | Radoimic features from T1, T1-Gd, T2, T2-FLAIR, were combined with pathomic s of H&E slides to build a SVM classifier for differentiating long and short term survivors | N=107 GBM patients with MRI and pathology images obtained from TCIA and TCGA | AUC=0.74, 0.76 and 0.8 for radiomics, pathomics and combined model in predicting survival outcome | |