| Literature DB >> 35140267 |
Tian Xia1, Ashnil Kumar2, Michael Fulham3, Dagan Feng4, Yue Wang5, Eun Young Kim6, Younhyun Jung7, Jinman Kim4.
Abstract
Radiogenomics relationships (RRs) aims to identify statistically significant correlations between medical image features and molecular characteristics from analysing tissue samples. Previous radiogenomics studies mainly relied on a single category of image feature extraction techniques (ETs); these are (i) handcrafted ETs that encompass visual imaging characteristics, curated from knowledge of human experts and, (ii) deep ETs that quantify abstract-level imaging characteristics from large data. Prior studies therefore failed to leverage the complementary information that are accessible from fusing the ETs. In this study, we propose a fused feature signature (FFSig): a selection of image features from handcrafted and deep ETs (e.g., transfer learning and fine-tuning of deep learning models). We evaluated the FFSig's ability to better represent RRs compared to individual ET approaches with two public datasets: the first dataset was used to build the FFSig using 89 patients with non-small cell lung cancer (NSCLC) comprising of gene expression data and CT images of the thorax and the upper abdomen for each patient; the second NSCLC dataset comprising of 117 patients with CT images and RNA-Seq data and was used as the validation set. Our results show that our FFSig encoded complementary imaging characteristics of tumours and identified more RRs with a broader range of genes that are related to important biological functions such as tumourigenesis. We suggest that the FFSig has the potential to identify important RRs that may assist cancer diagnosis and treatment in the future.Entities:
Mesh:
Year: 2022 PMID: 35140267 PMCID: PMC8828715 DOI: 10.1038/s41598-022-06085-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The workflow for generating the FFSig and the identification of RRs with genes and GO terms. The workflow was implemented using the NRG-H dataset and validated on the NRG-S dataset.
Figure 2Heatmap of the FFSig across patient clusters with corresponding T stage from the NRG-H dataset. The heatmap was generated using MATLAB, version 2019b, URL: https://www.mathworks.com/products/matlab.html.
Figure 3The distribution of RRs between feature signatures and: (a) gene expression value of the processed genes (n = 11,318) from the NRG-H dataset. (b) Gene expression value of the processed genes (n = 2993) from the NRG-S dataset.
Figure 4Venn diagram shows the distribution of unique genes that were associated with FFSig, HCSig, TLSig, and FTSig: (a) generated using the NRG-H dataset. (b) generated using the NRG-S dataset.
Two-sample t tests that assess the strengths of all RRs constructed using the FFSig with HCSig, TLSig and FTSig, in both statistical directions on the NRG-H dataset.
| Feature signature | |||
|---|---|---|---|
| FFSig | p > 0.2 | p > 0.7 | p > 0.3 |
| FFSig | p < 1 × 10–3 | p < 1 × 10–2 | p > 0.6 |
Figure 5The distribution of RRs between the FFSig with the key genetic biomarker of EGFR from the NRG-H dataset, in comparison to HCSig, TLSig and FTSig.
Two-sample t tests that assess the strengths of all RRs constructed using the FFSig with HCSig and FTSig, in both statistical directions on the NRG-S dataset.
| Feature signature | HCSig | FTSig |
|---|---|---|
| FFSig | p > 0.8 | p > 0.2 |
| FFSig | p > 0.3 | p > 0.08 |
Figure 6Venn diagram shows the distribution of GO terms that were associated with image feature signatures of FFSig, TLSig, FTSig and HCSig: (a) generated using the NRG-H dataset. (b) Generated using the NRG-S dataset.
The GO terms that have RRs with FFSig, HCSig, TLSig and FTSig with positive and negative associations from the NRG-H dataset.
| NES | NES | ||
|---|---|---|---|
| Organelle lumen | 2.43 | Extracellular region | 1.74 |
| Nuclear lumen | 2.22 | Regulation of transferase activity | 0.60 |
| Membrane enclosed lumen | 2.19 | Transferase activity transferring phosphorus containing groups | 0.58 |
| Glycoprotein biosynthetic process | 1.98 | Protein kinase activity | 0.58 |
| Macromolecule biosynthetic process | 1.94 | Stress activated protein kinase signalling pathway | 0.58 |
| Response to virus | − 1.98 | Carbohydrate metabolic process | − 0.99 |
| Cell cell signaling | − 1.98 | Phosphoric monoester hydrolase activity | − 0.99 |
| Response to other organism | − 2.00 | Phosphoric ester hydrolase activity | − 1.01 |
| Anatomical structure morphogenesis | − 2.01 | Alcohol metabolic process | − 1.02 |
| Response to biotic stimulus | − 2.01 | Hydrolase activity acting on ester bonds | − 1.02 |
| Cell fraction | 2.17 | Anatomical structure morphogenesis | 1.85 |
| Membrane fraction | 2.03 | Enzyme regulator activity | 1.80 |
| Phosphoric ester hydrolase activity | 2.02 | Enzyme activator activity | 1.79 |
| Soluble fraction | 1.96 | Enzyme linked receptor protein signalling pathway | 1.77 |
| Insoluble fraction | 1.96 | Membrane fraction | 1.73 |
| Homophilic cell adhesion | − 1.66 | Extracellular region part | − 1.23 |
| Sulfuric ester hydrolase activity | − 1.67 | Extracellular space | − 1.23 |
| Nervous system development | − 1.68 | Phosphorylation | − 1.24 |
| Regulation of anatomical structure morphogenesis | − 1.68 | Lipase activity | − 1.25 |
| Cell surface | − 1.99 | Female pregnancy | − 1.27 |
The GO terms that have the highest NES and exclusively RRs with FFSig (left) and the GO terms that are restricted to have RRs with FFSig (right), experimented on the NRG-H dataset.
| FFSig exclusive | NES | FFSig restricted | NES |
|---|---|---|---|
| Transmembrane receptor protein kinase activity | 1.61 | Soluble fraction | 1.96 |
| Protein tyrosine kinase activity | 1.60 | Insoluble fraction | 1.96 |
| Transmembrane receptor protein tyrosine kinase activity | 1.53 | Enzyme regulator activity | 1.80 |
| Generation of precursor metabolic and energy | 1.47 | Enzyme activator activity | 1.79 |
| Phospholipid metabolic process | 1.42 | Molecular adaptor activity | 1.73 |
| RNA processing | − 1.85 | Generation of neurons | − 1.66 |
| Organ morphogenesis | − 1.95 | Homophilic cell adhesion | − 1.67 |
| Response to virus | − 1.98 | Sulfuric ester hydrolase activity | − 1.67 |
| Response to other organism | − 2.00 | Regulation of anatomical structure morphogenesis | − 1.68 |
| Response to biotic stimulus | − 2.01 | Cell surface | − 1.99 |
The GO terms that have RRs with FFSig, HCSig and FTSig with positive and negative associations from the NRG-S dataset.
| FFSig | NES | HCSig | NES | FTSig | NES |
|---|---|---|---|---|---|
| Perinuclear region of cytoplasm | 2.62 | Sensory perception | 1.80 | Intracellular protein transport | 2.62 |
| Nervous system development | 2.58 | Monooxygenase activity | 1.78 | Establishment of protein localisation | 2.61 |
| Membrane organisation and biogenesis | 2.45 | Oxygen binding | 1.78 | Macromolecule localisation | 2.61 |
| Intercellular junction | 2.05 | Electron transport (GO 0006118) | 1.75 | Protein localisation | 2.54 |
| Tight junction | 1.96 | Neurological system process | 1.70 | Protein transport | 2.52 |
| Kinase activity | − 2.00 | Second messenger mediated signalling | − 0.77 | Soluble fraction | − 1.61 |
| Endoplasmic reticulum | − 2.11 | Establishment and or maintenance of cell polarity | − 0.77 | Organelle lumen | − 1.62 |
| Nuclear lumen | − 2.19 | Regulation of catalytic activity | − 0.77 | Nucleolus | − 1.65 |
| Organelle lumen | − 2.84 | cAMP mediated signalling | − 0.77 | Nuclear lumen | − 1.67 |
| Membrane enclosed lumen | − 3.06 | G protein signalling adenylate cyclase activating pathway | − 0.77 | Membrane enclosed lumen | − 1.71 |
The GO terms that have the highest NES and exclusively RRs with FFSig (left) and the GO terms that are restricted to have RRs with FFSig (right), experimented on the NRG-S dataset.
| FFSig exclusive | NES | FFSig restricted | NES |
|---|---|---|---|
| Perinuclear region of cytoplasm | 2.62 | Positive regulation of metabolic process | 1.93 |
| Membrane organisation and biogenesis | 2.45 | Positive regulation of cellular metabolic process | 1.90 |
| Intercellular junction | 2.05 | Neurite development | 1.90 |
| Tight junction | 1.96 | Steroid hormone receptor signalling pathway | 1.89 |
| Apical junction complex | 1.94 | Cellular lipid catabolic process | 1.88 |
| Serine type peptidase activity | − 1.57 | cAMP mediated signalling | − 0.77 |
| Serine hydrolase activity | − 1.58 | G Protein signalling adenylate cyclase activating pathway | − 0.77 |
| Serine type endopeptidase activity | − 1.60 | Intrinsic to Golgi membrane | − 0.88 |
| Peptidase activity | − 1.75 | Intrinsic to organelle membrane | − 0.93 |
| Endopeptidase activity | − 1.76 | Integral to organelle membrane | − 0.93 |