| Literature DB >> 28727543 |
Mu Zhou1, Ann Leung1, Sebastian Echegaray1, Andrew Gentles1, Joseph B Shrager1, Kristin C Jensen1, Gerald J Berry1, Sylvia K Plevritis1, Daniel L Rubin1, Sandy Napel1, Olivier Gevaert1.
Abstract
Purpose To create a radiogenomic map linking computed tomographic (CT) image features and gene expression profiles generated by RNA sequencing for patients with non-small cell lung cancer (NSCLC). Materials and Methods A cohort of 113 patients with NSCLC diagnosed between April 2008 and September 2014 who had preoperative CT data and tumor tissue available was studied. For each tumor, a thoracic radiologist recorded 87 semantic image features, selected to reflect radiologic characteristics of nodule shape, margin, texture, tumor environment, and overall lung characteristics. Next, total RNA was extracted from the tissue and analyzed with RNA sequencing technology. Ten highly coexpressed gene clusters, termed metagenes, were identified, validated in publicly available gene-expression cohorts, and correlated with prognosis. Next, a radiogenomics map was built that linked semantic image features to metagenes by using the t statistic and the Spearman correlation metric with multiple testing correction. Results RNA sequencing analysis resulted in 10 metagenes that capture a variety of molecular pathways, including the epidermal growth factor (EGF) pathway. A radiogenomic map was created with 32 statistically significant correlations between semantic image features and metagenes. For example, nodule attenuation and margins are associated with the late cell-cycle genes, and a metagene that represents the EGF pathway was significantly correlated with the presence of ground-glass opacity and irregular nodules or nodules with poorly defined margins. Conclusion Radiogenomic analysis of NSCLC showed multiple associations between semantic image features and metagenes that represented canonical molecular pathways, and it can result in noninvasive identification of molecular properties of NSCLC. Online supplemental material is available for this article.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28727543 PMCID: PMC5749594 DOI: 10.1148/radiol.2017161845
Source DB: PubMed Journal: Radiology ISSN: 0033-8419 Impact factor: 11.105
Figure 1:Overview of radiogenomic analysis to identify associations between, A, semantic features at CT and, B, RNA sequencing data.
Clinical Characteristics of the Cohort
Note.—Unless otherwise indicated, data are number of patients and data in parentheses are percentages. EGFR = epidermal growth factor receptor, KRAS = V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog, Adeno = adenocarcinoma, SCC = squamous cell carcinoma.
*Data in parentheses are range.
Metagene Characteristics and Homogeneity Scores
Note.—In part A, the homogeneous scores of the cohort and five public datasets were also recorded; metagenes were ranked in a decreasing order in terms of their homogeneity scores. In part B, metagenes were ranked in a decreasing order in terms of their homogeneity scores. The last column in part B shows the average records of the five validation cohorts. TCGA = the Cancer Genome Atlas.
Prognostic Performance of Metagenes Assessed By Meta Z Scores Summarizing the Individual z Scores from PRECOG Analysis and Corresponding Meta P Value
Note.—Negative z scores indicate good prognosis and positive z scores indicate poor prognosis. See (online) for detailed z scores for each individual adeno and squamous cell carcinoma data set. FDR = false discovery rate.
Figure 2:Radiogenomic map revealing 32 statistically significant associations between 35 CT semantic features and top 10 metagenes in NSCLC. Only image features above the variance cutoff of 10% are shown. The order of image features and metagenes is determined by hierarchical clustering of the P values.
non–small cell lung cancer
Associations between Image Features and Metagenes
Note.—The associations were ordered first by metagene and then by P value. P values and false discovery rate values presented statistical significance of the feature correlation. The t statistic is given for binary image features and the Spearmen correlation coefficient for ordinal image features. FDR = false discovery rate.
Figure 3a:Association of the activity of metagene 19 with two distinct image phenotypes. (a) Low activity of metagene 19 is associated with low activity of LRIG1 (green arrow), and high activity of the EGF–EGF receptor (EGFR; red arrow) pathway results in cell proliferation through activating KRAS and PIK3CA. (b) High activity of metagene 19 results in high activity of LRIG1 (red arrow) with inhibition of the EGF–EGF receptor pathway (green arrow), but results in higher occurrence of EGF receptor mutations, which severs the link between LRIG1 and EGF receptor.
Figure 3b:Association of the activity of metagene 19 with two distinct image phenotypes. (a) Low activity of metagene 19 is associated with low activity of LRIG1 (green arrow), and high activity of the EGF–EGF receptor (EGFR; red arrow) pathway results in cell proliferation through activating KRAS and PIK3CA. (b) High activity of metagene 19 results in high activity of LRIG1 (red arrow) with inhibition of the EGF–EGF receptor pathway (green arrow), but results in higher occurrence of EGF receptor mutations, which severs the link between LRIG1 and EGF receptor.
Figure 4:Box-and-whisker plot shows the distribution of the normalized expression of metagene 19 regarding different types of nodule margins at CT. The normalized expression of metagene 19 reflects the activity of the genes in this metagene.