| Literature DB >> 34093800 |
Fayang Ma1,2, Kyle Laster2, Wenna Nie2, Fangfang Liu1,2, Dong Joon Kim1,2, Mee-Hyun Lee1,2,3, Ruihua Bai4, Rendong Yang5, Kangdong Liu1,2,4, Zigang Dong1,2,4.
Abstract
Esophageal Squamous Cell Carcinoma (ESCC) is the predominant type of Esophageal Cancer (EC), accounting for nearly 88% of EC incidents worldwide. Importantly, it is also a life-threatening cancer for patients diagnosed in advanced stages, with only a 20% 5-year survival rate due to a limited number of actionable targets and therapeutic options. Increasing evidence has shown that inter-tumor and intra-tumor heterogeneity are widely distributed across ESCC tumor tissues. In our work, multi-omics data from ESCC cell lines, tumor tissue, normal tissue and Patient-Derived Xenograft (PDX) tissues were analyzed to investigate the heterogeneity among ESCC samples at the DNA, RNA, and protein level. We identified enrichment of ECM-receptor interaction and Focal adhesion pathways from the subset of protein-coding genes with non-silent mutations in ESCC patients. We also found that TP53, TTN, KMT2D, CSMD3, DNAH5, MUC16 and DST are the most frequently mutated genes in ESCC patient samples. Out of the identified genes, TP53 is the most frequently mutated, with 84 distinct non-silent mutation variants. We observed that p.R248Q, p.R175G/H, and p.R273C/H are the most common TP53 mutation variants. The diversity of TP53 mutations reveal its importance in ESCC progression and may also provide promising targets for precision therapeutics. Additionally, we identified the Olfactory transduction as the top signaling pathway, enriched from genes uniquely expressed in The Cancer Genome Atlas (TCGA)-ESCC patient tumor tissues, which may provide implications for the exact roles of the corresponding genes in ESCC. Cyclic nucleotide-gated channel subunit beta 1(CNGB1), a gene belonging to the Olfactory transduction pathway, was found exclusively overexpressed in ESCC. Expression of CNGB1 could serve as a marker, indicating potential diagnostic or therapeutic value. Finally, we investigated heterogeneity in the context of the ESCC PDX model, which is an emerging tool used to predict drug response and recapitulate tumor behavior in vivo. We observed trans-species heterogeneity in as high as 75% of the identified proteins, indicating that the ambiguity of proteins should be addressed by specific strategies to avoid drawing false conclusions. The identification and characterization of gene mutation and expression heterogeneity across different ESCC datasets, including various novel TP53 mutations, ECM-receptor interaction, Focal adhesion, and Olfactory transduction pathways (CNGB1), provide researchers with evidence and implications for accurate research and precision therapeutic development. © The author(s).Entities:
Keywords: Bioinformatics.; Esophageal Squamous Cell Carcinoma; Genetic Heterogeneity; Heterograft; Proteomics; Transcriptome
Year: 2021 PMID: 34093800 PMCID: PMC8176252 DOI: 10.7150/jca.52286
Source DB: PubMed Journal: J Cancer ISSN: 1837-9664 Impact factor: 4.207
Figure 1Investigation of mutation and expression heterogeneity in CCLE-ESCC cell lines and TCGA-ESCC patient tissue. (A and B) Heatmaps illustrating gene mutation heterogeneity as a function of matching dissimilarity in CCLE-ESCC cell lines (N=10) and TCGA-ESCC patient tissues (N=80), respectively (CCLE-ESCC : 4725 genes, TCGA-ESCC: 8510 genes). Darker hues indicate increased dissimilarity among ESCC samples. (C and D) Heatmaps illustrating expression heterogeneity as a function of transcriptome correlation distance in CCLE-ESCC cell lines (N=10) and TCGA-ESCC patient tissues (N=80), respectively (CCLE-ESCC: 15258 genes, TCGA-ESCC: 18335 genes).
Figure 2The landscape of non-silent mutation heterogeneity in ESCC. (A and B) The grey color indicates the number of genes with silent mutations, and the red color indicates genes with non-silent mutations across the ESCC cell lines (A) and patient tissues (B). (C and D) The top ten significantly enriched KEGG signaling pathways from genes harboring non-silent mutations in ESCC cell lines and patient tissues, respectively. (E and F) A positive correlation was observed between the number of genes with silent mutations and genes with non-silent mutations in ESCC cell lines and patient tissues, respectively. (G) The number of total mutated genes and unique mutated genes across the 12 ESCC cancer cell lines. (H) The frequencies of non-silent mutated genes across ESCC patient tumor tissues.
Figure 3The distribution of (A) TP53 is the only mutated gene shared by all 12 ESCC cell lines. (B) A total of 16 TP53 non-silent mutations were identified across the 12 ESCC cell lines. On average, more than one TP53 mutation per cell line was observed. (C) A total of 90 TP53 non-silent mutations are distributed across 90% of ESCC patient tumor tissues (a complete list in the supplemental data). (D) Overlap diagram illustrating commonly shared TP53 non-silent mutations between ESCC patients and ESCC cell lines.
Figure 4Heterogeneity illustrated by integrative analysis of gene mutation and protein-coding transcriptome. (A) 501 mutant proteins specific to kyse30 were revealed after overlapping total proteins with the total non-silent mutated genes in kyse30. (B) All mutated protein-coding genes were overlapped with all proteins expressed in the 10 ESCC cell lines. A subset of 3786 mutant proteins, a subset of 11472 proteins without mutations, and a subset of mutated but untranscribed genes were identified. (C) A subset of mutated genes commonly expressed in all the ten ESCC cell lines were observed. (D) Only one mutant protein was identified in the intersection of kyse30-specific proteins and kyse30-specific mutated genes. (E) All the unique proteins for each individual ESCC cell line were overlapped with all mutated genes, producing a subset of 135 mutant proteins unique across the 10 ESCC cell lines.
Figure 5A subset of proteins, termed Possibly of Murine Origin (PMO) proteins, was identified in the PDX model. (A) The overlapping diagram demonstrated a subset of 3963 proteins was commonly shared by both origins, and the subsets of identified proteins with clear human_origin or mouse_origin in PDX tissue. (B) The top ten signaling pathways with the most significance were enriched from the subset of PMO proteins.
Figure 6A subset of 2574 uniquely expressed genes specific to TCGA-ESCC patient tumor tissues was identified. (A) Diagram illustrating the unique and shared subsets of gene expression after overlapping the four datasets from TCGA-ESCC Tissues, CCLE-ESCC Cell Lines, Protein Atlas-Normal Esophagus Tissue and PDX-ESCC Tissues. (B) The top ten significant signaling pathways enriched from the 2574 TCGA-ESCC Unique genes. (C) The corresponding PPI and networks of signaling pathways derived from TCGA-ESCC Unique genes. (D) 293 genes residing within the Olfactory transduction pathway, out of which 290 genes belong to the Olfactory receptor family. (E) The expression landscape of CNGB1 across different Tumor-Normal tissue pairs. (F) The expression of CNGB1 in ESCA and HNSC. (G) The comparison of CNGB1 expression between 80 ESCC samples and 11 normal esophageal tissues. Significance was assessed using the Mann-Whitney test. * indicates p<.05, *** indicates p<.001.