| Literature DB >> 36105102 |
Juxuan Zhang1, Jiaxing Deng1, Xiao Feng2, Yilong Tan1, Xin Li1, Yixin Liu3, Mengyue Li1, Haitao Qi1, Lefan Tang1, Qingwei Meng2, Haidan Yan4, Lishuang Qi1.
Abstract
Background: Lung cancer is a complex disease composed of neuroendocrine (NE) and non-NE tumors. Accurate diagnosis of lung cancer is essential in guiding therapeutic management. Several transcriptional signatures have been reported to distinguish between adenocarcinoma (ADC) and squamous cell carcinoma (SCC) belonging to non-NE tumors. This study aims to identify a transcriptional panel that could distinguish the histological subtypes of NE tumors to complement the morphology-based classification of an individual.Entities:
Keywords: histological classification; individualization; lung neuroendocrine tumors; relative gene expression orderings; transcriptional signatures
Year: 2022 PMID: 36105102 PMCID: PMC9465419 DOI: 10.3389/fgene.2022.944167
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Datasets and molecular landscape of lung cancer. (A) 25 lung cancer datasets were used in this study. (B) heatmaps of the molecular landscape of lung cancer subtype in the training (GSE30219) dataset. The clinical heatmap panels show the distributions of clinical parameters, including histological subtype, tumor stage, age, and sex. The score heatmap panels show the proliferation scores, stemness scores, hypoxia scores, and immune scores calculated by mRNA expression profiles, based on the published articles (Supplementary Material). The boxplots of four scores across the lung cancer subtypes are displayed in Supplementary Figure S1. The immune cell heatmap panels show the relative infiltration abundances of 28 immune cell types quantified by ssGSEA. The immune checkpoint heatmap panels show the mRNA expression levels of three immune checkpoint genes, which are targets of immunotherapy. The levels of immune cell infiltration and immune checkpoint gene expression were scaled across all samples using the Z-score method. The subtype-specific marker heatmap panels depict the mRNA expression levels of seven subtype-specific marker genes, including three neuroendocrine marker genes (CD56, SYP, and CHGA), two SCC marker genes (KRT5 and TP63), and one ADC marker gene (NAPSA). Analysis of variance was used to test the differences across five subtype groups. The log 10-transformed p values are displayed on the left of the heatmap panels. (C) Kaplan–Meier curves of overall survival for the lung cancer subtypes in the training (GSE30219) dataset. The patients had undergone only curative surgical resection. ssGSEA, single-sample gene set enrichment analysis; SCC, squamous cell carcinoma; and ADC, adenocarcinoma.
FIGURE 2Clustering heatmap of lung cancer subtypes in the GSE30219 dataset. (A) consensus clustering of all the lung cancer samples based on the top 1,000 most variable genes in the dataset. The left panel represents the matrix heatmap when k = 2, and the right panel represents the consistent cumulative distribution function graph. (B) hierarchical clustering of all the samples based on the top 1,000 most variable genes.
FIGURE 3Flowchart of this study. (A) identification of the NEsubtype-panel. First, in the training dataset (GSE30219), a consensus clustering was performed based on mRNA expression to remove the discordant samples, and then, a panel of transcriptional signatures for determining NE subtype (NEsubtype-panel) in the clustering-adjusted samples was hierarchically developed, based on the “within-sample” relative expression orderings (REOs) of gene pairs to determine the lung NE subtypes. Second, the NEsubtype-panel was tested in multiple datasets with fresh-frozen, clinically challenging (FFPE and small biopsy specimens), and single-cell samples. At last, survival and differential expression analyses were conducted to indirectly support the reclassification indicated by these signatures. (B) the NEsubtype-panel classification diagram. For a given sample, the NEsubtype-panel was used to classify the histological subtype step by step based on the “within-sample” REOs of gene pairs, and to ultimately determine the patient subtype. NE, neuroendocrine; non-NE, non-neuroendocrine; and FFPE, formalin-fixed paraffin-embedded.
Gene pair composition of the NEsubtype-panel
| No. | NE-signature | CARCI-signature | SCLC-signature |
|---|---|---|---|
| Gene | Gene | Gene | |
| 1 |
|
|
|
| 2 |
|
|
|
| 3 |
|
|
|
| 4 |
|
|
|
| 5 |
|
|
|
| 6 |
|
|
|
| 7 |
|
|
|
| 8 |
|
|
|
| 9 |
|
|
|
| 10 |
|
|
|
| 11 |
|
|
|
| 12 |
|
|
|
| 13 |
|
|
|
| 14 |
|
|
|
| 15 |
|
|
|
| 16 |
|
|
|
| 17 |
|
|
|
| 18 |
|
|
|
| 19 |
|
|
|
| 20 |
|
|
|
| 21 |
|
|
|
| 22 |
|
|
|
| 23 |
|
| |
| 24 |
|
| |
| 25 |
|
| |
| 26 |
|
| |
| 27 |
|
| |
| 28 |
|
| |
| 29 |
|
| |
| 30 |
|
| |
| 31 |
| ||
| 32 |
| ||
| 33 |
| ||
| 34 |
| ||
| 35 |
| ||
| 36 |
| ||
| 37 |
| ||
| 38 |
| ||
| 39 |
| ||
| 40 |
|
Gene Symbol and Entrez gene IDs (within brackets) are provided in Table 1. For each gene pair (Gene a and Gene b) in the NEsubtype-panel, if the expression of Gene a is greater than Gene b in a sample, then it was supported to classify the sample as NE, CARCI, or SCLC, respectively. NE, neuroendocrine; CARCI, carcinoids; and SCLC, small-cell lung cancer.
FIGURE 4Hierarchical validation of the NEsubtype-panel. (A) protein–protein interaction network of genes in the NEsubtype-panel constructed using Cytoscape. The NE-signature, CARCI-signature, and SCLC-signature genes are marked in light green, pink, and blue, respectively. Line thickness indicates the strength of data support (interaction score by STRING). The apparent sensitivity, apparent specificity, and apparent accuracy of the (B) NE-signature, (C) CARCI-signature, and (D) SCLC-signature in multiple datasets. The left panel of each signature represents the classification accuracy of different sample types, and the right panel displays the number of reclassified samples. NE, neuroendocrine; CARCI, carcinoids; and SCLC, small-cell lung cancer.
FIGURE 5Biological analyses of the reclassification of the NEsubtype-panel. The boxplots of mRNA expression of the subtype-specific marker genes in (A) GSE60052, (B) TCGA-LUAD, and (C) TCGA-LUSC datasets with the most reclassified samples (6, 10, and 19 samples, respectively). The subtype-specific marker genes include three neuroendocrine marker genes (CD56, SYP, and CHGA), two SCC marker genes (KRT5 and TP63), and one ADC marker gene (NAPSA). The RankProd algorithm was used to test the difference in the subtype-specific marker genes between the reclassified samples and the signature-confirmed samples. (D) Kaplan–Meier curves of overall survival for the non-NE samples that were reclassified as CARCI (blue), signature-confirmed non-NE (yellow), and non-NE samples reclassified as LCNEC (red). (E) multivariate Cox regression analysis for histological classification by signatures after adjusting for data center and clinical parameters in the integrated dataset. NE, neuroendocrine; CARCI, carcinoids; ADC, adenocarcinoma; SCC, squamous carcinoma; and LCNEC, large-cell neuroendocrine carcinoma.