| Literature DB >> 30914778 |
Anna Karlsson1, Helena Cirenajwis2, Kajsa Ericson-Lindquist3,4, Hans Brunnström3,4, Christel Reuterswärd2, Mats Jönsson2, Cristian Ortiz-Villalón5, Aziz Hussein6, Bengt Bergman7, Anders Vikström8, Nastaran Monsef9, Eva Branden10,11, Hirsh Koyi10,11, Luigi de Petris12, Patrick Micke13, Annika Patthey14, Annelie F Behndig15, Mikael Johansson16, Maria Planck2,17, Johan Staaf18.
Abstract
Accurate histological classification and identification of fusion genes represent two cornerstones of clinical diagnostics in non-small cell lung cancer (NSCLC). Here, we present a NanoString gene expression platform and a novel platform-independent, single sample predictor (SSP) of NSCLC histology for combined, simultaneous, histological classification and fusion gene detection in minimal formalin fixed paraffin embedded (FFPE) tissue. The SSP was developed in 68 NSCLC tumors of adenocarcinoma (AC), squamous cell carcinoma (SqCC) and large-cell neuroendocrine carcinoma (LCNEC) histology, based on NanoString expression of 11 (CHGA, SYP, CD56, SFTPG, NAPSA, TTF-1, TP73L, KRT6A, KRT5, KRT40, KRT16) relevant genes for IHC-based NSCLC histology classification. The SSP was combined with a gene fusion detection module (analyzing ALK, RET, ROS1, MET, NRG1, and NTRK1) into a multicomponent NanoString assay. The histological SSP was validated in six cohorts varying in size (n = 11-199), tissue origin (early or advanced disease), histological composition (including undifferentiated cancer), and gene expression platform. Fusion gene detection revealed five EML4-ALK fusions, four KIF5B-RET fusions, two CD74-NRG1 fusion and three MET exon 14 skipping events among 131 tested cases. The histological SSP was successfully trained and tested in the development cohort (mean AUC = 0.96 in iterated test sets). The SSP proved successful in predicting histology of NSCLC tumors of well-defined subgroups and difficult undifferentiated morphology irrespective of gene expression data platform. Discrepancies between gene expression prediction and histologic diagnosis included cases with mixed histologies, true large cell carcinomas, or poorly differentiated adenocarcinomas with mucin expression. In summary, we present a proof-of-concept multicomponent assay for parallel histological classification and multiplexed fusion gene detection in archival tissue, including a novel platform-independent histological SSP classifier. The assay and SSP could serve as a promising complement in the routine evaluation of diagnostic lung cancer biopsies.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30914778 PMCID: PMC6435686 DOI: 10.1038/s41598-019-41585-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Patient cohorts and clinicopathological features.
| Histological assessment by pathologist | Cohort | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Fusion gene detection** | SSP development cohort | External validation cohort I*** | External validation cohort II◆ | External validation cohort III◆ | External validation cohort IV◆◆ | External validation cohort V◆◆ | External validation cohort VI◆◆ | External validation cohort VII◆◆ | |
| AC | 78 | 27 | 39 | 4 | 8 | 115 | 83 | 106 | 127 |
| SqCC | 35 | 30 | — | 4 | 1 | 68 | 26 | 66 | 43 |
| LCNEC | 12 | 11 | — | — | 1 | 5 | — | — | — |
| NSCLC-NOS | 1 | — | — | — | 1 | — | — | — | — |
| LCC | 3 | — | — | 3 | — | 6 | — | — | — |
| Other* | 2 | — | — | — | — | 5 | — | — | — |
| Total | 131 | 68 | 39 | 11 | 11 | 199 | 109 | 172 | 170 |
| Intended use of data | NanoString fusion gene detection ability | Fusion gene detection | NanoString fusion gene detection ability | Fusion gene detection | Fusion gene detection | SSP validation | SSP validation | SSP validation | SSP validation |
| Data generation platform | NanoString | NanoString | NanoString | NanoString | NanoString | RNAseq | Illumina microarrays | Affymetrix microarrays | Affymetrix microarrays |
| Novelty data generation | Yes | Yes | Yes | Yes | Yes | No | No | No | No |
| Tissue origin | FFPE | FFPE | FFPE | FFPE | FFPE | Fresh frozen | Fresh frozen | Fresh frozen | Fresh frozen |
*This subgroup includes sarcomatoid carcinomas, carcinoid tumors, and adenosquamous carcinomas.
**Fusion gene analysis was performed in all samples from the SSP development cohort, and validation cohort I, II and III plus two tumors of other histological subtypes.
***Patients with a never-smoking history and surgically resected tumors.
◆Advanced disease. Biopsies and cytologies.
◆◆Surgically resected tumor material.
Figure 1Cohorts and SSP development. 31 tumors from never-smokers were profiled by NanoString analysis. 29 of these patients and 39 tumors of SqCC and LCNEC histology from in-house biobanks were merged to a final SSP development cohort (n = 68). A feasibility test of the SSP was performed prior to deriving a final prediction model. In the feasibility test, samples were partitioned based on histology into a train and test set respectively and iterated 10 times. The SSP developed during the feasibility test in the train set was used to classify tumors of the test set and re-classify tumors of the train set. Accuracy, balanced accuracy and AUC values were calculated as mean values over iterations. Based on the high performance and low variability due to different sample selections of the iterated SSP models in the feasibility test, a final prediction model was trained in the entire SSP development cohort (n = 68) and used for re-classification of tumors in the SSP development cohort for confirmatory purpose. To test the independent performance of the final SSP, the model was applied to six external validation cohorts that differed in size, tumor stage, histology composition and gene expression data platform. Fusion gene detection using the NanoString assays was performed on a cohort of never-smoking patients (n = 31), the SSP development cohort, and three validation cohorts.
Figure 2Detection of ALK fusion gene and MET exon 14 skipping events using the NanoString technology. (A) Detection of gene fusions, e.g. involving ALK, by the NanoString assay is based on expression (counts) of the 3′, 5′ part of the gene, and fusion specific probes as described elsewhere (see e.g.[11]). The actual count values (left panel) for ALK related probes in sample S_0003 reveals the, likely, exact ALK fusion (EML4-ALK_E13:A20) and demonstrates the differential expression of the 3′ and 5′ probes of the ALK gene when a fusion occurs. Combining a 3′/5′ probe ratio with fusion specific probe expression identifies five ALK fusion positive samples (red samples) in the upper right quadrant of a scatter plot of the 3′/5′ expression ratio versus the expression of ALK fusion specific probes (left panel) (see[11] for further details) for cases subjected to fusion gene analysis (see Table 1). (B) Identification of three patients harboring MET exon 14 skipping events (red samples). Detection is based on high expression of a specific junction probe spanning exon 13–15 (excluding exon 14) (y-axis), versus a ratio of the mean expression for probes representing exons 3–4 and 20–21 divided by exon 14 specific expression (x-axis). Samples harboring MET exon 14 skipping events are visualized in the upper right quadrant as these report high junction probe counts and differential expression of exon 14 and exons 3–4 and 20–21 probe counts. (C) Raw NanoString count data for the S_0297_1 sample, which was tested clinically ALK positive by IHC, but was called FISH negative. NanoString analysis identifies a likely EML4-ALK_E13:A20 fusion. One 5′ probe demonstrates a high background count compared to remaining 5′ probes.
Concordance of ALK gene fusion detection using the NanoString technology to the clinical diagnostic routine analysis (external validation cohort I).
| 36 | 1** | |
| 0 | 2 |
*Methods used include IHC, FISH and/or qPCR.
**FISH negative, IHC positive.
Cohorts and SSP concordance.
| Cohort | Fusion gene detection (n) | Fusion positive cases (n) (incl. | SSP development (n) | AIMS prediction (n) | AC concordance (%) | SqCC concordance (%) | LCNEC concordance (%) | Discordance |
|---|---|---|---|---|---|---|---|---|
| Never-smokers | 31 | 7 (22.6% frequency) | 29 | 29 | 93% (n = 25/27) | 90% (n = 27/30) | 100% (n = 11/11) | — |
| In-house biobanks | 39 | 0 | 39 | 39 | ||||
| Validation cohort I | 39 | 7 | — | — | — | — | — | See Table |
| Validation cohort II | 11 | 0 | — | 11 | 100% (n = 4) | 75% (n = 3/4) | — | SqCC: Low p40 staining, basaloid |
| Validation cohort III | 11 | 0 | — | 11 | 38% (n = 3/8) | 100% (n = 1) | 100% (n = 1) | AC: Low RNA quality (n = 2), poor differentiation/lack of mucin markers (n = 2), SqCC metaplasia (n = 1) |
| Validation cohort IV | — | — | — | 199 | 95% (n = 109/115) | 97% (n = 66/68) | 60% (3/5) | AC: Ends up in AC-enriched clusters |
| Validation cohort V | — | — | — | 109 | 95% (n = 79/83) | 96% (n = 25/26) | — | |
| Validation cohort VI | — | — | — | 172 | 92% (n = 97/106) | 92% (n = 61/66) | — | |
| Validation cohort VII | — | — | — | 170 | 83% (n = 106/127) | 84% (n = 36/43) | — |
Figure 3Discordance in SSP classification. (A) In-depth histopathological evaluation of a tumor initially classified as NSCLC-NOS that was re-classified as AC by a pathologist but SqCC by the SSP (top right panel) in validation cohort III. Hematoxylin staining of the tumor clearly demonstrates the tumor cell rich area (lower left IHC panel), while KRT5 IHC staining reveals positivity in bronchial epithelium with squamous metaplasia surrounding the tumor tissue (lower right IHC panel). The latter is most probably the cause of elevated KRT5 (and other squamous markers) expression in the RNA extracted from the tissue (top left expression panel). (B) Two misclassified LCNEC tumors in validation cohort IV. Due to low mRNA expression of LCNEC associated genes and high expression of AC genes, these tumors are classified as AC by the SSP (top expression panels), discordant to original histopathological assessment. In-depth review of these discordant cases revealed mixed AC and LCNEC histology, evident from IHC stains using AC and LCNEC immunomarkers respectively (lower panels). Further review of these cases revealed that RNA had been extracted and analyzed by RNAseq only from the AC component from these two tumors.
Figure 4Examples of SSP classification. (A) SSP classification of three individual tumors corresponding to three major subtypes of NSCLC is based on expression of genes associated with AC histology (SFTPG, TTF-1, NAPSA), SqCC histology (TP73L, KRT6A, KRT5, KRT40, and KRT16) or LCNEC histology (CHGA, SYP, and CD56). (B) The SSP classifies these three individual tumors with high probability rates in concordance with the histopathological assessment of histology.