| Literature DB >> 31015447 |
Jennifer E Beane1, Sarah A Mazzilli2, Joshua D Campbell2, Grant Duclos2, Kostyantyn Krysan3, Christopher Moy4, Catalina Perdomo5, Michael Schaffer4, Gang Liu2, Sherry Zhang2, Hanqiao Liu2, Jessica Vick2, Samjot S Dhillon6, Suso J Platero7, Steven M Dubinett3, Christopher Stevenson8, Mary E Reid9, Marc E Lenburg2, Avrum E Spira2,4.
Abstract
Bronchial premalignant lesions (PMLs) are precursors of lung squamous cell carcinoma, but have variable outcome, and we lack tools to identify and treat PMLs at risk for progression to cancer. Here we report the identification of four molecular subtypes of PMLs with distinct differences in epithelial and immune processes based on RNA-Seq profiling of endobronchial biopsies from high-risk smokers. The Proliferative subtype is enriched with bronchial dysplasia and exhibits up-regulation of metabolic and cell cycle pathways. A Proliferative subtype-associated gene signature identifies subjects with Proliferative PMLs from normal-appearing uninvolved large airway brushings with high specificity. In progressive/persistent Proliferative lesions expression of interferon signaling and antigen processing/presentation pathways decrease and immunofluorescence indicates a depletion of innate and adaptive immune cells compared with regressive lesions. Molecular biomarkers measured in PMLs or the uninvolved airway can enhance histopathological grading and suggest immunoprevention strategies for intercepting the progression of PMLs to lung cancer.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31015447 PMCID: PMC6478943 DOI: 10.1038/s41467-019-09834-2
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Subject demographic and clinical annotation in the discovery and validation cohorts
| Variable | Discovery cohort ( | Validation cohort ( | |
|---|---|---|---|
| Average # biopsies/subject | 6.6 (5.7) | 5.25 (2.9) | 0.3 |
| Average # bronchoscopies/subject | 3.1 (1.6) | 2.5 (0.7) | 0.08 |
| Average time between bronchoscopies (days) | 348.6 (197.5) | 366.8 (208.3) | 0.69 |
| Male | 15/30 (50) | 12/20 (60) | 0.81 |
| White | 27/30 (90) | 17/20 (85) | 1 |
| Age (at baseline clinical visit) | 58.8 (7.6) | 58.7 (8.3) | 0.97 |
| Ever smoker (at baseline clinical visit) | 29/30 (96.7) | 19/20 (95) | 1 |
| Pack-years (at baseline clinical visit) | 49.8 (22.1) | 41.3 (20.7) | 0.17 |
| Prior history of lung cancer | 21/30 (70) | 12/20 (60) | 0.82 |
| Prior history of LUSC | 5/30 (16.7) | 5/20 (25) | 0.73 |
| COPD (FEV1/FVC ≤ 0.7, at baseline clinical visit) | 17/27 (63.0) | 8/18 (44.4) | 0.61 |
| GOLD 1 (FEV1%> 80) | 2/27 (7.4) | 2/18 (11.1) | 1 |
| GOLD 2 (FEV1% < 80 and > 50) | 12/27 (44.4) | 5/18 (27.8) | 0.56 |
| GOLD 3 (FEV1% < 50 and > 30) | 3/27 (11.1) | 1/18 (5.6) | 1 |
| Occupational asbestos | 13/30 (43.3) | 9/20 (45) | 1 |
| Occupational high-risk job | 14/30 (46.7) | 12/20 (60) | 0.62 |
Statistical tests between the discovery and validation cohorts were performed using two-sided Fisher’s exact tests for categorical variables and two-sided Student’s t-tests for continuous variables. Percentages are reported for categorical variables and mean and standard deviations are reported for continuous variables. Source data are provided as a Source Data file
Sample clinical annotation in the discovery and validation cohorts
| Variable | Discovery cohort | Validation cohort | ||||
|---|---|---|---|---|---|---|
| Sample type | Biopsies | Brushes | Biopsies | Brushes | Biopsies | Brushes |
| Histology | 0.05 | 0.42 | ||||
| Normal | 38/190 (20) | 6/89 (6.7) | 23/105 (21.9) | 0/48 (0) | ||
| Hyperplasia | 30/190 (15.8) | 11/89 (12.4) | 31/105 (29.5) | 9/48 (18.8) | ||
| Metaplasia | 46/190 (24.2) | 15/89 (16.9) | 14/105 (13.3) | 9/48 (18.8) | ||
| Mild dysplasia | 21/190 (11.1) | 9/89 (10.1) | 13/105 (12.4) | 6/48 (12.5) | ||
| Moderate dysplasia | 38/190 (20) | 30/89 (33.7) | 20/105 (19.0) | 18/48 (37.5) | ||
| Severe dysplasia | 12/190(6.3) | 17/89 (19.1) | 4/105 (3.8) | 6/48 (12.5) | ||
| Carcinoma in situ | 1/190 (0.5) | 0/89 (0) | 0/105 (0) | 0/48 (0) | ||
| Tumor | 0/190 (0) | 1/89 (1.1) | 0/105 (0) | 0/48 (0) | ||
| Unknown histology | 4/190 (2.1) | 0/89 (0) | 0/105 (0) | 0/48 (0) | ||
| Current smoker (genomic prediction) | 119/190 (62.6) | 44/89 (49.4) | 38/105 (36.2) | 20/48 (41.7) | 1.80E-05 | 0.47 |
| Progression status | 0.39 | |||||
| Normal/stable | 47/190 (24.7) | 35/105 (33.3) | ||||
| Progressive/persistent | 44/190(23.2) | 20/105 (19.0) | ||||
| Regressive | 30/190 (15.8) | 18/105 (17.1) | ||||
| Unknown | 69/190 (36.3) | 32/105 (30.5) | ||||
Statistical tests between the discovery and validation cohorts within either the biopsies or brushes were performed using two-sided Fisher’s exact tests and percentages are reported. Source data are provided as a Source Data file
Fig. 1Endobronchial biopsies divide into four distinct molecular subtypes that correlate with clinical and molecular phenotypes. a Genes (n = 3936) organized into nine gene co-expression modules were used to discover four molecular subtypes (Proliferative, Inflammatory, Secretory, and Normal-like) across the 190 DC biopsies using consensus clustering. The heatmap shows semi-supervised hierarchal clustering of z-score-normalized gene expression across the 3936 genes and 190 DC biopsies. The top color bar represents the four molecular subtypes: Proliferative (n = 52 samples), Inflammatory (n = 37 samples), Secretory (n = 61 samples), and Normal-like (n = 40 samples). To the left of the heatmap, barplots for each module show the mean module GSVA score for each subtype. To the right of the heatmap, a summary of enriched biological pathways is listed for each module. b Bubbleplots showing significant associations (p < 0.01, two-sided Fisher’s exact test) between the molecular subtypes and genomic smoking status, biopsy histological grade, and the predicted LUSC tumor molecular subtypes. The columns represent the four molecular subtypes (Proliferative, Inflammatory, Secretory, and Normal-like) and the diameter of the circle is proportional to the number of samples within each subtype that have the row phenotype. c Boxplot of MKI67 expression values in biopsies with normal or hyperplasia histology (n = 8, 16, 26, 18 in Proliferative, Inflammatory, Secretory, and Normal-like subtypes, respectively). The MKI67 expression levels of the Proliferative subtype are significantly greater than non-Proliferative subtype samples (FDR = 3.4e-10, linear model). d Boxplot of expression values of MKI67 in biopsies with dysplastic histology (n = 33, 11, 19, 9 in Proliferative, Inflammatory, Secretory, and Normal-like subtypes, respectively). The MKI67 expression levels of the Proliferative subtype are significantly greater than non-Proliferative subtype samples (FDR = 3.1e-8). e Immunofluorescent staining demonstrating the increased MKI67 and KRT5 staining and reduced TUB1A1 staining in the Proliferative subtype. The representative samples shown for the Proliferative and Inflammatory subtypes have dysplasia histology, whereas the samples shown for the Secretory and Normal-like subtypes have normal histology (Magnification × 200). In the boxplots, the upper and lower hinges correspond to the first and third quartile, center line represents the median, and whiskers extend from the hinge to the largest or smallest value at most 1.5 times the distance between the quartiles. Source data are provided as a Source Data file
Molecular subtype characteristics in the discovery cohort
| Proliferative | |
| Up-regulated modules | 4, 5, 7 |
| Down-regulated modules | 6 |
| Clinical characteristics | Current smoking (86%), dysplastic biopsies (63%) |
| Biological characteristics | LUSC subytpes—classical and basal; |
| Pathways | Cell cycle: |
| DNA repair: | |
| Oxidative phosphorylation and electron transport chain: ATP synthases, NADH-ubiquinone oxidoreductases, cytochrome C oxidases | |
| Transcription factors |
|
| Inflammatory | |
| Up-regulated modules | 1, 2, 7, 8 |
| Down-regulated modules | 4, 5, 6 |
| Clinical characteristics | Former smoking (59%), non-dysplastic biopsies (68%) |
| Biological characteristics | LUSC subytpes—secretory; |
| Pathways | Extracellular matrix, focal adhesion, and integrin pathways: collagen, integrin, and laminin genes |
| Cytokine/chemokine: | |
| Downregulation of oxidative phosphorylation, respiratory elecron transport, cell cycle | |
| Transcription factors |
|
| Secretory | |
| Up-regulated modules | 6, 8 |
| Down-regulated modules | 1, 2, 5, 7 |
| Clinical characteristics | Current smoking (63%), non-dysplastic biopsies (66%) |
| Biological characteristics | LUSC subytpes—secretory; |
| Pathways | Down regulation of extracellular matrix, focal adhesion, integrin pathways |
| Transcription factors | Down regulation of |
| Normal-like | |
| Up-regulated modules | 1, 6 |
| Down-regulated modules | 8, 9 |
| Clinical characteristics | Former smoking (65%), non-dysplastic biopsies (75%) |
| Biological characteristics | |
| Pathways | Core extracellular matrix genes: collagen and laminin genes, |
| Down regulation of innate and adaptive immunity: HLA genes, | |
| Transcription factors | Down regulation of |
For each molecular subtype, significant associations are reported between the molecular subtype and gene module GSVA scores, clinical characteristics, canonical cell type epithelial and white blood cell gene markers, biological pathways, and transcription factors. The modules are designated as up regulated or down regulated in each molecular subtype based on the direction of gene expression of the majority of genes in each module. Source data are provided as a Source Data file
Fig. 2Phenotypic associations with the molecular subtypes are confirmed in an independent sample set. a The 190 DC biopsies and the 3936 genes were used to build a 22-gene nearest centroid molecular subtype classifier. The heatmap shows semi-supervised hierarchal clustering of z-score normalized gene expression across the 22 classifier genes and 190 DC biopsies training samples. b The 22-gene nearest centroid molecular subtype classifier was used to predict the molecular subtypes of the 105 VC biopsies. The heatmap shows semi-supervised hierarchal clustering of z-score normalized gene expression across 22 genes and 105 VC. The rows of the heatmap give the gene name and module membership, and the column color bar shows molecular subtype membership. c Bubbleplots showing significant associations (p < 0.01 by two-sided Fisher’s exact test) between the VC molecular subtypes and smoking status, biopsy histological grade, and the predicted LUSC tumor molecular subtypes. The columns represent the four molecular subtypes (Proliferative, Inflammatory, Secretory, and Normal-like) and the radius of the circle is proportional to the number of samples within each subtype that have the row phenotype. Source data are provided as a Source Data file
Fig. 3Normal-appearing bronchial brushes predict the presence of proliferative lesions. a The DC (left) and VC (right) cohorts, showing the number of brushes (y axis) classified as proliferative orange) that have at least one biopsy (y axis) classified as proliferative at the time the brush was sampled. Brushes/biopsies classified as not proliferative are turquoise. b Boxplots of GSVA scores for modules 4, 5, 6, and 7 (y axis) across all brushes (n = 86 in DC and n = 48 in VC) and biopsies (n = 190 in DC and n = 105 in VC) from each cohort classified as Proliferative or not Proliferative (x axis). The red asterisk indicates significant differences between the Proliferative subtype versus all other samples (FDR < 0.05, linear model). In the boxplots, the upper and lower hinges correspond to the first and third quartile, center line represents the median, and whiskers extend from the hinge to the largest or smallest value at most 1.5 times the distance between the quartiles. Source data are provided as a Source Data file
Fig. 4Immune alterations are associated with lesion outcome in the Proliferative subtype. Boxplots of Module 9 GSVA scores across DC a and VC biopsies b within the Proliferative subtype. There is a significant difference between the progressive/persistent versus regressive biopsies (p = 0.002 (DC) and p = 0.03 (VC), linear models). c Top: heatmap of z-score-normalized gene expression across the 112 genes in Module 9 in the DC biopsies (left) and the VC biopsies (right). Each heatmap is supervised by Module 9 GSVA scores. Top color bars indicate the histological grade of the biopsies and their progression status. Bottom: heatmap of xCell results indicating the relative abundance of immune cell types across the DC biopsies (left) and the VC biopsies (right). Immune cell types displayed are significantly associated with lesion progression/persistence (FDR < 0.05 in both the DC and VC, linear model). d Representative histology where the dashed yellow line denotes the separation of epithelium and stromal compartments. Top panels: a progressive severe dysplasia has reduced presence of immune cells demonstrated by the marked reduction in expression of M2 macrophages (CD68/163 staining, double-positive cells indicated by the yellow arrows) and CD8 T cells (sample corresponds to *P in c). Bottom panels: a regressive moderate dysplasia has increased presence of immune cells including M2 macrophages (CD68/163 staining double-positive cells indicated by the yellow arrows) and CD8 T cells (samples correspond to *R in c). e Boxplots of the percentages of CD68 and CD163, CD68, CD163, CD4, and CD8 positively stained cells between progressive/persistent and regressive biopsies (p < 0.001, linear model, for all comparisons). The x axis labels indicate the number of regions (R) enumerated across (P) subjects for each stain and outcome group depicted in the boxplot. Biopsies were included in the analysis if their clinical outcome was concordant with the Module 9 score. In the boxplots, the upper and lower hinges correspond to the first and third quartile, center line represents the median, and whiskers extend from the hinge to the largest or smallest value at most 1.5 times the distance between the quartiles. Source data are provided as a Source Data file