Literature DB >> 35255902

Distinct bronchial microbiome precedes clinical diagnosis of lung cancer.

Janice M Leung1, Wan L Lam2,3, Erin A Marshall4,5, Fernando S L Filho6, Don D Sin6, Stephen Lam4,5.   

Abstract

Resident microbial populations have been detected across solid tumors of diverse origins. Sequencing of the airway microbiota represents an opportunity for establishing a novel omics approach to early detection of lung cancer, as well as risk prediction of cancer development. We hypothesize that bacterial shifts in the pre-malignant lung may be detected in non-cancerous airway liquid biopsies collected during bronchoscopy. We analyzed the airway microbiome profile of near 400 patients: epithelial brushing samples from those with lung cancer, those who developed an incident cancer, and those who do not develop cancer after 10-year follow-up. Using linear discriminate analysis, we define and validate a microbial-based classifier that is able to predict incident cancer in patients before diagnosis with no clinical signs of cancer. Our results demonstrate the potential of using lung microbiome profiling as a method for early detection of lung cancer.
© 2022. The Author(s).

Entities:  

Keywords:  Early detection; Liquid biopsy; Lung cancer; Microbiome; Risk prediction

Mesh:

Year:  2022        PMID: 35255902      PMCID: PMC8900294          DOI: 10.1186/s12943-022-01544-6

Source DB:  PubMed          Journal:  Mol Cancer        ISSN: 1476-4598            Impact factor:   27.401


Background

Forecasting the 10-15% of smokers who would develop lung cancer would improve survival rate [1, 2]. Smoking weakens the integrity of the bronchial epithelium, rendering the lungs more susceptible to resident microbial changes [3, 4]. Lung tumor and surround non-malignant tissues show differences in microbiome composition, and are both distinguishable from health lungs [4-8]. While lung microbiome profiles are explored as a marker for tumor presence, we see far greater potential in the clinical utility of detecting lung microbiome changes, as an indicator of imminent cancer development, in individuals at risk for cancer before any cancer diagnosis can be made, i.e. prior to clinical detection of tumors [9]. Here, we examined airway microbiomes of 400 individuals prior to cancer diagnosis, and compared those who do and do not subsequently develop cancer within a ~10-year follow-up period.

Results

Patient cohorts and microbiome profiling

Bronchscopy-obtained tissue and liquid biopsies (bronchial brushing and washing) are clinically used to monitor high-risk patients. Bronchial brushing samples from bronchoscopy of 352 smokers were retrospectively obtained as part of the Lung Health Study at BC Cancer (BCC) with Research Ethics Board approval [10]. Seven patients had a previous diagnosis of lung cancer that was treated with surgery. Study participants were stratified based on their diagnosis at the time of sampling by bronchoscopy into No Cancer, Incident Cancer and Prevalent Cancer categories for correlative analyses with microbiome attributes (Fig. 1A). Incident Cancer refers to patients who at the time of bronchoscopy did not have cancer, but during follow-up developed incident lung cancer. The 345 samples were randomized and arbitrarily divided into two cohorts such that each category was matched for age, pack-year smoking history, and follow-up time between cohorts: two-thirds used for discovery, and one-third as validation (Cohort 1 n = 230, Cohort 2 n = 115; Table 1). These cohorts show no statistically-significant difference in age, pack-year smoking history, lung function, or follow-up time (all p>0.05; mean follow-up of 10.2 years).
Fig. 1

Microbiome-based LMPC classifier identifies cancer onset in independent cohort of patients at risk of lung cancer. A Timeline of bronchial sampling in Cohorts 1-3. B Relative abundance of taxa in the microbial communities Cohorts 1 is shown at genus level of classification. C Linear discriminant analysis (LDA) scores of taxonomic features included in the LMPC scoring model. A score magnitude of 4 was used as a cut-off for significant features, with a Kruskal-Wallis p-value of <0.05. Grey bars represent taxa that are higher in relative abundance in No Cancer participants, while blue bars represent taxa with higher relative abundance in Incident Cancer participants. D Formula used to construct LMPC score, where a represents the LDA score value and x represents the relative abundance of each taxa (described in Fig. 1C). E Receiver Operating Characteristic AUC differentiates Incident Cancer participants from No Cancer participants in Cohort 1 (p<0.0001, AUC: 0.7057, 95% CI: 0.6118 to 0.7997). F When the Incident Cancer (n = 18) and No Cancer participants (n = 91) in Cohort 2 were combined and stratified by risk score, those with high scores (red) had significantly earlier cancer diagnosis than those with low score (blue). G Within the Incident Cancer patient group, those with high scores demonstrated shorter time to cancer diagnosis than those with low scores. In both analysis, samples were separated into two groups based on the median score value. H Using a Receiver Operating Characteristic AUC as defined by the score, we are able to differentiate Incident Cancer participants from No Cancer participants in Cohort 2 (p=0.0498, AUC: 0.6503, 95% CI: 0.5167-0.7839). I In an independent cohort (Cohort 3), incident cancer participants comprised 4 of the top 5 largest score values. Incident cancer participants are shown in blue, while participants who did not have a diagnosed cancer with follow-up are indicated in black. Cancer onset was assessed using receiver operating curves and log-rank tests, where p<0.05 was considered significant

Table 1

Clinical summary of Cohort 1 (n=230), 2 (n=115), and 3 (n=48). Cohorts 1 and 2 were sourced from BCC, and clinical characteristics are summarized for incident- (n=36, n=18), prevalent- (n=12, n=6), and no-cancer (n=182, n=91) participants. Cohort 3 was sourced from SPH/BCC and clinical characteristics are summarized for incident- (n=5), prevalent- (n=3), no-cancer (n=40) participants. Patients were randomly assisnged to Cohort 1 or 2 Values are displayed as mean values, unless otherwise specified

Cohort 1Cohort 2Cohort 3
IncidentPrevalentNo cancerIncidentPrevalentNo cancerIncidentPrevalentNo cancer
N3612182186915340
Age (mean)64.462.162.164.665.561.864.664.362.6
Sex (% female)44.45046.25083.345.14066.647.5
Smoking status (mean %)
Current502545.655.633.347.32033.347.5
Ex507554.444.466.752.7600.052.5
Never0000002033.30
Pack-years smoked (mean)51.1484755.34744.850.737.848
Lung function (mean)
FEV1/FVC67.36271.36665.272.173.67967
FEV12.52.12.62.31.82.62.32.12.4
FEV1 % predicted78.571.484.385.268.685.28883.377.6
COPD status (mean %)
No COPD36.133.357.138.933.363.78010045
Mild19.48.318.116.7017.6005
Moderate38.92520.933.333.316.520035
Severe2.816.72.711.116.71.10015
Very Severe000000000
Follow-up time (mean, yr)
Total9.79.51010.18.410.17.679.8
To cancer4.2N/AN/A4.3N/AN/A1.9N/AN/A
Cancer type
LUAD42.488.938.966.7100100
LUSC12.111.116.733.300
NSCLC9.1011.1000
SCLC12.105.6000
Other24.2027.80NANA
Microbiome-based LMPC classifier identifies cancer onset in independent cohort of patients at risk of lung cancer. A Timeline of bronchial sampling in Cohorts 1-3. B Relative abundance of taxa in the microbial communities Cohorts 1 is shown at genus level of classification. C Linear discriminant analysis (LDA) scores of taxonomic features included in the LMPC scoring model. A score magnitude of 4 was used as a cut-off for significant features, with a Kruskal-Wallis p-value of <0.05. Grey bars represent taxa that are higher in relative abundance in No Cancer participants, while blue bars represent taxa with higher relative abundance in Incident Cancer participants. D Formula used to construct LMPC score, where a represents the LDA score value and x represents the relative abundance of each taxa (described in Fig. 1C). E Receiver Operating Characteristic AUC differentiates Incident Cancer participants from No Cancer participants in Cohort 1 (p<0.0001, AUC: 0.7057, 95% CI: 0.6118 to 0.7997). F When the Incident Cancer (n = 18) and No Cancer participants (n = 91) in Cohort 2 were combined and stratified by risk score, those with high scores (red) had significantly earlier cancer diagnosis than those with low score (blue). G Within the Incident Cancer patient group, those with high scores demonstrated shorter time to cancer diagnosis than those with low scores. In both analysis, samples were separated into two groups based on the median score value. H Using a Receiver Operating Characteristic AUC as defined by the score, we are able to differentiate Incident Cancer participants from No Cancer participants in Cohort 2 (p=0.0498, AUC: 0.6503, 95% CI: 0.5167-0.7839). I In an independent cohort (Cohort 3), incident cancer participants comprised 4 of the top 5 largest score values. Incident cancer participants are shown in blue, while participants who did not have a diagnosed cancer with follow-up are indicated in black. Cancer onset was assessed using receiver operating curves and log-rank tests, where p<0.05 was considered significant Clinical summary of Cohort 1 (n=230), 2 (n=115), and 3 (n=48). Cohorts 1 and 2 were sourced from BCC, and clinical characteristics are summarized for incident- (n=36, n=18), prevalent- (n=12, n=6), and no-cancer (n=182, n=91) participants. Cohort 3 was sourced from SPH/BCC and clinical characteristics are summarized for incident- (n=5), prevalent- (n=3), no-cancer (n=40) participants. Patients were randomly assisnged to Cohort 1 or 2 Values are displayed as mean values, unless otherwise specified Airway DNA samples were amplified and profiled at the hypervariable region V4 of the 16S rDNA [11] using a paired-end read chemistry (2x250bp) alongside negative control samples for amplification and sequencing. All samples yielded high-quality sequence data for delineating operational taxonomic units (OTUs, Naïve Bayes clustering) through clustering and alignment to the SILVA reference database (v132) [12, 13]. Reads were processed using the QIIME2® analysis platform (version 2019.1) [14], and forward reads were trimmed to 231 bases. Organisms identified to be dominant in control samples were excluded from analysis. Microbial community alpha diversity was evaluated (Shannon index, Pielou Evenness, number of OTUs, and Faith’s phylogenetic diversity), and no significant difference in diversity between Cohorts 1 and 2, or between groups within each cohort. Further, principal coordinate analysis of Bray Curtis Dissimilarity did not result in separation between participants from Cohorts 1 and 2 (PERMOVA, p>0.05; data not shown). Additional samples were obtained from a BC Cancer and St. Paul’s Hospital study to form a third cohort for the validation of the microbiome classifier for predicting incident cancers (Cohort 3, n = 48; described in Table 1).

Relating airway microbiome to cancer status

At the Genus level, relative abundance measures were dominated by Veillonella, Streptococcus, Prevotella, and Paenibacillus in all three categories (182 No Cancer, 36 Incident Cancer and 12 Prevalent Cancer) (Fig. 1B). Taxonomic pattern in Cohort 1 discriminated the Incident Cancer Group from the No Cancer Group (i.e. no lung cancer at any point during follow-up) suggesting that shifts in taxonomic composition related to lung cancer may occur in airways of patients who develop incident lung cancer many months from the date of bronchoscopy. To determine which taxonomic identifiers discriminated patients who developed incident cancers from those who did not develop lung cancer, we used a combination of linear discriminate analysis and effect size modeling (LEfSe; Fig. 1C) [15]. LEfSe was chosen to determine features most likely differentiate groups because in additional to differences in relative abundance, this statistical model captures potential relevance by considering magnitude effect size in the LDA score. Beyond the relative abundance of Bacilli, the features with the strongest association with incident cancer status were concentrated in the Bacilli class, and the relative abundance of Lactobacillales, the Streptococcus genus and associated family, as well as the Paenibacillus genus and associated family.

LMPC classifier predicting incident lung cancer in smokers

To determine whether the candidate features identified above could be used to build a classifier to predict incident cancer risk, we created a summative score based on the LDA scores and applied them to our cohorts (LMPC; Lung Microbiome Predictor of Cancer). A LDA-weighted combined score was created by multiplying the LDA score value (established in the discovery cohort above) by the relative frequency of the feature in each of the patient sample (Fig. 1D). In the discovery cohort, the Area Under the Curve of the Receiver Operating Characteristic (AUC) was able to differentiate the Incident Cancer participants from the No Cancer participants (p<0.0001, AUC: 0.7057, 95% CI: 0.6118-0.7997; Fig. 1E). Using Cohort 2 as a validation cohort (18 Incident Cancer and 91 No Cancer), this classifier was able to predict cancer incidence. When stratified by the median score, patients with high scores had a significantly higher rates of incident cancer than those with low scores (p=0.01) (Fig. 1F). Further, when clinical features are incorporated to adjust for participant age, smoking history (pack-years smoked), and lung function (FEV1% predicted), the LMPC score can identify patients who will develop lung cancer (Cox proportional hazards model; log-rank test, p=0.004). Again, the AUC was able to differentiate the Incident Cancer from No Cancer participants (p=0.0498, AUC: 0.6503, 95% CI: 0.5167-0.7839) (Fig. 1G). Further, within the Incident Cancer subset of Cohort 2, patients with higher scores had a trend towards faster cancer incidence than those with low scores (log-rank test, p=0.05; adjusted for clinical co-variants as above) (Fig. 1H).

Application of LMPC classifier to an independent cohort

A third cohort was processed and sequenced independently from Cohorts 1 and 2 (40 No Cancer, 5 Incident Cancer). After applying the scoring system to Cohort 3, 4/5 Incident Cancer participants had the higher risk scores, indicating that these individuals may be at the highest risk of developing cancer (Fig. 1I). Indeed, they developed lung cancer on average 16 months from the time of sampling, while the case with low score did not developed cancer until 53 months. The AUC readily differentiated Incident Cancer from No Cancer participants (p=0.0103, AUC: 0.8550, 95% CI: 0.6180-1.000).

Discussion

Distinguishing the 10-15% of smokers who would develop lung cancer from those who will not presents a clinical challenge. While the consensus of specific taxonomic classifiers associated with lung cancer remains contentious, it is clear that the microbiome is involved in human biological processes, and plays a functional role in cancer pathogenesis. However, the timing of these microbial community alterations with respect to cancer onset remains unknown and has tremendous clinical implications in advancing early detection of lung cancer. Here, we showed that there are significant detectable changes in the lung microbiome in participants who, at the time of bronchoscopy, did not have cancer but who subsequently developed lung cancer during follow-up (i.e. the Incident Cancer patients). These data thus propose a microbial-DNA-based classifier to predict incident cancer cases in our cohort of current and former smokers. We identified taxonomic features (changes in Cohort 1) that yielded the largest LDA score and developed a classifier, which was able to distinguish cancer status. The classifier model was validated in Cohort 2, demonstrating its ability to distinguish Incident Cancer from No Cancer and thus enabling prediction of incident cancer in smokers who are otherwise asymptomatic (Fig. 1D). When stratified by the LMPC classifier, participants with a high score have a significantly shorter time to cancer onset than those with a low score in a combined group of No-Cancer and Incident Cancer participants. Of note, the absolute risk in this cohort is consistent with cancer onset rates in “high-risk” individuals, suggesting that these findings may be applicable to a broader at-risk community. Within the Incident Cancer group (in Cohort 2) those samples with high scores (separated by median) had earlier cancer onset than those with low scores (Fig. 1F). When this same scoring was applied to a third cohort that was sequenced independently, 4/5 Incident Cancer participants were identified to have the highest LMPC scores and developed cancer in a mean time of 1.29 years from the time of sampling (Fig. 1H). Upon further examination, the fifth Incident Cancer sample (C3-15) with the lower score was found to have a much longer time-to-cancer diagnosis of 4.43 years following the airway sample collection. Since this proposed classifier is able to distinguish participants who will develop cancer over time from those who will not, it is reasonable to expect that this score is more predictive the closer the sample procurement is to cancer diagnosis. The behavior of this patient is consistent with this expectation, indicating that this risk score model may be most applicable closer to cancer onset. Our results demonstrate that specific changes in airway liquid biopsy (brushings) samples, obtained during clinical examination by bronchoscopy, in fact, occur before clinical cancer diagnosis. Lung microbiome shifts can serve as a marker in advance of the clinical phenotypes detectable by low-dose CT scans. We have defined a microbiome-based classifier that can predict incident cancer with follow-up from patients having no clinical signs of cancer. As lung-resident microbiomes are associated with smoking status, and this study describes cohorts of ever-smokers and former smokers, future work should examine if this signature can be applied to assess cancer onset in individuals without a history of smoking history. Our findings provide strong rationale for developing microbiome-based liquid biopsy technology to prioritize at-risk individuals for clinical attention.
  15 in total

1.  Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform.

Authors:  James J Kozich; Sarah L Westcott; Nielson T Baxter; Sarah K Highlander; Patrick D Schloss
Journal:  Appl Environ Microbiol       Date:  2013-06-21       Impact factor: 4.792

Review 2.  The biology and management of non-small cell lung cancer.

Authors:  Roy S Herbst; Daniel Morgensztern; Chris Boshoff
Journal:  Nature       Date:  2018-01-24       Impact factor: 49.962

3.  Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2.

Authors:  Evan Bolyen; Jai Ram Rideout; Matthew R Dillon; Nicholas A Bokulich; Christian C Abnet; Gabriel A Al-Ghalith; Harriet Alexander; Eric J Alm; Manimozhiyan Arumugam; Francesco Asnicar; Yang Bai; Jordan E Bisanz; Kyle Bittinger; Asker Brejnrod; Colin J Brislawn; C Titus Brown; Benjamin J Callahan; Andrés Mauricio Caraballo-Rodríguez; John Chase; Emily K Cope; Ricardo Da Silva; Christian Diener; Pieter C Dorrestein; Gavin M Douglas; Daniel M Durall; Claire Duvallet; Christian F Edwardson; Madeleine Ernst; Mehrbod Estaki; Jennifer Fouquier; Julia M Gauglitz; Sean M Gibbons; Deanna L Gibson; Antonio Gonzalez; Kestrel Gorlick; Jiarong Guo; Benjamin Hillmann; Susan Holmes; Hannes Holste; Curtis Huttenhower; Gavin A Huttley; Stefan Janssen; Alan K Jarmusch; Lingjing Jiang; Benjamin D Kaehler; Kyo Bin Kang; Christopher R Keefe; Paul Keim; Scott T Kelley; Dan Knights; Irina Koester; Tomasz Kosciolek; Jorden Kreps; Morgan G I Langille; Joslynn Lee; Ruth Ley; Yong-Xin Liu; Erikka Loftfield; Catherine Lozupone; Massoud Maher; Clarisse Marotz; Bryan D Martin; Daniel McDonald; Lauren J McIver; Alexey V Melnik; Jessica L Metcalf; Sydney C Morgan; Jamie T Morton; Ahmad Turan Naimey; Jose A Navas-Molina; Louis Felix Nothias; Stephanie B Orchanian; Talima Pearson; Samuel L Peoples; Daniel Petras; Mary Lai Preuss; Elmar Pruesse; Lasse Buur Rasmussen; Adam Rivers; Michael S Robeson; Patrick Rosenthal; Nicola Segata; Michael Shaffer; Arron Shiffer; Rashmi Sinha; Se Jin Song; John R Spear; Austin D Swafford; Luke R Thompson; Pedro J Torres; Pauline Trinh; Anupriya Tripathi; Peter J Turnbaugh; Sabah Ul-Hasan; Justin J J van der Hooft; Fernando Vargas; Yoshiki Vázquez-Baeza; Emily Vogtmann; Max von Hippel; William Walters; Yunhu Wan; Mingxun Wang; Jonathan Warren; Kyle C Weber; Charles H D Williamson; Amy D Willis; Zhenjiang Zech Xu; Jesse R Zaneveld; Yilong Zhang; Qiyun Zhu; Rob Knight; J Gregory Caporaso
Journal:  Nat Biotechnol       Date:  2019-08       Impact factor: 54.908

4.  Incremental value of pulmonary function and sputum DNA image cytometry in lung cancer risk prediction.

Authors:  Martin C Tammemagi; Stephen C Lam; Annette M McWilliams; Don D Sin
Journal:  Cancer Prev Res (Phila)       Date:  2011-03-16

5.  Tumor Microbiome Diversity and Composition Influence Pancreatic Cancer Outcomes.

Authors:  Erick Riquelme; Yu Zhang; Liangliang Zhang; Maria Montiel; Michelle Zoltan; Wenli Dong; Pompeyo Quesada; Ismet Sahin; Vidhi Chandra; Anthony San Lucas; Paul Scheet; Hanwen Xu; Samir M Hanash; Lei Feng; Jared K Burks; Kim-Anh Do; Christine B Peterson; Deborah Nejman; Ching-Wei D Tzeng; Michael P Kim; Cynthia L Sears; Nadim Ajami; Joseph Petrosino; Laura D Wood; Anirban Maitra; Ravid Straussman; Matthew Katz; James Robert White; Robert Jenq; Jennifer Wargo; Florencia McAllister
Journal:  Cell       Date:  2019-08-08       Impact factor: 41.582

6.  Comparative Analysis of Age- and Gender-Associated Microbiome in Lung Adenocarcinoma and Lung Squamous Cell Carcinoma.

Authors:  Lindsay M Wong; Neil Shende; Wei Tse Li; Grant Castaneda; Lauren Apostol; Eric Y Chang; Weg M Ongkeko
Journal:  Cancers (Basel)       Date:  2020-06-02       Impact factor: 6.639

7.  Estimating lifetime and 10-year risk of lung cancer.

Authors:  Christina Bruder; Jean-Luc Bulliard; Simon Germann; Isabelle Konzelmann; Murielle Bochud; Magali Leyvraz; Arnaud Chiolero
Journal:  Prev Med Rep       Date:  2018-06-18

Review 8.  Cigarette Smoke Induced Lung Barrier Dysfunction, EMT, and Tissue Remodeling: A Possible Link between COPD and Lung Cancer.

Authors:  Wei Hou; Siyi Hu; Chunyan Li; Hanbin Ma; Qi Wang; Guangping Meng; Tingting Guo; Jie Zhang
Journal:  Biomed Res Int       Date:  2019-06-24       Impact factor: 3.411

9.  Microbiome analyses of blood and tissues suggest cancer diagnostic approach.

Authors:  Gregory D Poore; Evguenia Kopylova; Qiyun Zhu; Carolina Carpenter; Serena Fraraccio; Stephen Wandro; Tomasz Kosciolek; Stefan Janssen; Jessica Metcalf; Se Jin Song; Jad Kanbar; Sandrine Miller-Montgomery; Robert Heaton; Rana Mckay; Sandip Pravin Patel; Austin D Swafford; Rob Knight
Journal:  Nature       Date:  2020-03-11       Impact factor: 49.962

10.  The SILVA ribosomal RNA gene database project: improved data processing and web-based tools.

Authors:  Christian Quast; Elmar Pruesse; Pelin Yilmaz; Jan Gerken; Timmy Schweer; Pablo Yarza; Jörg Peplies; Frank Oliver Glöckner
Journal:  Nucleic Acids Res       Date:  2012-11-28       Impact factor: 16.971

View more
  1 in total

Review 1.  Smoking-induced microbial dysbiosis in health and disease.

Authors:  Hagit Shapiro; Kim Goldenberg; Karina Ratiner; Eran Elinav
Journal:  Clin Sci (Lond)       Date:  2022-09-30       Impact factor: 6.876

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.