Blood-based biomarkers, such as carcinoembryonic antigen (CEA), and saliva-based biomarkers, such as mRNA, have emerged as potential liquid biopsies for non-invasive detection of many cancers. However, current tests typically use single type of biomarkers, and their sensitivity and specificity is often unsatisfactory.In this study, we developed a novel biomarker panel that measures both CEA level in blood and GREB1 and FRS2 levels in saliva to achieve high sensitivity and high specificity in detecting Non-Small Cell Lung Cancer (NSCLC).In the discovery phase, we achieved sensitivity of 96.67% and specificity of 93.33% for 30 NSCLC patients and 30 healthy controls. To further evaluate the prediction performance of our biomarker panel, we applied it to an independent set of 15 NSCLC cancer patients and 25 healthy controls. The sensitivity and specificity of our test reached 93.33% and 80.00% respectively.Our study discovered that the combined analysis of CEA and mRNA can be a novel liquid-biopsy technology for non-invasive detection of NSCLC.
Blood-based biomarkers, such as carcinoembryonic antigen (CEA), and saliva-based biomarkers, such as mRNA, have emerged as potential liquid biopsies for non-invasive detection of many cancers. However, current tests typically use single type of biomarkers, and their sensitivity and specificity is often unsatisfactory.In this study, we developed a novel biomarker panel that measures both CEA level in blood and GREB1 and FRS2 levels in saliva to achieve high sensitivity and high specificity in detecting Non-Small Cell Lung Cancer (NSCLC).In the discovery phase, we achieved sensitivity of 96.67% and specificity of 93.33% for 30 NSCLC patients and 30 healthy controls. To further evaluate the prediction performance of our biomarker panel, we applied it to an independent set of 15 NSCLC cancer patients and 25 healthy controls. The sensitivity and specificity of our test reached 93.33% and 80.00% respectively.Our study discovered that the combined analysis of CEA and mRNA can be a novel liquid-biopsy technology for non-invasive detection of NSCLC.
Lung cancer is the leading cause of cancer death in both sexes combined and in men in the world.[ In China, lung cancer is the leading cause of cancer death among both men and women.[ The primary reason for lung cancer is smoking, which accounts for 90% of all cases.[ In China, cigarette consumption has continued to increase since 1960s, and seen a marked increase since 2000.[ The main cause for the high death rate of lung cancer is that early detection can be very difficult. In the early stage, the 5-year survival rate of lung cancer is 56%, but dramatically drops to 5% in the late stage.[Carcinoembryonic antigen (CEA) is one of the most commonly used cancer biomarkers in blood test for detection of disease recurrence, and response or progression in metastatic disease.[ Since it is measured in whole peripheral blood, CEA can be used as a non-invasive prognostic biomarker to monitor cancer progression.[ However, CEA tend to have a high false positive rate, in which healthy individuals might receive positive results due to benign lung diseases (e.g., tuberculosis). Hence, CEA cannot serve as a sole predictor of cancer detection.[Transcriptomic salivary biomarkers have been recently studied as potential non-invasive biomarkers for detection of lung cancer. Five mRNA biomarkers, namely CCNI, FGF19, GREB1, FRS2, EGFR, have been suggested for discrimination of lung cancer patients from healthy controls.[ Using the aforementioned combination of mRNA biomarkers, Zhang et al achieved the sensitivity of 93.75% and specificity of 82.81% in their pre-validation sample set of 32 patients and 64 healthy controls. However, most clinical applications of salivary RNA biomarkers were currently conducted in populations of the United States.[ Whether or not these findings could be suitable for lung cancer detection in Chinese populations remain largely unknown.The goal of this study is to explore the possibility of combining a panel of salivary RNA biomarkers with CEA detection in blood to detect non-small cell lung cancer (NSCLC) in a Chinese population. To this end, a total of 30 NSCLC patients and 30 healthy controls were recruited in the initial discovery phase. The expression levels of CCNI, EGFR, FGF19, FRS2, GREB1 in saliva and the CEA level in blood were analyzed. The sensitivity and specificity of our combined analysis of RNAs and CEA reached 96.67% (29/30) and 93.33% (28/30), respectively. In a subsequent validation phase that included 15 NSCLC patients and 25 healthy controls, the sensitivity and specificity of our method remained as high as 93.33% (14/15) and 80.00% (20/25). Our study discovered that by combining the analysis of RNAs in saliva and CEA in blood, accurate and non-invasive detection of NSCLC in Chinese population could be indeed achieved. To our best knowledge, it is the first time that multiple biomarkers from saliva and blood were used for lung cancer detection in Chinese populations.
Materials and methods
Patient cohort
In the discovery phase, we recruited totally 30 NSCLC patients and 30 NSCLC-negative people (i.e., healthy controls) between June 3, 2017 and March 11, 2018. In the independent clinical validation phase, we recruited totally 15 NSCLC patients and 25 NSCLC-negative people (i.e., healthy controls) between April 6, 2018 and January 11, 2019. All the patients and healthy controls were recruited from Rizhao City Traditional Chinese Medicine Hospital (Rizhao, China). Written informed consents and questionnaire data sheets were obtained from all patients who agreed to serve as saliva and blood donors. The information on patient characteristics, such as age and sex, is presented in Table 1.
Table 1
Demographic information of all subjects used in this study.
Demographic information of all subjects used in this study.This study was approved by the institutional review board at the Rizhao City Traditional Chinese Medicine Hospital and conducted in adherence to the Declaration of Helsinki Principles. Inclusion criteria of NSCLC patients consisted of a confirmed diagnosis of NSCLC via histology or cytology. Exclusion criteria of NSCLC patients included chemotherapy/radiotherapy within 1 month, a diagnosis of other malignancies within 5 years prior to blood collection, clotted, or hemolysis patient samples.
Sample collection
Both whole peripheral blood samples and unstimulated whole saliva samples were blindly collected following a standard procedure between 08:00 and 09:00 in the morning from NSCLC patients and healthy controls. All blood samples (2 mL) were collected using tubes without anti-coagulant for CEA analysis. Serum was separated in 30 minutes after collection, divided into 4 aliquots, and stored at –25 °C until assays. All saliva samples (2 mL) were collect at least 1 hour after the subjects were asked to refrain from eating, drinking, smoking, or oral hygiene procedures. Saliva samples were kept on ice during collection and were then centrifuged at 13,000 × g for 10 minutes at 4 °C. For preservation of salivary RNA, the supernatant was removed from the pellet and treated with RNase inhibitor (Superase-In, Ambion Inc., Austin, TX) and stored at –80 °C prior to assay. The samples in the validation phase were stored and measured in a different batch from that of discovery phase.
Analysis of CEA level in blood samples
CEA concentrations were determined by microparticle enzyme immunoassays using ARCHITECT CEA kit (Abbott, Ireland) and ARCHITECT i System (Abbott, Ireland) following the manual. Briefly, serum samples and anti-CEA coated paramagnetic microparticles were combined. CEA present in the sample bound to the anti-CEA coated microparticles. After washing, anti-CEA acridinium-labeled conjugate was added in the next step. Pre-Trigger and Trigger Solutions were then added to the reaction mixture; the resulting chemiluminescent reaction was measured as relative light units (RLUs). A direct relationship existed between the amount of CEA in the sample and the RLUs detected by the ARCHITECT i optical system.
Analysis of RNA in saliva samples
We followed a protocol that was previously published for extracting salivary RNA.[ In brief, the supernatant of 330 μL saliva supernatant was used and processed. Extraction of saliva RNA was conducted by using RNeasy Protect Saliva Mini Kit (Qiagen, Germany) as instructed by the product manual. After extraction, RNA samples were stored at –80 °C and analyzed within 12 hours of sample preparation. Extracted RNA was linearly amplified with TURBO DNase treatment. Quantitative real-time PCR (qPCR) analysis was conducted to detect the expression levels of selected genes in saliva samples. Amplification was performed using Roche LightCycler 480 (Roche, Switzerland) in 20 μL reactions containing 2 μL cDNA, the primers (Table 2), the probe, and the reaction mix with qPCR. Glyceraldehyde 3-phosphate dehydrogenase (GAPDH) was used as an internal control. The primers used in qPCR analysis were listed in Table 2. Raw Ct data were normalized by subtracting GAPDH Ct values from the biomarker Ct values to generate ΔCt. We next followed a reported method[ to calculate the relative gene expression in various saliva samples: Relative Gene Expression = 2(–ΔCt) × K, where K was chosen as 1000 in our study.
Table 2
Primers used in this study.
Primers used in this study.
Statistical analysis
CEA level and expression levels of candidate genes were compared using the multiple t test. We considered P < .05 as statistically significant. The receiver operating characteristic (ROC) curve and the area under the curve (AUC) value were calculated, using MedCalc software (MedCalc Software, Ostend, Belgium) and MATLAB software (The MathWorks, Inc., Natick, MA).
Machine learning for sample classification
We selected decision tree algorithm as our machine learning classifier. We used the “fitctree” and “predict” functions in Statistics and Machine Learning Toolbox as implemented in MATLAB software to classify sample classes in the discovery phase. Leave-one-out cross-validation (LOOCV) was performed to evaluate model performance. The sensitivity, specificity, and accuracy of the discovery phase was calculated using one machine-learning model based on the whole training dataset. After training the classifier, we then applied it to the datasets in the independent clinical validation phase, and used RNA and CEA levels as the features for the classifier to predict the occurrence of NSCLC. The predicted classes of the 15 NSCLC patients and 25 healthy controls were listed in Table 3.
Table 3
Confusion matrix of combined biomarker analysis in an independent clinical validation phase.
Confusion matrix of combined biomarker analysis in an independent clinical validation phase.
Results
Study design
This study contained a discovery phase and a subsequent clinical validation phase (Fig. 1). In the discovery phase, we built a machine-learning model that uses the CEA level and gene expression levels as input to predict the occurrence of NSCLC. In the validation phase, we applied our trained model to a different set of subjects, and used their CEA and gene expression levels to predict if a sample was from a NSCLC patient. We then compared our predictions with pathological classification to evaluate the clinical performance of our method (sensitivity, specificity, and accuracy).
Figure 1
Comparison of CEA levels and RNA expression levels of 5 candiate genes (CCNI, EGFR, FGF19, FRS2, GREB1) between healthy control group (green) and NSCLC patient group (yellow) in the discovery phase (A, B) and validation phase (C, D). In the discovery phase, the healthy control group included 30 normal people; the NSCLC patient group included 30 patients. In the validation phase, the healthy control group included 25 normal people; the NSCLC patient group included 15 patients. ∗ indicates P < .05; ∗∗ indicates P < .01; ∗∗∗ indicates P < .001. CEA = carcinoembryonic antigen, NSCLC = non-small cell lung cancer.
Comparison of CEA levels and RNA expression levels of 5 candiate genes (CCNI, EGFR, FGF19, FRS2, GREB1) between healthy control group (green) and NSCLC patient group (yellow) in the discovery phase (A, B) and validation phase (C, D). In the discovery phase, the healthy control group included 30 normal people; the NSCLC patient group included 30 patients. In the validation phase, the healthy control group included 25 normal people; the NSCLC patient group included 15 patients. ∗ indicates P < .05; ∗∗ indicates P < .01; ∗∗∗ indicates P < .001. CEA = carcinoembryonic antigen, NSCLC = non-small cell lung cancer.
Analysis of CEA levels and gene expression levels
First, the CEA levels of 30 NSCLC patients and 30 healthy controls were measured in the blood samples in the discovery phase. The mean CEA level was 1.408 ng/mL for healthy controls and 9.326 ng/mL for NSCLC patients (Fig. 2A). The difference of the CEA level between healthy controls and NSCLC patients was not significantly different (P = .08), indicating that CEA alone cannot discriminate NSCLC patients from healthy controls.
Figure 2
Schematic diagram of the study design and experimental procedures used in this study.
Schematic diagram of the study design and experimental procedures used in this study.Next, we measured 5 mRNAs biomarkers CCNI, EGFR, FGF19, FRS2, and GREB1, in saliva samples (Fig. 2B). Four biomarkers, namely EGFR, FGF19, FRS2, and GREB1, were significantly up-regulated in NSCLC patients (P < .05). Specifically, FRS2 and GREB1 demonstrated 3.13-fold and 2.06-fold elevated expression in NSCLS patients (P < .001). No significant difference was observed for CCNI expression between NSCLC patients and healthy controls (P = .79).
Machine-learning model building and evaluation
We then used a machine-learning model to predict NSCLC occurrence from clinical characteristics. We plotted receiver operating characteristic (ROC) curve for each candidate biomarker and calculated the area under curve (AUC) value (Fig. 3). In general, an AUC value that is >0.70 suggests a decent performance for segregating clinical positives from negatives. GREB1 and FRS2 were ranked as the top 2 biomarkers, with AUC values reaching 0.84 and 0.79, respectively. The AUC value of CEA as well as the rest biomarkers in saliva was below 0.70, indicating unsatisfactory clinical performance when using these biomarkers alone for discriminating NSCLC patients and healthy controls.
Figure 3
ROC curves of single biomarkers (CCNI, EGFR, FGF19, FRS2, GREB1) in saliva and CEA level in blood in the discovery phase. ROC = receiver operating characteristic curve.
ROC curves of single biomarkers (CCNI, EGFR, FGF19, FRS2, GREB1) in saliva and CEA level in blood in the discovery phase. ROC = receiver operating characteristic curve.Next, we chose 2 biomarkers, CEA and GREB1 as our design parameters. We included CEA because it was commonly use in clinical applications, in spite of its low AUC value (0.62). We used the pairwise data of the design parameters and the corresponding NSCLC occurrence to train a decision-tree algorithm. The model was constructed by following a toolkit developed in MATLAB (i.e., “fitctree” and “predict” in Statistics and Machine Learning Toolbox), which automatically adjusted the nodes and connections of the decision tree to optimize the fitting.[ We found that our model could indeed discriminate NSCLC patients and healthy controls. The sensitivity and specificity for the combination of CEA and GREB1 was 96.67% and 83.33% for NSCLC prediction (Fig. 4).
Figure 4
Performance of different panels of biomarkers in the discovery phase and validation phase. Blue color indicates sensitivity (or Sn), red color indicates specificity (or Sp), and black color indicates accuracy (or Acc). The sensitivity, specificity and accuracy of the discovery phase was calculated using one machine-learning model based on the whole training dataset, and the model performance was evaluated by leave-one-out cross-validation (LOOCV).
Performance of different panels of biomarkers in the discovery phase and validation phase. Blue color indicates sensitivity (or Sn), red color indicates specificity (or Sp), and black color indicates accuracy (or Acc). The sensitivity, specificity and accuracy of the discovery phase was calculated using one machine-learning model based on the whole training dataset, and the model performance was evaluated by leave-one-out cross-validation (LOOCV).Finally, we added FRS2 in the machine-learning model to see if the prediction performance could be further improved (Fig. 4). We determined the mean accuracies by 10 iterations of LOOCV for each case. We found that by including FRS2, the specificity could be further improved to 93.33%. Considering the high sensitivity (96.67%) and high specificity (93.33%) of this 3-parameter model, we decided to use these design parameters, that is, CEA level in blood, and GREB1 and FRS2 expression levels in saliva, in the machine-learning model for the subsequent clinical validation phase.
Validation with independent datasets
To find out if our machine-learning algorithm could be generally applied for NSCLC detection, we performed a validation study in an independent dataset. In this independent validation phase, we performed a blinded study of 15 NSCLC patients and 25 healthy controls (Table 1). The blood and saliva samples were collected from a total of 40 subjects, blinded, and analyzed for CEA levels (Fig. 2C) and gene expression levels (Fig. 2D). For both patients and healthy controls, CEA levels showed no significant difference (P > .05) between the discovery phase and the validation phase. Similarly, the expression levels of CCNI, EGFR, FGF19, FRS2, and GREB1 in the validation phase were not significantly different (P > .05) from those in the discovery phase. In sum, the difference between discovery phase and validation phase was not significant in our study.We next used the machine-learning model that was trained in the discovery phase to predict if a sample from the validation phase was from a NSCLC patient. In total, we successfully predicted 20 out of 25 healthy controls, and 14 out of 15 NSCLC patients (Table 3). Our predictions highly matched with pathological classification, with sensitivity and specificity reaching 93.33% and 80.00% respectively (Table 3). Overall, our study discovered that a novel panel of CEA and mRNA could be applied in clinical diagnosis of NSCLC as a non-invasive method.
Discussion
We discovered that the combined analysis of gene expressions in saliva and CEA in blood could be complementary to the current use of single type of biomarkers to diagnose NSCLC. By combining RNA and CEA analysis, we can extract more unique bio-signature of cancers from higher dimensions and thus increase both sensitivity and specificity. Combined analysis of multiple analytes for non-invasive detection of cancers is gradually becoming a consensus among researchers. For example, Cohen et al[ recently developed CancerSEEK, a blood test that analyzes ctDNA mutations and various protein biomarkers to detect multiple cancers such as ovarian, liver, and lung cancer. Their method achieved over 95% sensitivity in detection of ovary and liver cancers. However, when using CancerSEEK for lung cancer detection, the sensitivity was <60%, while our combined analysis of CEA and RNAs led to a sensitivity of 93.33%. Future studies using larger and prospective cohorts will help define the role of this approach in clinical practice.The key of our method is using a machine-learning modeling approach. Machine learning is advantageous than linear model in solving complex problems in 2 aspects: requiring little a priori knowledge of the system, and capable of resolving complex systems with high non-linearity and multi-dimensionality. Actually, the predictive power of machine learning is now being recognized by biologists, as we have witnessed multiple applications of using machine-learning models to analyze biomarkers from ctDNA,[ ctRNA,[ proteomics,[ and metabolomics.[ Most importantly, in addition of the use of machine-learning, we implemented an independent validation study to evaluate the practical use of our model. Such design is crucial as machine-learning models could face over-fitting problem, that is, a model could over-fit training datasets but suffers low accuracy in other datasets. In this study, our model achieves equally good predictions between the training datasets and the validation datasets, suggesting no over-fitting during model development. It is also worth noticing that the feature selection process used in this study is not entirely data-driven. For example, we included CEA level in our model although its AUC value was low and might not be used as a feature based on machine learning analysis. The reason we included CEA level is that it is commonly used in blood test and has great value in real-world clinical applications. The integration of domain knowledge with machine learning model is important for converting analytical data into clinical insights, which represents an applicable and non-invasive method for lung cancer detection in this study.Several limitations of our study should be acknowledged. First, the sample size of this study (60 subjects in the discovery phase and 40 subjects in the validation phase) is relatively small compared with that required for implementation in the clinical. To plan for clinical trials, we would first need to estimate the correct sample size with sound scientific justification.[ Second, our database included NSCLC patients and healthy controls, but left out subjects with benign lung diseases, such as pneumonia or tuberculosis. To achieve generalization of our findings, we will need to improve our model by including more types of patients. Third, the patient cohort in our study was composed of subjects from China only. To discover the biomarkers that are universal for people of all ethnicities, we might need to collect samples from other races (e.g., Caucasian) or include data from previous work that are based on other races. Fourth, we derived our biomarker panel from the biomarkers suggested by Zhang et al[ (GREB1 and FRS2) to detect NSCLC in China. Both biomarkers were discovered from a database that is primarily Caucasian people, rather than Chinese people. As a result, some region-specific biomarkers that are particularly good for Chinese people might be missing. For example, KRAS, NOTCH1, STRN, and TP53 are potential candidates, because they are classical driver genes for lung cancer.[ We are currently conducting a large-scale clinical test to confirm their discriminatory power in the Chinese population. Finally, smoking, age, and sex were not considered in our model for NSCLC prediction. In particular, smoking is a critical risk factor for lung cancer, and is involved in almost all of the lung cancer cases.[ A more comprehensive clinical test is being conducted to evaluate the role of these factors to test whether or not these factors could further improve the prediction performance of our model.Our study proves that combining CEA and salivary transcriptomic biomarkers can achieve non-invasive detection of NSCLC in a Chinese population. In future, we plan to endeavor our efforts into detection of multiple cancers, and also evaluate the clinical utility of our study in a large population. We expect our method, together with other liquid biopsies, will eventually provide early, painless, and affordable test for all incident cancer types.
Acknowledgments
The authors thank Rizhao City Traditional Chinese Medicine Hospital for providing funding support.
Authors: Lei Zhang; Hua Xiao; Hui Zhou; Silverio Santiago; Jay M Lee; Edward B Garon; Jieping Yang; Ole Brinkmann; Xinmin Yan; David Akin; David Chia; David Elashoff; No-Hee Park; David T W Wong Journal: Cell Mol Life Sci Date: 2012-06-12 Impact factor: 9.261
Authors: Anya Litvak; Andrea Cercek; Neil Segal; Diane Reidy-Lagunes; Zsofia K Stadler; Rona D Yaeger; Nancy E Kemeny; Martin R Weiser; Melissa S Pessin; Leonard Saltz Journal: J Natl Compr Canc Netw Date: 2014-06 Impact factor: 11.908
Authors: Avrum Spira; Jennifer E Beane; Vishal Shah; Katrina Steiling; Gang Liu; Frank Schembri; Sean Gilman; Yves-Martine Dumas; Paul Calner; Paola Sebastiani; Sriram Sridhar; John Beamis; Carla Lamb; Timothy Anderson; Norman Gerry; Joseph Keane; Marc E Lenburg; Jerome S Brody Journal: Nat Med Date: 2007-03-04 Impact factor: 53.440
Authors: Frederico Omar Gleber-Netto; Maha Yakob; Feng Li; Ziding Feng; Jianliang Dai; Huang-Kai Kao; Yu-Liang Chang; Kai-Ping Chang; David T W Wong Journal: Clin Cancer Res Date: 2016-02-04 Impact factor: 12.531
Authors: Antonio Sanchez-Fernandez; Raúl Roncero-Martin; Jose M Moran; Jesus Lavado-García; Luis Manuel Puerto-Parejo; Fidel Lopez-Espuela; Ignacio Aliaga; María Pedrera-Canal Journal: Healthcare (Basel) Date: 2020-06-15