Literature DB >> 35706818

Study on plasma amino acids and piperonamide as potential diagnostic biomarkers of non-small cell lung cancer.

Caifa Zhang1,2, Yuanyuan Wang2,3, Yunfeng Cao3,4, Linyang Shi1,2, Ruonan Wang1,2, Ningning Sheng1,2, Qingjun Wang2,3, Zhitu Zhu2,3.   

Abstract

Background: The value of plasma threonine, cysteine, and piperonamide as diagnostic biomarkers for non-small cell lung cancer (NSCLC) has been rarely explored. The lack of a validation set containing confounders is common to most previous metabolomics studies. The purpose of this study was to explore and validate the value of plasma amino acids and piperonamide as diagnostic biomarkers for NSCLC using liquid chromatography-tandem mass spectrometry (LC-MS/MS).
Methods: A total of 250 participants were included in this study, including 167 patients with pathologically confirmed NSCLC and 83 healthy controls (HCs). These participants were divided into training set, validation set 1, and validation set 2 in chronological order and in a certain proportion. The plasma levels of 22 amino acids and 1 piperonamide in these pre-treatment NSCLC patients and HCs were measured by LC-MS/MS. Metabolic biomarkers were identified after multivariate analysis, univariate analysis, receiver operating characteristic (ROC) analysis. Furthermore, these biomarkers and transcriptomic data were subjected to joint pathway analysis.
Results: The area under the ROC curve (AUC) values for threonine, piperonamide, arginine, alanine, cysteine, methionine, and histidine in the integrated data set were 0.911, 0.848, 0.909, 0.869, 0.786, 0.597 and 0.637, respectively. This panel composed of these 7 metabolites showed good diagnostic capability for NSCLC (the AUC of this diagnostic panel in each data set was greater than 0.9). The specificity of this diagnostic panel in validation set 2, which included confounders, was 0.970, similar to that of the other datasets. The presence of confounding factors had little effect on the diagnostic accuracy of this panel. The ROC analysis of this diagnostic panel between all stage I NSCLC patients and HCs showed AUC, sensitivity, and specificity of 1.000, 1.000, and 0.988, respectively. Moreover, PSAT1, SHMT2, AOC3, and MAOB were found to be involved in the metabolism of threonine and cysteine. Conclusions: Plasma amino acids and piperonamide have potential as diagnostic biomarkers in NSCLC. This metabolic biomarker panel appears useful for the diagnosis and screening of NSCLC. In addition, metabolomic and transcriptomic integration pathway analysis may help elucidate the mechanism of NSCLC occurrence and development and even reveal new treatment vulnerabilities. 2022 Translational Cancer Research. All rights reserved.

Entities:  

Keywords:  Non-small cell lung cancer (NSCLC); amino acids; diagnosis; metabolomics; transcriptomics

Year:  2022        PMID: 35706818      PMCID: PMC9189173          DOI: 10.21037/tcr-22-865

Source DB:  PubMed          Journal:  Transl Cancer Res        ISSN: 2218-676X            Impact factor:   0.496


Introduction

Lung cancer is the leading cause of cancer death worldwide, with an estimated 1.79 million individuals dying of lung cancer each year. Among the histological subtypes of lung cancer, approximately 85% of patients are non-small cell lung cancer (NSCLC), which mainly include lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) (1-3). The high mortality associated with lung cancer is attributed in no small part to patients frequently having reached an advanced stage by diagnosis (4). Thus, accurate and early lung cancer diagnosis is critical if we wish to improve statistics on lung cancer survival. However, the commonly used serum biomarkers such as carcinoembryonic antigen (CEA) and cytokeratin 19 fragment (Cyfra21-1) are not ideal for early diagnosis or screening of cancer (5). In recent years, molecular biomarkers such as circulating tumor DNA (ctDNA) in the blood of patients with NSCLC have been extensively studied, but due to its low sensitivity and specificity, ctDNA is ineffective for cancer screening or early diagnosis (6,7). Therefore, the search for highly sensitive and specific NSCLC blood biomarkers with early diagnostic ability has become an urgent research requirement. Metabolomics is a powerful tool with the potential to identify cancer biomarkers. It can systematically measure small molecule metabolites in blood to provide crucial information about cancer status (8). Abnormal metabolite accumulation is closely related to tumorigenesis (9). For instance, the growth of hepatocellular carcinoma depends on the accumulation of branched chain amino acids (10,11). Moreover, the growth of xenograft tumors in mice is affected by serine and glycine (12). This suggests that understanding changes in metabolites such as amino acids in plasma may have implications for cancer therapy and even diagnosis. Numerous studies have explored and demonstrated that amino acids detected by mass spectrometry, a common platform for metabolomics studies, can be used as biomarkers for screening and diagnosis of different tumors. Certain amino acids have exhibited high sensitivity and specificity (13-18). However, the value of plasma threonine, cysteine, and piperonamide (also known as piperine) as diagnostic biomarkers for NSCLC has been rarely explored (19). Moreover, Elevated blood glucose and weight loss in cancer patients may affect amino acid metabolism. Most previous metabolomics studies of cancer did not use samples containing these confounding factors as a validation set, so it was not sufficient to verify the diagnostic value of amino acids as biomarkers. In addition to changes in metabolism or metabolites, another major requirement for cancer cell survival is changes in transcriptional programs. Transcription and metabolism are an inseparable “community”. Metabolic changes can affect gene expression in tumor cells, and the state of gene expression can also regulate metabolic remodeling (16,20). Thus, compared with the analysis based only on the level of metabolism, the combined analysis of metabolism and transcription may better explain the causes of metabolite changes. Several studies have demonstrated that the integration of metabolites and metabolic genes expands the findings of a single omics study (21-23). Unfortunately, the joint pathway analysis of transcriptome data and plasma amino acids has not been discussed in previous NSCLC studies. Here, we used liquid chromatography-tandem mass spectrometry (LC-MS/MS) to analyze the metabolic changes of 22 amino acids and 1 piperonamide in the plasma of patients with NSCLC. In order to validate the reliability of the results, an independent verification set 1, a verification set 2 including confounding factors, and an integrated data set were used for verification. Also, we performed weighted gene co-expression network analysis (WGCNA) on gene expression data downloaded from The Cancer Genome Atlas (TCGA) database. In addition to the above-mentioned independent analysis, a combined pathway analysis of significantly changed metabolites and differentially expressed genes (DEGs) was carried out to increase awareness of changes in these metabolites. The aims and implications of this study were to identify diagnostic biomarkers to improve diagnosis and screening of NSCLC, explore the relationship between candidate metabolites and transcripts to better understand biological processes, and provide new insights into potential molecular mechanisms to help discover unique therapeutic vulnerabilities. We present the following article in accordance with the STARD reporting checklist (available at https://tcr.amegroups.com/article/view/10.21037/tcr-22-865/rc).

Methods

Participants and study design

Plasma samples from 167 NSCLC patients and 83 gender-matched HCs (χ2 test, P=0.273, no gender difference between 167 NSCLCs and 83 HCs) were retrospectively collected at the First Affiliated Hospital of Jinzhou Medical University. Dates of sample collection ranged from 2015 to 2021. This study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). This study was approved by institutional ethics committee of The First Affiliated Hospital of Jinzhou Medical University (No. 202222) and informed consent was taken from all the patients. The inclusion criteria included the following: (I) pathological diagnosis of NSCLC; (II) patients aged 20 to 85 years. The exclusion criteria included the following: (I) patients who had undergone malignant tumor resection surgery, radiotherapy, chemotherapy, targeted therapy, and immunotherapy within 1 year before the study; (II) patients with other primary malignancies; (III) patients with recent severe vomiting or diarrhea; (IV) drug addicts, alcoholics, and pregnant women. This study included a training set (66 NSCLCs and 30 HCs), a validation set 1 (36 NSCLCs and 20 HCs), a validation set 2 (65 NSCLCs and 33 HCs), and an integrated data set (training set + validation set 1 + validation set 2). Patients in the training set, validation set 1, and validation set 2 were NSCLCs from 2015 to 2018, NSCLCs from 2019 to 2021, and NSCLCs with elevated fasting glucose and weight loss from 2015 to 2021, respectively. Healthy controls (HCs) were randomly selected in a ratio of approximately 1 to 2. According to the tumor-node-metastasis (TNM) staging standard in the 8th edition of the American Joint Committee on Cancer (AJCC), the NSCLC staging was determined. The study design of this research work is shown in . The demographic and clinicopathological characteristics of study participants are presented in .
Figure 1

The workflow in the study. NSCLCs, non-small cell lung cancers; HCs, healthy controls; TCGA, The Cancer Genome Atlas; LC-MS/MS, liquid chromatography tandem mass spectrometry; DEGs, differentially expressed genes; WGCNA, weighted gene co-expression network analysis.

Table 1

Demographic and clinicopathologic profiles of the study participants

ParametersTraining setValidation set 1Validation set 2
NSCLC (n=66)HC (n=30)NSCLC (n=36)HC (n=20)NSCLC (n=65)HC (n=33)
Gender
   Male422023154023
   Female24101352510
Age (years)
   Median6039.56040.56134
   Range37–8225–5841–7723–5633–7725–59
TNM stage
   I2088
   II634
   III221018
   IV181535
Histological type
   LUAD442343
   LUSC211321

Since there is 1 lung adenosquamous carcinoma patient and 1 patient with unclear subtype in NSCLC patients, samples from these 2 patients were excluded from the subtype analysis. TNM, tumor-node-metastasis; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; NSCLC, non-small cell lung carcinoma; HC, healthy control.

The workflow in the study. NSCLCs, non-small cell lung cancers; HCs, healthy controls; TCGA, The Cancer Genome Atlas; LC-MS/MS, liquid chromatography tandem mass spectrometry; DEGs, differentially expressed genes; WGCNA, weighted gene co-expression network analysis. Since there is 1 lung adenosquamous carcinoma patient and 1 patient with unclear subtype in NSCLC patients, samples from these 2 patients were excluded from the subtype analysis. TNM, tumor-node-metastasis; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; NSCLC, non-small cell lung carcinoma; HC, healthy control.

Chemicals

High performance liquid chromatographic (HPLC) grade acetonitrile, pure water, and methanol were purchased from Thermo Fisher Scientific (Waltham, MA, USA). Acetyl chloride and 1-Butanol were acquired from Sigma-Aldrich (St. Louis, MO, USA). Internal standard kits were obtained from Cambridge Isotope Laboratories (Tewksbury, MA, USA). They contained 12 amino acid isotope-labeled internal standards. These standards were each dissolved in 1 mL of pure methanol respectively. These standards and methanol were thoroughly mixed and stored at 4 ℃. Working solutions were obtained by 100-fold dilution. The quality control (QC) standards were purchased from Chromsystems (Grafelfing, Germany).

Sample preparation

Fasting peripheral blood samples were collected from all participants in the early morning and stored in vacuum tubes containing heparin. Then, these samples were stored at 4 ℃ and sent to the laboratory for further processing within half an hour. Each piece of dried blood spots (DBS) paper was made into a disc with a diameter of 3 mm using a punch. The disc was placed into a 96-well plate for amino acids and piperonamide extraction. Then, 100 µL of fresh working solution was added to each well of this plate. The plate was centrifuged at 1,500 r/min for 2 minutes following gentle shaking at room temperature for 20 minutes. The new filtrate was collected in a new 96-well plate. Subsequently, 4 empty wells on each plate were randomly selected and 2 low and high concentration QC solutions were separately added. The filtrate and QC solution were dried at 50 ℃ via pure nitrogen. After adding 60 µL of acetyl chloride and 1-butanol mixture (10:90) to each well, the dried sample was incubated at 65 ℃ for 20 minutes to derivatize the metabolites. A second drying was performed on each sample following derivation, as described previously. Finally, the dried samples were fully dissolved in 100 µL of mobile phase solution for LC-MS/MS analysis.

LC-MS/MS analysis

For performing LC-MS/MS analysis, an AB SCIEX 4000 QTrap system (AB SCIEX LLC; Framingham, MA, USA), which was equipped with an electrospray ionization source, and operated under positive scan mode. The detailed parameters were set as described in our previous work (24-26). For each run, 20 µL of the sample was injected and 80% HPLC grade acetonitrile aqueous solution was used for the elution of gradient. The flow rate was initially set to 0.2 mL/min, and then dropped to 0.01 mL/min within 0.08 minutes and remained unchanged for 1.5 minutes. After 1.5 minutes, the flow rate returned to 0.2 mL/min within 0.01 minutes, and remained constant for 0.5 minutes. The pressure of ion source gas 1 and gas 2 was set to 35 psi. The auxiliary gas temperature was maintained at 350 ℃, the ion spray voltage was 4.5 kV, and curtain gas pressure was 20 psi (pounds per square inch).

Transcriptomics study

Data preparation and DEGs screening

The gene expression data were downloaded from TCGA database (https://cancergenome.nih.gov/). A total of 1,037 NSCLC tissues and 108 normal tissues gene expression data were included in this study. The “limma” R package (The R Foundation for Statistical Computing, Vienna, Austria) was applied to analyze the DEGs between NSCLC tissues and normal tissues. The DEGs were screened based on false discovery rate (FDR) <0.05 and |log2fold change (FC)| >2.

WGCNA

To explore the interactions between genes and between genes and clinical traits, WGCNA was performed on 2,470 DEGs. A gene co-expression network was constructed using R package “WGCNA”. Firstly, we performed sample clustering to check for outliers. Secondly, Pearson’s correlation analysis was used to calculate the correlations between genes. After that, we used network topology analysis to determine the optimal soft threshold that can enhance the strong correlations between genes and punish the weak correlations between genes. The expression matrix was converted to obtain a topological overlap matrix (TOM). After the minimum module size was set to 50, gene hierarchical clustering was performed to generate co-expression modules (27). At the same time, module eigengenes (MEs) in each module were also calculated. Finally, we evaluated the associations between ME and clinical traits to determine NSCLC-related modules for subsequent joint pathway analysis (28).

Statistical analysis

Statistical analysis was executed with the software SPSS 25.0 (IBM Corp., Armonk, NY, USA). Mann-Whitney U test was applied to analyze differences in age and metabolite levels between NSCLC and HC groups. Pearson’s chi-squared (χ2) test was used to analyze gender differences between NSCLC and HC groups. A two-tailed P value of <0.05 was considered statistically significant between the two groups. Multivariate analysis was performed on metabolic data by using MetaboAnalyst 5.0 (https://www.metaboanalyst.ca/) (29,30). According to the default on MetaboAnalyst 5.0, missing values will be replaced by 1/5 of minimum positive values of their corresponding variables. Moreover, to assess the predictive power of biomarkers, receiver operating characteristic (ROC) analyses were performed through R v.4.0.2 and GraphPad Prism 9 (GraphPad Software, San Diego, CA, USA).

Results

Participant characteristics

The participant characteristics of the training set, validation set 1, and validation set 2 are displayed in . Among them, validation set 2 included participants with elevated fasting blood glucose (6.18≤ blood glucose ≤12.89) at the time of mass spectrometry analysis and participants with weight loss (0< weight loss ≤10 kg) during the 6 months prior to pathological diagnosis. There was no difference in gender between the NSCLC and HC groups (χ2 test, P=0.774, 0.394 and 0.426 in the training set, validation set 1, and validation set 2, respectively). The median age of the 167 NSCLC patients (61 years) was higher than that of the 83 HCs (38 years) (Mann-Whitney U test, P<0.05). To understand the effect of age on amino acid and piperonamide metabolism, NSCLC patients were grouped according to age (group 1 with median age 51, group 2 with median age 64), and the grouped data were analyzed by multivariate analysis using SIMCA 14.1 (Umetrics, Umeá, Sweden). No significant separation was detected between group 1 and group 2 (Figure S1). Therefore, in terms of our data, age had a low impact on metabolism of these metabolites.

Metabolomics data analysis

Discovery of differential metabolites in training set

The data in the training set were analyzed by MetaboAnalyst 5.0 for multivariate analysis. Sum normalization, cube root transformation, and auto scaling were applied for data normalization. To explore the disparities between the NSCLC and HC groups, principal component analysis (PCA) was first used to describe the clustering behavior of plasma amino acids and piperonamide between the two groups. As depicted in , the clear trend of separation manifested the existence of significant metabolic alterations. Then partial least squares-discriminant analysis (PLS-DA) further supported the different changes in plasma metabolites in the two groups (). Since PLS-DA is a supervised model, a 10-fold cross-validation (CV) was performed to assess whether the model overfitted. The Q2 value of 0.808 indicated that the PLS-DA model has good predictive performance on the training set (). The prediction reliability of the model was verified again by 1,000-times permutation test (P<0.001; Figure S2). After PLS-DA, variance in projection (VIP) analysis was applied to identify 15 important metabolites that contribute to classification (). Subsequently, a univariate analysis was used to determine whether these 15 metabolites changed significantly between the two groups (). Finally, 8 out of 15 metabolites were deemed differential metabolites based on the selection criteria of VIP >1, P<0.05, and FDR <0.05 (21). Compared with the HC group, the contents of threonine, piperonamide, arginine, alanine, and cysteine were significantly lowered in the NSCLC group. The other 3 metabolites including methionine, leucine, and histidine were significantly elevated in the NSCLC group.
Figure 2

Multivariate analysis of NSCLC and HC plasma samples in the training set. (A) PCA of plasma samples for NSCLC and HC; (B) PLS-DA; (C) 10-fold CV of the PLS-DA model. Red * corresponds to the highest Q2, Q2=0.808; (D) the 15 important metabolites that help distinguish NSCLC from HC revealed by VIP analysis. PC, principal component; HC, healthy control; NSCLC, non-small cell lung cancer; VIP, variance in projection; PCA, principal component analysis; PLS-DA, partial least squares-discriminant analysis; CV, cross-validation.

Table 2

Univariate analysis of 15 important metabolites using SPSS 25.0 and R v.4.0.2

Metabolite (μmol/L)NSCLC, mean ± SDHC, mean ± SDP valueFDR
Threonine (Thr)26.30±7.6449.10±10.427.98E-135.99E-12
Piperonamide (Pip)262.17±129.67452.46±71.831.38E-105.16E-10
Arginine (Arg)12.92±8.6030.05±9.534.10E-112.05E-10
Alanine (Ala)165.47±76.54264.58±52.911.13E-093.38E-09
Methionine (Met)19.59±6.8215.71±2.129.00E-051.93E-04
Cysteine (Cys)1.36±0.762.71±1.052.68E-086.71E-08
Leucine (Leu)127.02±41.60108.86±26.580.0240.03
Histidine (His)84.15±50.2753.72±6.940.0140.021
Proline (Pro)450.36±229.70328.46±80.030.0020.003
Tyrosine (Tyr)49.32±14.5646.06±8.320.3850.431
Homocysteine (Hcy)8.45±0.8411.43±0.951.46E-142.19E-13
Serine (Ser)59.42±26.9352.43±7.220.5320.532
Valine (Val)138.66±36.87141.07±28.100.4020.431
Citrulline (Cit)15.24±13.4420.56±3.770.0180.025
Aspartic acid (Asp)38.16±15.0751.16±19.794.23E-047.93E-04

NSCLC, non-small cell lung cancer; HC, healthy control; SD, standard deviation; FDR, false discovery rate.

Multivariate analysis of NSCLC and HC plasma samples in the training set. (A) PCA of plasma samples for NSCLC and HC; (B) PLS-DA; (C) 10-fold CV of the PLS-DA model. Red * corresponds to the highest Q2, Q2=0.808; (D) the 15 important metabolites that help distinguish NSCLC from HC revealed by VIP analysis. PC, principal component; HC, healthy control; NSCLC, non-small cell lung cancer; VIP, variance in projection; PCA, principal component analysis; PLS-DA, partial least squares-discriminant analysis; CV, cross-validation. NSCLC, non-small cell lung cancer; HC, healthy control; SD, standard deviation; FDR, false discovery rate.

Screening of potential diagnostic biomarkers for NSCLC

To screen for potential biomarkers, Boruta algorithm was used to select and shrink differential metabolites. The results showed that threonine, piperonamide, arginine, alanine, cysteine, methionine, and histidine were selected as potential diagnostic biomarkers for NSCLC in the training set (Figure S3). Then, ROC analyses were applied to evaluate the classification performance of these 7 metabolites for NSCLC patients and HCs. As shown in , both boxplots and ROC curves analyzed by GraphPad prism 9 supported that each metabolite had a good diagnostic ability in the training set. Among these potential biomarkers, threonine displayed the highest accuracy in diagnosing NSCLC [area under the ROC curve (AUC) =0.958]. The results suggested that these 7 plasma differential metabolites had the potential to diagnose NSCLC.
Figure 3

Box plot and ROC curve of each potential biomarker in the training set. The ROC curve was plotted using GraphPad Prism 9. HC, healthy control; NSCLC, non-small cell lung cancer; AUC, area under the ROC curve; ROC, receiver operating characteristic.

Box plot and ROC curve of each potential biomarker in the training set. The ROC curve was plotted using GraphPad Prism 9. HC, healthy control; NSCLC, non-small cell lung cancer; AUC, area under the ROC curve; ROC, receiver operating characteristic.

Identification of biomarkers for the diagnosis of NSCLC

To evaluate the accuracy of the above results, validation set 1, validation set 2, and integrated data set (training set + validation set 1 + validation set 2) were applied to verify the performance of these 7 potential biomarkers in the diagnosis of NSCLC. The results of Mann-Whitney U test and ROC analysis showed that the variation trend of threonine, piperonamide, arginine, alanine, and cysteine in the 3 validation sets was consistent with that in the training set, with statistical differences (). Although the AUC values of these 5 markers decreased compared with the training set, the overall diagnostic performance was still optimistic. Notably, methionine and histidine were not statistically significant in validation set 1 and validation set 2. The AUC values of these 2 potential biomarkers also declined in the 3 validation sets, but their P values and FDR values in the integrated data set were both lower than 0.05. We speculated that it might have been caused by the small sample size of validation set 1 and validation set 2. In a study by Klupczynska et al., the AUC values of methionine and histidine were 0.685 and 0.687, respectively, which were slightly higher than the calculation results in each of our validation sets (18). When these 2 biomarkers were incorporated into a diagnostic model containing 12 metabolites by Klupczynska et al., the AUC of this diagnostic model was 0.836, which was higher than the diagnostic performance of a single metabolite. Their research indicated that these 2 metabolites contributed to improving the model’s classification ability. Finally, these 2 metabolites were not excluded from the construction of our panel, and these 7 potential biomarkers were confirmed as NSCLC biomarkers.
Table 3

Results of Mann-Whitney U test and ROC analysis of 7 biomarkers in three different validation sets

Data set (metabolites)NSCLC, mean ± SDHC, mean ± SDP valueFDRAUC (95% CI)Sensitivity (95% CI)Specificity (95% CI)Cutoff
Validation set 1
   Threonine (Thr)33.49±16.4053.17±12.321.10E-052.57E-050.857 (0.757– 0.957)0.750 (0.589–0.863)1.000 (0.839–1.000)37.27
   Piperonamide (Pip)312.82±154.80498.04±96.762.80E-054.90E-050.840 (0.738–0.943)0.667 (0.503–0.798)1.000 (0.839–1.000)365.40
   Arginine (Arg)14.25±11.2930.53±11.403.00E-062.10E-050.879 (0.788–0.971)0.750 (0.589–0.863)1.000 (0.839–1.000)17.40
   Alanine (Ala)181.26±80.25285.03±61.709.00E-062.57E-050.860 (0.759–0.961)0.861 (0.713–0.939)0.850 (0.640–0.948)237.70
   Cysteine (Cys)1.46±0.882.34±1.450.0230.0320.684 (0.541–0.827)0.417 (0.271–0.578)0.950 (0.764–0.997)1.13
   Methionine (Met)15.08±3.9716.85±3.490.090.1050.638 (0.491–0.784)0.528 (0.370–0.680)0.750 (0.531–0.888)14.56
   Histidine (His)74.21±54.0856.56±9.380.1280.1280.624 (0.475– 0.772)0.556 (0.396–0.705)0.900 (0.699–0.982)66.43
Validation set 2
   Threonine (Thr)35.15±51.6447.65±9.421.40E-104.90E-100.898 (0.835–0.960)0.846 (0.739–0.914)0.909 (0.764–0.969)37.66
   Piperonamide (Pip)294.12±166.65406.32±66.421.59E-062.78E-060.798 (0.709–0.887)0.708 (0.588–0.804)1.000 (0.896–1.000)326.90
   Arginine (Arg)12.01±8.2727.70±6.232.17E-111.52E-100.915 (0.859–0.971)0.800 (0.687–0.879)0.970 (0.847–0.998)16.48
   Alanine (Ala)187.04±77.01289.27±53.249.83E-092.29E-080.857 (0.782–0.931)0.828 (0.718–0.901)0.849 (0.691–0.934)247.40
   Cysteine (Cys)1.64±2.202.55±1.234.06E-065.68E-060.786 (0.697–0.874)0.492 (0.375–0.611)1.000 (0.896–1.000)1.04
   Methionine (Met)19.05±8.2916.37±2.660.1550.1550.588 (0.476–0.700)0.569 (0.448–0.682)0.667 (0.496–0.803)16.97
   Histidine (His)87.73±55.5165.94±17.740.0560.0650.619 (0.509–0.728)0.646 (0.525–0.751)0.667 (0.496–0.803)66.14
Integrated data set
   Threonine (Thr)31.29±33.5449.50±10.633.26E-262.28E-250.911 (0.876–0.947)0.844 (0.782–0.892)0.916 (0.836–0.959)37.48
   Piperonamide (Pip)285.53±150.84445.10±83.813.53E-196.18E-190.848 (0.800–0.896)0.713 (0.640–0.776)0.988 (0.935–0.999)327.10
   Arginine (Arg)12.85±9.1129.23±8.896.58E-262.30E-250.909 (0.873–0.945)0.844 (0.782–0.892)0.904 (0.821–0.950)20.34
   Alanine (Ala)177.21±77.68279.32±55.732.48E-215.79E-210.869 (0.825–0.913)0.765 (0.695–0.823)0.904 (0.821–0.950)212.30
   Cysteine (Cys)1.49±1.512.56±1.221.86E-132.60E-130.786 (0.730–0.842)0.425 (0.353–0.501)0.988 (0.935–0.999)1.05
   Methionine (Met)18.41±7.1516.25±2.720.0130.0130.597 (0.527–0.666)0.455 (0.381–0.531)0.795 (0.696–0.868)18.05
   Histidine (His)83.40±53.1459.27±13.834.43E-045.17E-040.637 (0.569–0.704)0.539 (0.463–0.613)0.868 (0.778–0.924)70.15

AUC, sensitivity, and specificity were obtained by ROC analysis on GraphPad Prism 9. Specifically, the sensitivity, specificity and cutoff value are obtained according to the sensitivity, specificity and cutoff value corresponding to the maximum value of the sum of the sensitivity and specificity. ROC, receiver operating characteristic; NSCLC, non-small cell lung cancer; HC, healthy control; SD, standard deviation; FDR, false discovery rate; AUC, area under the ROC curve; CI, confidence interval.

AUC, sensitivity, and specificity were obtained by ROC analysis on GraphPad Prism 9. Specifically, the sensitivity, specificity and cutoff value are obtained according to the sensitivity, specificity and cutoff value corresponding to the maximum value of the sum of the sensitivity and specificity. ROC, receiver operating characteristic; NSCLC, non-small cell lung cancer; HC, healthy control; SD, standard deviation; FDR, false discovery rate; AUC, area under the ROC curve; CI, confidence interval.

Establishment and verification of NSCLC diagnostic panel

Due to the complexity of NSCLC metabolism, a diagnostic panel containing multiple biomarkers could more comprehensively reflect the pathological state of the disease. Therefore, a panel composed of threonine, piperonamide, arginine, alanine, cysteine, methionine, and histidine was established. Next, validations were performed using multiple validation sets including validation set 1, validation set 2, and integrated data set (training set + validation set 1 + validation set 2). Although the diagnostic ability of this panel in the training set was slightly higher than the other 3 validation sets, its overall diagnostic performance was worthy of recognition (). The AUCs of the training set, validation set 1, validation set 2, and integrated data set reached 0.999, 0.965, 0.963, and 0.967, respectively (Table S1). Especially for validation set 2, which contained interference factors, the presence of interference factors had little effect on the detection of these 7 biomarkers, and the specificity was 0.970, similar to that in other data sets. In addition, the linear support vector machine (SVM) algorithm and the random forest algorithm were used on MetaboAnalyst 5.0 to prove the reliability of the above results again (Table S2). Thus, this diagnostic panel was regarded as reliable and would be used for subsequent exploration.
Figure 4

Diagnostic capabilities of this panel in different data sets. The figure was drawn by the “pROC” R package. ROC, receiver operating characteristic.

Diagnostic capabilities of this panel in different data sets. The figure was drawn by the “pROC” R package. ROC, receiver operating characteristic.

The diagnostic ability of this panel for early NSCLC and NSCLC subtypes

We use PLS-DA and ROC analysis to determine whether this panel had diagnostic value for early NSCLC. In the integrated data set, although this panel could not distinguish each stage well, it could distinguish NSCLC at each stage well from HCs (). Then, we conducted a separate ROC analysis on stage I NSCLC patients and HCs, and the results showed that the AUC, sensitivity, and specificity were 1.000, 1.000, and 0.988, respectively (). Thus, this panel showed promise for identifying patients with early NSCLC.
Figure 5

Predictive power of this panel for early NSCLC. (A) PLS-DA analysis between NSCLC stages and HC; (B) ROC analysis between stage I NSCLC patients and HCs using the “pROC” R package. HC, healthy control; AUC, area under the ROC curve; ROC, receiver operating characteristic; NSCLC, non-small cell lung cancer; PLS-DA, partial least squares-discriminant analysis.

Predictive power of this panel for early NSCLC. (A) PLS-DA analysis between NSCLC stages and HC; (B) ROC analysis between stage I NSCLC patients and HCs using the “pROC” R package. HC, healthy control; AUC, area under the ROC curve; ROC, receiver operating characteristic; NSCLC, non-small cell lung cancer; PLS-DA, partial least squares-discriminant analysis. The panel’s diagnostic capabilities for NSCLC subtypes were also explored. As depicted in Figure S4, this panel had similar diagnostic performance for LUAD and LUSC with almost the same AUCs. Relative to the HC group, the AUCs of LUAD and LUSC groups were 0.972 and 0.971, respectively, within the integrated data set.

Correlation analysis of biomarkers in this panel with CEA and Cyfra21-1

The R package “correlation” was applied to explore the association of CEA and Cyfra21-1 with the 7 metabolic biomarkers we identified. As shown in Figure S5, the biomarkers in our panel were not significantly associated with clinically commonly used lung cancer markers CEA and Cyfra21-1.

Transcriptomics data analysis

DEGs screening

The gene expression profiles from TCGA data were imported into R v.4.0.2 to screen DEGs. A total of 17,938 genes were detected in the NSCLC group and the normal group. Then, a volcano map was drawn to show how each gene is distributed (). According to FDR <0.05 and |log2FC| >2, 2,470 of these genes were screened as DEGs, of which 1,810 genes were up-regulated and 660 were downregulated in NSCLC samples.
Figure 6

The identification of 2,470 DEGs and WGCNA of DEGs. (A) Volcano plot. The red dots on the right represent up-regulated genes, and the green dots on the left represent down-regulated genes; (B) specimens clustering and clinical features (tumor and normal correspondence to NSCLC tissues and normal tissues); (C) cluster dendrogram utilized to detect co-expression clusters with corresponding color assignments. Gene clusters in different colors represent different co-expression modules; (D) the relationships between modules and clinical traits. Each row stands for a ME, and each column represents a clinical feature. Each long square contains the P value and correlation. FDR, false discovery rate; ME, module eigengene; DEGs, differentially expressed genes; WGCNA, weighted gene co-expression network analysis; NSCLC, non-small cell lung cancer.

The identification of 2,470 DEGs and WGCNA of DEGs. (A) Volcano plot. The red dots on the right represent up-regulated genes, and the green dots on the left represent down-regulated genes; (B) specimens clustering and clinical features (tumor and normal correspondence to NSCLC tissues and normal tissues); (C) cluster dendrogram utilized to detect co-expression clusters with corresponding color assignments. Gene clusters in different colors represent different co-expression modules; (D) the relationships between modules and clinical traits. Each row stands for a ME, and each column represents a clinical feature. Each long square contains the P value and correlation. FDR, false discovery rate; ME, module eigengene; DEGs, differentially expressed genes; WGCNA, weighted gene co-expression network analysis; NSCLC, non-small cell lung cancer.

Module formation and its correlations with clinical traits

We performed WGCNA analysis on the selected 2,470 DEGs. As shown in , no outliers were found after clustering the samples. The soft threshold was chosen to be 6, which complied with the scale-free network rules (Figure S6A). After converting to TOM, we got 7 modules through gene clustering (). Next, to determine the genes related to the occurrence of NSCLC, we calculated the ME. It was found that the module most relevant to the occurrence of NSCLC was the turquoise module, which covered a total of 893 dysregulated genes (Table S3, ). Figure S6B shows the correlation between each gene in the module and clinical features (the correlation coefficient =0.82; P<1e-200). Therefore, the turquoise module was thought to be the NSCLC-related module and would be used in the next analysis.

Joint pathway enrichment analysis and critical pathway extraction

In order to explore how metabolic biomarkers and genes interact, a joint pathway enrichment analysis was performed through MetaboAnalyst 5.0. The 893 DEGs in the turquoise module and the 7 biomarkers in the diagnostic panel were input to this network analysis platform. As depicted in Figure S7, a total of 5 metabolism pathways related to metabolic genes were significantly altered in the NSCLC samples (P<0.05). These strikingly disturbed metabolic pathways included glycine, serine and threonine metabolism, aminoacyl-tRNA biosynthesis, nitrogen metabolism, cysteine and methionine metabolism, and thiamine metabolism. When we explored these disordered metabolic processes through Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis, several important metabolic pathways were found to be regulated by 4 DEGs (PSAT1, SHMT2, AOC3, and MAOB). Then, the pathways were extracted and displayed in . Figure S8 shows the significant differences in expression of these 4 genes between NSCLC and HCs (all P<0.0001).
Figure 7

Mapping of important pathways based on the results of metabolome and transcriptome integrated pathway analysis. The orange ellipse and the purple-red rounded rectangle contain the amino acid name and gene name, respectively.

Mapping of important pathways based on the results of metabolome and transcriptome integrated pathway analysis. The orange ellipse and the purple-red rounded rectangle contain the amino acid name and gene name, respectively.

Discussion

In this study, LC-MS/MS technology was used to determine the potential of amino acids and piperonamide as diagnostic biomarkers for NSCLC. The transcriptional data were mined to increase our knowledge of these metabolite changes. Our findings indicated that NSCLC patients exhibited different plasma levels of threonine, piperonamide, arginine, alanine, cysteine, methionine, and histidine as compared with HCs ( and ). As can be seen from , the panel composed of these 7 metabolites showed a promising diagnostic ability for NSCLC patients at various stages, including the early stage. The results of Tables S1,S2 showed that elevated fasting blood glucose (6.18≤ blood glucose ≤12.89) and weight loss (0< weight loss ≤10 kg) in NSCLC patients will not cause much interference in the detection of 7 biomarkers. We also revealed that these biomarkers do not differ much between subtypes, since the panel composed of these biomarkers had similar ability to diagnose NSCLC subtypes (Figure S4). The combined pathway enrichment analysis of 7 biomarkers and 893 DEGs revealed disruptions in glycine, serine, and threonine metabolism, aminoacyl-tRNA biosynthesis, nitrogen metabolism, cysteine and methionine metabolism, and thiamine metabolism (Figure S7). Finally, we found that 4 DEGs (PSAT1, SHMT2, AOC3, and MAOB) were involved in the metabolism of threonine and cysteine (). Threonine, as an essential amino acid, is an important biologically active molecule that has a significant impact on regulation of immune function (31-34). In vitro, lymphocytes can use threonine for their own proliferation and antibody secretion (32). In addition to the proliferation of lymphocytes and the secretion of antibodies, threonine is also related to the regulation of the expression of inflammatory cytokines. When fish are infected with bacteria, threonine deficiency will promote the expression of pro-inflammatory cytokine genes and inhibit the expression of anti-inflammatory cytokine genes to aggravate the inflammatory response (33,34). In NSCLC patients, we observed the decrease in plasma threonine concentration ( and ). A study of plasma amino acids in lung cancer patients also revealed significant changes in threonine (35). In this plasma metabolome study involving 142 Korean participants, threonine was decreased in cancer patients when compared to the control group, similar to our findings. Therefore, NSCLC cells may heavily consume threonine to establish an “immune comfort zone”. Since metabolism and transcription were inextricably linked (16,20), genes changed when threonine was lowered. As shown in and Figure S8, the accumulation of threonine may be caused by the up-regulation of SHMT2 and down-regulation of AOC3 and MAOB. However, the accumulation of threonine may be insignificant and does not significantly increase plasma threonine levels in NSCLC patients. Immune function will not be significantly improved. Cysteine, a sulfur-containing amino acid, is produced by another sulfur-containing amino acid, methionine. Cysteine plays an important role in the process of cancer metabolism remodeling. It serves as a substrate for the production of hydrogen sulfide (H2S), which provides energy for the mitochondrial electron transport chain to stimulate the bioenergetics of cancer cells (36). In our study, reduced level of cysteine was indicated as a plasma metabolic signature of NSCLC patients ( and ). This abnormality of cysteine was also found in the serum of non-smoking female NSCLC patients (37). High concentrations of cysteine in the blood have been associated with a reduced risk of gastrointestinal cancers, as reported by Miller et al. and Murphy et al. (38,39). Thus, it can be assumed that the drop in blood cysteine levels is a common feature of many types of cancer, not just NSCLC. The downregulation of cysteine in plasma may be related to the production of H2S. In order to continuously utilize H2S, PSAT1 may be upregulated by cancer cells to partially replenish cysteine depletion ( and Figure S8). Arginine is an amino acid with anti-tumor potential that deserves special attention; arginine was down-regulated in the plasma samples of our NSCLC patients. Inconsistent with our results, serum arginine was elevated in patients with early NSCLC in the study by Klupczynska et al. (40). Although we performed a univariate analysis of early-stage NSCLC patients, arginine levels remained inconsistent. This difference may be due to different research methods. However, Kim et al. observed consistent results with reduced plasma arginine levels in lung cancer patients (35). This phenomenon of blood arginine drop in patients with lung cancer may be mainly attributed to increased uptake and utilization of circulating arginine by cancer cells and enhanced decomposition of intracellular arginine by cancer cells. This seems contradictory, but it may be because cancer cells want to meet both energy requirements and escape immune attack. Autophagy helps feed the nutrients needed for the survival of cancer cells (41). Blood is the source of nutrients for tumor cells. Poillet-Perez et al. reported that autophagy maintains the growth of tumor through arginine in the blood (42). Increasing intracellular arginine levels not only improved T cells survival capacity but also enhanced their anti-tumor activity in vivo (43). However, tumor cells suppress anti-tumor immunity by catabolizing intracellular arginine (44). The metabolism of arginine in NSCLC is complicated, and it is still necessary to study its value as a biomarker and anti-tumor medicine. Another amino acid that was significantly reduced in the plasma of NSCLC patients was alanine, as previously reported by different authors (35,40,45). The decrease of serum alanine concentration in breast cancer patients was reported by Eniu et al. (46). A study of pancreatic duct adenocarcinoma (PDAC) found that PDAC cells use the SLC38A2 transporter to supplement the nutrient alanine (47). The low availability of alanine in different tumor types can be explained by the consumption of alanine for cancer cell proliferation. The other 2 controversial amino acids are histidine and methionine, both of which were increased in our NSCLC samples. However, these 2 amino acids were decreased in the cancer group in other studies (18,35,46,48). The reason for this difference may be the small sample size of the cancer group in our study or the different research methods. Piperonamide, also known as piperine, has been shown to inhibit the proliferation and migration of cancer cells (19,49). In our study, piperonamide level was significantly lower in patients with NSCLC compared to HCs. This may be due to excessive degradation or reduced synthesis of piperonamide. Therefore, abnormal piperonamide metabolism can be involved in the pathogenesis of NSCLC. After validation on multiple validation sets, we finally confirmed that these 7 metabolites have value as biomarkers for diagnosing NSCLC. The diagnostic panel composed of these 7 metabolic biomarkers had a good ability to classify NSCLC and HC, and also had the ability of screening or early diagnosis. Also, elevated fasting blood glucose and weight loss interfered little with its predictions. Finally, the integrated pathway analysis of metabolomics and transcriptomics provided more information for our understanding of changes in certain metabolic biomarkers. However, several limitations existed in the current research. First, the sample size in this study was small, especially for patients with stage I NSCLC. Second, there was no information on blood glucose and weight changes in HCs, but the design of comparisons between multiple groups of NSCLCs and HCs could compensate for the lack of information. There was also no information on collection time for HC samples. Third, the metabolism and transcription data were from different populations. Fourth, the sample types of metabolomics and transcriptomics analysis were different: the former was plasma and the latter was tissue. These may have biased the analysis, so solving the above limitations is the goal of our future research.

Conclusions

In summary, plasma threonine, piperonamide, arginine, alanine, cysteine, methionine, and histidine have the potential to serve as diagnostic biomarkers for NSCLC. The panel constructed from them performed satisfactorily in differentiating NSCLC patients from HCs. These findings may help improve NSCLC diagnosis and screening. Furthermore, the supplement of transcriptomic data improved our understanding of the changes in several of these metabolites. This has implications for studying carcinogenesis mechanisms and discovering possible therapeutic opportunities.
  49 in total

Review 1.  The biology and management of non-small cell lung cancer.

Authors:  Roy S Herbst; Daniel Morgensztern; Chris Boshoff
Journal:  Nature       Date:  2018-01-24       Impact factor: 49.962

2.  Evaluation of serum amino acid profiles' utility in non-small cell lung cancer detection in Polish population.

Authors:  Agnieszka Klupczynska; Paweł Dereziński; Wojciech Dyszkiewicz; Krystian Pawlak; Mariusz Kasprzyk; Zenon J Kokot
Journal:  Lung Cancer       Date:  2016-05-07       Impact factor: 5.705

3.  GC-MS-based metabolomics reveals new biomarkers to assist the differentiation of prostate cancer and benign prostatic hyperplasia.

Authors:  Wenyu Wang; Zhuoru He; Yu Kong; Zhongqiu Liu; Lingzhi Gong
Journal:  Clin Chim Acta       Date:  2021-04-05       Impact factor: 3.786

Review 4.  Piperine as a Potential Anti-cancer Agent: A Review on Preclinical Studies.

Authors:  Azadeh Manayi; Seyed Mohammad Nabavi; William N Setzer; Samineh Jafari
Journal:  Curr Med Chem       Date:  2018       Impact factor: 4.530

5.  Homocysteine, cysteine, and risk of incident colorectal cancer in the Women's Health Initiative observational cohort.

Authors:  Joshua W Miller; Shirley A A Beresford; Marian L Neuhouser; Ting-Yuan David Cheng; Xiaoling Song; Elissa C Brown; Yingye Zheng; Beatriz Rodriguez; Ralph Green; Cornelia M Ulrich
Journal:  Am J Clin Nutr       Date:  2013-02-20       Impact factor: 7.045

6.  High-resolution metabolomic biomarkers for lung cancer diagnosis and prognosis.

Authors:  Shi-Ang Qi; Qian Wu; Zhenpu Chen; Wei Zhang; Yongchun Zhou; Kaining Mao; Jia Li; Yuanyuan Li; Jie Chen; Youguang Huang; Yunchao Huang
Journal:  Sci Rep       Date:  2021-06-03       Impact factor: 4.379

7.  Study of early stage non-small-cell lung cancer using Orbitrap-based global serum metabolomics.

Authors:  Agnieszka Klupczynska; Paweł Dereziński; Timothy J Garrett; Vanessa Y Rubio; Wojciech Dyszkiewicz; Mariusz Kasprzyk; Zenon J Kokot
Journal:  J Cancer Res Clin Oncol       Date:  2017-02-06       Impact factor: 4.553

8.  Integrative Metabolomic and Transcriptomic Analysis for the Study of Bladder Cancer.

Authors:  Alba Loras; Cristian Suárez-Cabrera; M Carmen Martínez-Bisbal; Guillermo Quintás; Jesús M Paramio; Ramón Martínez-Máñez; Salvador Gil; José Luis Ruiz-Cerdá
Journal:  Cancers (Basel)       Date:  2019-05-16       Impact factor: 6.639

9.  Novel Biomarkers Associated With Progression and Prognosis of Bladder Cancer Identified by Co-expression Analysis.

Authors:  Yejinpeng Wang; Liang Chen; Lingao Ju; Kaiyu Qian; Xuefeng Liu; Xinghuan Wang; Yu Xiao
Journal:  Front Oncol       Date:  2019-10-11       Impact factor: 6.244

10.  An Integrative Transcriptomic and Metabolomic Study of Lung Function in Children With Asthma.

Authors:  Rachel S Kelly; Bo L Chawes; Kevin Blighe; Yamini V Virkud; Damien C Croteau-Chonka; Michael J McGeachie; Clary B Clish; Kevin Bullock; Juan C Celedón; Scott T Weiss; Jessica A Lasky-Su
Journal:  Chest       Date:  2018-06-13       Impact factor: 9.410

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.