Literature DB >> 28729763

Serum and Plasma Metabolomic Biomarkers for Lung Cancer.

Nishith Kumar1,2, Md Shahjaman1,3, Md Nurul Haque Mollah1, S M Shahinul Islam4, Md Aminul Hoque1.   

Abstract

In drug invention and early disease prediction of lung cancer, metabolomic biomarker detection is very important. Mortality rate can be decreased, if cancer is predicted at the earlier stage. Recent diagnostic techniques for lung cancer are not prognosis diagnostic techniques. However, if we know the name of the metabolites, whose intensity levels are considerably changing between cancer subject and control subject, then it will be easy to early diagnosis the disease as well as to discover the drug. Therefore, in this paper we have identified the influential plasma and serum blood sample metabolites for lung cancer and also identified the biomarkers that will be helpful for early disease prediction as well as for drug invention. To identify the influential metabolites, we considered a parametric and a nonparametric test namely student׳s t-test as parametric and Kruskal-Wallis test as non-parametric test. We also categorized the up-regulated and down-regulated metabolites by the heatmap plot and identified the biomarkers by support vector machine (SVM) classifier and pathway analysis. From our analysis, we got 27 influential (p-value<0.05) metabolites from plasma sample and 13 influential (p-value<0.05) metabolites from serum sample. According to the importance plot through SVM classifier, pathway analysis and correlation network analysis, we declared 4 metabolites (taurine, aspertic acid, glutamine and pyruvic acid) as plasma biomarker and 3 metabolites (aspartic acid, taurine and inosine) as serum biomarker.

Entities:  

Keywords:  Kruskal-Wallis test; Metabolomics; Student t-test; biomarker identification; pathway analysis; support vector machine (SVM)

Year:  2017        PMID: 28729763      PMCID: PMC5512859          DOI: 10.6026/97320630013202

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Lung cancer is the leading cause of cancer mortality in United States as well as all over the world [1]. In 2012, 1.8 million people were affected by lung cancer and 1.6 million deaths [2] worldwide. This is the most common cause of cancer-related death in men and second most common in women after breast cancer [3]. Early diagnosis of lung cancer can increase the survival rate at 85% [4]. Till now, there are no FDA-approved diagnostic tests available for detecting the existence of lung cancer [5]. However, Due to the development of molecular biology early diagnosis of cancer is possible through metabolomics data analysis. Metabolomics is the powerful high throughput technology based on the entire set of metabolites that provide potential information because it measures and quantify the end product of cellular metabolism. Any disorder of cellular process is revealed with the changes of metabolites level. Therefore, differentially expressed (DE) metabolites identification between normal and cancer patient is very important for early diagnosis of disease as well as metabolomic biomarker discovery. There are several parametric and non-parametric approaches for DE metabolite identification. The well-known parametric tests are student׳s t-test, F-test, significant analysis of microarray (SAM), EBarrays, BRIDGE, LIMMA etc. and non-parametric tests are Wilcoxon signed-rank test, Mann-Whitney test, Kruskal-Wallis test etc. However, biologist often use fold change method for influential metabolite identification. We generally use parametric test if the data follow the normality property, on the contrary, if the dataset contain outliers and non-normality properties then non-parametric test is applicable. Student׳s t-test has been used for different cancer biomarker identification namely - prostate cancer [6], colon carcinoma [7], pancreatic cancer [8], kidney cancer [9], hepatocellular carcinoma [10] etc. Metabolomics dataset often contain outliers due to several steps involves in the data generating processes [11]. Therefore, to identify the DE metabolite for lung cancer, we used both t-test as parametric test and Kruskal-Wallis test as non-parametric test for identifying DE metabolite for lung cancer. We declare those metabolites as DE, which are significant in any of the methods. In this paper, we also classified the up regulated and downregulated metabolites by using cluster heatmap plot [12]. Upregulated and down-regulated metabolites for lung cancer are important to early diagnosis the disease, for drug discovery and biomarker discovery. On the basis of heatmap plot, importance plot (importance score is calculated using SVM classifier with radial basis kernel function), pathway analysis and correlation network plot finally we identified the biomarker metabolites. In this paper, we took plasma and serum blood samples for identifying the significant metabolites and biomarker discovery. Here, we got 27 significant (p-value<0.05) metabolites from plasma sample and 13 significant (p-value<0.05) metabolites from serum sample for lung cancer. According to the importance plot, pathway analysis and metabolomic correlation network analysis, we declared 4 metabolites (taurine, aspertic acid, glutamine and pyruvic acid) as plasma biomarker and 3 metabolites (aspartic acid, taurine and inosine) as serum biomarker for lung cancer. Among these metabolites taurine, aspartic acid and pyruvic acid are up regulated and glutamine and inosine are down regulated in cancer patient.

Methodology

In this paper, we have identified the significant metabolites from plasma and serum samples using student׳s t-test and Kruskal- Wallis test. This test has been implemented by R-software using function t.test and kruskal.test. Heatmap plot and importance plot using SVM have also been implemented by R-software in library gplots and caret respectively. We did pathway analysis by online software MetaboAnalyst 3.0 [13]. The detail description of the analyzed dataset and the significant metabolite identification methods namely student׳s t-test and Kruskal-Wallis test are below.

Student׳s t-test

Let us consider a random sample x11, x12, ... x1 n1 follows normal distribution with mean μ1 and variance σ12 (i.e., N(μ1, σ12)) and another independent random sample x21, x22, ... x2 n2 follows N(μ2, σ22). If we want to test the hypothesis H0: μ1= μ2 Vs H1: μ1 ≠ μ2 The test statistic (for σ12= σ22) is, Here, and The calculated t value is compared with the tabulated t-value with n1 + n2 - 2 degrees of freedom. If σ12 ≠ σ22; the test statistic is, In both cases, the Ho will be rejected at α % level of significance, if the calculated t value is greater than the tabulated t value with n1 + n2 - 2 degrees of freedom and α % level of significance. If X is a metabolomics data matrix that contains two types of samples (e.g., cancer vs. control), then for the ith metabolite xij; j=1,2, . . . n1 is a sample for type-1 (e.g., cancer) with sample size n1 and xik; k=1,2, . . ., n2 , is the sample for type-2(e.g., control) with sample size n2 ; we assume Ho, “the ith metabolite is not differentially expressed between cancer vs. control group”. Usually Ho is rejected if ׳p-value׳ <0.05.

Kruskal-Wallis

This is a non-parametric test was proposed by Kruskal and Wallis [14] and it is used when the data does not satisfy the normality property and contains outliers. The test statistic of Kruskal- Wallis for k groups each of size ni is defined by where, N is the total number and Ri is the sum of the ranks for the i-th sample and we assume Ho is that all k distribution functions are equal.

Dataset Description

The dataset used in this paper was produced by Gas chromatography time of flight mass spectrometry (GC-TOF-MS) using the blood sample of 82 subjects (20 males and 62 females). All samples were collected Among the 82 subjects, 41 blood samples came from the patients with lung cancer and another 41 samples were taken from the individuals without cancer. These blood samples were acquired from the bio-repositories of two institutes (Fred Hutchison Cancer Research Center (FHCRC) and University of California at Davis Medical Center (UCDMC)). All samples were collected with individuals consent and followed the IRB protocols, which was approved by each Institution׳s Institutional Review Board and its aim was to use only for research purposes. Blood samples were collected using EDTA tubes and prepared the samples (serum and plasma) using approved protocols and stored at -80 °C. Raw data of GC-TOFMS were processed using ChromaTOF software (v. 2.32) for peak finding and mass spectral deconvolution. Result files were exported and filtered for consistency using the UC Davis Metabolomics BinBase database. Finally, 158 metabolites were identified as known metabolites. Thus, the dataset contain 82 subjects (41 cancer and 41 control) and 158 metabolites. This dataset had been produced by Oliver Fiehn, whose study ID was ST000392 [5]. We used log2 transformation and auto-scaling to normalize the dataset.

Results and Discussion

To identify the significant metabolites from the plasma and serum blood samples for lung cancer, we used Student׳s t-test and Kruskal -Wallis test, where, p-values were adjusted using Benjamini-Hochberg (BH) procedure. The lists of significant metabolites of plasma and serum samples for lung cancer are given in Table 1. Table 1 shows that 27 significant metabolites were identified from the plasma samples for lung cancer, among those significant metabolites, t-test identified 24 and Kruskal- Wallis test identified 25 significant metabolites (BH adjusted pvalue< 0.05). On the contrary, 13 significant metabolites were identified from the serum samples for lung cancer, among these 13 significant metabolites, t-test identified 12 and Kruskal-Wallis test identified 11 significant metabolites (BH adjusted pvalue< 0.05). Using the identified significant metabolites, we also drew the heatmap plot to classify the up regulated and downregulated metabolites for plasma and serum sample, we also ranked those metabolites according to the importance (importance score is calculated using SVM classifier with radial basis kernel function), which were depicted in Figure 1 and Figure 2 respectively. Furthermore, we analyzed the pathway and drew the correlation network plot to identify the plasma and serum biomarker for lung cancer. Figure 3 and Figure 4 contain the pathway analysis plot and correlation network plot to identify the plasma and serum biomarker for lung cancer. From Figure 3 (a), we got three important metabolomic pathway for plasma biomarker namely (i) alanine, aspartate and glutamate metabolism pathway, (ii) taurine and hypotaurine metabolism pathway and (iii) pyruvate metabolism pathway. Also from Figure 4 (a), we got two important metabolomic pathway for serum biomarker: (i) alanine, aspartate and glutamate metabolism pathway and (ii) taurine and hypotaurine metabolism pathway. According to the importance plot (Figure 1 and Figure 2), pathway analysis and metabolomic correlation network analysis (Figure 3 and Figure 4), we declared 4 metabolites (taurine, aspertic acid, glutamine and pyruvic acid) as plasma biomarker and 3 metabolites (aspartic acid, taurine and inosine) as serum biomarker for lung cancer. Among these metabolites taurine, aspartic acid and pyruvic acid are up regulated and glutamine and inosine are down regulated in cancer patient. This is the dry laboratory based untargeted metabolomics results. To get the final and more accurate results, further analysis could be the wet laboratory experiment for targeted metabolomics analysis.
Table 1

List of significant metabolites of plasma and serum samples for lung cancer.

Significant Metabolite NameKEGG IDRaw p-Value of t-testBH Adjusted p-Value of t-testRaw p-Value of Kruskal- WallisBH Adjusted p-Value of Kruskal- Wallis
Plasma Sample
3-phosphoglycerateC005973.34E-060.000114.31E-060.00013
5-hydroxynorvaline NIST-0.000170.002720.000220.00346
5-methoxytryptamineC056591.28E-076.76E-063.77E-060.00013
adenosine-5-monophosphate-6.93E-121.10E-091.17E-091.85E-07
alpha-ketoglutarate-0.004120.029560.00630.04151
asparagineC001520.000930.009840.002210.01837
aspartic acidC000495.18E-060.000149.29E-060.00021
benzoic acidC001800.001450.013520.004160.02987
citrullineC003270.00060.006980.000360.00433
hypoxanthineC002620.003520.026470.001860.01633
lactic acidC001869.39E-060.000212.30E-050.00045
malic acidC001490.001170.011550.00180.01633
maltoseC002080.000110.001870.000170.00302
maltotrioseC018350.005460.037470.009520.05573
methionine sulfoxide-0.000330.004290.000550.00621
nornicotineC065240.000220.003210.000260.00372
phenolC001468.03E-050.001585.51E-060.00014
phosphoethanolamineC003460.002950.023280.001510.01592
pyrophosphateC000131.20E-089.48E-071.04E-078.25E-06
pyruvic acidC000220.000620.006980.000280.00372
quinic acidC067460.001750.015380.002780.02197
taurineC002451.54E-066.07E-056.99E-073.68E-05
tryptophanC000780.006380.042010.011490.06265
uric acidC003660.002640.021990.003860.02909
glutamineC000640.025370.120590.001640.0162
inosineC002940.008350.052030.005610.03853
lactamide-0.01090.05940.007560.04779
Serum Sample
5-hydroxynorvaline NIST-0.001070.042390.001920.04333
aspartic acidC000494.11E-076.50E-052.10E-060.00016
cholesterolC001870.002850.042860.002860.04509
glutamic acidC000250.003660.048140.005960.07248
hypoxanthineC002620.000150.011971.63E-060.00016
inosineC002940.000540.028620.000390.01562
lactic acidC001860.002630.042860.00180.04333
N-methylalanine-0.001950.042860.003140.04509
nornicotineC065240.002370.042860.004160.05476
phenolC001460.002980.042860.000280.01488
quinic acidC067460.001450.042860.002380.04454
taurineC002450.002160.042860.000530.01682
deoxypentitol-0.016190.127890.002540.04454
Figure 1

Heatmap plot (a), importance plot (b), using the significant metabolites of plasma sample for lung cancer.

Figure 2

Heatmap plot (a), importance plot (b), using the significant metabolites of serum sample for lung cancer.

Figure 3

Plasma biomarker identification for lung cancer using pathway analysis plot (a), alanine, aspartate and glutamate metabolism pathway (b), taurine and hypotaurine metabolism pathway (c), pyruvate metabolism pathway (d) and correlation network plot (e).

Figure 4

Serum biomarker identification for lung cancer using pathway analysis plot(a), alanine, aspartate and glutamate metabolism pathway(b), taurine and hypotaurine metabolism pathway (c), and correlation network plot (d).

Conclusion

We analysed GC-TOF-MS based untargeted metabolomics data of plasma and serum blood samples. Blood samples were collected from 41 lung cancer cases and 41 control subjects to identify the significant metabolites as well as to discover the plasma and serum biomarker for lung cancer. In our analysis, we got 27 significant metabolites (BH adjusted p-value<0.05) from plasma samples and 13 significant metabolites (BH adjusted pvalue< 0.05) for serum samples for lung cancer. We also got 3 important pathway: (i) alanine, aspartate and glutamate metabolism pathway, (ii) taurine and hypotaurine metabolism pathway and (iii) pyruvate metabolism pathway from plasma samples and 2 important pathway: (i) alanine, aspartate and glutamate metabolism pathway and (ii) taurine and hypotaurine metabolism pathway from serum samples for lung cancer. On the basis of the importance plot, pathway analysis and metabolomic correlation network analysis, we declared 4 metabolites (taurine, aspertic acid, glutamine and pyruvic acid) as plasma biomarker and 3 metabolites (aspartic acid, taurine and inosine) as serum biomarker for lung cancer. Among these metabolites taurine, aspartic acid and pyruvic acid are up regulated and glutamine and inosine are down regulated in cancer patient. We think, this analysis could be helpful for targeted metabolomics researcher, who may validate the result by wet laboratory experiment.
  9 in total

1.  Data Fusion in Metabolomics and Proteomics for Biomarker Discovery.

Authors:  Lionel Blanchet; Agnieszka Smolinska
Journal:  Methods Mol Biol       Date:  2016

2.  Cancer statistics, 2014.

Authors:  Rebecca Siegel; Jiemin Ma; Zhaohui Zou; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2014-01-07       Impact factor: 508.702

3.  A comprehensive urinary metabolomic approach for identifying kidney cancerr.

Authors:  Tobias Kind; Vladimir Tolstikov; Oliver Fiehn; Robert H Weiss
Journal:  Anal Biochem       Date:  2007-01-26       Impact factor: 3.365

4.  Metabolomic profiles delineate potential role for sarcosine in prostate cancer progression.

Authors:  Arun Sreekumar; Laila M Poisson; Thekkelnaycke M Rajendiran; Amjad P Khan; Qi Cao; Jindan Yu; Bharathi Laxman; Rohit Mehra; Robert J Lonigro; Yong Li; Mukesh K Nyati; Aarif Ahsan; Shanker Kalyana-Sundaram; Bo Han; Xuhong Cao; Jaeman Byun; Gilbert S Omenn; Debashis Ghosh; Subramaniam Pennathur; Danny C Alexander; Alvin Berger; Jeffrey R Shuster; John T Wei; Sooryanarayana Varambally; Christopher Beecher; Arul M Chinnaiyan
Journal:  Nature       Date:  2009-02-12       Impact factor: 49.962

5.  Metabolomic profiling of human urine in hepatocellular carcinoma patients using gas chromatography/mass spectrometry.

Authors:  Hao Wu; Ruyi Xue; Ling Dong; Taotao Liu; Chunhui Deng; Huazong Zeng; Xizhong Shen
Journal:  Anal Chim Acta       Date:  2009-06-21       Impact factor: 6.558

6.  MetaboAnalyst 3.0--making metabolomics more meaningful.

Authors:  Jianguo Xia; Igor V Sinelnikov; Beomsoo Han; David S Wishart
Journal:  Nucleic Acids Res       Date:  2015-04-20       Impact factor: 16.971

7.  Systemic Metabolomic Changes in Blood Samples of Lung Cancer Patients Identified by Gas Chromatography Time-of-Flight Mass Spectrometry.

Authors:  Suzanne Miyamoto; Sandra L Taylor; Dinesh K Barupal; Ayumu Taguchi; Gert Wohlgemuth; William R Wikoff; Ken Y Yoneda; David R Gandara; Samir M Hanash; Kyoungmi Kim; Oliver Fiehn
Journal:  Metabolites       Date:  2015-04-09

8.  Unboxing cluster heatmaps.

Authors:  Sophie Engle; Sean Whalen; Alark Joshi; Katherine S Pollard
Journal:  BMC Bioinformatics       Date:  2017-02-15       Impact factor: 3.169

9.  Metabolite profiling of human colon carcinoma--deregulation of TCA cycle and amino acid turnover.

Authors:  Carsten Denkert; Jan Budczies; Wilko Weichert; Gert Wohlgemuth; Martin Scholz; Tobias Kind; Silvia Niesporek; Aurelia Noske; Anna Buckendahl; Manfred Dietel; Oliver Fiehn
Journal:  Mol Cancer       Date:  2008-09-18       Impact factor: 27.401

  9 in total
  16 in total

1.  A systematic review on metabolomics-based diagnostic biomarker discovery and validation in pancreatic cancer.

Authors:  Nguyen Phuoc Long; Sang Jun Yoon; Nguyen Hoang Anh; Tran Diem Nghi; Dong Kyu Lim; Yu Jin Hong; Soon-Sun Hong; Sung Won Kwon
Journal:  Metabolomics       Date:  2018-08-10       Impact factor: 4.290

2.  Functional Metabolomics and Chemoproteomics Approaches Reveal Novel Metabolic Targets for Anticancer Therapy.

Authors:  Chang Shao; Wenjie Lu; Haiping Hao; Hui Ye
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

3.  Flexible Sensing Systems for Cancer Diagnostics.

Authors:  Anne K Brooks; Sudesna Chakravarty; Vamsi K Yadavalli
Journal:  Adv Exp Med Biol       Date:  2022       Impact factor: 3.650

4.  Exploring a Novel Screening Method for Patients with Oral Squamous Cell Carcinoma: A plasma Metabolomics Analysis.

Authors:  Yui Enomoto; Akira Kimoto; Hiroaki Suzuki; Shin Nishiumi; Masaru Yoshida; Takahide Komori
Journal:  Kobe J Med Sci       Date:  2018-09-11

5.  Precise pathological classification of non-small cell lung adenocarcinoma and squamous carcinoma based on an integrated platform of targeted metabolome and lipidome.

Authors:  Peng Cao; Sanlan Wu; Wei Guo; Qilin Zhang; Weijing Gong; Qiang Li; Rui Zhang; Xiaorong Dong; Shuangbing Xu; Yani Liu; Shaojun Shi; Yifei Huang; Yu Zhang
Journal:  Metabolomics       Date:  2021-11-03       Impact factor: 4.290

Review 6.  Metabolomic Profiling for Diagnosis and Prognostication in Surgery: A Scoping Review.

Authors:  Tabassum A Khan; Tyler J Loftus; Amanda C Filiberto; Tezcan Ozrazgat-Baslanti; Matthew M Ruppert; Sabyasachi Bandyopadhyay; Evagelia C Laiakis; Dean J Arnaoutakis; Azra Bihorac
Journal:  Ann Surg       Date:  2021-02-01       Impact factor: 13.787

7.  Scale-free networks in metabolomics.

Authors:  Hema Sekhar Reddy Rajula; Matteo Mauri; Vassilios Fanos
Journal:  Bioinformation       Date:  2018-03-31

8.  Metabolomic profiling of human lung tumor tissues - nucleotide metabolism as a candidate for therapeutic interventions and biomarkers.

Authors:  Paula Moreno; Carla Jiménez-Jiménez; Martín Garrido-Rodríguez; Mónica Calderón-Santiago; Susana Molina; Maribel Lara-Chica; Feliciano Priego-Capote; Ángel Salvatierra; Eduardo Muñoz; Marco A Calzado
Journal:  Mol Oncol       Date:  2018-09-13       Impact factor: 6.603

9.  Metabolomic, transcriptomic and genetic integrative analysis reveals important roles of adenosine diphosphate in haemostasis and platelet activation in non-small-cell lung cancer.

Authors:  Long T Hoang; Clara Domingo-Sabugo; Elizabeth S Starren; Saffron A G Willis-Owen; Deborah J Morris-Rosendahl; Andrew G Nicholson; William O C M Cookson; Miriam F Moffatt
Journal:  Mol Oncol       Date:  2019-09-30       Impact factor: 6.603

10.  A High-Performing Plasma Metabolite Panel for Early-Stage Lung Cancer Detection.

Authors:  Lun Zhang; Jiamin Zheng; Rashid Ahmed; Guoyu Huang; Jennifer Reid; Rupasri Mandal; Andrew Maksymuik; Daniel S Sitar; Paramjit S Tappia; Bram Ramjiawan; Philippe Joubert; Alessandro Russo; Christian D Rolfo; David S Wishart
Journal:  Cancers (Basel)       Date:  2020-03-07       Impact factor: 6.639

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.