Literature DB >> 26674646

Serum Metabolite Biomarkers Discriminate Healthy Smokers from COPD Smokers.

Qiuying Chen1, Ruba S Deeb2, Yuliang Ma1, Michelle R Staudt2, Ronald G Crystal2, Steven S Gross1.   

Abstract

COPD (chronic obstructive pulmonary disease) is defined by a fixed expiratory airflow obstruction associated with disordered airways and alveolar destruction. COPD is caused by cigarette smoking and is the third greatest cause of mortality in the US. Forced expiratory volume in 1 second (FEV1) is the only validated clinical marker of COPD, but it correlates poorly with clinical features and is not sensitive enough to predict the early onset of disease. Using LC/MS global untargeted metabolite profiling of serum samples from a well-defined cohort of healthy smokers (n = 37), COPD smokers (n = 41) and non-smokers (n = 37), we sought to discover serum metabolic markers with known and/or unknown molecular identities that are associated with early-onset COPD. A total of 1,181 distinct molecular ions were detected in 95% of sera from all study subjects and 23 were found to be differentially-expressed in COPD-smokers vs. healthy-smokers. These 23 putative biomarkers were differentially-correlated with lung function parameters and used to generate a COPD prediction model possessing 87.8% sensitivity and 86.5% specificity. In an independent validation set, this model correctly predicted COPD in 8/10 individuals. These serum biomarkers included myoinositol, glycerophopshoinositol, fumarate, cysteinesulfonic acid, a modified version of fibrinogen peptide B (mFBP), and three doubly-charged peptides with undefined sequence that significantly and positively correlate with mFBP levels. Together, elevated levels of serum mFBP and additional disease-associated biomarkers point to a role for chronic inflammation, thrombosis, and oxidative stress in remodeling of the COPD airways. Serum metabolite biomarkers offer a promising and accessible window for recognition of early-stage COPD.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26674646      PMCID: PMC4682670          DOI: 10.1371/journal.pone.0143937

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Chronic obstructive pulmonary disease (COPD) is a debilitating cigarette smoking-associated disease and the third most common cause of death in the US. The natural history of COPD is marked by episodes of exacerbation, morbidity and ultimately, mortality [1]. Other than O2 inhalation, there are no therapies that diminish mortality of COPD [2]. To promote COPD education and set universal standards of treatment, the Global Initiative for Chronic Obstructive Lung Disease (GOLD) was initiated in 1998, and updated every year, incorporating the latest science for improving the lives of COPD patients around the world [3]. COPD is characterized by progressive restriction to expiratory airflow [4]. The forced expiratory volume in 1 second (FEV1) both defines and characterizes the severity of the disease, and is a well-known predictor of mortality [5]. However, COPD manifests itself with multiple phenotypes that cannot be identified through measurement of lung function alone. The importance of COPD risk assessment, phenotype identification, and early prognosis of exacerbation periods magnify the need for validated biomarkers in COPD [6]. A variety of promising biomarkers have been reported for COPD [7], including blood proteins [8-39], miRNAs [40, 41], and small molecule metabolites [42-50]. These proposed biomarkers are specifically associated with different aspects of COPD pathophysiology. Indeed, inflammatory markers in blood such as increased C-reactive protein [21, 33], fibrinogen [8, 9], IL-6 [34-36], IL-8 [37] and tumor necrosis factor alpha [38] have been linked to exacerbations and mortality, while reduced CC-16 [22] and elevated adiponectin [30, 31, 39] have been linked to lung function deficits and severity of emphysema. Notwithstanding rapidly expanding efforts to identify COPD biomarkers, especially blood proteins, no single marker has been adopted for routine clinical use to date. Radiological markers, such as quantitative CT imaging have been used to define the COPD-related phenotype in smokers [51]. Gene expression [52, 53] and analysis of molecules in exhaled breath [54-56] have also been applied for potential discovery of COPD biomarkers. Most recently, metabolomics-based metabolite profiling has begun to attract attention as a biomarker discovery approach [57, 58]. Indeed, metabolites are the ultimate readout of disease phenotype and accordingly, metabolomics has proven to be a powerful tool for probing complex biological mixtures to discover changes in metabolite levels that arise from drug treatments, gene mutations and disease states. Paige et.al [48] analyzed plasma metabolite profiles from healthy non-smokers, smokers with emphysema, and smokers without emphysema, using ultra high performance liquid chromatography/quadrupole–time-of-flight mass spectrometry (UPLC–QTOF). Based on observed differences in metabolite profiles, 12 unknown target metabolites were used to generate a predictive model that identified emphysema patients with 90% sensitivity [48]. A recent gas chromatography-mass spectrometry (GC-MS) based plasma metabolite profiling was applied to effectively predict the presence of cancer in 10 out of 10 lung cancer patients, 8 out of 8 non-smokers, 8 out of 9 COPD smokers and 5 out of 10 healthy smokers, based on 32 differentially-expressed metabolites across the 4 patient groups [47]. Of these 32 putative metabolite biomarkers, 21 were of unknown molecular identity, but were defined by their accurate mass and chromatographic retention times. In attempt to uncover biomarkers for early-stage COPD, we performed untargeted LC/MS profiling of a cohort of 118 human subjects comprised of healthy smokers, COPD smokers and healthy non-smokers, all with well-characterized clinical lung function parameters. The COPD smokers used in this study were classified as mild, belonging to GOLD I /II disease severity groups [3]. We surveyed relative levels of 1,181 molecular ions that could be measured in 95% of patient serum samples to generate a COPD prediction model with 87.8% sensitivity and 86.5% specificity. Efficacy of the model for predicting COPD was found to be 80% in a follow-on study of a small independent cohort. Differentially-upregulated molecules in COPD smokers vs. healthy smokers included a modified form of fibrinogen peptide B along with 3 additional peptides. To our knowledge, this is the first discovery of significantly elevated (greater than 2-fold) serum peptides in early-stage COPD patients with predictive potential. These results point to an intriguing linkage between thrombosis, cell signaling aberrations and early onset of COPD.

Materials and Methods

Human patient cohort

Serum samples from an initial cohort of 118 human subjects were categorized into 3 groups: healthy nonsmokers (n = 37), healthy smokers (n = 40) and COPD smokers (n = 41). These samples were used in attempt to identify small molecule biomarkers (50–1000 Da) that significantly distinguish COPD smokers from healthy smokers and nonsmokers. The validity of inferred serum COPD biomarkers was further assessed in an independent study cohort comprising healthy nonsmokers (n = 10), healthy smokers (n = 9), COPD smokers (n = 10) and COPD subjects who had quit smoking for > 3 months (n = 19). Serum samples were obtained in the morning following overnight fasting. All subjects were phenotyped with regard to medical history, physical exam, routine blood and urine tests, urine nicotine/cotinine levels, lung function (forced vital capacity (FVC), forced expiratory volume in 1 sec (FEV1), FEV1/FVC, total lung capacity and diffusing capacity and chest high-resolution tomography (HRCT). Serum samples from 3 smokers (2 males and 1 female) were not acquired for LC/MS negative ion mode due to a sample handling error (See LC/MS methods for description). All COPD subjects were categorized as GOLD I and II in terms of severity, according to post-bronchodilator lung function defined by the GOLD classification [3].

Reagents

LC-MS grade acetonitrile (ACN), isopropanol (IPA) and methanol (MeOH) were purchased from Fischer Scientific. High purity deionized water (ddH2O) was filtered from Millipore (18 OMG). OmniTrace glacial acetic acid and ammonium hydroxide were obtained from EMD Chemicals. Ammonium acetate and all other chemicals and standards were obtained from Sigma Aldrich in the best available grade.

Serum metabolite extraction and metabolomic data acquisition

Serum metabolites were extracted by addition to 1 part serum to 20 parts (vol:vol) 70% acetonitrile in ddH2O containing 0.2% ammonium hydroxide. The mixture was briefly vortexed and then centrifuged for 5 min at 16,000 × g to pellet precipitated proteins. An aliquot of the resulting extract (3 μl) was subjected to untargeted metabolite profiling using LC/MS and both positive and negative ion monitoring. The sequence of analyzed samples was randomized to eliminate sample order bias of results. Additionally, to minimize bias that may be introduced by drifting of chromatographic retention times and detected mass measurements with large sample runs, a pooled extract from all serum samples was prepared, stored in single-thaw aliquots, and used as a quality control to correct for potential day-to-day and batch-to-batch LC/MS data drift. One quality control sample was analyzed every 6 study samples and flanking quality control results were used to normalize the group of intervening samples.

LC/MS metabolomics platform for untargeted metabolite profiling

Serum extracts were analyzed by LC/MS as described previously [59, 60] using a platform comprised of an Agilent Model 1200 liquid chromatography system coupled to an Agilent 6230 time-of-flight MS analyzer. Chromatography of metabolites was performed using aqueous normal phase (ANP) gradient separation, on a Diamond Hydride column (Microsolv). Mobile phases consisted of: (A) 50% isopropanol, containing 0.025% acetic acid, and (B) 90% acetonitrile containing 5 mM ammonium acetate. To eliminate the interference of metal ions on the chromatographic peak integrity and electrospray ionization, EDTA was added to the mobile phase at a final concentration of 6 μM. The following gradient was applied: 0–1.0 min, 99%B; 1.0–15.0 min, to 20%B; 15.0 to 29.0, 0%B; 29.1 to 37min, 99%B. Raw data were analyzed using MassHunter Profinder 6.0 and MassProfiler Professional 13.0 software package (Agilent technologies). Unpaired t-tests (p<0.05) with Bonferroni correction for multiple group comparisons and ANOVA were used to determine significant differences between groups.

Differentially expressed metabolite identification

To ascertain the identities of differentially expressed metabolites (P<0.05) among COPD smokers, healthy smokers and healthy non-smokers, LC/MS data were searched against an in-house annotated METLIN Personal Metabolite Database (Agilent Technologies), based on accurate monoisotopic neutral masses (<5 ppm) and chromatographic retention times. A molecular formula generator (MFG) algorithm in MPP was used to generate and score empirical molecular formulae based on a weighted consideration of monoisotopic mass accuracy, isotope abundance ratios, and spacing between isotope peaks. A tentative compound ID was assigned when METLIN and MFG scores concurred for a given candidate molecule. Tentatively assigned molecules were verified based on a match of LC retention times and/or MS/MS fragmentation spectra to that of pure molecule standards in our progressively growing in-house database.

Results

Clinical characteristics of human subjects

Clinical characteristics of the initial study group are summarized in . A total of 118 serum samples were collected from 92 males and 26 females. The nonsmoker group comprised a balanced number of male and female patients, whereas the healthy smoker and COPD smoker groups were dominated by male subjects. Baseline parameters were similar in BMI and ethnicity for all groups. Both healthy smokers and COPD smokers had a comparable cigarette consumption rate and smoking history. COPD subjects on average were older than healthy smokers and non-smokers. 1All values were presented as mean ± standard deviation M = Male, F = Female; His: Hispanic ethnicity, Afr: African ethnicity; Eur: European ethnicity; FEV1: forced expiratory value in 1 second; FVC: functional vital capacity; TLC: total lung capacity; DLCO, diffusing capacity of the lung for carbon monoxide. COHb: carboxylhemoglobin. All lung function parameters are presented as % predicted except for FEV1/FVC which is expressed as % observed. N/A = not applicable. Urine cotinine, the nicotine metabolite, and blood COHb were at comparable levels for healthy smokers and COPD smokers and, as expected, low or undetected in non-smokers. Compared to healthy non-smokers and healthy smokers, COPD smokers had significantly decreased DLCO and FEV1/FVC ratio, the basis for clinical diagnosis of COPD. DLCO and FEV1/FVC were significantly correlated with each other (Pearson correlation coefficient R = 0.62, P < 6.3 x e-14) and readily discriminated healthy smokers from COPD smokers. Total lung capacity showed no difference between COPD smokers and nonsmokers. All COPD smokers were classified as belonging to the least severe, GOLD I and GOLD II disease groups Given that healthy smokers and COPD smokers had an indistinguishable cigarette exposure and smoking history, comparison of plasma metabolites between these groups should allow for discovery of biomarkers with potential to distinguish the groups and predict health outcomes.

Serum metabolite profile of non-smokers, healthy smokers and COPD smokers

COPD is routinely diagnosed based on clinical parameters related to lung airway function (e.g. FEV1/FVC%). However, such clinical parameters do not directly correlate with the overall health status of patients. Moreover, measurements of lung function provide little prognostic information regarding anticipated disease severity. In contrast, circulating serum metabolites offer a potential window into systemic physiology/pathophysiology and the possibility of informative and predictive COPD biomarkers. To compensate for the limited prognostic potential of clinical lung parameters, we sought to identify serum metabolites that may inform on COPD health status and outcomes. Applying an untargeted LC/MS-based metabolomics platform that uses aqueous normal phase chromatographic separation coupled to time-of-flight MS, we successfully quantified the relative levels of 586 positive ions and 635 negative ions in 95% of all serum samples studied (including quality control samples). Combining positive and negative ions, after excluding duplicate features measured in both MS detection modes, 1,181 total aligned molecular features in the range of 50–1,000 Da were quantified in all serum samples (. To account for day-to-day and batch-to-batch LC/MS instrument drift, metabolite stability and other experimental factors that can contribute to systematic error, we normalized individual serum metabolite measurements from all subjects based on levels quantified in flanking quality control samples. This normalization procedure enabled the comparative analysis of data from a relatively large number of clinical samples, quantified on different occasions.

Untargeted metabolite profiling identifies differentially-expressed serum metabolites among COPD smokers, healthy smokers, and non-smokers (P<0.05).

(A) A total of 1,181 aligned molecular features detected as positive and negative ions were quantified in all 115 serum samples by untargeted molecular feature extraction. (B) Heat map from hierarchical clustering analysis, showing 314 differentially-expressed metabolites among COPD smokers, healthy smokers and non-smokers. The relative level of expression of each metabolite is represented by a color coded heat map, where red represents an expression higher than the mean and blue represents an expression below the mean values. (C) Venn Diagram depicts shared metabolite changes among COPD smokers, healthy smokers and non-smokers (p < 0.05). Of these, 62 metabolites were significantly different (P<0.05) between healthy smokers and COPD smokers. And 17 of these 62 metabolites were differentially-expressed in COPD smokers compared to both healthy smokers and non-smokers, but showed no difference between healthy smokers and non-smokers. Unsupervised hierarchical cluster analysis (HCA) revealed a clear and reproducible pattern of within-group metabolite expression similarities and between-group differences ( A Venn diagram, depicted in , contrasts the number of differentially expressed metabolites and shared metabolite changes among healthy non-smokers, healthy smokers and COPD smokers. Of 1,181 features quantified in 95% of samples, 62 metabolites were found to be significantly different in abundance (P<0.05) between healthy smokers and COPD smokers. Of these, 17 metabolites were differentially-expressed (up or down) in COPD smokers compared to both healthy smokers and healthy non-smokers, but also showed no difference between healthy smokers and healthy non-smokers (i.e., metabolites that distinguish COPD smokers from both healthy smokers and healthy non-smokers). Regardless of whether smokers were healthy or had COPD, smoking itself was associated with 107 serum metabolite changes, relative to non-smokers. Among these smoking-associated serum metabolites, 20 were unknowns that were elevated by 1.5-fold or greater in both healthy and COPD smokers, compared to non-smokers (P<0.05, ). These 20 unknowns were defined by their accurate neutral mass and chromatographic retention time on ANP chromatographic matrix. As expected, the abundant nicotine metabolites, cotinine and hydroxy-cotinine, were dramatically increased in serum from both healthy and COPD smoker groups. Additionally, smoking was associated with altered levels of serum phospholipids and bile acids, but the extent of these changes were undistinguishable between healthy smokers and COPD smokers. 1RT = retention time P = p value; S = smokers, NS = nonsmoker, FC = fold-change

COPD-specific serum metabolite changes

The goal of untargeted metabolite profiling in this study was to discover serum metabolites of known and/or unknown molecular identities that distinguish the population of COPD smokers. Such biomarkers should meet the criteria of discriminating between COPD smokers from healthy smokers, but also exhibit no significant differences between healthy smokers and healthy nonsmokers. summarizes the list of metabolites meeting these criteria. Notably, five peptides were uniquely increased in serum from COPD smokers compared to the other groups. These peptides were all detected by positive ion MS as doubly-charged ions. Fragmentation of the unknown peptides by QTOF revealed that one of these was a modified version of the 14 amino acid fibrinogen peptide B (FPB; EGVNDNEEGFFSAR) (), wherein the N-terminal glutamate was observed as pyroglutamate and the C-terminal arginine was deleted. Serum products of fibrinogen turnover have been reported be involved in inflammation and wound healing [61], and elevated plasma fibrinogen has previously been associated with reduced pulmonary function and increased risk of COPD [8, 9]. To our knowledge, this is the first report of N-terminal modified and C-terminal truncated fibrinogen peptide B (mFBP) in early-stage COPD smokers. The amino acid sequences of the other four peptides were not solved, but their relative levels were significantly and positively correlated with each other, and with modified fibrinogen peptide B (Pearson coefficient R ≥ 0.68 and P ≤ 9.3 x e-18; ).

MS/MS fragmentation of a doubly-charged ion ([M+2H] = 697.97 Da) reveals the sequence of peptide as pyroEGVNDNEEGFFSA.

Fragments derived from ion fragmentation at 30 EV match the amino acid sequence of fibrinogen peptide B with truncated C-terminal arginine and N-terminal amino acid modified as pyroglutamate. 1Underlined metabolites were used for generating prediction model. *denotes metabolites showing significant correlation with mFBP (Pearson correlation R ≥ 0.34, P ≤ 2.0e-4) RT = retention; P = p value; FC = fold-change; S = smoker, NS = nonsmokers In addition to peptides, serum myoinositol, dimethyluric acid and N-methylnicotinate were significantly increased in COPD smokers, as compared with both healthy smokers and non-smokers. Serum cysteinsulfonic acid glycerophosphoinositol, phosphatidylinositol (40:7) and creatinine were also modestly elevated in COPD smokers. Two metabolites, 2-methylhippuric acid and 5-acetylamino-6-amino-3-methyluracil, were markedly increased in COPD smokers compared to healthy non-smokers, and also showed a trend of increase in comparison with healthy smokers, with P values approaching significance. In contrast, several hydrophobic unknowns (with chromatographic retention time consistent with fatty acids and lipids) were mildly decreased in COPD. Notably, the COPD cohort in this study had relatively mild GOLD I and II classified disease, and non-peptide serum metabolite changes were found to be modest compared to healthy smokers, but significant. Except for four metabolites that showed a significant correlation with mFBP (denoted by asterisks in ), other non-peptide metabolites were not significantly correlated with mFBP or other observed unsequenced peptides (not shown). Unknown1 (mass 863.40 Da at 8.8 min retention time), creatinine and 2-methylhippurate showed the most significant correlation with mFBP (r = 0.77, P <3.0 x e-23; r = 0.47, P<1.3 x e-7; and r = 0.47, P<2.5 x e-7, respectively). This raises the possibility that COPD-associated changes in non-peptide and peptide metabolites, may have distinct mechanistic implications for COPD pathophysiology.

Correlation of serum metabolites with clinical parameters

The spirometric parameter FEV1/FVC is the classic hallmark for defining and characterizing the severity of COPD, though it does not always correlate well with health status or disease symptoms. Urine continine and hydroxycotinine reflect the extent of cigarette smoking, but do not correlate with the severity of lung function deficits. In order to find a link between lung dysfunction and altered levels of serum metabolites elicited by smoking, we sought to correlate the relative levels of serum metabolites that distinguish COPD smokers from both healthy smokers and non-smokers with the clinical lung function parameters FEV1/FVC, DLCO, TLC and COHb. lists those metabolites showing a significant Pearson correlation coefficient (P<0.05) with at least one of these lung function parameters. Interestingly, five non-peptide metabolites (N-methylnicotinate, myoinositol, dimethylurate, methylhippurate and 5-acetylamino-6-amino-3-methyluracil) were observed to correlate with all four lung function parameters (P<0.05). In contrast, mFBP and all other non-sequenced peptides selectively and strongly correlate with FEV1/FVC and DLCO (P<0.0003), weakly correlated with COHb, and demonstrated no correlation with TLC. Several other non-peptide metabolites strongly correlated with FEV1/FVC, but exhibited no correlation with COHb and TLC. Serum creatinine and cysteinesulfonic acid showed no significant correlation with any of the lung parameters. Overall, serum metabolites demonstrated a heterogeneous correlation with lung function tests.

Serum metabolite prediction model for COPD

A serum metabolite-based prediction model for early early-stage COPD may suggest pathophysiological mechanisms and have predictive power beyond that offered by lung function parameters alone. Accordingly, we sought to identify a predictive serum metabolite signature for GOLD stage I/II COPD. Because non-smokers with no prior exposure to cigarette smoking and no detectable nicotine metabolites are unlikely to develop COPD, non-smokers were excluded from COPD model generation. Considering 23 metabolites that distinguished COPD smokers from healthy smokers (underlined metabolites in ), we conducted a partial least square discriminant analysis (PLS-DA) to classify the COPD smokers and healthy smokers, employing Agilent MassProfiler Professional 13.0 software. The PLS-DA score plot shows a marked separation of COPD smokers from healthy smokers (). The sensitivity of the constructed model was calculated from the ratio of predicted true COPD smokers vs. total COPD smokers, whereas specificity of the model was calculated as the ratio of predicted true healthy smokers to the total number of healthy smokers. Applying this quantitative approach, the classification sensitivity was found to be 36 out of 41 COPD smokers (87.8%), and the classification specificity was 32 out of 37 healthy smokers (86.5%). Based on these findings, the overall accuracy of classification was 87.2%. The receiver operator characteristics (ROC) were obtained by plotting the sensitivity against specificity at different discriminant thresholds. The area under the ROC, commonly used as a diagnostic statistic for PLS-DA models, was 0.87 (). ROC curves were generated using ROC Curve Explorer and Tester (ROCCET, http://www.roccet.ca/ROCCET/), using the same set of data for which Monte Carlo Cross Validation (MCCV) was applied via balanced subsampling. In each MCCV analysis, two thirds of the samples were used to evaluate the feature importance. The 23 differentially expressed COPD serum metabolites were then used to build classification models that were validated based on 1/3 of the samples left out. The procedure was repeated multiple times to calculate the performance and confidence interval for the model.

Characterization of a PLS-DA prediction model for identification COPD smokers, based on 23 serum metabolites.

(A) Three-dimensional score plot of PLS-DA classification model showing the partial separation of COPD smokers from healthy smokers. (B) Diagnostic statistics for the presented PLS-DA model, as area under the receiver operator characteristics curve (AUC). The AUC was 0.87 with a 95% confidence interval of 0.761–0.957 for the constructed model. (C) The location of original between-sum squares and the within-sum squares (B/W-ratio), relative to the distribution histogram of 2000 permuted B/W ratios. The B/W ratio of the original classification assignment (COPD smokers vs. healthy smoker.) was further to the right of the distribution, with a permutation P value less than 5.0 x e-4, affirming the statistical significance and validity of the PLS-DA model. PLS-DA is prone to data over-fitting, and therefore the generated models need to be validated by assessing whether the observed separation is indeed statistically significant, or due to random noise. To eliminate the random chance that the constructed model has discriminating power for COPD smokers vs. healthy smokers, a permutation test was performed using non-proprietary MetaboAnalyst 3.0 software, involving 2,000 permutated class labels–this tested the assumption that there is no difference between COPD smokers and healthy smoker groups. In each permutation, a PLS-DA model was built between the data (X) and the permuted class labels (Y), using the optimal number of components determined by cross-validation of the model based on the original class assignment. The ratio of between-sum squares and the within-sum squares (B/W-ratio) for the prediction of each model was calculated. shows the location of original B/W ratio, relative to the distribution histogram of 2,000 permuted B/W ratios. The B/W ratio of the original classification assignment (COPD smokers vs. healthy smokers) was further away from the right of the distribution with a permutation P value of less than 5.0 x e-4, indicating that the PLS-DA model of COPD- and healthy- smoker classification is statistically significant.

Validation of the COPD classification model

We sought to test the efficacy of the 23 metabolite signature for prediction of early-stage COPD in an independent group of subjects. As shown in S1 Table, applying the 23 serum metabolite model to an independent group of 10 early-stage COPD patients correctly classified these subjects as having COPD in 80% of cases. Further, in a cohort of 19 COPD patients who quit smoking for ≥ 3 months, this classification algorithm recognized the COPD metabotype in 14/19 subjects in accord with their active COPD.

Discussion

In attempt to uncover biomarkers that identify early-stage (GOLD I/II) COPD, we performed untargeted LC/MS profiling in a cohort of 118 healthy smokers, COPD smokers and non-smokers with well-characterized clinical lung function parameters. We report here the identity of circulating serum metabolites as potential COPD biomarkers. To our knowledge, this is the first comprehensive untargeted metabolite profiling for serum markers of COPD, comparing non-smokers, healthy smokers and early-stage COPD smokers. This analysis identified a group of 23 non-peptide and peptide metabolites, some of which were not directly correlated with any lung function parameters, potentially providing pathophysiological information over and above that obtained from clinical testing. Efficacy of this putative signature for early-stage COPD was provided by an independent validation cohort in which COPD was predicted with 80% success. One biomarker identified in serum distinguishing COPD smokers from smokers was a modified form of the fibrinogen-derived peptide, FPB. Fibrinogen is an acute phase plasma protein that has emerged as a promising biomarker in COPD [8, 9, 62, 63]. Elevated levels of plasma fibrinogen have been associated with pulmonary dysfunction, coronary heart disease [63] and hospitalization of COPD patients [9]. In blood, thrombin converts fibrinogen to fibrin in two sequential steps [64], ultimately resulting in equimolar quantities of FPB and fibrin II. Fibrin II is a mediator of thrombosis, polymerizing together with platelets it forms a hemostatic plug or clot over wound sites [65]. The 14-amino acid FPB is susceptible to cleavage by carboxypeptidase in blood, where removal of the C-terminal arginine yields des-arginine FPB. Therefore, measurement of des-arginine FPB indirectly signifies in vivo fibrin II production. Fibrin clot propensity was previously found to be altered in patients with COPD [66]. In accord with this prior finding [66]; our untargeted metabolite profiling revealed that N-terminally modified des-arginine FPB metabolite (pyroEGVNDNEEGFFSA) was elevated in the circulating blood of early-stage COPD subjects, as compared to healthy smokers with a similar smoking history and extent of cigarette exposure. Further, we demonstrated a significant positive correlation between the level of mFBP and the lung function parameters FEV1/FVC% and DLCO in all study subjects. This elevation in mFBP levels suggests that coagulation and thrombogenesis may play a role in COPD smokers. Indeed, quantitative exposure to passive smoke has been shown to positively correlate with blood coagulation activity [67]. The association between acute inflammation, endothelial activation and clotting initiation has been demonstrated in COPD exacerbations [68]. In addition to mFBP, three other unknown peptides were increased in COPD smokers only. Based on the significant correlation with each other and with mFBP, we postulate that these peptides may also belong to the fibrinogen family and /or the thrombotic cascade. Despite a significantly increased level of circulating mFBP and other peptides in COPD patient serum, the presence of peptides alone did not possess predictive power to accurately identify early stage COPD smokers (AUC ≈ 0.75, not shown). Interestingly, the 4 observed peptide biomarkers shared a negative correlation with the lung function parameters FEV1/FVC% and DLCO, but not with the lung parameters TLC and COHb. In contrast, non-peptide small molecule serum metabolites, including myoinositol, dimethylurate, N-methylnicotinate, and 5-acetylamino-6-amino-3-methyluracil were all significantly correlated with each of the 4 lung function parameters obtained in our patient cohort. Among the non-peptide metabolites, myoinositol evidenced the most significant correlation with clinical parameters. Myoinositol is a cyclic sugar alcohol that functions in glycerophospholipid metabolism and inositol phosphate metabolism, also serving as an osmolyte. Moreover, inositol phosphoglycans comprise a major family of intracellular signaling molecules, including the mediation of insulin action [69]. Furthermore, myoinositol has been used as chemoprevention drug in clinical trials, acting to inhibit Akt and extracellular signal-regulated kinase (ERK) in bronchial lesions from heavy smokers [70]. In vitro studies have shown that myoinositol decreases endogenous and tobacco carcinogen-induced activation of Akt and ERK in immortalized human bronchial epithelial cells [70]. In COPD patient’s sera, we observed that both myoinositol and the inositol-derived plasma membrane phospholipid, glycerophosphoinositol [PI (40:7)] was increased compared to healthy smokers and healthy non-smokers, and elevated levels of glycerophosphoinositol lipids were found in healthy smokers compared to healthy non-smokers. These results suggest an altered pattern of phosphatidylinositol metabolism in smokers. Consistent with this observation, smoking-derived toxins have been related to the activation of PI3K signaling pathway [71-73], altering cell growth, proliferation, survival, and motility in COPD smokers. Thus, increased myoinositol in COPD smokers may be the consequence of feedback regulation of activated PI3K. Notably, high serum myoinositol in COPD can suppress the synthesis of phosphatidylglycerol an important surfactant in the alveolar wall which has been show to decrease in COPD and patients with cystic fibrosis [74]. Elevated levels of serum metabolite N-methylnicotinate, 1,7dimethylurate and 5-acetylamino-6-amino-3-methyluracil (AAMU) in COPD subjects were unexpected, but intriguing. These three metabolites are all related to caffeine metabolism. N-methylnicotinate (aka trigonelline) is a pyridine alkaloid and metabolite of caffeine, as well as a phytoestrogen, in coffee beans [75, 76]. 1,7 dimetylureate is a metabolic product of caffeine, whose synthesis is primarily catalyzed by CYP2A6. AAMU is also a major caffeine metabolite, synthesized by N-acetyltransferase 2. Notably, these urinary caffeine metabolites have been used to assess the enzymatic activities of xanthine oxidase (XO) [77-79] and N-acetyltransferase 2 (NAT2) and cigarette smoking has been shown to induce cytochrome P450-mediated alkaloid metabolism [80]. Taken together, these findings suggest an acceleration of caffeine metabolism in COPD smokers, especially for the downstream caffeine metabolites resulting from activated CYP2A6. This is the first untargeted serum profiling finding that points to a potential upregulation of caffeine metabolism in COPD smokers. In summary, our untargeted serum metabolite profiling revealed metabolite biomarkers of early-stage COPD that are indicative of thrombosis, altered cell signaling and metabolic changes. Based on 23 serum metabolites, the prediction of COPD amongst a mixed group of both healthy and COPD smokers had an overall 87% accuracy and was found to be 80% correct when predicting COPD in an independent population of COPD smokers. Combined with clinical parameters, the identified serum markers offer the potential to improve sensitivity and specificity for predicting and monitoring COPD prognosis in cigarette smokers.

Prediction of COPD in subjects with COPD who are either active smokers (COPDact) and who have quit smoking for ≥ 3 months (COPDquit).

(DOCX) Click here for additional data file.
Table 1

Human subjects and associated clinical parameters .

Non-smokersHealthy smokersCOPD smokers
Number of subjects 374041
Gender 19M, 18F35M, 5F38M, 3F
Ethnicity 14 His, 13 Afr, 10 Eur14 His, 11 Afr, 5 Eur9 His, 23 Afr, 8 Eur, 1 mixed
Packs per year n/a21.5 ± 11.330.6 ± 1.7
Packs per day n/a0.9 ± 0.60.8 ± 0.5
Age of initiation n/a15.2 ± 2.916.7 ± 3.3
Age 39.5 ± 13.041.8 ± 10.153.2 ± 7.7
BMI 26.6 ± 5.026.9 ± 4.925.6 ± 3.6
FEV1% 104.0 ± 11.5105.5 ± 11.077.7 ± 18.9
FVC% 108.0 ± 11.3110.5 ± 11.0106.0 ± 16.4
FEV1/FVC% 80.0 ± 5.777.9 ± 4.358.5 ± 7.6
TLC 97.7 ± 15.793.8 ± 9.2101.6 ±14.5
DLCO 90.4 ± 11.487.2 ± 8.568.5 ±14.3
COHb 1.6 ± 0.82.9 ± 1.23.3 ± 1.2
Urine cotinine n/a1687.0 ±1073.41619.5 ±885.1

1All values were presented as mean ± standard deviation

M = Male, F = Female; His: Hispanic ethnicity, Afr: African ethnicity; Eur: European ethnicity; FEV1: forced expiratory value in 1 second; FVC: functional vital capacity; TLC: total lung capacity; DLCO, diffusing capacity of the lung for carbon monoxide. COHb: carboxylhemoglobin. All lung function parameters are presented as % predicted except for FEV1/FVC which is expressed as % observed. N/A = not applicable.

Table 2

Differential metabolites between COPD smoker and healthy smoker compared to non-smokers.

METABOLITEMassRTPFCPFCPFC
S vs NSS vs NSCOPD vs NSCOPD vs NSCOPD vs SCOPD vs S
Cotinine 176.0933.671.54E-1423.275.84E-1824.097.81E-011.04
3-Hydroxycotinine 192.0923.451.99E-134.854.23E-135.512.97E-011.14
Unknown 1 742.230.711.36E-033.362.84E-022.53.54E-010.74
Quinic acid 192.0614.416.98E-032.605.33E-043.771.72E-011.45
lysoPI(20:0) 628.364.473.85E-022.195.14E-022.239.62E-011.02
PI(32:2) 806.5061.81.40E-022.118.52E-031.784.83E-010.84
Glycochenodeoxycholic acid 3-glucuronide 625.3456.531.81E-032.007.43E-022.237.49E-011.11
PI(32:1) 808.5111.846.57E-031.891.90E-021.472.27E-010.78
5-Acetylamino-6-formylamino-3-methyluracil 226.0720.693.85E-031.892.13E-021.421.34E-010.75
Unknown2 600.4511.021.39E-021.834.00E-031.889.00E-011.03
PI(32:0) 810.5161.842.73E-031.811.33E-021.472.30E-010.81
Glycochenodeoxycholate-3-sulfate 529.271.611.63E-021.657.96E-021.79.09E-011.03
PE(35:1) 731.5467.647.91E-031.657.27E-031.464.52E-010.88
Unknown3 200.0140.723.04E-021.614.91E-032.527.60E-021.57
1α,25-dihydroxy-23-thiavitamin D3 434.2961.771.53E-031.541.93E-031.529.34E-010.99
PI(40:4) 914.5831.845.95E-041.542.38E-041.57.92E-010.97
Unknown 602.4681.033.24E-021.521.75E-031.636.95E-011.07
6-methyltetrahydropterin 181.0981.192.58E-051.512.85E-071.83.19E-021.19
PE(33:1) 703.5167.712.78E-021.511.90E-021.437.24E-010.94
PI(34:1) 836.5391.852.67E-031.511.68E-021.32.42E-010.86

1RT = retention time

P = p value; S = smokers, NS = nonsmoker, FC = fold-change

Table 3

Differential metabolites between COPD smokers compared to healthy smokers .

METABOLITEMMassRTPFCPFCPFC
COPD vs SCOPD vs SCOPD vs NSCOPD vs NSS vs NSS vs NS
Fibrinogen peptide B 1395.5598.895.56E-032.417.58E-032.389.60E-010.99
Peptide 1 1464.65410.441.14E-023.281.02E-023.746.17E-011.14
Peptide 2 1308.5479.514.75E-032.448.47E-032.237.29E-010.91
Peptide 3 1049.4649.389.04E-042.511.03E-021.892.40E-010.75
Peptide 4 859.37178.681.12E-022.096.00E-021.652.47E-010.79
1,7 Dimethyluric acid 196.0582.173.72E-021.579.34E-031.84.96E-011.14
Myoinositol 180.0625.613.69E-021.332.93E-041.649.87E-021.23
PI(40:7) 908.5421.844.20E-031.241.32E-031.353.92E-011.09
L-Cysteinesulfonic acid 200.9761.872.33E-021.262.46E-021.238.04E-010.97
* Creatinine 113.0567.523.93E-021.157.08E-021.149.03E-010.99
N-Methylnicotinate 137.04910.895.70E-021.726.31E-032.511.72E-011.46
* Unknown 1 863.4028.76.54E-031.945.00E-021.582.62E-010.82
Unknown 2 627.551.851.72E-021.272.83E-041.556.68E-021.22
Unknown 3 234.0859.611.53E-030.96.77E-030.874.68E-010.97
Unknown 5 935.5137.41.14E-020.811.55E-020.845.68E-010.96
Unknown 6 592.471.618.60E-030.634.28E-020.724.65E-011.14
Unknown 7 922.4971.834.73E-021.161.09E-021.273.88E-011.09
Unknown 8 482.1050.751.90E-021.382.31E-021.399.48E-011.01
Unknown 9 834.5967.192.43E-041.438.84E-021.177.67E-020.82
Fumarate 116.0116.961.00E-030.723.50E-010.941.00E-021.3
3-hydroxyvalerate 118.0611.73.40E-030.54.40E-010.891.00E-021.86
Unknown 10 72.0561.714.32E-030.598.70E-010.989.47E-031.66
Unknown 11 496.3061.055.67E-030.611.27E-010.424.59E-010.70
Unknown 12 576.4761.521.07E-020.682.75E-010.889.43E-021.28
Unknown 13 736.4641.021.05E-030.431.58E-010.213.82E-010.48
Unknown 14 290.0049.285.11E-021.253.10E-011.134.31E-010.90
Unknown 15 225.0680.685.05E-020.662.24E-011.254.59E-031.91
Unknown 16 660.2620.781.00E-020.682.74E-010.851.57E-011.25
* 2-Methylhippuric acid 193.0654.361.02E-011.911.54E-023.81.14E-011.99
* 5-Acetylamino-6-amino-3-methyluracil 198.0723.576.19E-021.493.97E-031.922.45E-011.29

1Underlined metabolites were used for generating prediction model.

*denotes metabolites showing significant correlation with mFBP (Pearson correlation R ≥ 0.34, P ≤ 2.0e-4)

RT = retention; P = p value; FC = fold-change; S = smoker, NS = nonsmokers

Table 4

Correlation of serum metabolites with clinical lung parameters (Pearson correlation).

Metabolite%FEV1/FVCDLCOTLCCOHb
Pearson RP valuePearson RP valuePearson RP valuePearson RP value
N-Methylnicotinate -0.39341.05E-05-0.1850.0450.35717.20E-050.32930.0003
Myoinositol -0.43859.51E-07-0.24950.00710.35529.81E-050.3380.0002
Dimethylurate -0.31640.0006-0.26090.00490.20540.0280.24480.0087
2-Methylhippurate -0.35240.0001-0.37383.87E-050.24170.00930.26840.0039
5-Acetylamino-6-amino-3-methyluracil -0.36017.72E-05-0.32790.00030.25340.00630.31440.0007
863.40@8.7 -0.36367.00E-05-0.28910.00180.0004137NS0.1252NS
627.55@1.85 -0.24150.0084-0.22970.012-0.004NS0.2110.02
592.47@1.67 0.28210.0023-0.1535NS0.009827NS0.02193NS
284.061@8.88 0.2390.00910.24210.0083-0.05NS-0.11NS
234.085@9.61 0.24730.00690.1205NS-0.05NS-0.06NS
Fibrinogen peptide B -0.30160.0009-0.36594.61E-05-0.00968NS0.23930.0094
Peptide 1464.65@10.44 -0.3260.0003-0.51851.81E-090.1385NS0.24220.0085
Peptide 859.371@8.68 -0.37634.00E-05-0.30960.00030.003102NS0.1267NS
Peptide 1308.547@9.51 -0.37942.89E-05-0.4331.34E-060.07092NS0.17680.0558
Peptide 1049.464@9.38 -0.41157.23E-06-0.39421.86E-050.04195NS0.17250.075
  79 in total

1.  Airway count and emphysema assessed by chest CT imaging predicts clinical outcome in smokers.

Authors:  Alejandro A Diaz; Clarissa Valim; Tsuneo Yamashiro; Raúl San José Estépar; James C Ross; Shin Matsuoka; Brian Bartholmai; Hiroto Hatabu; Edwin K Silverman; George R Washko
Journal:  Chest       Date:  2010-06-17       Impact factor: 9.410

Review 2.  The role of smoking in coagulation and thromboembolism in chronic obstructive pulmonary disease.

Authors:  Victor F Tapson
Journal:  Proc Am Thorac Soc       Date:  2005

3.  Biomarkers for chronic obstructive pulmonary disease: surfactant protein D and C-reactive protein.

Authors:  Morten Dahl
Journal:  Am J Respir Crit Care Med       Date:  2008-06-01       Impact factor: 21.405

4.  Phospholipid composition and surface-active properties of tracheobronchial secretions from patients with cystic fibrosis and chronic obstructive pulmonary diseases.

Authors:  S Girod; C Galabert; A Lecuire; J M Zahm; E Puchelle
Journal:  Pediatr Pulmonol       Date:  1992-05

5.  Plasma Fibrinogen as a Biomarker for Mortality and Hospitalized Exacerbations in People with COPD.

Authors:  David M Mannino; Ruth Tal-Singer; David A Lomas; Jorgen Vestbo; R Graham Barr; Kay Tetzlaff; Michael Lowings; Stephen I Rennard; Jeffrey Snyder; Mitchell Goldman; Ubaldo J Martin; Deborah Merrill; Amber L Martin; Jason C Simeone; Kyle Fahrbach; Brian Murphy; Nancy Leidy; Bruce Miller
Journal:  Chronic Obstr Pulm Dis       Date:  2015

6.  Levels of leptin and IL-6 in lungs and blood are associated with the severity of chronic obstructive pulmonary disease in patients and rat models.

Authors:  Rui Liang; Wei Zhang; Ya-Mei Song
Journal:  Mol Med Rep       Date:  2013-03-15       Impact factor: 2.952

7.  Inflammatory biomarkers and exacerbations in chronic obstructive pulmonary disease.

Authors:  Mette Thomsen; Truls Sylvan Ingebrigtsen; Jacob Louis Marott; Morten Dahl; Peter Lange; Jørgen Vestbo; Børge G Nordestgaard
Journal:  JAMA       Date:  2013-06-12       Impact factor: 56.272

8.  Pentraxin 3 as a novel biomarker of inflammation in chronic obstructive pulmonary disease.

Authors:  Ozlem Kar Kurt; Mehmet Tosun; Emine Bahar Kurt; Fahrettin Talay
Journal:  Inflammation       Date:  2015-02       Impact factor: 4.092

9.  Untargeted plasma metabolite profiling reveals the broad systemic consequences of xanthine oxidoreductase inactivation in mice.

Authors:  Qiuying Chen; Hyeong-Cheon Park; Michael S Goligorsky; Praveen Chander; Steven M Fischer; Steven S Gross
Journal:  PLoS One       Date:  2012-06-18       Impact factor: 3.240

10.  Surfactant protein D, soluble intercellular adhesion molecule-1 and high-sensitivity C-reactive protein as biomarkers of chronic obstructive pulmonary disease.

Authors:  Sahar E El-Deek; Hoda A Makhlouf; Tahia H Saleem; Manal A Mandour; Nahed A Mohamed
Journal:  Med Princ Pract       Date:  2013-07-13       Impact factor: 1.927

View more
  20 in total

1.  Smokers with COPD Show a Shift in Energy and Nitrogen Metabolism at Rest and During Exercise.

Authors:  Olaf Holz; David S DeLuca; Stefan Roepcke; Thomas Illig; Klaus M Weinberger; Christian Schudt; Jens M Hohlfeld
Journal:  Int J Chron Obstruct Pulmon Dis       Date:  2020-01-06

Review 2.  A Review of Pulmonary Toxicity of Electronic Cigarettes in the Context of Smoking: A Focus on Inflammation.

Authors:  Peter G Shields; Micah Berman; Theodore M Brasky; Jo L Freudenheim; Ewy Mathe; Joseph P McElroy; Min-Ae Song; Mark D Wewers
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2017-06-22       Impact factor: 4.254

3.  New Strategies and Challenges in Lung Proteomics and Metabolomics. An Official American Thoracic Society Workshop Report.

Authors:  Russell P Bowler; Chris H Wendt; Michael B Fessler; Matthew W Foster; Rachel S Kelly; Jessica Lasky-Su; Angela J Rogers; Kathleen A Stringer; Brent W Winston
Journal:  Ann Am Thorac Soc       Date:  2017-12

4.  Plasma Metabolomic Signatures of Chronic Obstructive Pulmonary Disease and the Impact of Genetic Variants on Phenotype-Driven Modules.

Authors:  Lucas A Gillenwater; Katherine A Pratte; Brian D Hobbs; Michael H Cho; Yonghua Zhuang; Eitan Halper-Stromberg; Charmion Cruickshank-Quinn; Nichole Reisdorph; Irina Petrache; Wassim W Labaki; Wanda K O'Neal; Victor E Ortega; Dean P Jones; Karan Uppal; Sean Jacobson; Gregory Michelotti; Christine H Wendt; Katerina J Kechris; Russell P Bowler
Journal:  Netw Syst Med       Date:  2020-12-31

5.  Measurement of Melanin Metabolism in Live Cells by [U-13C]-L-Tyrosine Fate Tracing Using Liquid Chromatography-Mass Spectrometry.

Authors:  Qiuying Chen; Dalee Zhou; Zalfa Abdel-Malek; Fengli Zhang; Philip S Goff; Elena V Sviderskaya; Kazumasa Wakamatsu; Shosuke Ito; Steven S Gross; Jonathan H Zippin
Journal:  J Invest Dermatol       Date:  2021-02-05       Impact factor: 7.590

Review 6.  Using omics approaches to understand pulmonary diseases.

Authors:  Mengyuan Kan; Maya Shumyatcher; Blanca E Himes
Journal:  Respir Res       Date:  2017-08-03

7.  Investigations in the possibility of early detection of colorectal cancer by gas chromatography/triple-quadrupole mass spectrometry.

Authors:  Shin Nishiumi; Takashi Kobayashi; Shuichi Kawana; Yumi Unno; Takero Sakai; Koji Okamoto; Yasuhide Yamada; Kazuki Sudo; Taiki Yamaji; Yutaka Saito; Yukihide Kanemitsu; Natsuko Tsuda Okita; Hiroshi Saito; Shoichiro Tsugane; Takeshi Azuma; Noriyuki Ojima; Masaru Yoshida
Journal:  Oncotarget       Date:  2017-03-07

Review 8.  Smoking status and second-hand smoke biomarkers in COPD, asthma and healthy controls.

Authors:  Matteo Bradicich; Macé M Schuurmans
Journal:  ERJ Open Res       Date:  2020-07-20

9.  Alterations in Serum Polyunsaturated Fatty Acids and Eicosanoids in Patients with Mild to Moderate Chronic Obstructive Pulmonary Disease (COPD).

Authors:  Bjoern Titz; Karsta Luettich; Patrice Leroy; Stephanie Boue; Gregory Vuillaume; Terhi Vihervaara; Kim Ekroos; Florian Martin; Manuel C Peitsch; Julia Hoeng
Journal:  Int J Mol Sci       Date:  2016-09-20       Impact factor: 5.923

10.  Differential Expression Study of Lysine Crotonylation and Proteome for Chronic Obstructive Pulmonary Disease Combined with Type II Respiratory Failure.

Authors:  Qing Gan; Donge Tang; Qiang Yan; Jiejing Chen; Yong Xu; Wen Xue; Lu Xiao; Fengping Zheng; Huixuan Xu; Yingyun Fu; Yong Dai
Journal:  Can Respir J       Date:  2021-06-15       Impact factor: 2.409

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.