Literature DB >> 23843818

Advantage of Applying OSC to (1)H NMR-Based Metabonomic Data of Celiac Disease.

Mostafa Rezaei-Tavirani1, Fariba Fathi, Fatemeh Darvizeh, Mohamad Reza Zali, Mohamad Rostami Nejad, Kamran Rostami, Mohsen Tafazzoli, Afsaneh Arefi Oskouie, Seyed Abdolreza Mortazavi-Tabatabaei.   

Abstract

BACKGROUND: Celiac disease (CD) is a disorder associated with body reaction to gluten. After the gluten intake, an immune reaction against the protein occurs and damages villi of small intestine in celiac patients gradually.
OBJECTIVES: The OSC, a filtering method for minimization of inter- and intra-spectrometer variations that influence on data acquisition, was applied to biofluid NMR data of CD patients. PATIENTS AND METHODS: In this study, metabolites of total 56 serum samples from 12 CD patients, 15 CD patients taking gluten-free diet (GFD), and 29 healthy cases were analyzed using nuclear magnetic resonance (NMR) and associated theoretical analysis. Employing ProMetab (version ProMetab_v3_3) software, data obtained from NMR spectra were reduced and orthogonal signal correction (OSC) effect on celiac disease metabonomics before and after the separation by principle component analysis (PCA) was investigated.
RESULTS: The three groups were separated by OSC and findings were analyzed by partial least squares discriminant analysis (PLS-DA) method. Root mean square error of calibration (RMSEc) and correlation coefficient of calibration (Rc) for PLS-DA referred to an efficient group separation filtered by OSC.
CONCLUSIONS: The applied leave-one-out cross-validation to PLS-DA method performed along with OSC confirmed validation of data analysis. Finally four metabolites are introduced as CD biomarkers.

Entities:  

Keywords:  Celiac Disease; Discriminant Analysis; Magnetic Resonance Spectroscopy; Principle Component Analysis

Year:  2012        PMID: 23843818      PMCID: PMC3693626          DOI: 10.5812/ijem.3058

Source DB:  PubMed          Journal:  Int J Endocrinol Metab        ISSN: 1726-913X


1. Background

Celiac desease (CD) is a disorder caused essentially by body reaction to the gluten. Intake of foods containing gluten promotes an immune reaction against the protein that damages villi of small intestine in celiac patients gradually. Consequently, lack of vitamins, minerals, and other essential nutrients occurs. Therefore, celiac patients are at the risk of malnutrition, anemia and osteoporosis from which the anemia attributes to iron deficiency and results in declining red blood cell efficiency, and the osteoporosis may represent as fragile bones caused by lack of calcium (1). Full analysis of a living organism can be achieved by an integrated set of ‘omics’ approaches including metabonomics, genomics, transcriptomics, and proteomics in order to increase data complexity. Metabonomics generates sufficient quantitative or qualitative metabolic data for analytical studies of biological systems (2-7) and is originally defined as ‘the quantitative measurement of the dynamic multi-parametric metabolic response of living systems to pathophysiological stimuli or genetic modification’(5). Investigation of complex metabolic systems such as disease mechanisms, toxic reactions, and genetic manipulations requires full analytical data sets. Nuclear magnetic resonance (NMR) and mass spectrometry (MS) are the most conventional techniques for metabolic profiling (8-10). Multivariate statistical approaches are also suitable methods for extraction metabolic data associated with each level of dynamic processes. Since NMR method produces various and highly correlated data, employing multivariate methods such as PCA prior to any further data analysis is essential to choose efficient number of descriptors (11).

2. Objectives

In this study OSC, a filtering method for minimization of inter- and intra-spectrometer variations that influence on data acquisition, was applied to biofluid NMR data of CD patients.

3. Patients and Methods

3. 1. Blood Samples

Using syringes, two milliliters of blood samples were drawn from antecubital vein of each case through a single puncture followed by immediate serum separation via centrifugation, and storage at -20ºC. Experimental data was obtained by placing serum samples in 5mm NMR tubes. 600 µL serum samples each were diluted by 100 µL D2O to provide a field-frequency lock. The experimental data consisted of 56 serum samples including 12 samples from celiac patients on gluten-free diet, 15 samples from patients without a specific diet, and 29 samples from healthy cases.

3. 2.1H NMR Spectroscopy

A BrukerAvance DRX 600 spectrometer operating at 500 MHz at 300 K using the Carr–Purcell–Meiboom–Gill (CPMG) spin–echo sequence (12) with pre saturation was employed for recording 1H NMR spectra. QNP probe was used in this experiment. Spin–echo loop time (2nt) and relaxation delay for recorded spectra were 43.9 ms and 2s, respectively. A total of 128 transients were collected into 32k data points using a spectral width of 8389.26 Hz and an acquisition time of 1.95 s. Prior to Fourier transformation, an exponential line broadening function of 0.30 Hz was applied to the Free Induction Decay (FID).

3.3. Data Pre-Processing

Employing Prometab software (version Prometab_v3_3), correction of baseline was carried out according to 4, 4-Dimethyl-4-Silapentane-1-Sulfonic acid (DSS) as reference. The number of variables was reduced to 245 in each spectrum by integrating spectral intensity in regions of equal width [0.04 parts per million (ppm) over the range 0.2–10.0 ppm]. In addition, the region of (δ = 4.50–5.98) was excluded from the analysis to avoid unauthentic effects of variability in the suppression of water resonance. Therefore, the number of variables was reduced to 205. All spectra were normalized to constant total intensity.

3.4. Statistical Analysis

3.4.1. OSC for Classification Modeling

A response matrix (Y) describing variation between defined sample classes was constructed and the variation in X orthogonal to Y was removed by applying OSC. The filtered data matrix, osc, that included class-correlated variations was modeled by subsequent multivariate modeling methods such as PCA or PLS-DA (14). OSC Modeling eliminates variables that are not correlated with desirable features from 1H NMR biofluid data; as a result calculated values in multivariate models motivate only class separation. In this study, OSC was applied to biofluid NMR data prior to chemometric analysis to minimize influences of inter- and intra-spectrometer variations during data acquisition. Moreover, it was employed to eliminate physiological variation from data sets (13-17).

3.4.2. Principal Components Analysis (PCA)

PCA is a conventional technique for multivariate analysis and mainly employed for multivariate data representation in a low-dimensional space that means it describes maximum possible number of variables with minimum possible number of principal components (PCs). In PCA, principal components are named consecutively starting from PC1 until total variance is defined. PC1 or first principal component is a line that goes through the points in a variable space and best conserves relevant distances between objects and defined by a loading vector as follows. PC1 is a hidden variable and has maximum variance of the scores. Scores are predicted data values on the hidden variable (18). All calculations were performed in MATLAB 6.5 and PCA was implemented with the PLS-Toolbox Version 3.0. Both the graphical (‘pcagui’) and the command-line (‘pca’) versions of PCA have been employed (19).

3.4.3. Partial Least Squares Discriminant Analysis (PLS-DA)

PLS regression was directed by a response data set Y to derive components from descriptor data set X that best describe specified Y structure, as it maximizes the covariance that expresses common structure between X and Y (20-22). PLS is also divided into regression as well as discriminant analysis (PLS-DA). Classification by DA assigns the samples to proper separate classes which are represented by using so-called ‘dummy’ variables (23).

4. Results

As shown in Figure 1a, data analysis leads to introduce two groups of control and celiac cases. Since separation of GDF group from other two groups was an important index in this study, more analytical efforts were effected by using OSC in order to achieve better transparency between sample groups. The useful role of OSC in separation groups is reported in several studies (24). By using OSC, 95.26% total variability was distributed between three PCs as PC1 (91.11%), PC2 (2.88%), and PC3 (1.47%) (See Figure 1b). After successful division of samples into three groups, PLS-DA as regression extension of PCA was applied to maximize the separation. It is reported that PLS-DA is useful tool for maximizing covariance between measured data (x) and response variable (y) (25). PLS-DA findings introduced three most influential metabolites that play important role in separation of groups. Serum level alteration of these metabolites in CD group compared to control group is shown in Table 1.
Figure 1

Score Plot PCA a) Without OSC, and b) With OSC

Table 1

PCA-Detected 1H NMR Spectral Regions that Separate Significantly CD and Control Groups Based on Metabolites Levels

Chemical Shift (ppm)MetaboliteAlteration of Metabolite Level in CD Group Compared to Control Group
0.86lipid (mainly VLDL)Decrease
1.26 -1.3LipidDecrease
1.34LactateDecrease
3.22CholineIncrease
The percentage of captured variance by PCA model is a suitable index for data validation (26); accordingly, three times PCs and their associated variance amounts were calculated and shown in Table 2 with total variance equal to 92.22% out of which the percentage variance in step 1 was equal to 84.38%, and in second and third steps were 4.74 and 3.12, respectively. Since 92.24% is close to 1, we may conclude that PCA model could separate the three groups in a real and valid manner.
Table 2

Percent Variances Captured by PCA Model. First column corresponds to steps of model application and the second and third columns refer to variance of each step and total variance, respectively.

Number% Variance% Variance Total
184.3884.38
24.7489.12
33.1292.24
Score plots of PLS-DA without and with application of OSC for three groups are shown in Figure 2. The useful parameter to analyze PLS-DA findings is latent variable variance (LV) (16). LVs of represented data in Figure 2 for three steps of PLS-DA were calculated and shown in Table 3. There are several evidences implicate that total variance value above 80% is an acceptable index for data validation. Accumulated PLS-DA score plot by using OSC for three LV variances was 86.82% (see Table 3) that revealed a clear separation between the three groups; however this procedure without applOSC application (identified by an accumulated score plot of 75.55%) could not be considered as a suitable analytical method.
Figure 2

Score Plot PLS-DA a) Without OSC, and b) With OSC

Table 3

Summary of PLS-DA and OSC- PLS-DA Models of NMR Spectra. LVs refer to number of latent variances and Vx and Vy correspond to NMR data matrix and respond matrix, respectively. Vx (cuml, %) and Vy show total variances.

LVsVX (%)VX (cuml, %)VY (%)VY (cuml, %)
PLS-DA Model
163.9663.9644.0744.07
29.0573.0116.3460.41
32.5475.558.2466.73
OSC- PLS-DA Model
175.2575.25100.00100.00
26.1181.360.00100.00
35.4686.820.00100.00
RMSEc and Rc values for PLS-DA calibration without applying OSC were equal to 0.6682 and 0.6348, respectively. By using OSC, these values changed to 0.4226 and 0.8629, respectively. Amounts of Rc close to 1 correspond to acceptable data analysis (27) that means the calculated Rc value of 0.8629 refers to application of a proper analytical method for group separation. Besides this, OSC application reduced RMSEc value, that means an error reduction occurred. For better clarity, the applied OSC model was performed accompanied by leave-one-out cross-validation. The characterized findings were 0.5175 and 0.8510 for RMSEcv and Rcv, respectively. The low amount of error and Rcv close to 1, confirm sufficient validity of findings.

5. Discussion

According to orthogonal theory in mathematics, OSC can eliminate data in X matrix that are orthogonal with response Y matrix. The response Y matrix is class variable. The Y matrix variables determined as control, CD, and GFD groups were assigned to as 0 , 1, and 2, respectively. Orthogonal component with eigenvalues greater than 1 were eliminated. In order to perform metabonomics analysis of recorded 1H NMR spectra of healthy, celiac, and GFD groups, two different recognition pattern methods before and after OSC were applied. Score plots of PCA are drawn using the MATLAB software. Figure 1a and Figure 1b demonstrate PCA score plots of NMR spectra without and with OSC, respectively. On the basis of this study, four metabolites were introduced to differentiate between celiac patients on GFD, celiac patients without specific diet, and healthy people. We hope that further investigations lead to determine exact metabonomic pattern of CD.
  13 in total

Review 1.  'Metabonomics': understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data.

Authors:  J K Nicholson; J C Lindon; E Holmes
Journal:  Xenobiotica       Date:  1999-11       Impact factor: 1.908

Review 2.  Spectroscopic and statistical techniques for information recovery in metabonomics and metabolomics.

Authors:  John C Lindon; Jeremy K Nicholson
Journal:  Annu Rev Anal Chem (Palo Alto Calif)       Date:  2008       Impact factor: 10.745

Review 3.  Metabolomics: a global biochemical approach to drug response and disease.

Authors:  Rima Kaddurah-Daouk; Bruce S Kristal; Richard M Weinshilboum
Journal:  Annu Rev Pharmacol Toxicol       Date:  2008       Impact factor: 13.820

4.  1H nuclear magnetic resonance-based metabolomic characterization of wines by grape varieties and production areas.

Authors:  Hong-Seok Son; Ki Myong Kim; Frans van den Berg; Geum-Sook Hwang; Won-Mok Park; Cherl-Ho Lee; Young-Shick Hong
Journal:  J Agric Food Chem       Date:  2008-08-16       Impact factor: 5.279

Review 5.  Mass spectrometry for the identification of the discriminating signals from metabolomics: current status and future trends.

Authors:  Erwan Werner; Jean-François Heilier; Céline Ducruix; Eric Ezan; Christophe Junot; Jean-Claude Tabet
Journal:  J Chromatogr B Analyt Technol Biomed Life Sci       Date:  2008-07-12       Impact factor: 3.205

6.  Systems biology: Metabonomics.

Authors:  Jeremy K Nicholson; John C Lindon
Journal:  Nature       Date:  2008-10-23       Impact factor: 49.962

7.  Distinguishing transgenic from non-transgenic Arabidopsis plants by (1)H NMR-based metabolic fingerprinting.

Authors:  Yanfei Ren; Tao Wang; Yufa Peng; Bin Xia; Li-Jia Qu
Journal:  J Genet Genomics       Date:  2009-10       Impact factor: 4.275

8.  Statistically integrated metabonomic-proteomic studies on a human prostate cancer xenograft model in mice.

Authors:  Mattias Rantalainen; Olivier Cloarec; Olaf Beckonert; I D Wilson; David Jackson; Robert Tonge; Rachel Rowlinson; Steve Rayner; Janice Nickson; Robert W Wilkinson; Jonathan D Mills; Johan Trygg; Jeremy K Nicholson; Elaine Holmes
Journal:  J Proteome Res       Date:  2006-10       Impact factor: 4.466

Review 9.  Systematic functional analysis of the yeast genome.

Authors:  S G Oliver; M K Winson; D B Kell; F Baganz
Journal:  Trends Biotechnol       Date:  1998-09       Impact factor: 19.536

10.  Atypical presentation is dominant and typical for coeliac disease.

Authors:  Mohammad Rostami Nejad; Kamran Rostami; Mohamad Amin Pourhoseingholi; Ehsan Nazemalhosseini Mojarad; Manijeh Habibi; Hossein Dabiri; Mohammad Reza Zali
Journal:  J Gastrointestin Liver Dis       Date:  2009-09       Impact factor: 2.008

View more
  3 in total

Review 1.  How Metabolomics Provides Novel Insights on Celiac Disease and Gluten-Free Diet: A Narrative Review.

Authors:  Mirco Vacca; Annalisa Porrelli; Francesco Maria Calabrese; Tamara Lippolis; Ilaria Iacobellis; Giuseppe Celano; Daniela Pinto; Francesco Russo; Gianluigi Giannelli; Maria De Angelis
Journal:  Front Microbiol       Date:  2022-06-23       Impact factor: 6.064

Review 2.  Metabonomics exposes metabolic biomarkers of Crohn's disease by (1)HNMR.

Authors:  Fariba Fathi; Fatemeh Ektefa; Mehrdad Hagh-Azali; Hamid Asadzadeh Aghdaie
Journal:  Gastroenterol Hepatol Bed Bench       Date:  2013

3.  Serum Metabolomic Profiles Associated With Untreated Metabolic Syndrome Patients in the Chinese Population.

Authors:  Yuanyuan Li; Yongfa Wang; Yaodong Zhuang; Ping Zhang; Shujiao Chen; Tetsuya Asakawa; Bizhen Gao
Journal:  Clin Transl Sci       Date:  2020-06-16       Impact factor: 4.689

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.