Literature DB >> 28517934

Plasma and Serum Metabolite Association Networks: Comparability within and between Studies Using NMR and MS Profiling.

Maria Suarez-Diez¹, Jonathan Adam^2,3,4, Jerzy Adamski^4,5,6, Styliani A Chasapi⁷, Claudio Luchinat^8,9, Annette Peters^2,3,4,10, Cornelia Prehn⁵, Claudio Santucci⁸, Alexandros Spyridonidis¹¹, Georgios A Spyroulias⁷, Leonardo Tenori¹², Rui Wang-Sattler^2,3,4, Edoardo Saccenti¹.

Abstract

Blood is one of the most used biofluids in metabolomics studies, and the serum and plasma fractions are routinely used as a proxy for blood itself. Here we investigated the association networks of an array of 29 metabolites identified and quantified via NMR in the plasma and serum samples of two cohorts of ∼1000 healthy blood donors each. A second study of 377 individuals was used to extract plasma and serum samples from the same individual on which a set of 122 metabolites were detected and quantified using FIA-MS/MS. Four different inference algorithms (ARANCE, CLR, CORR, and PCLRC) were used to obtain consensus networks. The plasma and serum networks obtained from different studies showed different topological properties with the serum network being more connected than the plasma network. On a global level, metabolite association networks from plasma and serum fractions obtained from the same blood sample of healthy people show similar topologies, and at a local level, some differences arise like in the case of amino acids.

Entities: Chemical Disease Gene Species

Keywords: blood; correlations; differential network analysis; low molecular weight metabolites; mutual information; network inference; network topology; plasma; serum

Mesh：

Substances：

Year: 2017 PMID： 28517934 PMCID： PMC5645760 DOI： 10.1021/acs.jproteome.7b00106

Source DB: PubMed Journal: J Proteome Res ISSN： 1535-3893 Impact factor: 4.466

Introduction

A large variety of omics data can be collected from a sample, providing information about the biological system under investigation at different levels and from different angles; the analysis and integration of these aspects is one of the hallmarks of systems biology. Metabolomics profiling and analysis of blood samples has been successfully applied to investigate a variety of diseases, such as cancer,[1−3] kidney diseases,[4] cardiovascular diseases,[5,6] diabetes, and celiac disease.[7−9] Blood bathes every tissue and every organ in the body and collects and transports the molecules that are being assimilated, secreted, excreted, or discarded by different tissues.[10] In most metabolomics studies, either plasma or serum matrices are used as a proxy for blood. Blood is composed of cellular components (red and white cells and platelets) suspended in a straw-colored liquid carrier, the plasma, that accounts for ∼50–55% of blood volume.[10] Plasma is separated from the cellular component via centrifugation after the addition of anticoagulants. Serum is obtained by letting the blood coagulate and removing the supernatant. Both plasma and serum are aqueous solutions (∼95% water) and contain proteins and peptides, carbohydrates, lipids, amino acids, electrolytes, organic wastes, and a variety of other small organic molecules dissolved in them. Many target clinical parameters, such as metal ions, proteins, and enzymes, have been found to show different concentrations[11−13] in the two different media, but in terms of small molecules the compositions of plasma and serum are considered to be very similar.[10] Recent studies have reported higher metabolite concentrations in serum than in plasma[10,14,15] but suggested that either matrix should generate similar results in clinical and biological studies.[15] However, to the best of our knowledge, there are no available studies in which biological or clinical results obtained using plasma have been confirmed using serum or vice versa. We are interested in the associations between the concentration levels of metabolites in blood. Specifically, we wish to determine how similar the association networks inferred using metabolite concentration levels measured in plasma and serum are. As already anticipated, a great deal of biological information is encoded in the relationships among metabolite concentration levels rather than in their levels alone.[16] For these reasons, it is important to understand which biological information is contained in both plasma and serum and to what extent plasma and serum contain unique or shared information. In metabolomics, networks are usually reconstructed using Pearson’s, Spearman’s, or partial correlations (referred to as CORR).[17−20] Here we complement a correlation-based approach with methods adapted from gene network inference to investigate metabolite association networks in blood samples. Four alternative methods for network inference were applied to avoid bias in network estimations, standard Pearson’s correlation, two well-characterized algorithms to infer gene regulatory networks, ARACNE (Algorithm for the Reconstruction of Accurate Cellular Networks),[21] CLR (Context Likelihood of Relatedness),[22] and PCLRC (Probabilistic Context Likelihood of Relatedness on Correlations), which we recently developed to infer metabolite associations.[6] The aim of the present study is to investigate whether possible (dis)similarity exists between serum and plasma metabolite association networks obtained from the same blood samples or from different samples obtained from equivalent populations of healthy subjects. Two groups of ∼1000 healthy blood donors each were sampled for either their plasma or serum and analyzed using 1H nuclear magnetic resonance spectroscopy (NMR). An additional study, formed by 377 individuals, was considered for plasma and serum extracted from the same blood samples and analyzed using flow injection analysis-tandem mass spectrometry (FIA–MS/MS). For the purpose of investigating whether (dis)similarity of plasma and serum profiles obtained from the same sample was retained also in the presence of pathophysiological conditions, a small study on subjects suffering from various hematological malignancies was also considered.

Materials and Methods

Studies Description

Studies Ia and Ib

The participating subjects were recruited in collaboration with the Tuscan section of the Italian Association of Blood Donors (AVIS) in the Transfusion Service of Pistoia Hospital (Ospedale del Ceppo, AUSL 3 - Pistoia, Italy) (Study Ia) and in the Service of Immunohematology and Blood Transfusion of the Azienda Ospedaliero-Universitaria Careggi (Florence, Italy) (Study Ib). In brief, a total of 864 adult healthy volunteers (678 males, 186 females, mean age 41 ± 11 years) were enrolled in Pistoia.[6,23] 994 adult healthy volunteers (723 males, 271 females, mean age 41 ± 12 years) were enrolled in Florence.[5] Plasma samples (study Ia) were obtained after overnight fasting, and EDTA was used as an anticoagulant; samples were stored at −80 °C immediately after collection. For more details, see Bernini et al.[23] Samples were collected, preprocessed, and stored according to the standard operating procedures previously described.[24] All subjects in the study provided informed consent. Both studies Ia and Ib consisted of blood donors living in the same geographical area (within a ∼50 km radius from Florence, Tuscany in Italy) who must comply with the requirements for blood donation according to the Italian legislation. Among others: age 18–60 years, body weight >50 kg, systolic blood pressure 110–148 mmHg, diastolic blood pressure 60–100 mmHg, absence of (manifested) infectious diseases, absence of chronic diseases (such as diabetes, tumors, autoimmune diseases), no current menstruation, no consumption of medicines within 1 week before donation (bd), no common diseases (such as flu, cold, bronchitis) within 2 weeks bd, no surgery within 3 months bd, no endoscopic exams within 4 months bd, no pregnancy within 12 months bd, no abortion within 4 months bd, no travels to tropical countries within 6 months bd, and no (heavy) sport activity within 24 h bd. All samples were collected under a fasting condition. The two large studies could then be considered to be rather homogeneous. Plasma and serum samples were prepared for NMR analysis in the same laboratory using the same standard protocols for blood derivatives NMR analysis. EDTA was used on the plasma samples, as the effect of EDTA on the quality of the samples for NMR analysis has been found to be negligible.[25] Samples were analyzed on the same instrument operating with the same operative setting. NMR spectra were postprocessed (phasing and baseline correction) with automated routines. Under these conditions, NMR experiments have been found to be extremely reproducible in inter/intra laboratory comparative investigations.[26−29] Metabolites were then quantified using the same automated routine.

Study II

The subjects were recruited in the KORA (Cooperative Health Research in the Region of Augsburg) cohort, a population-based research platform with subsequent follow-up studies in the fields of epidemiology, health economics and health care research consisting of interviews in combination with medical and laboratory examinations, as well as the collection of biological samples.[30] Plasma and serum samples from 377 individuals (180 females, 197 males, age range from 51 to 84 years) from the population-based cohort KORA F3[31] were used. For each participant, fasting blood was simultaneously drawn into serum and EDTA plasma gel tubes between 8 and 10 a.m. Plasma tubes were shaken gently and thoroughly for 15 min, followed by centrifugation at 2750g for 15 min at 15 °C. In the meantime, serum tubes were gently inverted twice, followed by 45 min of resting at room temperature to obtain complete coagulation before performing the same centrifugation process as for plasma. All samples were stored at −80 °C until the metabolomics analysis.[15] All KORA participants gave written informed consent. The KORA study was approved by the ethics committee of the Bavarian Medical Association, Germany.

Study III

We were also interested in investigating the equivalence between plasma and serum profiles in the case of pathophysiological alterations. For this we considered a third study where serum and plasma were extracted from the same blood of subjects affected by different various hematological malignancies, and the samples were analyzed using NMR. This data set comprises 30 patients (18 males, 12 females, median age 45 years, range 18 to 68) suffering from various hematological malignancies (9 acute myeloid leukemia, 7 acute lymphoblastic leukemia, 9 lymphoma, 3 myelodysplastic syndrome, 2 multiple myeloma) who underwent allogeneic (n = 24) or autologous (n = 6) hematopoietic stem cell transplantation at the Bone Marrow Transplantation Unit of the University Hospital of Patras, Greece. In addition, plasma and serum samples were collected from 13 healthy individuals. In total the study data set consisted of 43 serum and plasma samples obtained from the same blood specimen. Samples were collected, preprocessed, and stored according to the standard operating procedures previously described.[24] All subjects in the study provided informed consent.

Metabolite Quantification and Analysis

Serum and plasma metabolite concentrations in the samples from studies I and III were analyzed using 1H NMR using a Bruker 600 MHz spectrometer (Bruker BioSpin) operating at 600.13 MHz using standard CPMG experiments and standard protocols for sample preparation as previously described.[23] All resonances of interest were manually checked, and signals were assigned on template 1D NMR profiles by using matching routines of AMIX 7.3.2 (Bruker BioSpin) in combination with the BBIOREFCODE (Version 2-0-0; Bruker BioSpin) reference database and published literature when available. The relative concentrations of each metabolite were calculated by integrating the signals in the spectra. The 29 quantified metabolites are given in Table . We refer the reader to the original publications for more details of the experimental procedures.[6,23]

Table 1

List of Metabolites Measureda

no. p	studies I and III - NMR¹	study II FIA-MS/MS²
1	3-hydroxybutyrate	arginine	33	SM OH C14:1b	65	PC aa C38:4b	97	PC ae C40:0b
2	acetate	glutamine	34	SM OH C16:1b	66	PC aa C38:5b	98	PC ae C40:1b
3	acetoacetate	glycine	35	SM OH C22:1b	67	PC aa C38:6b	99	PC ae C40:2b
4	alanine	histidine	36	SM OH C22:2b	68	PC aa C40:1b	100	PC ae C40:3b
5	arginine	methionine	37	SM OH C24:1b	69	PC aa C40:4b	101	PC ae C40:4b
6	citrate	ornithine	38	SM C16:0b	70	PC aa C40:5b	102	PC ae C40:5b
7	creatine	phenylalanine	39	SM C16:1	71	PC aa C40:6b	103	PC ae C40:6b
8	creatinine	proline	40	SM C18:0b	72	PC aa C42:0b	104	PC ae C42:1b
9	dimethylglycine	serine	41	SM C18:1b	73	PC aa C42:1b	105	PC ae C42:2b
10	formate	threonine	42	SM C24:0b	74	PC aa C42:2b	106	PC ae C42:3b
11	glucose	tryptophan	43	SM C24:1b	75	PC aa C42:5b	107	PC ae C42:4b
12	glutamine	tyrosine	44	PC aa C24:0	76	PC aa C42:6b	108	PC ae C42:5
13	HDL	valineb	45	PC aa C28:1	77	PC ae C30:0b	109	PC ae C44:3b
14	histidine	xLeucine	46	PC aa C30:0b	78	PC ae C30:2	110	PC ae C44:4b
15	isoleucine	C0	47	PC aa C32:0b	79	PC ae C32:1b	111	PC ae C44:5b
16	LDL	C10b	48	PC aa C32:1b	80	PC ae C32:2	112	PC ae C44:6b
17	leucine	C10:1b	49	PC aa C32:2b	81	PC ae C34:0b	113	lysoPC a C14:0
18	lysine	C12b	50	PC aa C32:3	82	PC ae C34:1b	114	lysoPC a C16:0b
19	methionine	C12:1b	51	PC aa C34:1b	83	PC ae C34:2b	115	lysoPC a C16:1
20	N-acetylglucosamine	C14:1b	52	PC aa C34:2b	84	PC ae C34:3	116	lysoPC a C17:0
21	oxoglutarate	C14:2	53	PC aa C34:3b	85	PC ae C36:1b	117	lysoPC a C18:0b
22	phenylalanine	C16b	54	PC aa C34:4	86	PC ae C36:2b	118	lysoPC a C18:1b
23	proline	C18b	55	PC aa C36:0b	87	PC ae C36:3b	119	lysoPC a C18:2
24	pyruvate	C18:1b	56	PC aa C36:1b	88	PC ae C36:4b	120	lysoPC a C20:3b
25	serine	C18:2	57	PC aa C36:2b	89	PC ae C36:5	121	lysoPC a C20:4
26	threonine	C2	58	PC aa C36:3b	90	PC ae C38:0b	122	lysoPC a C28:0b
27	tyrosine	C3b	59	PC aa C36:4b	91	PC ae C38:1b
28	valine	C4b	60	PC aa C36:5b	92	PC ae C38:2b
29	VLDL	C5b	61	PC aa C36:6b	93	PC ae C38:3b
30		C8b	62	PC aa C38:0b	94	PC ae C38:4b
31		C8:1	63	PC aa C38 1b	95	PC ae C38:5b
32		H1	64	PC aa C38 3b	96	PC ae C38:6b

Concentrations are isotope-corrected.[33]

(1) Study I and III: Metabolites (p = 29) measured (NMR) in serum and plasma obtained from different blood specimens. (2) Study II: Metabolites (p = 122) measured (FIA-MS/MS) in serum and plasma obtained from the same blood specimens. The metabolites common to both data sets are in italics. xLeucine refers to the sum of leucine and isoleucine. Concentrations are isotope-corrected.[33] Plasma and serum metabolite concentrations in Study II were quantified using a commercially available metabolomics kit (AbsoluteIDQ p150 Kit, Biocrates Life Sciences AG, Innsbruck, Austria), which is based on flow injection analysis-triple quadrupole mass spectrometry (FIA-MS/MS). Out of the 10 μL sample, 163 metabolites were quantified simultaneously. Of these, n = 122 (25 quantified and 97 semiquantified metabolites passed both criteria) passing the data quality control (see material and methods in ref (15) for further details) were retained for further analysis. The assay procedures and the full biochemical names have been described in more detail in previous publications.[15,32] Metabolite concentrations in the original samples were updated due to new insights according to the isotope correction, as previously described.[33]

Network Reconstruction

In the graphical representation of a biological network, the molecular components (here metabolite concentrations) are represented as nodes and the edges (or links) represent their interactions, either direct or indirect. Here interactions represented coordinated changes in metabolite concentration levels. In brief, three different methods for network inference were used to infer metabolite network using default parameters as previously detailed,[34] together with the standard correlation approach. We present here a brief description of the methods based on ref (34). We refer to the original publications for more details. All methods were used with default parameters.

Method Based on Correlations

The association between any pair of metabolites was measured through the absolute value of Pearson’s correlation (the method is referred to as CORR in this study).

CLR Algorithm

The CLR (Context Likelihood of Relatedness) algorithm[22] uses mutual information as a measure of the similarity between the profiles of the two chosen variables data. Indicating with X and Y the concentration of two metabolites, the mutual information MI between X and Y is defined aswhere p(x,y) is the joint probability distribution function of X and Y and p(x) (respectively, p(y)) indicates the probability that X = x (respectively, Y = y). It should be noted that to compute MI continuous data are discretized. The relationships between pairs of metabolites expressed by MI are then compared against the local context for each possible interaction so that possible spurious (indirect) associations are removed.

ARACNE Algorithm

As CLR, ARACNE (Algorithm for the Reconstruction of Accurate Cellular Networks)[21] uses MI as a measure of the similarity between two chosen variables. The properties of MI are used to prune the network of spurious interactions: The weakest edge of each triplet is interpreted as an indirect interaction and is removed if the difference between the two lowest weights is above a threshold γ. We used the ARANCE implementation presented in the R package “minet”[35] with default parameters (γ = 0).

PCLRC Algorithm

PCRLC (Probabilistic Context Likelihood of Relatedness on Correlations) was based on a modification of the CLR algorithm (using correlation instead of MI to measure similarity between profiles) and on iteratively sampling the data set, resulting in a weighted adjacency matrix containing an estimate of the likeliness of the association between any two metabolites expressed as probability in the range 0 to 1. We deemed significant those associations for which the probability was >0.95. An ‘R’ implementation of this algorithm is available at semantics.systemsbiology.nl. More details are provided in ref (6).

Construction of Serum and Plasma Metabolite Networks

The serum and plasma metabolite–metabolite association networks were constructed taking a so-called wisdom of crowds approach as detailed in ref (36). The following methodological description is based on ref (36), to which we refer for more details. For each set of samples obtained from Studies Ia, Ib, and II four adjacency matrices {a} (with m = 1 to 4) were obtained using the above-described methods. The entries of such matrices are real numbers in the range [−1, 1] for correlation matrices, in the [0, +∞) range for mutual information matrices, or [0, 1] for probabilistic networks, indicating the strength or the likelihood of the metabolite–metabolite associations. It should be noted that each of the considered algorithms uses different approaches to estimate the weight of the associations. As a result, the weights produced by different methods cannot be directly compared. These matrices are binarized to 0 and 1, imposing a threshold, τ, on the {a} values The values of τ depend on the method considered: 0.95 for PCLRC and 0.6 on the absolute value of the correlation for the CORR method. ARACNE and CLR follow different approaches to remove spurious correlations, and the weight of all associations deemed spurious is already set to zero. As a result, no further threshold is needed for these methods, and a τ value of zero was selected, as further detailed in ref (34). The choice of 0.6 for the correlation is based on the threshold, as discussed by Camacho et al.[37] The four networks were then superimposed The final adjacency matrix, representing the metabolite network, was defined by retaining only those links inferred by two or more methods. We set Q = 2, but other options were also explored as detailed in the Results and Discussion. In total, four networks were defined, two for studies Ia and Ib (plasma and serum obtained from different subjects, respectively) and two for study II (serum and plasma obtained from the same subjects). All networks were constructed using a large sample size (>900 samples for studies Ia and Ib and >350 for study II), ensuring the reliability of the inferred networks, as previously described.[34] The node degree δ for the ith metabolite is the number of links connecting those particular metabolites and was obtained as

Indices for Assessing Network Differences

To compare analytically different networks we used the Frobenius norm of a matrix X defined as[38]It holds that ∥X∥ = 0 if and only if X = 0. It follows that ∥X – Y∥ = 0 if and only if the two matrices are equal. In this case, X and Y are two binary adjacency matrices representing metabolite–metabolite association networks. The Frobenius norm was used solely to conveniently quantify and summarize the difference between two adjacency matrices. Network differences were derived on the basis of different node (metabolite)-degree observed in serum and plasma networks.

Pathway Enrichment Analysis

The MetaboAnalyst server 3.0 (www.metaboanalyst.ca)[39] was used to perform pathway enrichment analysis. For the over-representation analysis, the hypergeometric test was chosen and the pathway topology analysis was based on the relative-betweenness centrality.

Results and Discussion

Plasma is obtained from a blood sample, after the addition of an anticoagulant (usually citrate, heparin, or, in the present case, EDTA), by centrifuging the sample and removing or decanting the most buoyant (noncellular) portion. Serum is obtained by letting the blood clot and then collecting the supernatant. Recent studies have addressed the problem of the stability, in plasma and serum, of low-molecular-weight metabolites for metabolomics studies with respect to sample handling and storing. Serum and plasma extraction procedures have been thoroughly investigated in the past, and variations in several analyte concentrations were observed depending on extraction and storage protocols.[11−14] During the coagulation process, blood cells are metabolically active, and this might lead to some changes in metabolite concentrations. Thus it can be speculated that metabolite association networks inferred in serum may also be influenced by the undergoing coagulation process. For these reasons it is of interest to construct and compare metabolite–metabolite association networks considering serum and plasma fractions extracted from different as well as from the same blood specimen.

Comparison of Reconstructed Networks from Plasma and Serum from Different Blood Specimens

Using the samples collected within studies Ia and Ib (864 and 994 samples, respectively) we built metabolite association networks using the four described methods. The networks are shown in Figure (panels A and B for plasma and serum, respectively). It can be observed that for both plasma and serum the networks obtained using different methods show different topology. Figure A shows the relationship between the node degree (i.e., the number of connecting metabolites) for metabolites in serum and plasma as inferred from different methods.

Figure 1

Figure 2

Scatter plot of metabolite degree (connectivity) observed in the plasma and serum networks reconstructed with different methods. (A) Serum and plasma networks reconstructed from different blood samples (studies Ia and Ib, NMR data, 29 metabolites). (B) Serum and plasma networks reconstructed from the same blood samples (Study II, MS data, 122 metabolites). (CLR denotes Context Likelihood of Relatedness, ARACNE is Algorithm for the Reconstruction of Accurate Cellular Networks, PCLRC is Probabilistic Context Likelihood of Relatedness on Correlations, and CORR is Pearson's correlation.)

(A) Plasma metabolites association networks obtained using the four different methods. (B) Serum metabolites association networks obtained using the four different methods. (C) Consensus association network for serum and plasma. Data from studies Ia (plasma) and Ib (serum) (NMR, 29 metabolites from different blood specimens). (CLR denotes Context Likelihood of Relatedness, ARACNE is Algorithm for the Reconstruction of Accurate Cellular Networks, PCLRC is Probabilistic Context Likelihood of Relatedness on Correlations, and CORR is Pearson's correlation.) Scatter plot of metabolite degree (connectivity) observed in the plasma and serum networks reconstructed with different methods. (A) Serum and plasma networks reconstructed from different blood samples (studies Ia and Ib, NMR data, 29 metabolites). (B) Serum and plasma networks reconstructed from the same blood samples (Study II, MS data, 122 metabolites). (CLR denotes Context Likelihood of Relatedness, ARACNE is Algorithm for the Reconstruction of Accurate Cellular Networks, PCLRC is Probabilistic Context Likelihood of Relatedness on Correlations, and CORR is Pearson's correlation.) Within each data set, the association networks among metabolites inferred using different methods have inherently different topology: This means that edges between the same nodes can be present in a network and absent in another and vice versa. It can be argued that different connectivity patterns arise because of the arbitrary choice imposed on the weighted adjacency matrices, but it can be shown[34] that by varying the thresholds it is not possible to transform the connectivity matrix obtained with one method into another. A pragmatic approach to arrive at serum and plasma specific association networks is to take a so-called wisdom of crowds approach, that is, aggregating the results of the four methods, as suggested in the comparative study from the DREAM challenge.[40]Figure A–D shows scatter plots of the serum metabolite degree versus the plasma metabolite degree as a function of the number of methods considered to obtain a consensus: The best agreement is obtained, in this case of NMR data (studies Ia and Ib), when the consensus network (see eq ) is obtained by considering the results of at least three methods; in this case, the correlation between node degree in the two matrices is r = 0.6 (Pearson’s correlation, P value <10–6). Consensus networks are presented in Figure C. However, some differences in the topology of the networks still remain that are worthy of comment: Metabolite degrees are given in Supporting Table S1. Consistent changes in metabolite connectivity can be observed: Alanine, citrate, and proline are disconnected in the serum network but not in the plasma network; conversely, VLDL is disconnected in plasma but not in serum. Other metabolites, like NAG1, creatine, valine, and formate, are densely connected in serum but not in plasma, indicating a substantially differential behavior.

Figure 3

Metabolite degree (connectivity) observed in the consensus plasma and serum networks reconstructed by combining the results of different network inference methods. (A–D) Networks obtained from different blood specimens (Studies Ia and Ib, NMR data, 29 metabolites) using the consensus of 1, 2, 3, and 4 methods, respectively. The best agreement between metabolites in serum and plasma is obtained when the consensus of three methods is taken (r = 0.6, panel C). (E–H) Networks obtained from different blood specimens (Study II, FIA-MS/MS data, 122 metabolites) using the consensus of 1, 2, 3, and 4 methods, respectively. The best agreement between metabolites in serum and plasma is obtained when the consensus of two methods is taken (r = 0.96, panel F). It can again be argued that different connectivity patterns arise because of the arbitrary choice imposed on the weighted adjacency matrices: As shown in Figure , it is not possible to transform the serum network into the plasma one (or vice versa) by varying the thresholds τ (see eq ) imposed on the weighted adjacency matrix.

Figure 4

Transformation of the serum network into the plasma network (or vice versa) by varying the thresholds imposed on the weighted adjacency matrices. The Frobenius norm is used to assess differences between the connectivity matrices. The Frobenius norm is always larger than 0; therefore, it can be concluded that it is not possible to transform one network into the other: The metabolites association networks obtained from samples in studies Ia and Ib (NMR, 29 metabolites, plasma and serum from different blood samples) are inherently different in plasma and serum. (CLR denotes Context Likelihood of Relatedness, ARACNE is Algorithm for the Reconstruction of Accurate Cellular Networks, PCLRC is Probabilistic Context Likelihood of Relatedness on Correlations, and CORR is Pearson's correlation.)

Exploring Differences between the Plasma and Serum Data Sets

The differences observed in the plasma and serum networks obtained using NMR on studies Ia and Ib may be attributed to the two fractions being extracted from different biological specimens (see description in the Material and Methods). By performing principal component analysis (PCA) on the two large combined data sets we found separation between the plasma and serum samples (Figure A), as already noted in other studies, even when plasma and serum were extracted from the same samples.[14,41] Independent centering of the two data sets removed the separation, indicating differences in the average concentrations (Figure B): We found that 23 out of the 29 measured metabolites had lower levels in plasma with respect to serum (two-tailed t test with adjustment for unequal variances, with P value <0.0001 after Bonferroni correction[42]), whereas for the others (formate, pyruvate, alanine, threonine, and oxoglutarate) the concentrations were significantly higher in plasma than in serum (results are summarized in Supporting Table S2). Only 3-hydroxybutyrate showed no significant difference between the two data sets. However, given the large sample size, the analysis may be overpowered and the result, although statistically significant, may not be also biologically relevant because these may be trivial effects.[43−45] Moreover, the difference observed in the networks originates from a multivariate data analysis approach, which does not necessarily reflect differences observed in the univariate analysis.[16] However, an increased level of metabolites in serum with respect to plasma has also been observed in previous studies.[15,46]

Figure 5

(A) Score plot for a PCA model on the data set obtained from union of the plasma and serum metabolite data sets from studies Ia and Ib (NMR, 29 metabolites, plasma and serum obtained from different blood specimens). The separation between the two blood fractions is evident as a result of different concentration levels of the same metabolites. (B) Score plot for a PCA model on the union of the plasma and serum metabolites (studies Ia and Ib), which have been independently centered to remove the concentration offset: The separation is removed and the two groups overlap. However, differences in the networks remain; See the text for more details. Although the two study populations are highly homogeneous being blood donor volunteers (see Study Ia and Ib description in the Materials and Methods section), we may be observing a possible batch effect, probably due to the procedures for blood withdrawal and processing performed in the two distinct clinical units. Differences in laboratory conditions, reagent and consumables lots, and personnel habits could affect the final measurements with behaviors that are unrelated to the biological or scientific variables in the study.[47] To compensate for these, we applied a correction method for the removal of batch effects based on a mixed model with simultaneous estimation of the correlation matrix[48] before re-estimating the metabolite–metabolite association networks. We observed a slight reduction in the dissimilarity between the serum and plasma networks (not shown): However, the overall topology of the two networks remained different.

Comparison of Reconstructed Networks from Plasma and Serum from the Same Blood Specimen

We sought experimental validation of the existence of a difference between the two networks using the samples obtained from study II, where serum and plasma were obtained from the same blood specimens. The resulting networks are shown in Figure (panels A and B for plasma and serum, respectively).

Figure 6

(A) Plasma metabolites association networks obtained using the four different methods. (B) Serum metabolites association networks obtained using the four different methods. (C) Consensus association network for serum and plasma. (CLR denotes Context Likelihood of Relatedness, ARACNE is Algorithm for the Reconstruction of Accurate Cellular Networks, PCLRC is Probabilistic Context Likelihood of Relatedness on Correlations, and CORR is Pearson's correlation.)

(A) Plasma metabolites association networks obtained using the four different methods. (B) Serum metabolites association networks obtained using the four different methods. (C) Consensus association network for serum and plasma. (CLR denotes Context Likelihood of Relatedness, ARACNE is Algorithm for the Reconstruction of Accurate Cellular Networks, PCLRC is Probabilistic Context Likelihood of Relatedness on Correlations, and CORR is Pearson's correlation.) When a consensus is sought using the same strategy, the overall topologies of the plasma and serum networks are very similar: The best agreement in terms of metabolite degree is obtained, in this case, when two methods are considered, as derived from the scatter plots in Figure E–H. The correlation between the metabolite degree is rather high (r = 0.96; Pearson’s correlation, P value < 10–10), indicating an almost perfect equivalence between serum and plasma metabolite association networks. This suggests that from a network analysis point of view the two fractions carry essentially the same information and can be used interchangeably. It is interesting to note that the best consensus here is obtained with two methods, while in the case of the NMR data (studies Ia and Ib, plasma and serum from different blood specimens) the best consensus was obtained with three methods. This difference may arise from the use of different analytical platforms: Data obtained using NMR and MS have different characteristics[49,50] especially concerning the error structure and its correlation, and this may hamper network inference and reconstruction,[18] leading to both false-negatives and false-positives; the latter could indeed be avoided by deploying different methods that exploit different data characteristics. We also remark here that no preprocessing, such as normalization, was applied to the data because normalization affects the correlation data structure and ultimately network inference,[51] and we wanted to avoid artifacts in the inferred networks. However, some local topological differences remain that are worthy of further investigation, as in the case of amino acids.

Comparison of Plasma and Serum Amino Acids Association Networks

The array of metabolites measured on the same study spans a different class of compounds than those measured using NMR, but 11 amino acids have been measured in both studies I and II (see Table ), and the association networks can be directly compared across the two situations, where serum and plasma have been obtained from different and from the same blood specimens. The networks are shown in Figure (panels A and B different specimens; panels C and D same specimens).

Figure 7

Association network for serum and plasma amino acids obtained from the different blood specimens (networks A and B, Studies Ia and Ib, NMR data) and from the same blood specimens (networks C and D, Study II, FIA-MS/MS data, 122 metabolites). Note that here Leu (= xLeu) refers to the sum of leucine and isoleucine).(E) Consensus networks between plasma and serum extracted from different specimens (Studies Ia and Ib). (F) Consensus networks between plasma and serum extracted from the same specimens (Study II). (G) Plasma consensus networks among different analytical platforms. (H) Consensus networks between plasma and serum extracted from different specimens (Study I). In general, if amino acids are or share common substrates, then they can be expected to be correlated, like, for instance, proline/arginine, whose association we observed only in the plasma network derived from study Ia (NMR data) and not in the other networks. In contrast, an association between leucine and valine (that share enzymes for catabolizing the first two steps in their metabolism and that can be metabolically interconverted) is observed in all the networks. Arginine was found to be connected only in the networks obtained from study I, with several associations found in both serum and plasma networks (see consensus network shown Figure E). It is also important to note that correlations between amino acids might be a result not only of their common biosynthetic pathways and their tight regulation but also of their coordinated participation in protein synthesis or conversely a result of amino acids release after protein degradation.[52] The consensus network for plasma and serum derived from the same blood specimen given in Figure F shows a core of conserved metabolite–metabolite associations, such as the strong connectivity among proline, valine, serine, methionine, leucine (here signifying both leucine and isoleucine, which were not distinguishable), and glutamine and threonine with serine, methionine, and glutamine. Differences mostly arise in the connectivity patterns of histidine, where no association was found to be in common between the two networks: those metabolites (histidine, tyrosine, threonine, serine, phenylalanine, methionine, and glutamine). Overall, network differences related to these amino acids can be associated with differences in aminoacyl-tRNA biosynthesis pathways (P value = 2.1 × 10–11, Holm-corrected P value = 1.7 × 10–9). These results point to the hypothesis that the representation of this pathway may be not the same in the two blood fractions. It is interesting to note that aminoacyl-tRNA synthetases have been recently associated with inflammation,[53,54] which are also triggered by the blood coagulation process: In this respect the differences observed between serum and plasma could be explained. Differences in amino acid correlation patterns have been observed in different tissues, a behavior that we observed in serum and plasma.[55] The reliability of metabolite measurements was higher in serum compared with plasma samples and was good for most saturated short-and medium-chain acylcarnitines, amino acids, biogenic amines, glycerophospholipids, sphingolipids, and hexose; however, serum amino acids may become unstable,[46] and this may explain some of the differences observed in networks.

Comparison of Plasma and Serum Amino Acids Profiles in the Presence of Pathophysiological Alterations

Owing to the limited number of samples and their heterogeneity, we did not attempt to infer networks or covariance/correlation matrices for these data. Instead, samples from study III were subject to PCA to investigate possible differences in metabolites concentration patterns. The two biofluids bear significant metabolic profile similarities at the local level, as shown in Figure , where the considerable overlap of the NMR spectra of serum and plasma sample obtained from the same blood specimen is displayed. This is also confirmed by the PCA performed on the data set of the metabolites quantified in the serum and plasma of the same subjects, as shown in Figure A.

Figure 8

Superimposition of the NMR spectra (aliphatic region) of a plasma (top) and a serum (bottom) sample obtained by the same blood specimen.

Figure 9

(A) Score plot for the PCA model of the data set containing the metabolite concentrations measured in serum and plasma extracted from the same blood sample: The profiles of serum and plasma samples are very similar, as indicated by the closeness of the point representing serum and plasma profiles from the same sample. (B) Violin plot of the distribution of the distances between profiles of different subjects (plasma and serum, denoted as Interplasma and Interserum, respectively) and the distribution of the distance between the plasma and serum profile of each subject (Intra). The average Intra distance is one order of magnitude smaller than the interdistance. Data are scaled by a factor of 10–3 for better visualization.

Superimposition of the NMR spectra (aliphatic region) of a plasma (top) and a serum (bottom) sample obtained by the same blood specimen. (A) Score plot for the PCA model of the data set containing the metabolite concentrations measured in serum and plasma extracted from the same blood sample: The profiles of serum and plasma samples are very similar, as indicated by the closeness of the point representing serum and plasma profiles from the same sample. (B) Violin plot of the distribution of the distances between profiles of different subjects (plasma and serum, denoted as Interplasma and Interserum, respectively) and the distribution of the distance between the plasma and serum profile of each subject (Intra). The average Intra distance is one order of magnitude smaller than the interdistance. Data are scaled by a factor of 10–3 for better visualization. Pearson’s correlation of the first principal component for the serum and plasma is r = 0.995, P value <10–12, indicating the high similarity of plasma and serum profiles. (Similar results hold for higher order components.) Working in the PCA subspace defined by the first three components (which explain >99% of the variation in the data), we observe that the average Euclidean distance between the serum and plasma sample of each subject (∼2.5 au) is one order of magnitude smaller than the average distance between the serum (or plasma) sample of two different subjects (∼26 and ∼29 au, respectively). The distribution of the distance values is shown in the form of violin plots in Figure B. These differences are statistically significant (P value <1.3 × 10–8 for both comparisons using a Wilcoxon test). However, it is possible that this equivalence may not be observed in other types of pathophysiological alterations given the large spectrum of disease manifestation, which often results in very nonhomogeneous clinical samples. In fact, that serum and plasma may not be biologically equivalent under pathological conditions is not a new concept. For instance, they are not equivalent for what concerns inflammation markers. The clotting of blood stimulates blood cell eicosanoid biosynthesis,[56] and thus serum levels of these metabolites do not reflect physiological concentrations.[10] Moreover, results can be affected by the choice of the algorithm used. Although, here we presented an approach that should reduce the bias toward a given network inference method.

Conclusions

Networks and network analysis are being extensively used in systems biology, and they have proven to be valuable tools to investigate and understand many aspects of the complex biological machinery underlying the function of living organisms. Through a comparative approach and using well-assessed and recently developed methods for network inference, we have shown that plasma and serum metabolite networks possess the same topological characteristics. To the best of our knowledge this is the first study to address plasma and serum differences from a network analysis perspective. Our findings suggest that plasma and serum may be biologically equivalent at a global network level. Nevertheless, some local differences arise, as in the case of amino acids, which should be taken into account when analyzing, comparing, and interpreting blood metabolite association networks. However, when the plasma and serum fractions are extracted from different samples (even when samples are collected, processed, and analyzed under controlled conditions), the topological characteristics of the two networks are different. Further validation of these results should also be sought considering samples from more heterogeneous studies involving, for instance, broader age span of the participants and pathophysiological conditions. Through a standard multivariate analysis, we also observed that the difference between serum and plasma profiles obtained from the same blood specimen is on average an order of magnitude smaller than the average difference between serum/plasma samples from different blood specimen. However, this result was observed on a rather small data set and will require further validation in a larger study.

48 in total

Review 1. Ten ironic rules for non-statistical reviewers.

Authors: Karl Friston
Journal: Neuroimage Date: 2012-04-13 Impact factor: 6.556

2. Differences in metabolite profile between blood plasma and serum.

Authors: Linsheng Liu; Jiye Aa; Guangji Wang; Bei Yan; Ying Zhang; Xinwen Wang; Chunyan Zhao; Bei Cao; Jian Shi; Mengjie Li; Tian Zheng; Yuanting Zheng; Gang Hao; Fang Zhou; Jianguo Sun; Zimei Wu
Journal: Anal Biochem Date: 2010-07-23 Impact factor: 3.365

3. Lessons from the DREAM2 Challenges.

Authors: Gustavo Stolovitzky; Robert J Prill; Andrea Califano
Journal: Ann N Y Acad Sci Date: 2009-03 Impact factor: 5.691

4. Isotope correction of mass spectrometry profiles.

Authors: Günther Eibl; Katussevani Bernardo; Therese Koal; Steven L Ramsay; Klaus M Weinberger; Armin Graber
Journal: Rapid Commun Mass Spectrom Date: 2008-07 Impact factor: 2.419

5. Metabolic signatures of lung cancer in biofluids: NMR-based metabonomics of urine.

Authors: Joana Carrola; Cláudia M Rocha; António S Barros; Ana M Gil; Brian J Goodfellow; Isabel M Carreira; João Bernardo; Ana Gomes; Vitor Sousa; Lina Carvalho; Iola F Duarte
Journal: J Proteome Res Date: 2010-11-23 Impact factor: 4.466

6. Weighted correlation network analysis (WGCNA) applied to the tomato fruit metabolome.

Authors: Matthew V DiLeo; Gary D Strahan; Meghan den Bakker; Owen A Hoekenga
Journal: PLoS One Date: 2011-10-21 Impact factor: 3.240

7. Differences between human plasma and serum metabolite profiles.

Authors: Zhonghao Yu; Gabi Kastenmüller; Ying He; Petra Belcredi; Gabriele Möller; Cornelia Prehn; Joaquim Mendes; Simone Wahl; Werner Roemisch-Margl; Uta Ceglarek; Alexey Polonikov; Norbert Dahmen; Holger Prokisch; Lu Xie; Yixue Li; H-Erich Wichmann; Annette Peters; Florian Kronenberg; Karsten Suhre; Jerzy Adamski; Thomas Illig; Rui Wang-Sattler
Journal: PLoS One Date: 2011-07-08 Impact factor: 3.240

8. MetaboAnalyst 3.0--making metabolomics more meaningful.

Authors: Jianguo Xia; Igor V Sinelnikov; Beomsoo Han; David S Wishart
Journal: Nucleic Acids Res Date: 2015-04-20 Impact factor: 16.971

9. Targeted metabolomics identifies reliable and stable metabolites in human serum and plasma samples.

Authors: Michaela Breier; Simone Wahl; Cornelia Prehn; Marina Fugmann; Uta Ferrari; Michaela Weise; Friederike Banning; Jochen Seissler; Harald Grallert; Jerzy Adamski; Andreas Lechner
Journal: PLoS One Date: 2014-02-24 Impact factor: 3.240

Review 10. A metabolomic perspective on coeliac disease.

Authors: Antonio Calabrò; Ewa Gralka; Claudio Luchinat; Edoardo Saccenti; Leonardo Tenori
Journal: Autoimmune Dis Date: 2014-02-09

17 in total

1. An Integrated Gaussian Graphical Model to evaluate the impact of exposures on metabolic networks.

Authors: Jai Woo Lee; Erika L Moen; Tracy Punshon; Anne G Hoen; Delisha Stewart; Hongzhe Li; Margaret R Karagas; Jiang Gui
Journal: Comput Biol Med Date: 2019-08-31 Impact factor: 4.589

Review 2. From correlation to causation: analysis of metabolomics data using systems biology approaches.

Authors: Antonio Rosato; Leonardo Tenori; Marta Cascante; Pedro Ramon De Atauri Carulla; Vitor A P Martins Dos Santos; Edoardo Saccenti
Journal: Metabolomics Date: 2018-02-27 Impact factor: 4.290

3. Vibrational Spectroscopic Investigation of Blood Plasma and Serum by Drop Coating Deposition for Clinical Application.

Authors: Jing Huang; Nairveen Ali; Elsie Quansah; Shuxia Guo; Michel Noutsias; Tobias Meyer-Zedler; Thomas Bocklitz; Jürgen Popp; Ute Neugebauer; Anuradha Ramoji
Journal: Int J Mol Sci Date: 2021-02-22 Impact factor: 5.923

4. Multi-omic signatures of atherogenic dyslipidaemia: pre-clinical target identification and validation in humans.

Authors: Mariola Olkowicz; Izabela Czyzynska-Cichon; Natalia Szupryczynska; Renata B Kostogrys; Zdzislaw Kochan; Janusz Debski; Michal Dadlez; Stefan Chlopicki; Ryszard T Smolenski
Journal: J Transl Med Date: 2021-01-06 Impact factor: 5.531

5. Exploration of Blood Lipoprotein and Lipid Fraction Profiles in Healthy Subjects through Integrated Univariate, Multivariate, and Network Analysis Reveals Association of Lipase Activity and Cholesterol Esterification with Sex and Age.

Authors: Yasmijn Balder; Alessia Vignoli; Leonardo Tenori; Claudio Luchinat; Edoardo Saccenti
Journal: Metabolites Date: 2021-05-18

6. Plasma methionine metabolic profile is associated with longevity in mammals.

Authors: N Mota-Martorell; M Jové; R Berdún; R Pamplona
Journal: Commun Biol Date: 2021-06-11

7. ¹H Nuclear Magnetic Resonance of Pig Seminal Plasma Reveals Intra-Ejaculate Variation in Metabolites.

Authors: Yentel Mateo-Otero; Pol Fernández-López; Sergi Gil-Caballero; Beatriz Fernandez-Fuertes; Sergi Bonet; Isabel Barranco; Marc Yeste
Journal: Biomolecules Date: 2020-06-15

Review 8. High-Throughput Metabolomics by 1D NMR.

Authors: Alessia Vignoli; Veronica Ghini; Gaia Meoni; Cristina Licari; Panteleimon G Takis; Leonardo Tenori; Paola Turano; Claudio Luchinat
Journal: Angew Chem Int Ed Engl Date: 2018-11-11 Impact factor: 15.336

9. Differential Network Analysis Reveals Metabolic Determinants Associated with Mortality in Acute Myocardial Infarction Patients and Suggests Potential Mechanisms Underlying Different Clinical Scores Used To Predict Death.

Authors: Alessia Vignoli; Leonardo Tenori; Betti Giusti; Serafina Valente; Nazario Carrabba; Daniela Balzi; Alessandro Barchielli; Niccolò Marchionni; Gian Franco Gensini; Rossella Marcucci; Anna Maria Gori; Claudio Luchinat; Edoardo Saccenti
Journal: J Proteome Res Date: 2020-01-17 Impact factor: 4.466

10. A Comparison of Serum and Plasma Blood Collection Tubes for the Integration of Epidemiological and Metabolomics Data.

Authors: Jennie Sotelo-Orozco; Shin-Yu Chen; Irva Hertz-Picciotto; Carolyn M Slupsky
Journal: Front Mol Biosci Date: 2021-07-08