Literature DB >> 25160569

A normative study of the synovial fluid proteome from healthy porcine knee joints.

Tue Bennike1, Ugur Ayturk, Carla M Haslauer, John W Froehlich, Benedikt L Proffen, Omar Barnaby, Svend Birkelund, Martha M Murray, Matthew L Warman, Allan Stensballe, Hanno Steen.   

Abstract

Synovial fluid in an articulating joint contains proteins derived from the blood plasma and proteins that are produced by cells within the joint tissues, such as synovium, cartilage, ligament, and meniscus. The proteome composition of healthy synovial fluid and the cellular origins of many synovial fluid components are not fully understood. Here, we present a normative proteomics study using porcine synovial fluid. Using our optimized method, we identified 267 proteins with high confidence in healthy synovial fluid. We also evaluated mRNA expression data from tissues that can contribute to the synovial fluid proteome, including synovium, cartilage, blood, and liver, to better estimate the relative contributions from these sources to specific synovial fluid components. We identified 113 proteins in healthy synovial fluid that appear to be primarily derived from plasma transudates, 37 proteins primarily derived from synovium, and 11 proteins primarily derived from cartilage. Finally, we compared the identified synovial fluid proteome to the proteome of human plasma, and we found that the two body fluids share many similarities, underlining the detected plasma derived nature of many synovial fluid components. Knowing the synovial fluid proteome of a healthy joint will help to identify mechanisms that cause joint disease and pathways involved in disease progression.

Entities:  

Keywords:  PTM; Synovial fluid; human; method optimization; origin; plasma; porcine; proteomics; synovium; transcriptomics

Mesh:

Substances:

Year:  2014        PMID: 25160569      PMCID: PMC4184458          DOI: 10.1021/pr500587x

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


Introduction

Synovial fluid is present in all joint cavities, where it protects the articular cartilage surfaces, in part by reducing friction. Synovial fluid, furthermore, facilitates the transport of nutrients and waste products including proteins and metabolites between the vascularized synovium and the avascular cartilage.[1−3] Many components of synovial fluid are derived from blood plasma, and these two body fluids share many similarities in terms of protein composition.[4,5] However, synovial fluid also contains proteins secreted from the surrounding tissue, including the articular cartilage and synovium.[6] The protein concentration in synovial fluid from healthy knee joints is approximately 25 mg/mL, i.e., ∼1/3 of the concentration found in blood plasma, and albumin constitutes approximately 12 mg/mL.[1,3,7−10] Joint diseases, in particular osteoarthritis (OA) and rheumatoid arthritis (RA), are the leading cause of disability in people over 55 years.[6] It has been estimated that, as of 2005, 27 million adults in the United States have clinical OA, and in 2009, OA was the fourth most common cause of hospitalization.[11,12] Furthermore, joint injuries, such as anterior cruciate tear, that predispose to precocious joint failure have become epidemic in young athletes.[13,14] Although changes in the composition of synovial fluid have been described in patients with joint disease, there do not exist reliable biomarkers for early disease diagnosis or biomarkers that accurately depict response to therapy.[1,15−17] As a consequence, OA is often not diagnosed before irreversible damage has occurred.[15,18,19] Since synovial fluid is in direct contact with the joint tissues, it provides an attractive source of biomarkers candidates for monitoring joint health and for furthering the understanding of the disease mechanisms.[3,18] Previous studies of synovial fluid have mainly been focused on various diseased states. However, the protein concentration, content, and volume of synovial fluid is known to change dramatically during active joint diseases, and few studies have been focused on synovial fluid in healthy state or the likely origins of the synovial fluid proteins.[3,17] Balakrishnan et al.[20] compared the human synovial fluid proteome from OA and RA patients. Synovial fluid was immune-depleted of the most abundant proteins, followed by liquid chromatography–tandem mass spectrometry (LC–MS/MS) analysis, which led to the identification of 575 proteins, of which 135 demonstrated a greater than 3-fold abundance change between the two groups. In a similar study, Mateos et al.[1] identified a total of 136 different proteins using two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) protein separation, followed by in-gel trypsin digestion and protein identification by LC–matrix-assisted laser desorption/ionization time-of-flight (MALDI TOF/TOF) MS. When assessing the total number of different proteins to be expected in synovial fluid, analyzing 2D-PAGE images seems to be a valid approach. Chen et al.[10] analyzed synovial fluid from patients with joint swelling using several precipitation strategies and detected 456 protein spots on 2D-PAGE images. Smith et al.[21] detected 1000 protein spots on 2D-PAGE images of synovial fluid from RA patients, which was estimated to represent 300 individual proteins. However, neither of the two latter studies identified the proteins within the protein spots. 2D-PAGE in-gel digestion strategies, while highly informative, are not suitable for high-throughput proteome analyses.[19] Therefore, we investigated the use of in-solution proteomics strategies with the aim to increase high-throughput protein identification and quantitation. In this normative proteomics study of synovial fluid from healthy porcine knee joints, we report results from applying our optimized method, using porcine synovial fluid. Domesticated or minipigs are a highly relevant anatomically large animal model organism to study several human diseases, including acute joint injury, OA, and other inflammatory disease.[22−26] The homology between the porcine genome and the human genome is conserved to a much greater extent than that between human and mouse or other rodents.[27] Furthermore, the porcine model has previously been used to study anterior cruciate ligament injury and surgical and tissue engineering approaches for healing.[28−30] We also report mRNA sequence data from healthy porcine synovium that we used to estimate the contribution to the synovial fluid proteome from this tissue, and we compare this to the relative contribution from plasma (e.g., proteins secreted into plasma by the liver or by blood cells) and articular cartilage.

Experimental Procedures

Collection of Synovium and Synovial Fluid Samples

Six adolescent Yucatan minipigs (Coyote CCI, Douglas, MA), aged 12–15 months, were obtained for use in this study. All minipigs were housed and monitored by the Animal Resources at Boston Children’s Hospital (ARCH) and handled according to approved Institutional Animal Care and Use Committee (IACUC) protocols. Minipigs were acclimated to the ARCH environment for a minimum of 3 days prior to experimental handling. Nonbloody synovial fluid was obtained from the joints by aspiration using a 21 gauge needle. Samples were centrifuged at 3000g at room temperature for 10 min to pellet and remove cells and cellular debris. In some cases, 3 mL of sterile saline was injected into the knee joint to facilitate fluid extraction; after saline injection the knee was bent 10 times to ensure homogeneous fluid distribution and mixing. The saline/synovial fluid mix was then processed as above. Following centrifugation, the supernatants were stored at −80 °C. Furthermore, a human synovial fluid sample was obtained from a RA patient according to an approved IRB protocol (IRB-P00006443) to evaluate the integrity of the UniProt Sus scrofa protein database. Euthanasia of the animals was induced by intramuscular injection of atropine (0.04 mg/kg), Telazol (4.4 mg/kg), and xylazine (2.2 mg/kg) and finalized by intravenous injection of Fatal Plus (86 mg/kg). At the time of euthanasia, synovia from the knee joints of the hind limbs were harvested. Care was taken to sample only the synovial membrane without any subintimal structures, such as fat or blood vessels. Each tissue specimen was snap frozen in liquid nitrogen and stored at −80 °C.

Protein Concentration

Total protein concentration for each sample (diluted 1:30 in water) was determined for normalization of sample material using a colorimetric (Bradford) protein assay kit (Bio-Rad, Hercules, CA) according to the manufacturer’s instructions, with bovine serum albumin used as the standard.

SDS-PAGE

Thirty micrograms of total synovial fluid protein was prepared for sodium dodecyl sulfate (SDS)-PAGE in Laemmli sample buffer (Bio-Rad, Hercules, CA) according to the manufacturer’s instructions. SeeBlue Plus2 pre-stained standard (Invitrogen, Carlsbad, CA) was used as the protein molecular weight standard. The sample was fractionated using NuPAGE 4–12% Bis-Tris minigels (Invitrogen) at 150 V for 65 min in MOPS SDS-running buffer (Invitrogen). The gel was stained using Coomassie blue, SimplyBlue SafeStain (Invitrogen), according to manufacturer’s instructions.

Synovial Fluid Protein Digestion

Three trypsin digest protocols were evaluated:

Filter-Aided Sample Preparation (FASP) Digestion

Performed using the FASP protein digestion kit (Protein Discovery, San Diego, CA) according to manufacturer’s instructions using 30 kDa cutoff spin filters. Ninety micrograms of total synovial fluid protein was digested overnight at 37 °C with 2 μg of sequencing grade modified trypsin (Promega, Fitchburg, MA). To assess the need of glycan removal when working with synovial fluid, 500 U peptide-N4-(N-acetyl-beta-glucosaminyl)-asparagine amidase (PNGase F) (New England BioLabs, Ipswich, MA) was added to these samples prior to the trypsin digestion step, and samples were incubated overnight at 37 °C, after which the normal FASP protocol was continued. After trypsin digestion, the samples were desalted with TARGA C18 columns (Nest Group, Southborough, MA) and resuspended in 5% acetonitrile (ACN) and 5% formic acid (FA) prior to analysis.

Urea In-Solution Digestion

Performed according to Gallien et al.[31] Ninety micrograms of total synovial fluid protein was diluted with 8 M urea in 100 mM ammonium bicarbonate to a final volume of 25 μL. The sample was reduced with dithiothreitol at a final concentration of 12 mM for 30 min at 37 °C and alkylated with iodoacetamide at a final concentration of 40 mM for 1 h at room temperature in the dark. The samples were diluted with 100 mM ammonium bicarbonate to a total volume of 100 μL, 2 μg of trypsin was added, and the sample was digested overnight at 37 °C. The samples were desalted with TARGA C18 columns (Nest Group) and resuspended in 5% ACN, 5% FA prior to analysis.

In-Gel Digestion

Three gel-lanes, each loaded with 150 μg total synovial fluid protein, were divided into 10 sections each and subjected to standard in-gel tryptic digestion as previously described,[32−34] followed by analysis.

Human Plasma Protein Digestion

Human plasma was acquired as part of an ongoing method optimization study using a deidentified, discarded cord plasma sample and thus are not considered research of human subjects. One hundred micrograms of plasma protein was digested using the FASP protein digestion kit (Protein Discovery, San Diego, CA) with a modification to the recommended protocol. This modification involved the use of a 10 kDa MWCO filter instead of the stock 30 MWCO filters. A trypsin/LysC mix (Promega, Madison, WI) was added to the FASP filter at 1:25 ratio for digestion. The samples were then incubated at 37 °C overnight, and the resulting peptides were recovered as recommended by the manufacturer’s protocol. The peptides were desalted using Oasis HLB columns (Waters, Milford, MA) and resuspended in 2% ACN in 0.1% FA prior to analysis.

LC–MS/MS Measurement and Proteomics Data Analysis

Two different high-resolution/high-accuracy mass spectrometer systems were used for the shotgun proteomic analysis: (1) For post-translational modification (PTM) analysis and method optimization, synovial fluid samples were analyzed on a TripleTOF 5600 (AB Sciex, Framingham, MA) connected online with a nanoflow UPLC and a NanoFlex system (Eksigent/AB Sciex). The samples were loaded onto a 15 cm reversed-phase C18 200 μm chip with 2 μL/min in 100% solvent A (0.1% FA). The samples were then separated using a 15 cm reversed-phase C18 75 μm chip and eluted with a linear gradient of 2% solvent B (0.1% FA in ACN), which was raised to 35% solvent B over 120 min (60 min for in-gel digested samples) at a constant flow rate of 500 nL/min. (2) The six FASP digested synovial fluid samples used to determine the synovial fluid protein list and abundances as well as the trypsin-digested human plasma samples were analyzed on a Q Exactive (Thermo Scientific, Waltham, MA) connected online to an EASY-nUPLC 1000 (Thermo Scientific). The samples were loaded onto a 10.5 cm reversed-phase C18 PicoChip with a flow rate of approximately 1 μL/min in 98% solvent A and 2% solvent B and were eluted with eluent B using a linear gradient that was raised to 35% over 120 min at a constant flow rate of 300 nL/min. The AB Sciex.wiff data files were analyzed using ProteinPilot 4.5 (rev. 1656, Paragon Algorithm 4.5.0.0). To identify the most commonly single observed PTMs, data files were searched in thorough mode with a focus on biological modifications in ProteinPilot to include more than 300 different PTMs. The .raw data files from synovial fluid and human plasma analyzed on the Q Exactive were searched using MaxQuant 1.4.1.2.[35] All standard settings were employed with carbamidomethyl (C) as a static modification and deamidation (NQR), oxidation (M), and protein N-terminal acetylation included as variable modifications. Label-free quantitation of all proteins was performed in MaxQuant based on integrated precursor intensities. Protein abundances are represented as protein intensity-based absolute quantitation values (iBAQ) and are reported for all proteins having at least two quantifiable unique peptides in at least three LC–MS runs.[36] The human plasma sample was searched against the UniProt Homo sapiens reference proteome database with isoforms (downloaded 7/18/2014, containing 89 032 entries). The porcine synovial fluid data was searched against the UniProt S. scrofa reference proteome database (downloaded 11/09/2013, containing 26 070 entries). The human RA synovial fluid, used to evaluate the UniProt S. scrofa database, was searched against all reviewed H. sapiens UniProt proteins (downloaded 08/10/2013, containing 20 277 entries). All proteins and peptides are reported below a 1% false discovery rate (FDR) cutoff, and protein posterior error probability (PEP, equivalent to expectancy) was investigated to ensure only confident protein identifications.[35] For the PTM analysis, the search results were analyzed using ProteinPilot Descriptive Statistics Template, version 3.001, and for the protein abundance analysis, the iBAQ values were analyzed using Perseus, version 1.4.1.3, and IBM SPSS Statistics (version 21). Venn diagrams were created with BioVenn[37] and Venny.[38]

Assignment of Formerly Glycosylated Asparagine Residues

Four criteria were required to assign N-glycosylation sites: (I) a 1% FDR cutoff to all peptide spectral matches (PSMs); (II) all site assignments required the presence of a consensus site (CS) for N-glycosylation, i.e., NX(S/T), where X may be any amino acid except proline; (III) once CS status was established for all peptide assignments, an asparagine deamidation at the asparagine within the CS was required; and (IV), finally, all true site assignments were required to come from sample preparations that were treated with PNGase F. The FDR of site assignment was estimated by evaluation of the random rate of site assignment among control samples that were not treated with PNGase F. In this way, the rate of PSMs leading to the identification of a deglycosylated peptide may be compared.

RNA Extraction

Total RNA was extracted from frozen synovium tissue using the PureLink RNA mini kit (Ambion, Austin, TX), treated with PureLink DNase I (Life Technologies, Grand Island, NY) according to the manufacturer’s protocol, and quantified. Briefly, frozen tissue samples were placed in tubes containing 5 metal lysing beads (Metal Bead Lysing Matrix, MP Biomedicals, Solon, OH) and 1 mL of TRIzol (Life Technologies). Tissue was homogenized using a FastPrep24 (MP Biomedicals, Solon, OH) at 6 m/s for 40 s. This was repeated two additional times, with samples briefly placed in liquid nitrogen between homogenization runs. Following total RNA extraction and quantitation via spectrophotometry using a Nanodrop 1000 (Thermo Scientific, Wilmington, DE), samples were submitted for quality analysis on a Bioanalyzer model 2100 (Agilent Technologies, Santa Clara, CA). Each RNA sample had an RIN > 7.20, indicating that they were of sufficient quality for sequencing.[39]

Library Preparation

A mRNA library for each tissue sample was prepared as previously described using the TruSeq RNA sample preparation kit, v2 (Illumina, San Diego, CA).[40] Briefly, mRNA was enriched from total RNA, chemically fragmented, reverse-transcribed using random hexamers, and ligated to barcoded adapters per the manufacturer’s instructions. The cDNA fragments were amplified via PCR, and the libraries were washed with AMPure XP beads (Beckman Coulter Inc., Danvers, MA) to remove primer dimers. One microliter aliquots from each library were run on a 4–20% TBE gel (Life Technologies, Grand Island, NY) for verification. Equal amounts of DNA from separately barcoded cDNA libraries were pooled (n = 12 per lane) and sent for 50 base pair paired-end sequencing on an HiSeq 2000 (Illumina).

RNA-Seq Data Analysis

Reads were mapped to the pig genome (Susscr3) using RUM.[41] Known genes were annotated with an R script based on data available from ENSEMBL. Expression level of each gene was quantified by the reads per kilobase of exon per million mapped reads (RPKM). The repeatability of the RNA-seq data was measured by comparing libraries generated from the right and left knees of the same animal and calculating the Pearson correlation coefficient (R2) with respect to the genome-wide RPKM values. The calculated RPKM values were averaged across 12 samples for each gene, and these mean values were used in ranking the genes with respect to their expression levels. In order to estimate the likely source of proteins detected in the synovial fluid, we additionally reviewed mouse liver,[42] mouse blood,[40] and human articular cartilage (authors’ unpublished data) RNA-seq data. On the basis of homology information retrieved from the ENSEMBL database, we matched average RPKM values for each transcript with the proteins detected in the synovial fluid, and we identified the proteins for which all four mRNA expression values were available. We then determined the subset of proteins with a signal peptide. Among these proteins, we identified those with at least 2-fold higher mRNA levels in the synovium compared to all of the other individual tissues, and we then performed the same calculations for liver, cartilage, and blood.

Results and Discussion

Optimizing Preparation and Digestion Methods for Synovial Fluid

Initially, we determined whether precipitating the synovial fluid proteins increased the diversity of the detectable protein bands by Coomassie-stained SDS-PAGE. However, similar to Chen et al.,[10] we found that precipitation appeared to reduce polypeptide diversity rather than increase it (data not shown); thus, we chose not to use precipitated samples for our subsequent analyses. Likewise, because albumin can function as a protein carrier for a wide range of proteins, we chose not to deplete albumin or other abundant proteins from our samples to avoid changing protein abundance profiles and removing important proteins that might bind the depletion targets.[1,10,43−45] Zhou et al.[46] reported that removing albumin led to the depletion of many proteins in human serum, including clinically useful biomarkers. We compared urea in-solution digestion to FASP to find the most efficient digestion method without prior prefractionation for synovial fluid and used the robust and proven in-gel digestion as a reference. Urea in-solution digestion yielded the lowest number of 127 identified proteins. In contrast, FASP and in-gel-based digestion each yielded 227 protein identifications, indicating similar performance of the two digestion techniques. However, 170 of the 227 identified proteins were found in at least two of the three synovial fluid samples when using the FASP protocol in contrast to only 145 of the 227 proteins in the case of the in-gel digestion protocol (Figure 1). Furthermore, despite substantial overlap among the data sets, roughly 26% of proteins found in at least two of the three synovial fluid samples with FASP were missed by in-gel digestion, and 12% of the in-gel digestion proteins were missed with FASP. Additionally, 17% of the proteins unique to the in-gel digestion were keratins, supporting the notion that in-gel digestion protocols can be prone to keratin contamination. Besides this difference, no other bias was found with regard to which groups of proteins were identified uniquely with the different digestion methods. This indicates that the FASP protocol gives more consistent data than the in-gel digestion protocol. It is possible that depletion strategies can increase the number of identified proteins further; however, this was not investigated in this study.
Figure 1

Digestion method evaluation. Number of proteins uniquely identified in at least two of the three synovial fluid samples, using FASP digestion, urea in-solution digestion, and in-gel digestion.

Digestion method evaluation. Number of proteins uniquely identified in at least two of the three synovial fluid samples, using FASP digestion, urea in-solution digestion, and in-gel digestion. The UniProt S. scrofa reference proteome is not as well annotated as the human proteome. Therefore, in order to determine whether our proteomic method is as sensitive for detecting pig synovial fluid proteins as it would be for human synovial fluid proteins, we performed the same FASP digestion protocol on a synovial fluid sample from a RA patient. In the four technical replicates, we identified, on average, 173 different proteins in the human sample, which seems to be comparable to the 179 different proteins found, on average, in the individual porcine synovial fluid FASP LC–MS runs. The protein composition of synovial fluid is known to be altered during active joint diseases, so the protein overlap was not investigated.[3,17] Nonetheless, the similar number of identified proteins with the two databases indicates that the UniProt S. scrofa reference proteome is adequate for the analysis of the porcine synovial fluid.

Importance of Accounting for Post-Translational Modification in Protein Identification

Many PTMs occur in vivo or are introduced during sample preparation prior to MS analysis in vitro.[47−50] Therefore, to identify which PTMs should be considered for the database search of MS data from FASP-digested synovial fluid samples, we analyzed the identified peptides from this method in ProteinPilot Descriptive Statistics Template (Table 1). Several PTMs were identified, and all were likely artifacts from the sample preparation. Chloroacetamide can be used as an alternative alkylating agent to iodoacetamide to reduce the number of alkylation related artifacts.[47] However, due to the low number of detected peptides with these artificial modifications, this was not investigated further.
Table 1

Most Commonly Observed PTMs of the FASP-Digested Synovial Fluid Samples Based on the Most Frequent Single Features

rankmodification and position (N-terminal; C-terminal; acceptor amino acid)modification mass (Da)no. MS/MS eventsmodified sites of possible (%)
1Carbamidomethyl (C)57.021513 201100.0
2Deamidated (N)0.984010248.7
3Acetyl at N-term42.01065632.9
4Carbamidomethyl at N-term57.02155352.7
5Glu → pyro-Glu at N-term–18.010652224.1
6Deamidated (Q)0.98404402.9
7Carbamyl (K)43.00584342.0
8Carbamidomethyl (K)57.02154201.9
9Carbamyl at N-term43.00584162.1
Synovial fluid is known to be rich in glycoproteins, which prompted us to evaluate the effect of removing N-linked glycans on the number of identifiable peptides and proteins.[2] We compared two synovial fluid samples using the FASP protocol, with and without deglycosylation by PNGase F (±). As expected, PSMs resulting in identification of deglycosylated peptides considered to be previously N-glycosylated was increased ∼40-fold in the PNGase F-treated samples in comparison to that of the untreated samples (Table 2). Twenty-eight and 34 unique deglycosylated peptides in the biological repeats were identified using the four criteria described in the Experimental Procedures section, compared to only two peptides in each of the untreated samples (Supporting Information Table 1). However, all deglycosylated peptides belonged to proteins that had already been identified from other peptides and thus resulted in no new protein identifications. Twenty-eight and 34 unique deglycosylated peptides are relatively low numbers for an analysis of a biological fluid known to be rich in glycoproteins, which may be due to a number of factors: First, the proteome of synovial fluid may not be particularly complex. This hypothesis is consistent with the proteomics data we observed in this study and with previously reported 2D-PAGE analyses.[1,10,17] Second, N-glycosylation, which is the only glycosylation amenable to PNGase F treatment protocols, may not be highly prevalent in synovial fluid. Synovial fluid contains significant amounts of glycosaminoglycans, possibly lessening the biologic necessity for N-glycosylation.[51,52] Third, the finite dynamic range of the MS analysis might cause abundant peptides to hinder the detection of less abundant ones. A study that exclusively focuses on O-glycosylation in synovial fluid and combines data from depleted and nondepleted samples will likely identify more extensive glycosylation than that reported here.
Table 2

Summary of Overall Numbers of PSMs in the Deglycosylation Experiments

sampletotal no. of PSMsPSM identifying deglycosylated peptidesdeglycosylated peptide PSMs/all PSMs (%)
+ PNGase F 14827911.9
+ PNGase F 24878861.8
– PNGase F 1406830.07
– PNGase F 2437220.05

Identifying the Transcriptome of Healthy Pig Synovium

In order to identify synovial fluid proteins that are secreted by the synovium cells, i.e., transcribed and translated, we generated sequencing data of mRNA as active transcripts for pig synovium. We prepared and sequenced knee synovium from the right and left legs of 6 pigs. On average, we obtained 12.4 million sequencing reads per library, of which 90% mapped to the pig genome and 81% aligned uniquely. We then calculated RPKM values for individual genes and compared their expression level within each individual animal’s left and right knee (Figure 2a). Intra-animal Pearson correlation coefficients exceeded 0.85, indicative of reproducible data (Figure 2d). The top 20 abundant protein-coding mRNAs that did not originate from the ribosome or mitochondria exhibited minimal variation in abundance, as they ranked in the top 1% of all sample-specific RNA-seq data sets that we analyzed (Figure 2c).
Figure 2

RNA-seq of pig synovium is repeatable and indicates high expression of several transcripts encoding secreted proteins. (a) Table indicating the total number of reads and percentages of successfully mapped reads. More than 90% of reads on average are mapped to the pig genome (Susscr3), and more than 80% of reads are mapped uniquely. (b) Top 20 protein coding genes (based on expression level measured with RPKM) that do not originate from the mitochondria or the ribosome. (c) Box plot indicating the variation in the expression-based rank order of genes listed in (b) in all 12 libraries analyzed. The highest variation is in the rank of PRG4; however, it remains within the top 1% of genes in all libraries. (d) Scatterplots indicating high similarity (R2 > 0.85) between the synovial transcriptomes of paired left and right legs of all pigs. Data indicate RPKM, and each dot represents a single gene. Transcripts for the secreted proteins decorin (DCN) and lubricin (PRG4), red blood cell-derived beta hemoglobin (HBB), an adipocyte-derived transcription factor (PPARG), and secreted protein leptin (LEPT) are indicated with red colored symbols and closely follow the y = x line (solid red line) in each plot. Importantly, transcripts for matrix-degrading enzymes and inflammatory cytokines are not abundant, clustering in the bottom left corner of the panels.

RNA-seq of pig synovium is repeatable and indicates high expression of several transcripts encoding secreted proteins. (a) Table indicating the total number of reads and percentages of successfully mapped reads. More than 90% of reads on average are mapped to the pig genome (Susscr3), and more than 80% of reads are mapped uniquely. (b) Top 20 protein coding genes (based on expression level measured with RPKM) that do not originate from the mitochondria or the ribosome. (c) Box plot indicating the variation in the expression-based rank order of genes listed in (b) in all 12 libraries analyzed. The highest variation is in the rank of PRG4; however, it remains within the top 1% of genes in all libraries. (d) Scatterplots indicating high similarity (R2 > 0.85) between the synovial transcriptomes of paired left and right legs of all pigs. Data indicate RPKM, and each dot represents a single gene. Transcripts for the secreted proteins decorin (DCN) and lubricin (PRG4), red blood cell-derived beta hemoglobin (HBB), an adipocyte-derived transcription factor (PPARG), and secreted protein leptin (LEPT) are indicated with red colored symbols and closely follow the y = x line (solid red line) in each plot. Importantly, transcripts for matrix-degrading enzymes and inflammatory cytokines are not abundant, clustering in the bottom left corner of the panels. Focusing on transcripts encoding proteins containing signal peptides, which are most likely secreted or targeted to cell surfaces, we observed high expression of known connective tissue proteins such as decorin (DCN), fibronectin (FN1), collagen type III (COL3A1), and clusterin (CLU) (Figure 2b). Importantly, the principal lubricating protein in synovial fluid, lubricin (PRG4) made by type B synoviocytes, was also highly expressed. From these data, we produced a pig synovium transcriptome database (Supporting Information Table 2), which is composed of transcripts encoding known and predicted proteins.

Identifying the Proteome of Healthy Pig Synovial Fluid

We opted for the FASP digestion strategy for the detailed triplicate proteomic analysis of synovial fluid from 6 healthy pigs. This choice was based on the number of identified proteins, sample amount requirement, processing time, and instrument time. We identified 374 different proteins in total using the UniProt S. scrofa database and the synovium transcriptome database, and 42 proteins were identified solely in the transcriptome database. This seemed to be a large fraction assuming an adequate performance of the UniProt pig database, so all identified proteins were sorted by PEP (equivalent to expectancy[35]), with a high PEP value indicating a lower confidence in the identification (Figure 3a). Proteins unique to the transcriptome database displayed high PEP values, indicating that the number of identified proteins is too low for the global FDR calculation. To address this issue, all proteins with PEP > 1 × 10–3 were removed from further analysis, which came to 92 identified proteins, of which 36 were unique to the transcriptome database. Furthermore, we manually assessed the fragment spectra for proteins identified based on a single peptide and removed poor spectra with many unassigned high-intensity signals, which resulted in one additional protein being removed. After these two filtering steps, 267 proteins remained, of which 6 were unique to the transcriptome database (Figure 3b). Five of the 6 unique proteins were immunoglobulin lambda like proteins (UPID: IGLV-7 to IGLV-11), and the identified peptides originate from a variable region of the lambda chain, which is not annotated in the UniProt database. The remaining protein, ribonuclease 4 (UPID: I3LDZ2_PIG), was present in the UniProt database, but it had not been identified. We included several keratins (10, 14, 3, and 75) in the synovial fluid proteome because keratin-coding mRNA was detected in the articular cartilage. However, we cannot preclude some of the keratins being contaminants from sample preparation.
Figure 3

Analysis of identified synovial fluid proteins. (a) Identified proteins at 1% FDR sorted by PEP. Proteins unique to the transcriptome database (red squares) were, for the most part, identified with low confidence; hence, all proteins with PEP > 1 × 10–3 (red line) were removed from further analysis. (b) Synovial fluid proteins identified using the UniProt database and the transcriptome database fulfilling the filtering criteria. Of the 267 synovial fluid proteins, 140 had RPKM > 5 in the synovial mRNA libraries, indicating they are expressed by synovium.

Analysis of identified synovial fluid proteins. (a) Identified proteins at 1% FDR sorted by PEP. Proteins unique to the transcriptome database (red squares) were, for the most part, identified with low confidence; hence, all proteins with PEP > 1 × 10–3 (red line) were removed from further analysis. (b) Synovial fluid proteins identified using the UniProt database and the transcriptome database fulfilling the filtering criteria. Of the 267 synovial fluid proteins, 140 had RPKM > 5 in the synovial mRNA libraries, indicating they are expressed by synovium. Of the identified 267 different proteins in synovial fluid (Supporting Information Table 2), 194 proteins (73%) were identified in all pigs, indicating similar protein expression patterns between the pigs and a high method robustness. The difference in identified proteins between the pigs likely originates from biological diversity, which is to be expected, and the finite dynamic range of the MS analysis.[53] To investigate the relative protein abundances in synovial fluid, we investigated individual synovial fluid protein abundance by calculating iBAQ values.[36] The protein abundance in synovial fluid, as estimated by the 203 iBAQ quantifiable proteins, spans 5 orders of magnitude (Figure 4). As expected, albumin was the most abundant protein, and no apparent bias could be found regarding the abundances of serum-derived proteins (RPKM < 5) compared to proteins expressed by the synovium in the joint (RPKM > 5) (see below).
Figure 4

Protein abundance distribution in synovial fluid. Frequency histogram of the calculated iBAQ values for all proteomics quantifiable proteins. The relative proportion of proteins whose mRNA transcript abundances were less than or greater than 5 RPKM in the synovial membrane transcriptome are colored blue and yellow, respectively.

Protein abundance distribution in synovial fluid. Frequency histogram of the calculated iBAQ values for all proteomics quantifiable proteins. The relative proportion of proteins whose mRNA transcript abundances were less than or greater than 5 RPKM in the synovial membrane transcriptome are colored blue and yellow, respectively.

Estimating the Relative Contributions of Plasma, Synovium, and Cartilage to the Synovial Fluid Proteome

Synovial fluid is a transudate of blood that has additional components added by the surrounding joint tissue including synovium, cartilage, and, in the knee, ligaments and menisci. To determine whether a likely tissue source for a synovial fluid protein can be predicted, we compared protein expression with the pig synovium transcriptome, a human articular cartilage transcriptome we previously generated, and published mouse liver[42] and blood[40] transcriptomes (Supporting Information Table 2). As expected, albumin in synovial fluid solely derives from blood, as its transcript is highly expressed only in liver (Figure 5).
Figure 5

mRNA expression levels associated with the proteins detected with MS. (a) Tissue-specific histograms indicating the number of detected proteins corresponding to each transcript. Genes are ranked based on mRNA expression along the x axis, and the secondary y axis (in red) indicates the expression levels in RPKM (in log scale). (b) Table indicating the top 20 mRNAs in synovium with detectable protein products in synovial fluid. Gene expression data derived from published and unpublished studies on tissues with potential influence on protein expression repertoire of serum and synovial fluid are also presented in the three columns on the right. Blood and liver mRNA data originate from mice, and articular cartilage mRNA data originate from humans. (c) Table indicating the top 20 proteins with no detectable mRNA expression in the synovium: note that the majority of mRNA species corresponding to these proteins are abundantly expressed by liver cells.

mRNA expression levels associated with the proteins detected with MS. (a) Tissue-specific histograms indicating the number of detected proteins corresponding to each transcript. Genes are ranked based on mRNA expression along the x axis, and the secondary y axis (in red) indicates the expression levels in RPKM (in log scale). (b) Table indicating the top 20 mRNAs in synovium with detectable protein products in synovial fluid. Gene expression data derived from published and unpublished studies on tissues with potential influence on protein expression repertoire of serum and synovial fluid are also presented in the three columns on the right. Blood and liver mRNA data originate from mice, and articular cartilage mRNA data originate from humans. (c) Table indicating the top 20 proteins with no detectable mRNA expression in the synovium: note that the majority of mRNA species corresponding to these proteins are abundantly expressed by liver cells. When we compared the proteins detected in the synovial fluid with the mRNA expression levels in four possible sources, we found that the majority of proteins corresponded to highly expressed transcripts regardless of the source and that liver and synovium likely made the greatest contribution (Figure 5a). Further analysis of our data revealed that 171 proteins detected in the synovial fluid had a signal peptide and that liver, blood, and articular cartilage mRNA expression data were available for 149 proteins (Supporting Information Table 2). Of these 149 common proteins in the synovial fluid proteome, 113 most likely derive from plasma transudate since their transcript abundance is 2-fold or higher in liver or blood than that in synovium or cartilage. In contrast, 37 synovial fluid proteins have transcripts whose expression in synovium is at least 2-fold higher than that in other tissues, while 11 proteins have at least 2-fold or higher transcript expression in articular cartilage. Several proteins serve as positive controls in validation of our estimations. For example, aggrecan, type II collagen, cartilage oligomeric matrix protein, and cartilage intermediate layer protein 2 are detected in the synovial fluid, and the corresponding transcripts are expressed at greater levels in articular cartilage than in other tissues. Similarly, proteins involved in post-translational modification of type I collagen (such as procollagen C-endopeptidase enhancer) and connective tissue markers (such as fibronectin 1 and clusterin) were expressed at higher levels in synovium than in other tissues. Thus, the putative origins of a substantial number of synovial proteome constituents can be predicted from the RNA sequencing data. For those 17 proteins whose mRNAs are expressed by multiple tissues, tissue-specific changes in mRNA abundance will need to be correlated with changes in synovial protein abundance to reliably determine their most likely tissue of origin. We have additionally detected several proteins without a signal peptide, which are likely residual fragments that remained in the synovial fluid following apoptosis. Some examples of these are hemoglobin (HBA and HBB), which are expressed at extremely higher levels in blood than the remaining tissues, and beta-actin, a ubiquitously expressed protein whose mRNA was detected at high levels (RPKM > 180) in all four tissues. Also of interest are proteins not found in healthy pig synovial fluid and expressed at low levels in healthy synovium. Increased amounts of matrix metalloproteinases, including MMP13 and ADAMTS4, have been reported in synovial fluid from humans with OA, while proteins involved in the inflammatory/immune response, including complement activation, were found in patients with RA.[1] These proteins were neither detected in healthy pig synovial fluid nor were their transcripts abundant in healthy synovium. Therefore, in addition to monitoring changes in protein abundance for the healthy synovial fluid proteome, it will be of interest to compare proteomes and transcriptomes of healthy and diseased joints to identify pathways that may be integral to disease processes or mechanisms. Previous comparisons of synovial fluid obtained from patients with RA and OA identified 135 proteins at least 3-fold differentially abundant between the two groups.[20]

Comparison of Synovial Fluid and Plasma

Finally, to identify similarities between synovial fluid and blood plasma, from which many of the synovial fluid components were found to originate, we characterized the human plasma proteome (Supporting Information Table 3) and identified and quantified a total of 168 human plasma proteins. The detected iBAQ values of the human plasma proteins span 5 orders of magnitude, similar to what was found for the synovial fluid, demonstrating the large degree of protein abundance variation encountered in the two body fluids. However, the actual range of protein abundances in plasma and synovial fluid is expected to be several orders of magnitude larger, due to the limited dynamic range of the MS analysis.[54] Probing the top 10 most abundant proteins in both fluids, the most abundant protein is serum albumin, as expected (Table 3). Other shared high-abundance proteins include serotransferrin, apolipoprotein A, and several immunoglobulins. Serotransferrin and apoliopoprotein A are binding and transporting proteins of iron and lipids, respectively.[55,56] The high abundance of the transporter proteins and immunoglobulins points to the shared functions of the two body fluids as a transport media of cellular products. The main difference between the high-abundance proteins in the plasma and synovial fluid proteomes is the high abundance of hemoglobin in the synovial fluid, which is detected with a lower abundance in plasma (rank 25). In both fluids, the protein is a likely contaminant from red blood cells prior to fluid centrifugation. Fibrinogens, involved in blood clotting, were identified with high abundance in plasma in contrast to that in synovial fluid, as expected.
Table 3

Top Ten Most Abundant Proteins in Human Plasma and Porcine Synovial Fluid Ranked from the Most Abundanta

rankhuman plasmahuman UPIDporcine synovial fluidhuman UPID
1Serum albuminP02768Serum albuminP02768
2Ig gamma-1 chain C regionP01857Hemoglobin subunit alphaP69905
3Ig kappa chain C regionP01834Ig lambda chain C regionP0CG04
4Apolipoprotein A-IP02647Hemoglobin subunit betaP68871
5Ig lambda-2 chain C regionsP0CG05SerotransferrinP02787
6Ig gamma-2 chain C regionP01859Immunoglobulin lambda variable 8-61Q5NV62
7SerotransferrinP02787Immunoglobulin kappa variable 6–21P01834
8Alpha-1-antitrypsinP01009Apolipoprotein A-IP02647
9Apolipoprotein A-IIP02652HemopexinP02790
10Alpha-2-macroglobulinP01023Alpha-2-HS-glycoproteinP02765

As determined by iBAQ values.

As determined by iBAQ values. Our findings demonstrate the high degree of similarity shared by the synovial fluid proteome and the plasma proteome as well as the serum-derived nature of many synovial fluid components. The expected high degree of similarity found between human plasma and porcine synovial fluid, furthermore, indicates that porcine model systems of synovial joints are suitable for studies focusing on human joint diseases.[57]

Conclusions

We evaluated different proteomic strategies for analyzing healthy synovial fluid. We utilized porcine synovial fluid because this animal is used to model human joint disease. In this study, we developed a FASP-based analysis pipeline that consumes only 0.1% of the starting sample material and 10% of the MS instrument time used by, e.g., Balakrishnan;[20] thus, our method lends itself to clinical proteomics studies on larger cohorts. Applying this fast and efficient workflow, we completed a normative proteomics study of synovial fluid from healthy knee joints and identified more than 250 proteins using very stringent identification criteria. Preprocessing with PNGase F to remove N-glycosylation was not found to be essential for the proteomic analysis of synovial fluid. Through an analysis of high-occurrence peptide PTMs, we found that proteomics projects using the FASP protocol for synovial fluid analysis, and possibly for other biological samples as well, should include peptide N-terminal acetylation, deamidation of asparginine and glutamine, and peptide N-terminal pyro-glutamic acid in the data analysis for comprehensive peptide identification. Going beyond the conventional LC–MS-based proteome mapping, we also classified and predicted the likely tissue origins for the majority of the detected proteins in healthy synovial fluid by cross-referencing the proteome with the RNA transcriptomes from synovium, cartilage, blood, and liver. Although many proteins derive from plasma transudate, as expected, an important subfraction appears to be solely expressed by synovium or cartilage. Changes in the abundance of these latter proteins in synovial fluid or in blood may be useful biomarkers of disease onset or progression. For synovial fluid proteins whose transcripts are expressed in multiple tissues, it will be necessary to correlate tissue-specific changes in mRNA expression with changing synovial fluid protein abundance in order to determine these proteins’ principal tissues of origin. Finally, as many proteins were found to likely originate from plasma, we compared the identified porcine synovial fluid proteome to the proteome of human plasma. We found that the two body fluids share many similarities in terms of protein functions and localizations, underlining the detected plasma derived nature of many synovial fluid components. Our methods and the resulting proteome and transcriptome data sets will be useful when comparing porcine synovial fluid in healthy and diseased states. Importantly, the high homology between pigs and humans should make these methods and data sets valuable for human studies.
  58 in total

1.  Detection of tyrosine phosphorylated peptides by precursor ion scanning quadrupole TOF mass spectrometry in positive ion mode.

Authors:  H Steen; B Küster; M Fernandez; A Pandey; M Mann
Journal:  Anal Chem       Date:  2001-04-01       Impact factor: 6.986

2.  Incidence and risk factors for injuries to the anterior cruciate ligament in National Collegiate Athletic Association football: data from the 2004-2005 through 2008-2009 National Collegiate Athletic Association Injury Surveillance System.

Authors:  Jason L Dragoo; Hillary J Braun; Jennah L Durham; Michael R Chen; Alex H S Harris
Journal:  Am J Sports Med       Date:  2012-04-05       Impact factor: 6.202

3.  Proteomic analysis of human plasma proteins by two-dimensional gel electrophoresis and by antibody arrays following depletion of high-abundance proteins.

Authors:  Richard R Desrosiers; Edith Beaulieu; Marguerite Buchanan; Richard Béliveau
Journal:  Cell Biochem Biophys       Date:  2007-10-09       Impact factor: 2.194

4.  The dynamic range problem in the analysis of the plasma proteome.

Authors:  Glen L Hortin; Denis Sviridov
Journal:  J Proteomics       Date:  2009-07-17       Impact factor: 4.044

5.  A strategy for identifying gel-separated proteins in sequence databases by MS alone.

Authors:  A Shevchenko; M Wilm; O Vorm; O N Jensen; A V Podtelejnikov; G Neubauer; A Shevchenko; P Mortensen; M Mann
Journal:  Biochem Soc Trans       Date:  1996-08       Impact factor: 5.407

Review 6.  A systems biology approach to synovial joint lubrication in health, injury, and disease.

Authors:  Alexander Y Hui; William J McCarty; Koichi Masuda; Gary S Firestein; Robert L Sah
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2011-08-08

Review 7.  Pig proteomics: a review of a species in the crossroad between biomedical and food sciences.

Authors:  André Martinho de Almeida; Emøke Bendixen
Journal:  J Proteomics       Date:  2012-04-20       Impact factor: 4.044

8.  The lubricating activity of synovial fluid glycoproteins.

Authors:  D A Swann; R B Hendren; E L Radin; S L Sotman; E A Duda
Journal:  Arthritis Rheum       Date:  1981-01

9.  BioVenn - a web application for the comparison and visualization of biological lists using area-proportional Venn diagrams.

Authors:  Tim Hulsen; Jacob de Vlieg; Wynand Alkema
Journal:  BMC Genomics       Date:  2008-10-16       Impact factor: 3.969

10.  Differential proteomic analysis of synovial fluid from rheumatoid arthritis and osteoarthritis patients.

Authors:  Lavanya Balakrishnan; Mitali Bhattacharjee; Sartaj Ahmad; Raja Sekhar Nirujogi; Santosh Renuse; Yashwanth Subbannayya; Arivusudar Marimuthu; Srinivas M Srikanth; Rajesh Raju; Mukesh Dhillon; Navjyot Kaur; Ramesh Jois; Vivek Vasudev; Yl Ramachandra; Nandini A Sahasrabuddhe; Ts Keshava Prasad; Sujatha Mohan; Harsha Gowda; Subramanian Shankar; Akhilesh Pandey
Journal:  Clin Proteomics       Date:  2014-01-06       Impact factor: 3.988

View more
  27 in total

Review 1.  Why location matters - site-specific factors in rheumatic diseases.

Authors:  Caroline Ospelt; Mojca Frank-Bertoncelj
Journal:  Nat Rev Rheumatol       Date:  2017-06-15       Impact factor: 20.543

2.  The Pig PeptideAtlas: A resource for systems biology in animal production and biomedicine.

Authors:  Marianne O Hesselager; Marius C Codrea; Zhi Sun; Eric W Deutsch; Tue B Bennike; Allan Stensballe; Louise Bundgaard; Robert L Moritz; Emøke Bendixen
Journal:  Proteomics       Date:  2016-02       Impact factor: 3.984

3.  Urine Proteomics Reveals Sex-Specific Response to Total Pancreatectomy With Islet Autotransplantation.

Authors:  Tue Bjerg Bennike; Kate Templeton; Kimino Fujimura; Melena D Bellin; Saima Ahmed; Christoph N Schlaffner; Rohit Arora; Zobeida Cruz-Monserrate; Ramy Arnaout; Gregory J Beilman; Amit S Grover; Darwin L Conwell; Hanno Steen
Journal:  Pancreas       Date:  2022-07-27       Impact factor: 3.243

4.  Characterization of the chondrocyte secretome in photoclickable poly(ethylene glycol) hydrogels.

Authors:  Margaret C Schneider; Christopher A Barnes; Stephanie J Bryant
Journal:  Biotechnol Bioeng       Date:  2017-05-12       Impact factor: 4.530

5.  Toward understanding polymer micelle stability: Density ultracentrifugation offers insight into polymer micelle stability in human fluids.

Authors:  Timothy D Langridge; Richard A Gemeinhart
Journal:  J Control Release       Date:  2019-12-24       Impact factor: 9.776

6.  A Cost-Effective High-Throughput Plasma and Serum Proteomics Workflow Enables Mapping of the Molecular Impact of Total Pancreatectomy with Islet Autotransplantation.

Authors:  Tue Bjerg Bennike; Melena D Bellin; Yue Xuan; Allan Stensballe; Frederik Trier Møller; Gregory J Beilman; Ofer Levy; Zobeida Cruz-Monserrate; Vibeke Andersen; Judith Steen; Darwin L Conwell; Hanno Steen
Journal:  J Proteome Res       Date:  2018-04-19       Impact factor: 4.466

7.  Staphylococcus aureus Floating Biofilm Formation and Phenotype in Synovial Fluid Depends on Albumin, Fibrinogen, and Hyaluronic Acid.

Authors:  Samantha Knott; Dylan Curry; Neil Zhao; Pallavi Metgud; Sana S Dastgheyb; Caroline Purtill; Marc Harwood; Antonia F Chen; Thomas P Schaer; Michael Otto; Noreen J Hickok
Journal:  Front Microbiol       Date:  2021-04-29       Impact factor: 6.064

8.  Comprehensive protein profiling of synovial fluid in osteoarthritis following protein equalization.

Authors:  M J Peffers; B McDermott; P D Clegg; C M Riggs
Journal:  Osteoarthritis Cartilage       Date:  2015-03-26       Impact factor: 6.576

9.  Neutrophil Extracellular Traps in Ulcerative Colitis: A Proteome Analysis of Intestinal Biopsies.

Authors:  Tue Bjerg Bennike; Thomas Gelsing Carlsen; Torkell Ellingsen; Ole Kristian Bonderup; Henning Glerup; Martin Bøgsted; Gunna Christiansen; Svend Birkelund; Allan Stensballe; Vibeke Andersen
Journal:  Inflamm Bowel Dis       Date:  2015-09       Impact factor: 5.325

10.  Characterization of the porcine synovial fluid proteome and a comparison to the plasma proteome.

Authors:  Tue Bjerg Bennike; Omar Barnaby; Hanno Steen; Allan Stensballe
Journal:  Data Brief       Date:  2015-09-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.