Literature DB >> 24204839

Analysis of the sperm head protein profiles in fertile men: consistency across time in the levels of expression of heat shock proteins and peroxiredoxins.

Elsa Kichine1, Marcos Di Falco, Barbara F Hales, Bernard Robaire, Peter Chan.   

Abstract

We investigated the identity and quantitative variations of proteins extracted from human sperm heads using a label-free Gel-MS approach. Sperm samples were obtained from three men with high sperm counts at three different time points. This design allowed us to analyse intra-individual and inter-individual variations of the human sperm head proteome. Each time point was analyzed in triplicate to minimize any background artifactual effects of the methodology on the variation analyses. Intra-individual analysis using the spectral counting method revealed that the expression levels of 90% of the common proteins identified in three samples collected at various time-points, separated by several months, had a coefficient of variation of less than 0.5 for each man. Across individuals, the expression level of more than 80% of the proteins had a CV under 0.7. Interestingly, 83 common proteins were found within the core proteome as defined by the intra- and inter-variation analyses set criteria (CV<0.7). Some of these uniformly expressed proteins were chaperones, peroxiredoxins, isomerases, and cytoskeletal proteins. Although there is a significant level of inter-individual variation in the protein profiles of human sperm heads even in a well-defined group of men with high sperm counts, the consistent expression levels of a wide range of proteins points to their essential role during spermatogenesis.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24204839      PMCID: PMC3813703          DOI: 10.1371/journal.pone.0077471

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The delivery of a genetically intact sperm nucleus during fertilization is required for normal embryo development. Subtle alterations are sufficient to disrupt the contribution of sperm DNA to the embryo [1]. The intricacy of sperm DNA packaging in mature spermatozoa results in chromatin that is distinct from that of somatic cells with a higher order of DNA compaction. The resulting condensed and tightly packaged nature of sperm chromatin protects the genetic integrity of the paternal genome during its transport through the male and female reproductive tracts [2], [3]. The condensed sperm DNA, organized in “DNA loop domains” is closely associated with the proteinaceous sperm nuclear matrix (NM) at specific sites through matrix attachment regions [4]. Although there is still little known about the functions of the NM or its protein components, there is a growing interest in studying the composition of the sperm NM as some nuclear proteins have been shown to have a role in normal sperm function [1], [5]–[9]. NM proteins are involved in paternal DNA replication in the one cell embryo [6], [7]. Ocampo et al. have shown that actin, myosin and cytokeratin are components of the NM and may ensure nuclear stability in pig spermatozoa [10]. Recently, the nuclear isoform of GPX4, nGPX4, has been implicated in matrix instability and in paternal DNA decondensation [11]. Taken together, these data strongly advocate for the importance of identifying the structural components of the NM and defining the functional roles of sperm nuclear proteins. The use of label-free LC-MS/MS approaches is widely accepted for performing large scale quantitative analysis of proteins as a consequence of improvements in instrumentation and the development of bioinformatics tools. Two such approaches, spectral counting and ion profiling, are available. Relative quantitation by spectral counting makes use of the strong correlation between protein abundance and the number of MS/MS spectra [12]. Relative quantitation by ion profiling relies on comparison of the recorded full MS scan intensity of an eluting peptide ion at a particular chromatographic time point; it has a linear response over at least four orders of magnitude [13]. For these studies, we used both quantitation methods as a means of cross validating the results; although there are caveats associated with each of these two methodologies [14], our goal was to determine whether these methods yield similar overall results in terms of quantifiable proteins and their relative calculated ratios. The objectives of this study were to identify the proteins in the head of human spermatozoa using a Gel-MS label-free quantitative proteomics approach, to estimate variation in protein abundance across different collection time points in three healthy normospermic men with high sperm counts, and to identify the variation and consistency of common proteins across time and among individuals.

Materials and Methods

Subjects and Sample Collection

Three adult men were recruited through advertisement to participate in the study. The study was approved by the Internal Review Board of the McGill University and written informed consent was obtained from all subjects. All subjects were interviewed and examined every two to six months for a total of three visits when they provided a fresh semen sample after an abstinence period of 3–4 days. The semen samples were analyzed according to WHO protocol [15] and stored at −80 C. Three other subjects were added for the western blot analysis. Semen parameters are illustrated in Table S1 in File S1.

Sperm Head Enrichment and Protein Extraction

Thawed frozen semen samples were washed and dissolved in 0.05 mM PBS pH 7.4 followed by three periods of sonication (40 MHz for 30 sec) in order to break off sperm tails into small pieces. Less than 15% of the sperm heads had the mid-piece attached. After centrifugation at 9,300 g for 5 min and further washing with PBS, the pellet was resuspended, first in 150 µl hypotonic buffer (Nuclear Extract Kit, Active Motif, Cedarlane Labs Ltd., Burlington, ON) and subsequently in 90 µl lysis buffer (Nuclear Extract Kit) and 10 µl of 10 mM DTT. Detergent (25 µl, Nuclear Extract Kit) was added after 15 min of incubation on ice. Proteins from the tail pieces and from potential contaminant cells were solubilised by these two incubation steps and discarded in the supernatant. After washing in PBS, the sperm head suspension was treated with Proteinase Inhibitor Cocktail (Nuclear Extract Kit) and then with 6 M guanidine, 575 mM DTT, 2 M Urea/7 M thiourea. The solubilised proteins were precipitated overnight in cold acetone and resuspended in non-complete Laemmli buffer [16] (without Bromophenol blue and 2-mercaptoethanol) and protein quantification was done in duplicate with the BioRad DC-quantification kit (BioRad, Hercules, CA). Bromophenol blue at 0.01% and ß-mercaptoethanol at 5% (final concentrations) were added and the samples were stored at −80°.

1D SDS-PAGE and Automated Band Excision and Digestion

Forty five micrograms of protein were separated on 2.4 cm 7 to 15% acrylamide 1D SDS-PAGE gels. The full lane was excised with a Protein Picking Workstation ProXCISION (Perkin Elmer, Waltham, MA). Thirteen gel sections were excised. Gel pieces were robotically destained, reduced, cysteine-alkylated and in-gel digested with sequencing grade modified trypsin (Promega, Madison, WI), as previously described [17]. Peptide extraction from gel section was done robotically to generate a volume of 60 µl of peptide extract per gel section.

LC-MS/MS Analysis and Bioinformatics Data Processing

Aliquots of peptide extracts from the 13 gel sections were pooled to generate 3 pooled-sections samples. Each pooled sample was analyzed by on-line nano-HPLC-MSMS using a Velos LTQ-Orbitrap (Thermo Fisher Scientific). Peak-lists were generated using Mascot Distiller version 2.3.2.0 then searched against a database of Homo sapiens sequences extracted from the Universal Protein Resource (UniProt) database (September 17, 2010) containing 88352 entries, using Mascot version 2.3.01 followed by X! Tandem version 2007.01.01.1 on the subset of identified proteins. Mascot and X! Tandem searches were done using a fragment ion mass tolerance of 0.80 Da and a parent ion tolerance of 10.0 ppm. Scaffold (version 3_00_07, Proteome Software Inc., Portland, OR) was used to validate MS/MS based peptide and protein identifications. Peptide identifications were accepted if they could be established at >95.0% probability as specified by the Peptide Prophet algorithm [18]. Protein identifications were accepted if they could be established at >95.0% probability and contained at least 2 identified peptides. Protein probabilities were assigned by the Protein Prophet algorithm [19]. Proteins that contained similar peptides and could not be differentiated based on MS/MS analysis alone were grouped to satisfy the principles of parsimony. Quantitation analysis based on MS precursor ion signal was done using the precursor ion detection workflow from Proteome Discoverer Quant 1.2 (Thermo Fisher).

Western Blotting

Total protein was extracted from the heads of spermatozoa as described above. Protein concentrations were determined using the BioRad protein assay kit according to the manufacturer’s protocol (BioRad, Mississauga, ON, Canada). Samples (5 µg/lane) were resolved by SDS polyacrylamide gradient (8%, w/v) gel electrophoresis at 100 V for 1.5 h using SDS running buffer and then transferred onto PVDF membranes. Membranes were blocked with 5% nonfat milk in TBS containing 0.1% Tween-20. Proteins were detected using antibodies specific for PPIB, PRDX5 (Santa Cruz 20361, 33573) and SPESP1 (Abcam, 72672) all diluted in 3% nonfat milk in TBS-0.1% Tween-20 followed by HRP-linked secondary antibodies; Clusterin a protein consistently expressed across all of our samples with a coefficient of variation of 0.1 was used to correct for loading.

Statistical Analyses

Analyses of the protein profile at each time point for each subject were done in triplicate and proteins detected at a level below 3.5 assigned spectra were arbitrarily defined as artifactual signals from the technical background variation and were eliminated from further analysis. Likewise, protein signals that were not consistently present in the triplicate analysis were eliminated. Normalization of quantitative values was done by dividing the values by the mean of all quantitative values in the list of proteins in each sample. Variation in the level of expression of each protein was determined by the coefficient of variation (CV) and fold change. The intra-individual CV was calculated from the mean of triplicate analyses of the protein expression levels for each time point for each subject. The inter-individual CV of each protein was calculated from its mean level of expression at all three time points for each subject. All the statistical tests and graphs were done using GraphPad Prism version 5.00 for Windows (GraphPad Software, San Diego, CA, www.graphpad.com).

Results

Protein Identification and Triplicate Analyses

Analysis by LC-MS/MS of the protein extraction from the sperm head enriched preparations results in the identification of a total of 686 proteins (table S2 in ). The gene ontology analysis tool from the Scaffold software is illustrated in Fig. S1. The distribution of the CV associated with the corresponding average of the triplicates of the nine samples was analyzed. As an example, results from one of the subjects (subject 2) at the first time point are represented in Fig. 1A. The highest CVs were associated with proteins of the lowest quantitative values. Thus the variation level may be due to their higher margin of error in detection. We included in our variation analysis all proteins detected at a minimum level of 3.5 assigned spectra. As shown in Fig. 1B, the application of this cut-off significantly reduced the median CV of the total proteins analyzed from 0.3 to 0.1. This strategy ensured that the artefactual variation in proteins with low levels of detection among triplicates was minimized in the analysis.
Figure 1

Application of the cut-off reduces the median CV of technical triplicates from 0.3 to 0.1.

A. CV values repartition depending on the average of the triplicate quantitative value of the proteins detected in the first sample of subject 2 as an example. The highest CVs are associated with the lowest quantitative values. The cut-off has been defined at 3.5 assigned spectra. B. Box plot and minimum to maximum whiskers representing the decrease of the CV values after application of the cut-off.

Application of the cut-off reduces the median CV of technical triplicates from 0.3 to 0.1.

A. CV values repartition depending on the average of the triplicate quantitative value of the proteins detected in the first sample of subject 2 as an example. The highest CVs are associated with the lowest quantitative values. The cut-off has been defined at 3.5 assigned spectra. B. Box plot and minimum to maximum whiskers representing the decrease of the CV values after application of the cut-off.

Intra-individual Variation in the Normal Sperm Head Proteome

The majority of proteins (55–63%) were shared across the three time points for each subject. Proteins that were found at only one time point for each subject (23%–25%) were those with the lowest total spectra abundance (Fig. 2A). Examples of these proteins were proteasome subunit (PSD7) and ribosomal proteins (RS10).
Figure 2

Intra-individual variation analysis.

A. Venn diagrams of the three subjects showing the distribution of the number of identified proteins at the three time points before application of the cut-off. B. Box plot and 5–95 percentile whiskers illustrating the distribution of the CV values of all proteins included in the analysis of each subject. The majority of proteins had a time course CV<0.5.

Intra-individual variation analysis.

A. Venn diagrams of the three subjects showing the distribution of the number of identified proteins at the three time points before application of the cut-off. B. Box plot and 5–95 percentile whiskers illustrating the distribution of the CV values of all proteins included in the analysis of each subject. The majority of proteins had a time course CV<0.5. The variation analysis of proteins shared across all three time points for each subject was based on the CV values obtained from the three averages of triplicates. A boxplot with 5 to 95 percentile whiskers (Fig. 2B) was used to analyze the distribution of the CV of each protein shared across time points for each subject. An average of 90% of proteins had a lower than 0.5 dispersion level across time-points, with a median CV of 0.3 in the three subjects. Among these proteins, 33% had a low CV value (<0.2) for the time course analysis. Examples of such proteins included prolactin-inducible protein (PIP) and semenogelin 1 and 2 (SEMG1 and SEMG2) (Table S3 in ). A greater than 0.7 CV was seen in only 3% of all proteins in the analysis. These included various histones (H2AJ, H2AV, H4) and zona pellucida-binding protein 1 (ZPBP1). Interestingly, none of the various histone proteins detected were consistently associated with low or high CV values across time in the three men. Specifically, H2B1A was associated with a CV of 0.7 in subject 1, while in subject 2, the CV was only 0.3. For H2AJ the CV was 0.3 for one subject versus 0.7 in another (Table S3 in ). These intra-individual variation analyses revealed that the majority of proteins identified were shared across all time points and that this group of proteins had a dispersion level under 0.5 across time in all subjects.

Inter-individual Variation of the Normal Sperm Head Proteome

Our inter-individual variation analysis revealed that 211 proteins were shared in all three subjects. Subject 1 had 143 unique proteins whereas subjects 2 and 3 had only 21 and 7, respectively (Fig. 3A). After applying our cut-off criteria to eliminate proteins detected at a level below 3.5 assigned spectra, a total of 117 proteins were included for further analysis. The median CV of the level of expression of these proteins was 0.5 (Fig. 3B). Among the proteins shared in all subjects, 80% had an expression level dispersion under 0.7. As expected, the expression level of the proteins shared among subjects had a higher level of dispersion than the one determined in the intra-variation analysis. Proteins such as clusterin (CLUS), SEMG1 and SEMG2, and PIP were particularly consistent in their levels of expression among men, with a CV of less than 0.2. More than 9% of the proteins were associated with a CV of more than 0.9. Some of the more variable proteins included histones H2A.J (CV = 0.8) and H2AV (CV = 0.8) and sperm equatorial segment protein 1(SPESP) (CV = 0.9) (Table S3 in
Figure 3

Inter-individual variation analysis.

A. Venn diagrams of the three subjects showing the distribution of the number of identified proteins among the three men before application of the cut-off. B. Box plot and 5–95 percentile whiskers representing the repartition of the CV of the 117 proteins included in the analysis. The majority of these proteins had a CV<0.7.

Inter-individual variation analysis.

A. Venn diagrams of the three subjects showing the distribution of the number of identified proteins among the three men before application of the cut-off. B. Box plot and 5–95 percentile whiskers representing the repartition of the CV of the 117 proteins included in the analysis. The majority of these proteins had a CV<0.7. Therefore, inter-individual variation exceeded intra-individual variation for the proteins detected in our subjects, with 80% of proteins having dispersion in their expression level of <0.7.

Ion profiling Quantification was Equivalent to the Spectral Count Analysis

Among the 117 proteins included in the analysis, a total of 83 proteins associated with CV value of <0.7 were shared across time and among men (Table 1). These can be considered the core sperm head proteome. This consistency suggests a highly concerted regulation of their expression levels. Results of the ion profiling method of quantification were analyzed and compared to those obtained by spectral counting to further validate our observations. The CV was calculated with the same strategy described above for the inter-individual variation analyses. Of the 83 proteins listed by the spectral count quantification, 33 were also identified by ion profiling quantification and were included in the analysis. When comparing results of inter-individual analyses using spectral counting versus ion profiling quantification, we found no significant differences in the CV distribution (t-test, p = 0.78) (Fig. 4). These results indicate that, in our study design, the variations determined based on spectral count quantification were comparable to the values obtained by ion profiling quantification.
Table 1

List of 83 common proteins found within the core proteome.

GeneProteinAccessINTER
NumberCV
CLU CLUS ClusterinP109090,1
SEMG1 SEMG1 Semenogelin-1P042790,1
SEMG2 SEMG2 Semenogelin-2Q023830,1
AKR1B1 ALDR Aldose reductaseP151210,1
SIL1 SIL1 Nucleotide exchange factor SIL1Q9H1730,1
PPIB PPIB Peptidyl-prolyl cis-trans isomerase BP232840,2
SERPINA5 IPSP Plasma serine protease inhibitorP051540,2
TUBA1A TBA1A Tubulin alpha-1A chainQ71U360,2
AKAP4 AKAP4 kinase anchor protein 4Q5JQC90,2
PRDX2 PRDX2 Peroxiredoxin-2P321190,2
PRDX4 PRDX4 Peroxiredoxin-4Q131620,2
ACTB ACTB Actin, cytoplasmic 1P607090,2
CAMP CAMP Cathelicidin antimicrobial peptideP499130,2
ANPEP AMPN Aminopeptidase NP151440,2
TUBA3C TBA3C Tubulin alpha-3C/D chainQ137480,2
PIP PIP Prolactin-inducible proteinP122730,2
SORD DHSO Sorbitol dehydrogenaseQ007960,2
ZG16B ZG16B Zymogen granule protein 16 homolog BQ96DA00,2
NEU1 NEUR1 Sialidase-1Q995190,2
PGK1 PGK1 Phosphoglycerate kinase 1P005580,2
HSP90B1 ENPL EndoplasminP146250,3
SERPINB6 SPB6 Serpin B6P352370,3
PPIC PPIC Peptidyl-prolyl cis-trans isomerase CP458770,3
HSPA13 HSP13 Heat shock 70 kDa protein 13P487230,3
YWHAQ 1433T 14-3-3 protein thetaP273480,3
MATN2 MATN2 Matrilin-2O003390,3
PGC PEPC GastricsinP201420,3
HYOU1 HYOU1 Hypoxia up-regulated protein 1Q9Y4L10,3
HSPA5 GRP78 78 kDa glucose-regulated proteinP110210,3
LAMC1 LAMC1 Laminin subunit gamma-1P110470,3
PFN1 PROF1 Profilin-1P077370,3
SFN 1433S 14-3-3 protein sigmaP319470,3
YWHAB 1433B 14-3-3 protein beta/alphaP319460,3
Elongation factor 1-alpha (Fragment)Q53HR50,3
CPZ CBPZ Carboxypeptidase ZQ66K790,3
KLK3 KLK3 Prostate-specific antigenP072880,3
MUC6 MUC6 Mucin-6Q6W4X90,4
TUBB2C TBB2C Tubulin beta-2C chainP683710,4
UBC UBC Polyubiquitin-CP0CG480,4
ALB ALBU Serum albuminP027680,4
KRT2 K22E Keratin, type II cytoskeletal 2 epidermalP359080,4
GAPDH G3P Glyceraldehyde-3-phosphate dehydrogenaseP044060,4
TPI1 D3DUS9 Triosephosphate isomeraseD3DUS90,4
RUVBL1 RUVB1 RuvB-like 1Q9Y2650,4
HSP90AB1 HS90B Heat shock protein HSP 90-betaP082380,4
EEF1G EF1G Elongation factor 1-gammaP266410,4
SPA17 SP17 Sperm surface protein Sp17Q155060,4
LDHC LDHC lactate dehydrogenase C chainP078640,4
PDIA3 PDIA3 Protein disulfide-isomerase A3P301010,4
HSP90AA1 HS90A Heat shock protein HSP 90-alphaP079000,4
SPANXB1 SPNXB Sperm protein associated with the nucleus on the X chromosome B/FQ9NS250,4
YWHAE 1433E 14-3-3 protein epsilonP622580,4
LPL LIPL Lipoprotein lipaseP068580,4
KRT9 K1C9 Keratin, type I cytoskeletal 9P355270,4
ROPN1 ROP1A Ropporin-1AQ9HAT00,4
DLAT ODP2 Dihydrolipoyllysine-residue acetyltransferase of pyruvate dehydrogenase complexP105150,4
GAPDHS G3PT Glyceraldehyde-3-phosphate dehydrogenase, testis-specificO145560,4
KRT1 K2C1 Keratin, type II cytoskeletal 1P042640,4
FN1 FINC FibronectinP027510,5
PEBP1 PEBP1 Phosphatidylethanolamine-binding protein 1P300860,5
YWHAZ 1433Z 14-3-3 protein zeta/deltaP631040,5
ATP5A1 ATPA ATP synthase subunit alpha, mitochondrialP257050,5
CCT6A TCPZ T-complex protein 1 subunit zetaP402270,5
KRT10 K1C10 Keratin, type I cytoskeletal 10P136450,5
PRDX1 PRDX1 Peroxiredoxin-1Q068300,5
CRISP1 CRIS1 Cysteine-rich secretory protein 1P541070,5
HSPB1 HSPB1 Heat shock protein beta-1P047920,5
HSPA2 HSP72 Heat shock-related 70 kDa protein 2P546520,5
NUCB2 D3DQX5 Nucleobindin 2, isoform CRA_aD3DQX50,5
GSTM3 GSTM3 Glutathione S-transferase Mu 3P212660,5
HSPA8 HSP7C Heat shock cognate 71 kDa proteinP111420,5
ALDOA ALDOA Fructose-bisphosphate aldolase AP040750,6
PRDX5 PRDX5 Peroxiredoxin-5, mitochondrialP300440,6
OS9 OS9 Protein OS-9Q134380,6
LAMB2 LAMB2 Laminin subunit beta-2P552680,6
ATP5B ATPB ATP synthase subunit beta, mitochondrialP065760,7
HSPA1A HSP71 Heat shock 70 kDa protein 1A/1BP081070,7
HSPA1L HS71L LHeat shock 70 kDa protein 1-likeP349310,7
HSPD1 CH60 60 kDa heat shock protein, mitochondrialP108090,7

Proteins have been listed using the intra and inter-variation analyses-set criteria (CV<0.5 and CV<0.7, respectively).

Figure 4

Quantification analysis using the spectral count and ion profiling methods.

Box plot and minimum to maximum whiskers show repartition of the CV obtained by spectral count and ion profiling quantification methods on proteins that were determined to be commonly found in the core proteome by both intra and inter-variation analysis (CV<0.7, n = 33).

Quantification analysis using the spectral count and ion profiling methods.

Box plot and minimum to maximum whiskers show repartition of the CV obtained by spectral count and ion profiling quantification methods on proteins that were determined to be commonly found in the core proteome by both intra and inter-variation analysis (CV<0.7, n = 33). Proteins have been listed using the intra and inter-variation analyses-set criteria (CV<0.5 and CV<0.7, respectively).

Expression Levels of HSPs and PRXs were Consistent Over Time and among Men

Among the 83 consistently expressed proteins (Table 1), we identified chaperones, cytoskeleton proteins, peroxiredoxins, isomerases, and other enzymes. The chaperones identified were: HS90A and HS90B, HSP13, HSP72, HSP7C, ENPL, GRP78, HYOU1, and CLUS. HS90A and HS90B both promote the maturation and structural maintenance of specific target proteins [20], [21]. HSP13 has a peptide-independent ATPase activity [22]. HSP72 is implicated in stabilization of pre-existing proteins [23]. HSP7C acts as a repressor of transcriptional activation [24] and ENPL functions in the processing and transport of secreted proteins [25]. GRP78 is a component of the eIF2 complex, implicated in translational initiation [26]. Finally, HYOU1, a part of a large multi-complex aggregate implicated in cytoprotective mechanisms [27], and CLUS prevent aggregation of non-native proteins [28]. Surprisingly, we found that five of the 14-3-3 family proteins were among the highly-consistent proteins. These are adapter proteins implicated in the regulation of protein function in a large group of signalling pathways [29]. To our knowledge, this is the first report of a number of 14-3-3 family proteins in human spermatozoa. Four isomerases were found to be highly consistent: D3DUS9 is implicated in glycolysis and energy production [30]; PDIA3 catalyses the formation of disulfide bonds in targeted proteins [31]. PPIB and PPIC are both cis-trans isomerases, helping the folding of targeted proteins [32], [33]. Three of the four peroxiredoxins found are associated with the 2-Cys class of peroxiredoxins: PRDX1, PRDX4, and PRDX2. PRDX5 is associated with the atypical 2-Cys class of peroxiredoxins. All of these enzymes are associated with the regulation of oxidation and reduction [34]. Interestingly, we also found several consistently expressed proteins implicated in the regulation of gene expression, such as elongation factor 1-gamma (EF1G) and RUVB1, a component of the NuA4 histone acetyltransferase complex [35].

Validation of the Variation Analyses by Western Blotting

In order to confirm the dispersion level found by the inter-individual analyses of the proteomic data, the coefficients of variation of four proteins with various dispersion levels were analysed by western blotting (Fig. 5 and Fig. S2). The intra-individual variation was evaluated from the data of the three subjects used for the proteomic analysis and the three subject added for the Western blot experiment (n = 6).The average of the intra-individual CV and the inter-individual CV are shown in table 2. The average of the intra-individual CV based on the western blot data showed consistency with the CV calculated using the proteomic quantitative data. The inter-individual CV, based on western blot data, show that the variation is less than that described with the proteomic quantitative data. Nevertheless, a protein described as stable (PPIB) or variable (SPESP1) by proteomic analysis appears to be similarly changed in western blot analysis. These results show that after the application of all criteria, calculation of the coefficient of variation from proteomic data is valid.
Figure 5

Validation of the intra and inter-individual analysis by western blotting.

The coefficient of variation of four proteins with various dispersion levels was calculated by western blotting. The intra and inter-individual variation analysis based on proteomic quantitative data (n = 3) were compared with that obtained by western blot analysis of the three subjects used for the proteomic analysis and three additional subjects. The table 2 regroups the average of the intra CV and the inter CV calculated from the proteomic results (n = 3) and the CV calculated from the western blot data (n = 6).

Table 2

Average of the intra and inter coefficient of variation (CV).

Average Intra CVInter CV
Proteomic n = 3Western Blot n = 6Proteomic n = 3Western Blot n = 6
SPESP1 0.30.30.80.6
PRDXV 0.30.30.60.4
PPIB 0.10.10.30.2
CLUS 0.10.10.10.1

The intra and inter CV were calculated from the proteomic results (n = 3) and the CV calculated from the western blot data (n = 6).

Validation of the intra and inter-individual analysis by western blotting.

The coefficient of variation of four proteins with various dispersion levels was calculated by western blotting. The intra and inter-individual variation analysis based on proteomic quantitative data (n = 3) were compared with that obtained by western blot analysis of the three subjects used for the proteomic analysis and three additional subjects. The table 2 regroups the average of the intra CV and the inter CV calculated from the proteomic results (n = 3) and the CV calculated from the western blot data (n = 6). The intra and inter CV were calculated from the proteomic results (n = 3) and the CV calculated from the western blot data (n = 6).

Discussion

The methodology used in proteomics is particularly suitable to the study of spermatozoa because these cells are transcriptionally silent, i.e., the nature of the protein profile reflects the functional status of sperm. While proteomic analysis allows us to generate inventories of thousands of proteins expressed in sperm [36], comparative studies are useful in finding a link between the listed proteins and their biological role. Differences in the identity of proteins and their quantitative levels of expression may be a consequence of basic variations in the biological process of spermatogenesis and/or maturation in the epididymis or due to genetic polymorphisms. To our knowledge, this is the first study describing quantitative expression variations in the sperm head enriched proteome in fertile men. Through the quantitative measurement of sperm head proteins, we were able to compare their expression levels across different times of collection for individual subjects and across different subjects. An important strength of our study was the use of technical triplicate analyses for each individual at each time point, allowing us to evaluate and exclude contextual background protein identifications associated with technical variations inherent to the methodology and analyses. Application of identification and quantification cut-off criteria decreased the maximum variance of the mean of the three estimated quantitative values by nearly 3-fold. This observation shows the degree of arbitrary conclusions that may be derived if no replicate is used and demonstrates the importance of defining a threshold of inclusion of protein signals before conducting comparative analyses. Another important feature of our study design was the variation analyses of samples collected at different time points using semen samples from each subject. This approach allowed us to identify proteins that are consistently found both over time within an individual and among a cohort of men with a defined reproductive phenotype (normozoospermia). The identification of these proteins, for which there is a high level of confidence, serves as the basis for further analyses in defining the sperm head protein profile of a subject. Judging from their consistent presence among all subjects evaluated, they are likely to also play essential roles in normal sperm function. Our approach also allowed us to identify a list of proteins that were highly variable across time and not shared among subjects. Our findings highlight the importance of using multiple subjects and time points in the comparative proteomic analysis of sperm. Albumin (ALBU) is known to be the major protein detected from seminal plasma. In our study, ALBU was detected at a very low level, with an average of 0.05% of total spectra, indicating that seminal protein contamination is very low. Proteins that were expressed only at a low level were excluded by application of the cut-off before the quantitative analysis. Although the spectral counting quantitation approach is simpler to do, its reliability is restricted to a limited dynamic range of concentrations, particularly when dealing with proteins for which only a handful of spectra are identified. The ion profiling quantitative approach is used both as a means to ameliorate the relative quantitative determination of proteins at both extremes of the spectral abundance for which subtle differences in expression may have gone undetected by the spectral counting approach and as a way to validate those determinations for which both methods are in agreement. In our study, we used determinations from these two separate quantification methods to compare the dispersion of the expression levels of proteins found to constitute the core proteome across time and among subjects. Comparison of the CVs revealed that no significant differences were found in the distribution of the CVs obtained by both methods. This indicates that for relatively small fold differences both methods are fairly reliable; furthermore, the range of expression levels does significantly affect the reliability of the determination of ratios. The magnitude of the dynamic range of proteins in a complete spermatozoon is evaluated at 105–6 [37]. Our preparations of sperm were specifically enriched in sperm head proteins to reduce the levels of complexity (i.e., eliminating most tail sperm proteins from the biological samples studied), allowing us to identify a total of 686 proteins. Among these, 22% (n = 149) were identified as nuclear proteins based on information derived from somatic cells. When comparing total spectra abundance in one sample, the relative ratios of known spermatozoon nuclear proteins, such as H4 and H2A.J, were found to be 1.0 and 2.5 fold higher than the level of AKAP4, the most abundant protein known to be expressed in the entire spermatozoon [37]. This reflects the success of our methodology in enriching sperm heads. The true proportion of nuclear proteins in the spermatozoon, however, cannot be accurately defined since not all nuclear proteins in spermatozoa have been identified and localized. A total of 83 proteins, representing 75% of the total proteins included in the inter-individual analysis of variation (Table 1), were commonly found in the core proteome across time and among subjects. Several HSPs were expressed in sperm and some of these were expressed at high levels [37]; HSPs have been associated with capacitation and indirectly with sperm-egg binding [37]. Their specific functions seem to depend on their tyrosine phosphorylation state and whether they migrate to the surface of the spermatozoon [37]. In our study, we also found that the expression of nine chaperones was highly consistent. A recent study showed that three of these chaperones, namely, HSP7C, GRP78 and HYOU1, are expressed in the plasma membranes of human spermatozoa and are accessible to surface labelling [38]. Another protein that was found to be highly consistently expressed was the T-complex protein 1 subunit zeta (TCPZ). Interestingly, TCPZ has been described as a component of chaperonin-containing TCP-1 complex [39]. This multimeric protein complex is involved in sperm-zona pellucida interaction and TCPZ protein co-localizes with the ZPBP2 protein [39]. Capacitation is required to ensure the fertilization ability of spermatozoa. This complex process is highly regulated by phosphorylation of proteins and controlled by the amount of reactive oxygen species (ROS) [40]. Spermatozoa themselves are capable of producing ROS that, at low levels, are required for capacitation and hyperactivation [41]. However, high levels of ROS could result in oxidative stress and DNA damage. In fact, high ROS levels are detected in 25% of infertile men [42], [43]. Peroxiredoxins are important in redox regulation in somatic cells [34]. In our study, we found four peroxiredoxins, namely, PRDX1, PRDX2, PRDX4 and PRDX5, to be expressed consistently across time and among men. PRDX proteins are found in multiple compartments in the human spermatozoon; PRDX4 is localized in the acrosome, while PRDX1 is in the equatorial segment and PRDX5 is in the post-acrosomal region [44]. Gong et al., 2012 [45] noted the relationship of PRDX1 with fertility status. Semenogelins prevent capacitation by reducing ROS production and hyperactivated motility; they are the main component of the human semen coagulum [46], [47]. Recently, ROS in spermatozoa have been shown to have an impact on semenogelin metabolism [48], indicating a co-regulation of ROS and semenogelins, both involved in delaying the initiation of capacitation. This study represents the first description of the analyses of intra- and inter-individual variation of the sperm head proteome based on quantitative observations. We have clearly demonstrated the necessity of using more than one sample per subject in comparative studies; triplicate analysis of each sample helped to minimize haphazard quantification of the proteins with a very low level of expression. This study serves to increase our knowledge about proteins expressed in sperm. Furthermore, it lays the foundation for future comparative studies aimed at correlating protein profile expression with various pathological conditions, such as male infertility. Gene ontology annotation of the 686 proteins identified. Pie chart based on gene ontology tool from Scaffold software illustrating the distribution of proteins in the molecular function and cellular component categories.The majority of identified proteins are associated with molecular functions (n = 542), such as isomerases, and binding (n = 455) in the case of chaperones. Some proteins are associated with transport activity (n = 33), such as importins, and others with antioxidant activity (n = 12), such as peroxiredoxins. Interestingly, some of the proteins are associated with transcription regulation activity (n = 15), e.g., ribosomal proteins. Based on cellular components, the majority of proteins were associated with cytoplasm (n = 404), intracellular organelles, (n = 394), organelle parts (n = 261), membranes (n = 186), extracellular regions (n = 142), or the nucleus (n = 149). (TIF) Click here for additional data file. Western blot data of the proteins from the sperm sample of the three added control men. Proteins extracted from the sperm head-enriched samples from the three subjects of the study were resolved by SDS page and immunoblotted with antibodies. The coefficients of variation of those proteins were compared with the ones based on proteomic quantitative data. (TIF) Click here for additional data file. Contains: Table S1. Semen parameters data from the six subjects. The average age of the three subjects included in the proteomic analysis was 35.7 yrs (30, 38, and 39 years). Two of the subjects had a previous history of natural fecundity. The third one had not previously attempted to achieve pregnancy with partners. One subject had a history of hypertension that was well-controlled by medication with no additional significant co-morbidities. Another subject had a history of tobacco consumption that had been terminated for more than three years. Neither of the other subjects reported a history of smoking or illicit drug use. The average age of the three subjects added for the western blot analysis, were 24 (22–26) yrs. None of these three subjects have any significant co-morbidity. There was no history of smoking or use of any drugs and medications. All six subjects had values that placed them in the fertile range, as per WHO standards. Table S2. List of the 686 identified proteins by LC-MS/MS analysis. Each identified protein is listed from the most abundant to the less abundant. Table S3. List of protein analysed in the intra-individual and inter-individual analyses. Each protein included in the variation analyses for each subject is listed depending its variation level. The lowest is the CV, the lowest is the variation level over-time and/or among subject. (XLSX) Click here for additional data file.
  47 in total

1.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

2.  Levels of semenogelin in human spermatozoa decrease during capacitation: involvement of reactive oxygen species and zinc.

Authors:  E de Lamirande; G Lamothe
Journal:  Hum Reprod       Date:  2010-05-24       Impact factor: 6.918

3.  Evaluation of oxidative DNA damage in human sperm and its association with male infertility.

Authors:  H M Shen; S E Chia; C N Ong
Journal:  J Androl       Date:  1999 Nov-Dec

4.  Increased oxidative deoxyribonucleic acid damage in the spermatozoa of infertile male patients.

Authors:  H Kodama; R Yamaguchi; J Fukuda; H Kasai; T Tanaka
Journal:  Fertil Steril       Date:  1997-09       Impact factor: 7.329

5.  Topoisomerase IIB and an extracellular nuclease interact to digest sperm DNA in an apoptotic-like manner.

Authors:  Jeffrey A Shaman; Renata Prisztoka; W Steven Ward
Journal:  Biol Reprod       Date:  2006-08-16       Impact factor: 4.285

6.  Cleavage of structural proteins during the assembly of the head of bacteriophage T4.

Authors:  U K Laemmli
Journal:  Nature       Date:  1970-08-15       Impact factor: 49.962

Review 7.  Peroxiredoxins: a historical overview and speculative preview of novel mechanisms and emerging concepts in cell signaling.

Authors:  Sue Goo Rhee; Ho Zoon Chae; Kanghwa Kim
Journal:  Free Radic Biol Med       Date:  2005-03-24       Impact factor: 7.376

8.  Ability of hamster spermatozoa to digest their own DNA.

Authors:  Barbara Sotolongo; Elisabete Lino; W Steven Ward
Journal:  Biol Reprod       Date:  2003-08-20       Impact factor: 4.285

Review 9.  Sperm activation: role of reactive oxygen species and kinases.

Authors:  Eve de Lamirande; Cristián O'Flaherty
Journal:  Biochim Biophys Acta       Date:  2007-09-05

10.  Stch encodes the 'ATPase core' of a microsomal stress 70 protein.

Authors:  G A Otterson; G C Flynn; R A Kratzke; A Coxon; P G Johnston; F J Kaye
Journal:  EMBO J       Date:  1994-03-01       Impact factor: 11.598

View more
  3 in total

1.  A Human Lectin Microarray for Sperm Surface Glycosylation Analysis.

Authors:  Yangyang Sun; Li Cheng; Yihua Gu; Aijie Xin; Bin Wu; Shumin Zhou; Shujuan Guo; Yin Liu; Hua Diao; Huijuan Shi; Guangyu Wang; Sheng-Ce Tao
Journal:  Mol Cell Proteomics       Date:  2016-06-30       Impact factor: 5.911

2.  Utf1 contributes to intergenerational epigenetic inheritance of pluripotency.

Authors:  Qiuye Bao; Amir Morshedi; Fulu Wang; Sharma Bhargy; Konstantin Pervushin; Wei-Ping Yu; Peter Dröge
Journal:  Sci Rep       Date:  2017-11-06       Impact factor: 4.379

Review 3.  The Challenge of Human Spermatozoa Proteome: A Systematic Review.

Authors:  Kambiz Gilany; Arash Minai-Tehrani; Mehdi Amini; Niloofar Agharezaee; Babak Arjmand
Journal:  J Reprod Infertil       Date:  2017 Jul-Sep
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.