Literature DB >> 26793233

Milk Bottom-Up Proteomics: Method Optimization.

Delphine Vincent1, Vilnis Ezernieks1, Aaron Elkins1, Nga Nguyen1, Peter J Moate2, Benjamin G Cocks3, Simone Rochfort3.   

Abstract

Milk is a complex fluid whose proteome displays a diverse set of proteins of high abundance such as caseins and medium to low abundance whey proteins such as ß-lactoglobulin, lactoferrin, immunoglobulins, glycoproteins, peptide hormones, and enzymes. A sample preparation method that enables high reproducibility and throughput is key in reliably identifying proteins present or proteins responding to conditions such as a diet, health or genetics. Using skim milk samples from Jersey and Holstein-Friesian cows, we compared three extraction procedures which have not previously been applied to samples of cows' milk. Method A (urea) involved a simple dilution of the milk in a urea-based buffer, method B (TCA/acetone) involved a trichloroacetic acid (TCA)/acetone precipitation, and method C (methanol/chloroform) involved a tri-phasic partition method in chloroform/methanol solution. Protein assays, SDS-PAGE profiling, and trypsin digestion followed by nanoHPLC-electrospray ionization-tandem mass spectrometry (nLC-ESI-MS/MS) analyses were performed to assess their efficiency. Replicates were used at each analytical step (extraction, digestion, injection) to assess reproducibility. Mass spectrometry (MS) data are available via ProteomeXchange with identifier PXD002529. Overall 186 unique accessions, major and minor proteins, were identified with a combination of methods. Method C (methanol/chloroform) yielded the best resolved SDS-patterns and highest protein recovery rates, method A (urea) yielded the greatest number of accessions, and, of the three procedures, method B (TCA/acetone) was the least compatible of all with a wide range of downstream analytical procedures. Our results also highlighted breed differences between the proteins in milk of Jersey and Holstein-Friesian cows.

Entities:  

Keywords:  Jersey and Holstein-Friesian cow milk; proteome; replicates; shotgun nLC-ESI-MS; trypsin digestion

Year:  2016        PMID: 26793233      PMCID: PMC4707256          DOI: 10.3389/fgene.2015.00360

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

Milk is a very complex body fluid whose primary biological function is to nurture newborns. Cow's milk, in its pure form or derivative dairy products such as cream, butter, cheese, and yogurt, is a major source of nutrition for humans. On average, cow's milk is composed of 88% of water, 4.8% carbohydrates, 3.9% lipids, 3.2% proteins, and 0.7% minerals (Jost, 2005). Bos taurus have been bred for millenia and selected to increase milk production in dairy animals. The recent sequencing of Bos taurus genome (Bovine Genome Sequencing and Analysis Consortium, 2009) paved the way for omics studies, particularly proteomics which heavily relies on gene model annotations for accurate protein identification. The cattle genome is predicted to contain at least 22,000 protein-coding genes. In cow's milk, the most abundant proteins are caseins (α-S1-, α-S2-, β-, and κ-forms) which represent about 78% of total protein concentration, followed by whey proteins which make up 17% (β-lactoglobulin, α-lactalbumin, lactoferrin, and lactoperoxidase) (reviewed in Bendixen et al., 2011; Roncada et al., 2012). Various protocols for milk protein extraction have been described in the literature including dilution of skim milk in a urea-based buffer compatible with isoelectric focusing (IEF; Boehmer et al., 2008; Jensen et al., 2012a), acetone precipitation of full cream milk (Danielsen et al., 2010), ultracentrifugation to pellet caseins (Hettinga et al., 2011; Kim et al., 2011; Reinhardt et al., 2013) followed by 10 kD molecular weight cut-off (MWCO) filtration of whey fraction (Le et al., 2011), ammonium sulfate precipitation of caseins to isolate serum (Hogarth et al., 2004), acetic acid removal of caseins to isolate whey proteins (Senda et al., 2011), or low speed centrifugation to remove the fat layer followed by a dilution of the skim milk in a protein buffer compatible with 2-DE (Yang et al., 2013). The diversity of methods led us to assume there was not one established method proven to be superior to the others for enabling a complete proteome analysis while ensuring high throughput. Recently, Nissen et al. (2012, 2013) applied a fractionation method to bovine colostrum or mature milk resulting in a cell-free and fat-free fraction, a cell pellet fraction, and a whey fraction which was further treated by acidification, ultrafiltration or centrifugation. In these studies, the proteins from the various fractions were trypsin-digested, analyzed using 2-D-LC-MS/MS, and compared to the corresponding non-fractionated milk proteome. With this strategy, the authors deepened milk proteome coverage by identifying 69 (17%) additional proteins in the fractionated samples compared to the non-fractionated ones where 334 proteins could be identified (Nissen et al., 2012). However this coverage was achieved at the expense of throughput. We are currently undertaking a vast systems biology project aiming at characterizing milk from two widely-studied bovine breeds: Holstein-Friesian and Jersey. The first step was to optimize the extraction method for the proteomics aspect of the project. Because our literature survey failed to find publications describing attempts to optimize protein extraction from cow milk by comparing several protocols, compounded by the fact that there was no consensus on which protein extraction method to use to analyse the cow milk proteome, we designed an experiment to compare different extraction procedures used to recover as many proteins as possible for their analysis by shotgun LC-MS/MS in a high throughput fashion. To this end, we used three very different methods that have not been used in a gel-free bottom-up approach before to extract proteins from cow's skim milk from two different breeds. Replicates were used during the extraction, digestion, as well as injection steps to assess the reproducibility of the methods. Our null hypothesis was that the three methods would be similar in their major attributes when used to analyse proteins in milk samples from Jersey and Friesian-Holstein cows. These attributes include method efficiency as measured by the concentration of extracted protein, the SDS-PAGE patterns, the number of protein accessions identified following trypsin digestion and nLC-ESI-MS/MS analyses, cost of the extraction procedure and labor requirements for the extraction procedure. Statistical analyses and gene ontology (GO) classification were employed to further highlight commonalities and differences between the three extraction methods. Protein identities were validated using known protein standards subject to the same shotgun nLC-MS/MS treatment. Breed differences are also discussed.

Materials and methods

Milk collection and skim milk recovery

Multiparous Holstein-Friesian cows (coded H) were monitored at Ellinbank Research Centre (Victoria, Australia). Jersey cows (coded J) were kept at Wallacevale (Victoria, Australia). The animals were cared for in accordance with the Australian Code of Practice for the Care and Use of Animals for Scientific Purposes (www.nhmrc.gov.au). DeLaval proportional samplers (DeLaval International, Tumba, Sweden) were used to collect a sample of milk from each cow at each milking. Cows were milked twice daily, at 6:00 and 15:00, and milk was bulked into containers. A 50 mL aliquot of bulk milk samples from Jersey cows and from Holstein-Friesian cows were separately collected on 6, November 2014 and stored on ice at the respective dairy farms and during transport. A total of 440 Holstein-Friesian cows contributed to the vat on that date and cows averaged 139 days in milk. A total of 215 Jersey cows contributed to the vat on that date and cows averaged 140 days in milk. Three 2.0 mL milk samples were aliqoted from each bulk sample and stored at −80°C until use. The experimental design is outlined in Figure 1.
Figure 1

Overview of the experimental workflow. Two full cream milk samples were collected from bulk tanks containing the milk of the whole herd of Holstein-Friesian or Jersey cows milked on that particular day. Following centrifugation of the milk to eliminate the cream, proteins were extracted from skim milk in triplicates (e1-e3) using methods A (urea), B (TCA/acetone), or C (methanol/chloroform). All 18 protein extracts were separated using SDS-PAGE, and their protein concentrations obtained in triplicates using the BCA assay. One hundred microgram proteins of each of the 18 extracts were trypsin-digested using five replicates (d1-d5). All 90 tryptic digests underwent Solid Phase Extraction (SPE) clean-up, ultrafiltration (UF) using a 30 kD MWCO; peptide concentrations were obtained using the BCA assay. One hundred nanogram peptides of each of the 90 digests were randomly injected for nLC-MS/MS analysis in triplicates (i1-i3) thus generating 270 MS result files.

Overview of the experimental workflow. Two full cream milk samples were collected from bulk tanks containing the milk of the whole herd of Holstein-Friesian or Jersey cows milked on that particular day. Following centrifugation of the milk to eliminate the cream, proteins were extracted from skim milk in triplicates (e1-e3) using methods A (urea), B (TCA/acetone), or C (methanol/chloroform). All 18 protein extracts were separated using SDS-PAGE, and their protein concentrations obtained in triplicates using the BCA assay. One hundred microgram proteins of each of the 18 extracts were trypsin-digested using five replicates (d1-d5). All 90 tryptic digests underwent Solid Phase Extraction (SPE) clean-up, ultrafiltration (UF) using a 30 kD MWCO; peptide concentrations were obtained using the BCA assay. One hundred nanogram peptides of each of the 90 digests were randomly injected for nLC-MS/MS analysis in triplicates (i1-i3) thus generating 270 MS result files. Milk samples were skimmed as follows. Frozen full cream milk samples (2.0 mL per tube) were left to thaw at 4°C. Tubes were centrifuged at 4500 rpm for 30 min at 4°C. The skim milk in between the fat layer and the pelleted cells was pipetted (ca. 1.7 mL) and transferred into a fresh 2 mL tube, and this sample immediately underwent extraction.

Protein extraction methods

Figure 1 outlines the experimental design. Three extraction methods were tested on skim milk samples in triplicates (coded e1 to e3), thus yielding 18 protein extracts.

Method A (urea)

The skim milk sample was split into 3 x 0.5 mL aliquots in 2.0 mL tubes. An equal volume (0.5 mL) of Solubilisation Buffer [SB: 6 M urea, 10 mM DTT, 10 mM Tris-HCl pH 8.0, 75 mM NaCl, 0.05% SDS (w:w:v:w:w) in H2O] was added and the mixture was vortexed for 1 min. The tubes were incubated at 30°C for 60 min. A 1 M iodoacetamide (IAA) solution was added to reach a final 20 mM concentration and tubes were left to incubate at room temperature in the dark for 60 min. The tubes were centrifuged at 13,000 rpm for 5 min at room temperature. Protein extracts (hereafter named A) were stored at −80°C until use.

Method B (TCA/acetone)

The skim milk sample was split into 3 × 0.5 mL aliquots in 2.0 mL tubes. A volume of 1.5 mL 10% TCA, 10 mM DTT in ice-cold acetone (w:w:v) was added which produced a precipitate. The tubes were then vortexed for 1 min and incubated overnight at −20°C for precipitation. The tubes were centrifuged for 10 min at 13,000 rpm and −6°C. The supernatants were discarded. A volume of 1.5 mL 10 mM DTT in ice-cold acetone (w:v) was added. Pellets were first broken down using a spatula and further pulverized by vortexing the tubes for 1 min. The tubes were incubated at −20°C for 60 min, and then centrifuged for 10 min at 13,000 rpm and −6°C. The supernatants were discarded. Pellet washing was repeated once more. The pelleted proteins were dried under vacuum in a Speedvac Concentrator (SPD2010 model, Savant) without heat for 60 min and fully resuspended in 0.5 mL SB by vortexing. A 1 M IAA solution was added to reach a final 20 mM concentration and tubes were left to incubate at room temperature in the dark for 60 min. Protein extracts (hereafter named B) were stored at −80°C until use.

Method C (methanol/chloroform)

The skim milk sample was split into 3 × 0.5 mL aliquots in 50 mL tubes. A phase separation extraction procedure adapted from Taylor and Savage (2006) was performed. Briefly, 7.5 mL of chloroform in methanol (1:2) (v:v) was added to the skim milk aliquot and the mixture was vortexed for 1 min. Chloroform (5.0 mL) was added and the mixture vortexed for 1 min. NaCl solution [2.0 mL, (1:10) (w:v)] was added and the mixture vortexed for 1 min. This produced a triphasic solution with a protein interphase. To maximize phase separation, the tube was centrifuged at 5100 rpm for 30 min at room temperature using a swing bucket rotor. Both upper and lower phases were carefully discarded and the remaining wet interphase was transferred into a fresh 1.5 mL tube. The interphase was dried under vacuum using a SpeedVac Concentrator for 60 min. The dry interphase was resuspended by adding 0.5 mL of a SB and letting the interphase slowly reabsorb SB during an overnight incubation at 4°C. Resupension of the interphase was finalized by vortexing for 30 min using a Multi Tube Vortex Mixer (MTV1 model, Ratek) at full speed at room temperature. A 1 M IAA solution was added to reach a final 20 mM concentration and tubes were left to incubate at room temperature in the dark for 60 min. Protein extracts (hereafter named C) were stored at −80°C until use.

Protein assay

The protein concentrations of the skim milk aliquots and milk extracts (1:10 dilution) were assessed in duplicate using the Microplate BCA protein assay kit (Pierce) following the manufacturer's instructions which are based on the method developed by Smith et al. (1985). Bovine Serum Albumin (BSA) was used a standard. For each extract, the recovery rate of protein extraction was computed as a percentage of skim milk protein concentration.

SDS-PAGE

The complexity of milk protein patterns were initially analyzed by SDS-PAGE using pre-cast NuPAGE® Novex gels (4–12% bis-tris acrylamide, 1 mm, 8 × 8 cm, 10 lanes, Life Technologies). A volume of skim milk or protein extract corresponding to 50 μg of proteins was loaded per lane. Samples were diluted with the loading buffer (0.5 M DTT added to 4X NuPAGE LDS Sample Buffer, Life Technologies) to reach a final 20 μL volume and heated at 70°C for 10 min. Samples were loaded on the gels and run using MOPS-SDS running buffer (50 mM MOPS, 50 mM Tris Base, 0.1% SDS 1 mM EDTA, pH 7.7 in H2O) for 35 min at 300V at 4°C until the blue front reached the bottom of the gel. Novex SeeBlueR Pre-Stained Standard (Life Technologies) was loaded in the first lane of each gel to estimate the molecular weight (MW) of the milk proteins and account for gel to gel variation. Gels were stained using a Colloidal Coomassie Blue (CCB) method as follows. Gels were incubated at room temperature for 48 h on an orbital shaker in 200 mL of CCB solution (2% phosphoric acid, 18% ethanol, 15% ammonium sulfate, 1% Brilliant Blue G250 (v:v:w:w) in H2O). Gels were rinsed twice for 30 min in H2O and scanned using a CanoScan 8800F scanner (Canon).

In-solution protein digestion using trypsin protease

Digestions were performed five times (coded d1 to d5) on each protein extract, thus yielding 90 peptide digests. An aliquot corresponding to 100 μg of milk proteins was used for protein digestion as follows. The DTT-reduced and IAA-alkylated proteins were diluted six times using 50 mM ammonium bicarbonate (ABC) to decrease the urea molarity below 1 M. Trypsin protease (Sequencing Grade Modified Trypsin, 20 μg aliquots, Promega) was carefully solubilised in 1 mL of the resuspension buffer supplied by the manufacturer (50 mM acetic acid) and incubated for 15 min at 30°C to maximize its activity. An aliquot of trypsin was added and gently mixed with the milk proteins so as to reach a 1:50 ratio of trypsin:milk proteins. The mixture was left to incubate overnight (19 h) at 37°C in the dark. The digestion reaction was stopped by lowering the pH of the mixture using a 10% formic acid (FA) in H2O (v:v) to a final concentration of 1% FA.

Tryptic digest cleaning, assay, and dilution

The 90 tryptic digests were desalted using solid phase extraction (SPE) cartridges (Sep-Pak C18 1cc Vac Cartridge, 50 mg sorbent, 55–105 μm particle size, 1 mL, Waters) by gravity as follows. The SPE cartridges were conditioned by running 1 mL of 80% acetonitrile (ACN):0.1% FA in H2O (v:v:v) and then washed using 1 mL of 0.1% FA in H2O (v:v). The tryptic digests were loaded onto the cartridges and washed using 1 mL of 0.1% FA in H2O (v:v). Peptides were eluted using 1 mL of 80% ACN:0.1% FA in H2O (v:v:v) into a fresh 1.5 mL tube. The eluent's volume (1.00 mL) was reduced to 0.18 mL using a Speedvac Concentrator without heat, thereby ensuring the complete evaporation of the ACN. Undigested milk proteins were filtered out using ultrafiltration (UF) devices (MWCO 30 kD, 0.5 mL, Amicon Ultra-0.5 centrifugal filter device, Millipore). The filtrates were collected and the peptide concentration was assessed using the Microplate BCA protein assay kit (Pierce), as per the manufacturer's instructions albeit excluding the compatibility reagent step. Bovine Serum Albumin (BSA) was used a standard. An aliquot corresponding to 10 μg of peptide digest was diluted with 0.1% FA in H2O (v:v) to reach a final volume of 100 μL (0.1 μg/μL). The diluted peptide mixture was transferred into a 100 μL glass insert placed into a glass vial. The vials were positioned into the autosampler at 4°C until MS analyses.

Nano-liquid chromatography (nLC)-electrospray ionization (ESI) tandem MS (MS/MS) analyses

The nLC-ESI-MS/MS analyses were performed in triplicates (coded i1 to i3) thus yielding 270 MS files. The coding of the samples at the last stage follows the pattern breed_method_extraction-replicate_digestion-replicate_injection-replicate (e.g., JAe1d1i1 stands for Jersey breed_method A/extraction-replicate 1_digestion-replicate 1_injection-replicate 1). The injection order was randomized to minimize systematic error including chromatographic drift or suppression effects. Chromatographic separation of the tryptic peptides was performed by reverse phase (RP) using an Ultimate 3000 RSLCnano System (Dionex). A 1 μL aliquot (0.1 μg peptide) was loaded using a full loop injection mode onto a trap column (Acclaim PepMap100, 75 μm × 2 cm, C18 3 μm 100 Å, Dionex) at a 3 μL/min flow rate and switched onto a separation column (Acclaim PepMap100, 75 μm × 15 cm, C18 2 μm 100 Å, Dionex) at a 0.4 μL/min flow rate after 3 min. The column oven was set at 30°C. Mobile phases for chromatographic elution were 0.1% FA in H2O (v:v) (phase A) and 0.1% FA in ACN (v:v) (phase B). Ultraviolet (UV) trace was recorded at 215 nm for the whole duration of the nLC run. A linear gradient from 3 to 40% of ACN in 35 min was applied. Then ACN content was brought to 90% in 2 min and held constant for 5 min to wash the separation column. Finally, the ACN concentration was lowered to 3% over 0.1 min and the column re-equilibrated for 5 min. On-line with the nLC system, peptides were analyzed using an Orbitrap Velos hybrid ion trap-Orbitrap mass spectrometer (Thermo Scientific). Ionization was carried out in the positive ion mode using a nanospray source. The electrospray voltage was set at 2.2 kV, and the heated capillary was set at 280°C. Full MS scans were acquired in the Orbitrap Fourier Transform (FT) mass analyser over a normal range of 300–2000 m/z with 60,000 resolution in profile mode. MS/MS spectra were acquired in data-dependent mode. The 20 most intense peaks with charge state ≥ 2 and a minimum signal threshold of 10,000 were fragmented in the linear ion trap using collision-induced dissociation (CID) with a normalized collision energy of 35%, 0.25 activation Q, and activation time of 10 ms. The precursor isolation width was 2 m/z. Dynamic exclusion was enabled, and peaks selected for fragmentation more than once within 10 s were excluded from selection for 30 s. Blanks (1 μL of mobile phase A) were injected in between each peptide digest and analyzed over a 20 min nLC run to further clean the C18 separation column, and minimize carry-over.

Database search for protein identification

Database searching of the 270 MS.RAW files was performed in Proteome Discoverer 1.4 with MASCOT 2.4.1 against both the non-redundant (nr) National Center for Biotechnology Information (NCBI) database with taxonomy as mammalia (2,578,153 entries, released on 7 November 2014, 68th release) and the International Protein Index (IPI) bovine database (23,841 entries, last modified on 4 April 2014, http://www.uniprot.org/proteomes). The database searching parameters specified trypsin as the digestion enzyme and allowed up to two missed cleavages. The precursor mass tolerance was set at 10 ppm, and fragment mass tolerance set at 0.5 Da. Carbamidomethylation (C) was set as a static modification. Oxidation (M), phosphorylation (STY), conversion from Gln to pyro-Glu (N-term Q) and Glu to pyro-Glu (N-term E), and deamination (NQ) were set as dynamic modifications. The target decoy peptide-spectrum match (PSM) validator was used to estimate false discovery rates (FDR). At the peptide level, peptide confidence value set at high was used to filter the peptide identification, and the corresponding FDR on peptide level was less than 1%. At the protein level, protein grouping was enabled. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium (Vizcaíno et al., 2014) via the PRIDE partner repository with the dataset identifier PXD002529. The amino acid sequences of the proteins annotated as “Uncharacterized” were searched using the Basic Local Alignment Search Tool (BLAST) tool of UniProt database (http://www.uniprot.org/blast/) with the default parameters except for the target database which was set at “Mammals.” The best hit is indicated in brackets in Table 1.
Table 1

List of protein accessions identified in the different milk extracts, along with their description, their coverage (percentage of the database protein sequence covered by matching peptides) across samples, the number of unique peptides (distinct peptide that match to a single protein entry within the search database), the occurrence (%) in the three methods and the two breeds.

NoDescriptiongi AccessionIPI accessionΣcoverage (%)Σ# Unique PeptidesOccurrencea in method A (%)Occurrencea in method B (%)Occurrencea in method C (%)Occurrencea in breed H (%)Occurrencea in breed J (%)
111 kDa proteinIPI00843089.352.322100100100100100
2Actin 14501885IPI00698900.116.55207267404773
3Alpha-1-acid glycoprotein94966811IPI00691212.133.71097100277775
4Alpha-1-antiproteinase27806941IPI00695489.124.5109794859887
5Alpha-1B-glycoprotein114053019IPI00692686.113.36939354113
6Alpha-2-HS-glycoprotein27806751IPI00707101.119.288674659062
7Alpha-lactabumin11939369930.83950555051
8Alpha-lactalbumin5057821429.9170050
9Alpha-lactalbumin1103698618.12100100100100100
10Alpha-lactalbumin27805979IPI00717424.131.05355001821
11Alpha-lactalbumin-like isoform X358515393812.33100003634
12Alpha-S1-casein15979322715.81100100100100100
13Alpha-S1-casein29967664.7101001
14Alpha-S1-casein16265014.012212172211
15Alpha-S1-casein isoform X252895323641.347100100100100100
16Alpha-S1-casein isoform X352895323842.247100100100100100
17Alpha-S1-casein isoform X652895324430.25100100100100100
18Alpha-S1-casein isoform X752895324649.247100100100100100
19Alpha-S2-casein27806963IPI00698843.159.529041054
20Alpha-S2-casein14774464624.11100100100100100
21Angiogenin-1118151356IPI00726982.131.857639746957
22Antibody Blv5b85131374259.55516354121
23Anti-hiv llama vhh antibody a123802588376.3100101
24Apolipoprotein A-I preproprotein75832056IPI00715548.138.156990996763
25Apolipoprotein A-II114052298IPI00688815.216.052500143
26Apolipoprotein A-IV82697389IPI00695965.118.71936041216
27Apolipoprotein E27806739IPI00712693.17.312750153032
28ATP-binding cassette sub-family G member 2112817615IPI00690408.57.9801000816259
29Beta-1,4-galactosyltransferase 1IPI00760476.265.12902905429
30Beta-1,4-galactosyltransferase 1116241263IPI00685910.210.02900530
31Beta-2-microglobulin41386683IPI00686769.123.72210082979690
32Beta-casein555980347IPI00697085.123.7800812723
33Beta-casein8340609323.745010003236
34Beta-casein variant I3823152735.9235001213
35Beta-lactoglobulin5403771237.02210001
36Beta-lactoglobulin56289003530.01100100100100100
37Beta-lactoglobulin87196497IPI00699698.141.0185019610
38Beta-lactoglobulin22316547.51910001006864
39Beta-lactoglobulin4925942347.515100003634
40Beta-lactoglobulin37894794025.8110010006870
41Beta-lactoglobulin A219408847.522100003634
42Beta-lactoglobulin-like5937309959.412200213
43Butyrophilin subfamily 1 member A13183510IPI00708535.128.1335100100100100100
44Cathelicidin-127807341IPI00718108.110.31060150
45Cathelicidin-427807337IPI00686754.111.115131121
46CD5L proteinIPI00867131.214.123250812245
47CD81 antigen78042548IPI00685617.16.8989803731
48Cell division control protein 42 homolog isoform 14757952IPI00704257.25.276190331915
49Chemokine (C-X-C motif) ligand 3114050915IPI00721750.214.4995003433
50Clusterin preproprotein27806907IPI00694304.118.0539795999598
51Complement C36026972027.986100100100100100
52Complement C3124056491IPI00713505.234.0152100003634
53Cysteine-rich secretory protein 3118601862IPI00715999.123.479978909187
54dnaJ homolog subfamily B member 9300795871IPI01000889.117.98902013813
55dnaJ homolog subfamily C member 327807457IPI00693007.13.41400110
56Dystroglycan27806449IPI00707359.11.82109113230
57Endopin 2B38683423IPI01017613.116.6792003529
58Epididymal secretory protein E127806881IPI00711862.132.98598086238
59Fab Pgt123 Hiv-1 Neutralizing Antibody4916687375.21561303413
60Factor XIIa inhibitor27807349IPI00710025.115.4710068718278
61Fatty acid synthase4262391651.21180015
62Fatty acid-binding protein50756310614.34099401009464
63Fatty acid-binding protein27805809IPI00691946.247.410980093428
64Fibrinogen alpha chain75812954IPI00691819.18.979528637748
65Fibrinogen beta chain218931172IPI00709763.511.7242753494937
66Fibroblast growth factor-binding protein 127805911IPI00704023.111.559440928763
67Fibronectin isoform X10528940100IPI01028178.10.648400302
68Folate receptor alpha330688394IPI01017673.19.14767405944
69Gelsolin isoform b296484315IPI01017675.11.8190011025
70Glutamyltranspeptidase 1 gamma329664306IPI00705565.26.74102027
71Glycosylation-dependent cell adhesion molecule 127807339IPI00716366.150.339100100100100100
72GTP binding protein Rab1a15281851IPI00883580.117.01180137168
73hCG1791766 isoform CRA_a11962601413.72018472220
74Heat shock cognate 71 kDa protein548494982IPI00708526.25.9656140046
75Heat shock-related 70 kDa protein 2148887197IPI00710052.11.71011985
76Hedgehog interacting protein-like 2-like76669880IPI00700055.211.510080966054
77Hemopexin77736171IPI00690198.43.5620020
78Heparin cofactor 2157280001IPI00688367.12.6605012
79Histatherin242347807IPI00944405.139.721312884428
80Hornerin285571504.1600101
81Ig anti-HIV-161240550412.6110591325
82Ig J chain32401410IPI00701295.129.920991009998100
83Ig lambda light chain1508867521.31010001006864
84Ig lambda light chain constant region 2 allotypic variant IGLC2b34319700430.219810006769
85Ig lambda light chain constant region 3 allotypic variant IGLC3c34319702652.83100100100100100
86Ig lambda-1 variable region4275988213.920019111
87Ig lambda-like polypeptide 1-likeIPI01002118.123.3810001006864
88Ig light chain31089343316.739162778074
89Ig light chain31089343561.41610001006864
90Ig light chain variable region580243611.51100100100100100
91Ig light chain variable region232340041.81204340
92Ig light chain variable region4826731699.4151660106
93Ig light chain, lambda gene cluster92096965IPI00867205.120.11010003236
94IgG1 heavy chain constant region754726616.499895999897
95IgG2a heavy chain constant region169916712.92485454223
96IGK protein115545495IPI00724838.210.01002993429
97IgM heavy chain constant region223229922.753100100100100100
98Inhibitor of carbonic anhydrase-like4706555765.2706031
99Isocitrate dehydrogenase cytoplasmic75056526IPI00702781.212.15001462
100Isocitrate dehydrogenase cytoplasmic75832090IPI01028293.125.914877002826
101Kappa-casein809932440.633161358
102Kappa-casein16280742.442100100100100100
103Kappa-casein170560824.0135100100100100100
104Kappa-casein28402712417.3133100541008881
105Kappa-casein31514301636.8675826243241
106Kappa-casein42875521513.62100100100100100
107Kappa-casein42875521933.3669910010099100
108Kappa-casein28462658.51100100100100100
109Kappa-casein249351420.510100100100100100
110Kappa-casein251198100.01100100100100100
111Kappa-casein15942499.0210010006870
112Kappa-casein A22941657.133100100100100100
113Kininogen-1125505IPI00701166.111.613990265036
114Kininogen-2490IPI00718535.110.440530297
115Kininogen-2 isoform X152893669414.71799003534
116Lactadherin2494285IPI00689035.133.3551006210010075
117Lactoperoxidase27806851IPI00716157.126.1144360362325
118Lactoperoxidase167406705.112100100100100100
119Lactotransferrin5899254304.024100100100100100
120Lactotransferrin30794292IPI00710664.153.113498003533
121Lethal(3)malignant brain tumor-like protein 45057924232.5110120
122Leucine proline-enriched proteoglycan (leprecan) 15079373061.9110111
123Lipopolysaccharide-binding protein84579853IPI00730056.12.3520011
124Lipoprotein lipase115497164IPI00692291.322.01521009910010099
125Lymphocyte antigen 96114051704IPI00689371.26.3930012
126Lymphocyte-specific protein 1 isoform X65870038675.8430012
127Mammaglobin-A155372309IPI00711254.114.11100070
128Mammary serum amyloid A3.2347300329IPI01017597.19.2111710013
129Monocyte differentiation antigen CD14 isoform X1528956860IPI00686931.222.725820684752
130Mucin-141386778IPI00706283.13.113100784337
131Mucin-1541386723IPI00716220.17.31099100949798
132Myristoylated alanine-rich C-kinase substrate148872484IPI00760436.16.0609060
133Neutrophil gelatinase-associated lipocalin isoform X3528912092IPI00685784.330.5112895746566
134Nucleobindin-1115497814IPI00722271.228.11171000384846
135Nucleobindin-2115496067IPI00696729.215.273010003236
136Olfactory receptor 2S2-like7248011962.2104121
137Osteopontin isoform X1528952550IPI00691887.231.7719910010099100
138Pancreatic elastase inhibitor86151466.71100016
139Pancreatic secretory granule membrane major glycoprotein GP234157259723.0841009610010098
140Peptidoglycan recognition protein 127808640IPI00701640.113.75015633713
141Peptidyl-prolyl cis-trans isomerase A47523764IPI00697285.316.510162846
142Peptidyl-prolyl cis-trans isomerase B555965067IPI00702098.46.0853100148
143Peptidyl-prolyl cis-trans isomerase FKBP1A78365305IPI00698916.225.09870022
144Perilipin-2 isoform X2528958213IPI01017444.138.096100100100100100
145Platelet glycoprotein 4521258696IPI00710204.315.5106100100100100100
146Polymeric immunoglobulin receptor3914346IPI00696714.130.02810001006864
147Polymeric immunoglobulin receptor29647936523.921100100100100100
148Polyubiquitin262712921.038007902429
149Prosaposin27806447IPI00718311.35.0542500116
150Prostaglandin-H2 D-isomerase27807521IPI00709683.17.3665941008587
151Protein CREG1115495283IPI00702458.220.411780133033
152Protein HP-25 homolog 1114050753IPI00724799.27.1310010
153Protein HP-25 homolog 2114052108IPI00700655.314.937240274
154Protein inturned6176022411.3110010
155Protein OS-977735409IPI00706896.14.5250122
156Prothrombin135806IPI00710799.12.2761023
157Ras-related protein Rab-11A84000297IPI00695221.311.18210010
158Ras-related protein Rab-18115495023IPI00691826.37.312330012
159Rhophilin-2-like protein RHPN2P11903602252.4201010
160Ribonuclease 432363484IPI00697112.19.2108103025
161Ribonuclease 495006989IPI00760446.128.6431031211
162Ribonuclease pancreatic13321010.51000190310
163Secretoglobin family 1D member118150406IPI00824879.121.64100100100100100
164Selenium-binding protein 1IPI00718529.11.7180334
165Selenoprotein14868008213.06945092314
166Serotransferrin2501351IPI00690534.145.5611009510010097
167Serpin A3-1160332365IPI00700622.412.9321000776751
168Serum albumin1351907IPI01028455.165.140100611008390
169Sodium-dependent phosphate transport protein 2B27807195IPI00703813.15.9792003529
170Sulfhydryl oxidase 1156120795IPI00867237.119.1119827977572
171Superoxide dismutase134289051IPI00847121.111.87301001
172Sushi repeat-containing protein SRPX5079698503.9302624
173Tetranectin114051137IPI00717369.16.9589013429
174Transforming protein RhoA10835049IPI00688998.36.24900110
175Transthyretin27806789IPI00689362.134.724091775950
176Uncharacterized proteinb (P02666 Beta-casein 71% ident.)IPI00712994.333.55180076
177Uncharacterized proteinb (F1MGU7 Fibrinogen gamma-B chain 100% ident.)IPI00843209.117.21699003534
178Uncharacterized proteinb (A5PK72 Uncharacterized protein 92% ident.)IPI01017618.120.61042001020
179Uncharacterized proteinb (A0A0A0MP92 Serpin A3-7 98% ident.)IPI00971595.121.1410650
180Uncharacterized proteinb (Q2HJI6 Granulin 99% ident.)IPI00904166.16.399980415737
181Uncharacterized proteinb (F1MLW7 Uncharacterized protein 97% ident.)IPI00838162.220.91010003236
182Uncharacterized protein LOC524810b (A5D7Q2 Uncharacterized protein 100% ident.)326937675IPI00852509.110.96100100100100100
183Vitamin D-binding protein78369364IPI00823795.18.0203800216
184Xanthine dehydrogenase/oxidase29648268615.616010003236
185Xanthine dehydrogenase/oxidase109940048IPI00695367.314.11610001006864
186Zinc-alpha-2-glycoprotein77735615IPI00698993.111.4211001006467

Proteins whose occurrence varies more than 10% across breeds are highlighted in bold.

Occurrence is defined as the ratio of the number of samples in which the protein was identified over the total number of samples in the method or breed expressed in percent.

Amino acid sequence of uncharacterized proteins were blasted against uniprot database. Best hit is indicated in brackets with the percentage of identity.

List of protein accessions identified in the different milk extracts, along with their description, their coverage (percentage of the database protein sequence covered by matching peptides) across samples, the number of unique peptides (distinct peptide that match to a single protein entry within the search database), the occurrence (%) in the three methods and the two breeds. Proteins whose occurrence varies more than 10% across breeds are highlighted in bold. Occurrence is defined as the ratio of the number of samples in which the protein was identified over the total number of samples in the method or breed expressed in percent. Amino acid sequence of uncharacterized proteins were blasted against uniprot database. Best hit is indicated in brackets with the percentage of identity.

Statistical analyses

The 270 MS.RAW files were post-processed using Genedata Expressionist Refiner 8.1 as follows. The chemical noise was substracted by smoothing chromatograms over 50 scans retention time (RT) window. The intensities were put onto a common m/z-RT adaptive grid over 10 scans. Chromatograms were then aligned using a Pairwise Alignment Based Tree scheme with a 50 scan interval. Chromatograms were averaged using a mean method. Chromatographic peaks were detected using a 1 min summation window and a curvature-based peak detection method. Chromatogram istopes were clustered using a Peptide Isotope Shaping method with 0.05 min RT tolerance and 0.01 Da m/z tolerance. A reference grid was then applied and the reference peaks extracted. A MS/MS consolidation node was performed by filtering MS/MS not in cluster on the highest Total Ion Chromatogram (TIC). Identification results from Proteome Discoverer were then imported and peaks annotated. The resulting peaks were exported to Genedata Expressionist Analyst 8.1 for further statistical analyses. In Analyst, peaks were normalized using an Intensity Drift Normalization method using the randomized injection order. Principal component analyses (PCA) were applied to the normalized peaks using a covariance matrix of row means with 50% valid values. Partial Least Squares analyses (PLS) were performed on row means using the cow breed as a response, three latent factors, and 50% valid values.

Gene ontology (GO) classification

The database search produced two types of accessions: Gene Index (gi) and IPI. International Protein Index accessions were converted to gi accession numbers using the gi2ipi.xrefs file available at the European Bioinformatics Institute website (ftp.ebi.ac.uk/pub/databases/IPI/last_release/current). Gene Ontology terms were retrieved on-line from all gi accessions using UniProtKB Retrieve ID/Mapping tool (http://www.uniprot.org/uploadlists/). Results were exported into Microsoft Excel 2010 and charts generated.

Validation of protein identifications using known standards

In order to confirm the identities of some of the proteins identified in this study either commonly across breeds and extraction methods, or displaying qualitative variation across breeds and/or methods, bovine protein standards were purchased from Sigma from bovine wherever possible otherwise from human. If the bovine derived protein was not available the human protein was obtained. The protein standards include: actin from bovine muscle (A3653-1MG, 80% pure), fibrinogen from bovine plasma (F8630-1G, type I-S, 65–85% pure), lactoferrin from bovine milk (L9507-10MG, 85% pure), kininogen low molecular weight from human plasma (K3628-1MG, 95% pure), α-casein from bovine milk (C6780-250MG, 70% pure), β-casein from bovine milk (C6905-250MG, 98% pure), κ-casein from bovine milk (C0406-250MG, 70% pure), α-lactalbumin from bovine milk (L5385-25MG, 85% pure), β-lactoglobulin from bovine milk (L3908-250MG, contains lactoglobulins A and B, 90% pure), albumin from bovine serum (BSA, A7906-10G, 98% pure). These lyophilised protein standards were fully solubilised at a 10 mg/mL concentration in SB which contained 10 mM DTT. After a 60 min incubation at room temperature, a 1 M IAA solution was added to reach a final 20 mM concentration and tubes were left to incubate at room temperature in the dark for 60 min. These individual standards were combined together in equamolarity to make a mix which was duplicated. This mix was used to spike two milk extracts obtained using method A (JAe1 and HAe3) and chosen because their protein concentrations were the closest to those of the standards. Standard mixtures and extracts JAe1 and HAe3, spiked or unspiked, underwent trypsin digestion as described in Section In-Solution Protein Digestion Using Trypsin Protease by pipetting a volume corresponding to 100 μg of proteins. For the milk extracts spiked with the mix, 100 μg of proteins from milk extracts were spiked with 100 μg of proteins from the mix. A 1:50 ratio of trypsin:standards was used. The subsequent clean-up, nLC-MS/MS, database search steps were rigorously performed as described above in Sections Tryptic Digest Cleaning, Assay and Dilution, Nano-Liquid Chromatography (nLC)-electrospray ionization (ESI) tandem MS (MS/MS) analyses, and Database Search for Protein Identification.

Results

SDS-PAGE patterns, protein concentrations, number of accessions, and nLC-MS runs

Figure 2 displays SDS-PAGE profiles, protein concentrations and number of protein accessions identified for each milk sample across all three sets of protein extracts.
Figure 2

Comparison of SDS-PAGE patterns (top panel), protein concentration (middle panel), and number of protein accessions identified per sample (bottom panel). Error bars are Standard Deviation (SD); the n number displayed at the top right corner of each box represents the number of replicates used for average and SD. Error bars for the protein assay are from the BCA technical triplicates. Error bars for the accession numbers are from 15 replicates (5 digestion replicates × 3 injection replicates). Recovery rates are indicated in percent in the protein assay and are computed relative to protein concentrations in skim milk (SM). SA, Serum Albumin; aCN, alpha-casein; bCN, beta-casein, bLG, beta-lactoglobulin; aLA, alpha-lactalbumin.

Comparison of SDS-PAGE patterns (top panel), protein concentration (middle panel), and number of protein accessions identified per sample (bottom panel). Error bars are Standard Deviation (SD); the n number displayed at the top right corner of each box represents the number of replicates used for average and SD. Error bars for the protein assay are from the BCA technical triplicates. Error bars for the accession numbers are from 15 replicates (5 digestion replicates × 3 injection replicates). Recovery rates are indicated in percent in the protein assay and are computed relative to protein concentrations in skim milk (SM). SA, Serum Albumin; aCN, alpha-casein; bCN, beta-casein, bLG, beta-lactoglobulin; aLA, alpha-lactalbumin. The same amount of proteins was loaded per extract to produce SDS-PAGE profiles, with skim milk as a reference. The electrophoretic patterns were similar from one extraction method to another, albeit extracts C (methanol/chloroform) displayed the best resolution with the sharpest bands. In particular extracts C (methanol/chloroform) were the only ones consistently resolving the very intense 24–26 kD band corresponding to α-caseins, and were therefore more comparable to skim milk profiles than extracts A (urea) and B (TCA/acetone). This band was either missing or very faint on SDS-PAGE profiles of extracts A (urea) and B (TCA/acetone), except for extract JBe2. Protein assays were performed in triplicate, using skim milk as a reference to compute recovery percentage (%) following protein extraction procedure. Protein concentrations were converted to mg per mL of milk. Jersey cow milk had a greater protein concentration (24.9 mg/mL) than Holstein-Friesian cow milk (20.7 mg/mL). This is consistent with the literature which also reports higher concentrations of milk fats in Jersey breed than Holstein-Friesian's (Arnould and Soyeurt, 2009; Capper and Cady, 2012; Jensen et al., 2012b). All methods considered, protein concentrations ranged from 10.1 (HBe3) to 22.7 (JC1) mg/mL. Standard deviation (SD) was less than 10% of the mean. On average, protein concentrations for Jersey breed were 18.5 (±1.3) mg/mL (73%), 16.7 (± 2.3) mg/mL (68%), and 21.6 (±1.1) mg/mL (87%), respectively for extracts A (urea), B (TCA/acetone) and, C (methanol/chloroform). On average, concentrations for Holstein-Friesian breed were 16.1 (±0.9) mg/mL (80%), 10.5 (± 0.5) mg/mL (51%), and 17.5 (±0.8) mg/mL (83%), respectively for extracts A (urea), B (TCA/acetone), and C (methanol/chloroform). Figure 2 shows that method C (methanol/chloroform) yielded the highest protein concentrations substantiated by the highest recovery rate, followed by method A (urea), while method B (TCA/acetone) resulted in the lowest concentrations particularly for Holstein-Friesian breed. The number of unique proteins accessions identified per extract is indicated in Figure 2. All methods considered, number of identifications ranged from 48 (JBe2d4i1) to 93 (HAe2d2i2). On average for Jersey breed, there were 82.3 (±4.4), 55.5 (±4.2), and 69.1 (±4.2) protein accessions identified, respectively for extracts A, B, and C. On average for Holstein-Friesian breed, there were 81.6 (±4.3), 66.6 (±4.2), and 72.8 (±3.3) protein accessions identified, respectively for extracts A (urea), B (TCA/acetone), and C (methanol/chloroform). Unexpectedly, while extracts C (methanol/chloroform) produced the highest recovery rate and the highest concentrations relative to extracts A (urea) and B (TCA/acetone), they generated less unique accessions than extracts A (urea), albeit more than B (TCA/acetone). Figure 3 shows TICs of three tryptic digests from Holstein-Friesian and Jersey breeds illustrating the effect of the extraction methods. Peptides eluted from around 10 to 42 min. The three methods generated distinct TICs, with method C (methanol/chloroform) displaying peaks with higher resolution than methods A (urea) and B (TCA/acetone). The peaks eluting from 19.5 to 20.5 min, and 22.5 to 24 min, and which were the most intense in samples processed using method C (methanol/chloroform), yielded several peptides from α-S1-caseins. TICs are much more comparable across breeds than across methods because the elution patterns look similar, yet subtle differences can be seen in Figure 3 between the left and right panels, particularly with respect to the relative abundance of the chromatographic peaks. This is an indication that protein complexity varies between Holstein-Friesian and Jersey breeds, not only in a quantitative manner, as demonstrated with the protein concentrations, but also qualitatively. Indeed, different proteins will produce different tryptic peptides. This carries through to PCA plots as they were derived from chromatographic peaks, as illustrated below.
Figure 3

Total Ion Chromatograms (TIC) of three tryptic digests illustrating the effect of extraction method for Holstein-Friesian (left panel) or Jersey (right panel) breed. A TIC represents the summed intensity across the entire range of masses being detected at every point in the analysis. The duration of each nLC run is 50 min (x-axis), with tryptic peptides eluting from 10 to 42 min. Relative abundance (percent relative abundance with respect to the ion of highest abundance along the y-axis) of the most intense chromatographic peaks are comparable across methods. Most abundant peaks elute toward the end of the nLC run (27–38 min) for methods A (urea) and B (TCA/acetone), while they are evenly distributed along the whole elution pattern (11–38 min) for method C (methanol/chloroform). Subtle differences in peptide elution are visible between Holstein-Friesian (left panel) and Jersey (right panel) breeds. The nomenclature of each TIC exemplified here is explained in the Materials and Method Section and in Figure 1.

Total Ion Chromatograms (TIC) of three tryptic digests illustrating the effect of extraction method for Holstein-Friesian (left panel) or Jersey (right panel) breed. A TIC represents the summed intensity across the entire range of masses being detected at every point in the analysis. The duration of each nLC run is 50 min (x-axis), with tryptic peptides eluting from 10 to 42 min. Relative abundance (percent relative abundance with respect to the ion of highest abundance along the y-axis) of the most intense chromatographic peaks are comparable across methods. Most abundant peaks elute toward the end of the nLC run (27–38 min) for methods A (urea) and B (TCA/acetone), while they are evenly distributed along the whole elution pattern (11–38 min) for method C (methanol/chloroform). Subtle differences in peptide elution are visible between Holstein-Friesian (left panel) and Jersey (right panel) breeds. The nomenclature of each TIC exemplified here is explained in the Materials and Method Section and in Figure 1.

Reproducibility

Figure 4 illustrates a complete set of 15 replicates resulting from one extract for each method (5 individual digestions and three randomized repeated injections). Apart from the first and last peaks, TICs are very reproducible within a method, particularly within a set of 3 repeated injections (i1, i2, and i3).
Figure 4

TIC of 45 Jersey tryptic digests illustrating the reproducibility at digestion (5 replicates) and injection (3 replicates) levels, for methods A (urea) (15 replicates), B (TCA/acetone) (15 replicates), and C (methanol/chloroform) (15 replicates). TICs of each set of three randomized repeated injections are alternatively black or gray. The x-axis represents the duration of the nLC run in min, while the y-axis represents the relative abundance of the chromatographic peaks which corresponds to the percent relative abundance with respect to the ion of highest abundance. With the exception of the inconsistent peptides eluting very early (10–12 min) or very late (39–42 min) during the 50 min nLC run, TICs are very reproducible across technical replicates, within a particular method. The nomenclature of each TIC exemplified here is explained in the Materials and Method Section and in Figure 1.

TIC of 45 Jersey tryptic digests illustrating the reproducibility at digestion (5 replicates) and injection (3 replicates) levels, for methods A (urea) (15 replicates), B (TCA/acetone) (15 replicates), and C (methanol/chloroform) (15 replicates). TICs of each set of three randomized repeated injections are alternatively black or gray. The x-axis represents the duration of the nLC run in min, while the y-axis represents the relative abundance of the chromatographic peaks which corresponds to the percent relative abundance with respect to the ion of highest abundance. With the exception of the inconsistent peptides eluting very early (10–12 min) or very late (39–42 min) during the 50 min nLC run, TICs are very reproducible across technical replicates, within a particular method. The nomenclature of each TIC exemplified here is explained in the Materials and Method Section and in Figure 1. Principal Component Analyses of the MS data highlighted the reproducibility of the individual extraction methods while showing there were clear differences between the different methods (Figure 5). Principal Component (PC) 1 explained 19.9% of variance and clearly separated method A (urea) from method B (TCA/acetone). Principal Component 2 explained 13.8% of variance and set method C (methanol/chloroform) well apart from the other two methods. Within each method, all replicates clustered together whether it be at the extraction, digestion or injection levels. Within methods, cow breeds did not cluster together; it was evident within method A (urea) where Holstein-Friesian and Jersey breeds bear two different shades of colors that seldom mix. Breed explained 2.1% of the variance along PC7. On the plot PC1 against PC7, methods and breed were clearly separated. The effect of both cow breed and extraction method on protein analyses was further explored by PLS using only peaks which successfully led to protein identifications during database search (Figure 6). Plots of Latent Variable (LV) 1 (22.7%) against LV2 (16.7%) discriminated between breeds and methods, displaying 6 tight clusters for JA, JB, JC, HA, HB, and HC.
Figure 5

Principal component analyses (PCA) plots along Principal Component (PC) 1 against PC2 (left panel), and PC1 against PC7 (right panel). Together PC1 (19.9%) and PC2 (13.8%) explain 33.7% of the total variance and clearly separate the three methods. Within each method, all replicates cluster together whether it be at the extraction, digestion or injection levels. Breed explain 2.1% of the variance along PC7. On the plot PC1 against PC7, methods and breed are well-separated.

Figure 6

Partial Least Square (PLS) analysis plots along Latent Variable (LV) 1 against LV2. Together LV1 (22.7%) and LV2 (16.7%) explain 39.4% of the total variance, with a clear separation of breeds and methods, and displaying six tight clusters for JA, JB, JC, HA, HB, and HC.

Principal component analyses (PCA) plots along Principal Component (PC) 1 against PC2 (left panel), and PC1 against PC7 (right panel). Together PC1 (19.9%) and PC2 (13.8%) explain 33.7% of the total variance and clearly separate the three methods. Within each method, all replicates cluster together whether it be at the extraction, digestion or injection levels. Breed explain 2.1% of the variance along PC7. On the plot PC1 against PC7, methods and breed are well-separated. Partial Least Square (PLS) analysis plots along Latent Variable (LV) 1 against LV2. Together LV1 (22.7%) and LV2 (16.7%) explain 39.4% of the total variance, with a clear separation of breeds and methods, and displaying six tight clusters for JA, JB, JC, HA, HB, and HC.

Protein identities

Table 1 lists all the unique protein accessions and reports in which method/breed they were identified. Accessions that were unique to a particular set of extracts or conversely shared among samples were summed and plotted as a Venn diagram (Figure 7). Numbers of unique accessions sorted as follows: 149, 110, and 125, respectively for extracts A (urea), B (TCA/acetone) and, C (methanol/chloroform). A total of 71 protein accessions were common to all methods. Methods A (urea) and C (methanol/chloroform) shared a large number of protein identities (76); 61 accessions were shared between extracts A (urea) and B (TCA/acetone); 37 accessions were shared between extracts B (TCA/acetone) and C (methanol/chloroform). Such representation highlighted the fact that as different as methods A (urea), B (TCA/acetone) and, C (methanol/chloroform). were from each other, they recovered the same types of proteins from skim milk samples. In total, 186 different protein accessions were identified across all methods. Identities common to all three sets of extracts include: caseins (α-S1, α-S2, β, and κ forms), lactoferrin, albumin, β-lactoglobulin, α-lactalbumin, complement C3, and butyrophilin. This was expected as these proteins are the most abundant in milk. Yet proteins present in low abundance in milk were also identified, such as enzymes and minor glycoproteins, as well as many immunoglobulins (Igs), antibodies, and antigens.
Figure 7

Venn diagram of the number of unique protein accessions and Gene Ontology (GO) classification of known proteins per extraction method. A, method A (urea); B, method B (TCA/acetone); C, method C (methanol/chloroform); AB, methods A and B combined; AC, methods A and C combined; BC, methods B and C combined; ABC, methods A, B, and C combined. On the histograms illustrating GO classifications, the x-axis represents the square root of the number of proteins belonging to each of the classes distributed along the y-axis. The insets illustrate the histograms of the sub-classes of the GO class containing the greatest number of proteins.

Venn diagram of the number of unique protein accessions and Gene Ontology (GO) classification of known proteins per extraction method. A, method A (urea); B, method B (TCA/acetone); C, method C (methanol/chloroform); AB, methods A and B combined; AC, methods A and C combined; BC, methods B and C combined; ABC, methods A, B, and C combined. On the histograms illustrating GO classifications, the x-axis represents the square root of the number of proteins belonging to each of the classes distributed along the y-axis. The insets illustrate the histograms of the sub-classes of the GO class containing the greatest number of proteins. Gene Ontology classifications of known proteins are presented in Figure 7. All considered, classifications were very similar across methods, with method B (TCA/acetone) generally displaying the smallest number of proteins per category. As expected the most prominent protein category in the “Cellular Component” classification was the “extracellular region” as most milk proteins are secreted. The inset further details such components without revealing much difference across methods. Method B (TCA/acetone) had a unique “nuclear lumen” component due to ribonucleases, however it lacked the “intracellular organelle lumen.” Most “Molecular Functions” of identified proteins fell into the binding category, further detailed in Figure 7 inset. Method B (TCA/acetone) was lacking the “selenium binding” activity of selenium-binding protein 1. Method A (urea) was lacking the “laminin binding” function as it was depleted of dystroglycan. The peptidoglycan receptor activity was only found in methods B (TCA/acetone) and C (methanol/chloroform) and was associated to peptidoglycan recognition protein 1. The most prevalent Biological Process was “single-organism process,” detailed in the inset of Figure 7.

Protein validation

Using known protein standards, an independent experiment was designed on one hand to validate our shotgun nLC-MS/MS bottom-up approach and on the other hand, to confirm some of the proteins identified in our milk samples. To this end, actin, fibrinogen, lactoferrin, kininogen, α-casein, β-casein, κ-casein, α-lactalbumin, β-lactoglobulin, and BSA were purchased, and reconstituted in SB at the same concentration (10 mg/mL). These proteins were chosen as they displayed differences across methods and/or breeds. These standards were trypsin-digested individually and in combination, prior to analysis by shotgun nLC-MS/MS. Table 2 summarizes the identification results in the mixture of protein standards combined prior to trypsin digestion. Because their level of purity varied from 65 to 98%, our shotgun nLC-MS/MS approach identified proteins other than the known standards. The expected proteins were correctly identified with high scores (from 66 to 3415) and a mimimum of two Peptide Spectrum Matches (PSM, from 2 to 207) thereby validating our bottom-up identification method. For each of the know protein standard, a peptide was chosen, its Extracted Ion Chromatogram (EIC) was produced and compared across the standard mixture, Jersey and Holstein tryptic digests (Figure 8). The peptides were successfully found in all digests from the standard mixture and the milk sample; and they eluted at comparable retention times. This validates the protein identities from cow's milk samples.
Table 2

List of protein accessions identified in the combined protein standards, along with their description, their score, coverage, the number of peptides identified per protein, the number of peptide spectrum matches (PSM), the size of the protein (AA and MW) and their theoretical isoelectric point (calc. pI).

No.AccessionDescriptionScoreCoverage (%)# Peptides# PSM# AAsMW [kDa]calc. pI
1IPI00843089.311 kDa protein124.229.02210711.17.99
2306440544Actin746.428.692137141.35.22
374267962Albumin ALB protein [Bos taurus]1905.651.7296060769.26.25
4IPI00691212.1Alpha-1-acid glycoprotein84.315.43320223.25.87
511036998Alpha-lactalbumin [Physeter catodon]94.012.0129210.44.59
627805979; IPI00717424.1Alpha-lactalbumin precursor [Bos taurus]858.831.0185114216.25.14
7528953238Alpha-S1-casein isoform X3 [Bos taurus]1009.140.883720623.65.19
8528953244Alpha-S1-casein isoform X6 [Bos taurus]639.030.252219922.64.79
9528953246Alpha-S1-casein isoform X7 [Bos taurus]1011.847.583717720.25.41
10159793227Alpha-S1-casein, partial [Bos taurus]154.415.8239510.94.56
1127806963; IPI00698843.1Alpha-S2-casein precursor [Bos taurus]650.638.7196422226.08.43
12513137422Antibody Blv1h12221.523.23521622.56.11
13114052298; IPI00688815.2Apolipoprotein A-II precursor [Bos taurus]73.416.02210011.28.10
1447564119; IPI00689034.1APOLIPOPROTEIN C-III precursor [Bos taurus]108.316.7249610.75.11
1541386683; IPI00686769.1Beta-2-microglobulin precursor [Bos taurus]148.918.66911813.78.00
16223165Beta-lactoglobulin736.448.292616218.34.92
1787196497; IPI00699698.1Beta-lactoglobulin precursor [Bos taurus]1112.438.8177917819.95.02
18593730995Beta-lactoglobulin-like [Physeter catodon]65.69.42218020.35.94
191351907; IPI01028455.1Bovine serum albumin3415.069.47820760769.26.18
20124056491; IPI00713505.2Complement C3179.64.510101661187.16.84
216980814Fibrinogen908.632.6112139042.77.97
2275812954; IPI00691819.1Fibrinogen alpha chain precursor [Bos taurus]1601.741.1408061567.07.17
231346006Fibrinogen beta chain741.237.6132046853.38.19
24488508027Fibrinogen beta chain isoform 2 [Dasypus novemcinctus]126.48.83343350.07.39
25229156Fibrinopeptide B152.471.414212.44.44
26528940100; IPI01028178.1Fibronectin isoform X10 [Bos taurus]292.95.014142268249.05.63
27113912055; IPI00695142.3Glycoprotein 2 (zymogen granule membrane) [Bos taurus]81.72.82253459.24.82
28343197008Immunoglobulin lambda light chain IGLC2c [Bos taurus]160.642.53410611.48.59
29310893435Immunoglobulin light chain [Bos taurus]118.930.72210110.46.48
301705608Kappa-casein472.722.442519221.55.81
31284626Kappa-casein–bovine171.624.5213536.05.11
32315143016Kappa-casein [Bos indicus]2183.536.859014415.96.77
33284027124Kappa-casein [Ovis vignei]464.015.443216218.06.15
34229416Kappa-casein para kappaA2020.946.747910512.38.78
35162807Kappa-casein precursor, partial [Bos taurus]589.342.44289910.65.24
36146386372Kininogen [Oryctolagus cuniculus]104.63.61330233.36.52
3727806851; IPI00716157.1Lactoperoxidase precursor [Bos taurus]58.81.52271280.68.54
38586476652Lactotransferrin [Chrysochloris asiatica]233.34.12670878.08.51
3930794292; IPI00710664.1Lactotransferrin precursor [Bos taurus]2949.157.28817970878.08.32
40528912092; IPI00685784.3Neutrophil gelatinase-associated lipocalin isoform X3 [Bos taurus]168.816.04620023.09.17
41528961411; IPI00866916.1Plasminogen isoform X3 [Bos taurus]103.24.63472481.57.96
423914346; IPI00696714.1Polymeric immunoglobulin receptor130.95.95575782.47.27
4395006989; IPI00760446.1Ribonuclease 4 precursor [Bos taurus]122.228.66614716.98.85
44118150406; IPI00824879.1Secretoglobin family 1D member precursor [Bos taurus]83.121.64410211.38.73
452501351; IPI00690534.1Serotransferrin117.97.76670477.77.08
46554537890Serotransferrin [Myotis brandtii]70.84.12271377.97.05
4724119203; IPI00714405.3Tropomyosin alpha-3 chain isoform 2 [Homo sapiens]51.44.02224829.04.78
48999627Trypsin77.522.023828.87.30
49IPI00843209.1Uncharacterized protein228.218.77744350.25.72
50IPI00712994.3Uncharacterized protein532.429.262316118.35.19
51556760750Uncharacterized protein LOC102338350 [Pantholops hodgsonii]60.04.11424528.19.70

As the purity level of each standard varied, expected known proteins are highlighted in bold.

Figure 8

Validation of protein identities using known protein standards. One peptide per standard was selected and Extracted Ion Chromatograms (EICs) were produced and compared across the standard mixture, Jersey bulk milk, and Holstein bulk mik tryptic digests. Retention times (RT) are comparable across samples. The MS/MS spectrum of the selected peptide is displayed below the EICs. Insets indicate the proteins to which this peptide belongs, the AA sequence of the selected peptide, its m/z, charge state and RT. (A), peptide from beta actin; (B), peptide from alpha S1 casein; (C), peptide from alpha S2 casein; (D), peptide from beta casein; (E), peptide from kappa casein; (F), peptide from alpha lactalbumin; (G), peptide from beta lactoglobulin; (H), peptide from bovine serum albumin; (I), peptide from fibrinogen; (J), peptide from kininogen; (K), peptide from lactotransferrin.

List of protein accessions identified in the combined protein standards, along with their description, their score, coverage, the number of peptides identified per protein, the number of peptide spectrum matches (PSM), the size of the protein (AA and MW) and their theoretical isoelectric point (calc. pI). As the purity level of each standard varied, expected known proteins are highlighted in bold. Validation of protein identities using known protein standards. One peptide per standard was selected and Extracted Ion Chromatograms (EICs) were produced and compared across the standard mixture, Jersey bulk milk, and Holstein bulk mik tryptic digests. Retention times (RT) are comparable across samples. The MS/MS spectrum of the selected peptide is displayed below the EICs. Insets indicate the proteins to which this peptide belongs, the AA sequence of the selected peptide, its m/z, charge state and RT. (A), peptide from beta actin; (B), peptide from alpha S1 casein; (C), peptide from alpha S2 casein; (D), peptide from beta casein; (E), peptide from kappa casein; (F), peptide from alpha lactalbumin; (G), peptide from beta lactoglobulin; (H), peptide from bovine serum albumin; (I), peptide from fibrinogen; (J), peptide from kininogen; (K), peptide from lactotransferrin.

Discussion

The intended aim of this study was to establish a procedure to extract proteins from cow milk with minimum steps prior to protein digestion and shotgun nLC-MS/MS analyses, which yielded high protein concentration and was reproducible. To this end, three extraction methods were performed on skim milk samples from Jersey and Holstein-Friesian cows, resulting in three protein extracts. These methods were chosen because they were based on very different chemistries yet were simple enough to be performed in a high-throughput fashion as discussed below. As far as we know, these methods have not previously been applied to bottom-up proteomics of samples of cow milk. Method A (urea) merely consisted of a 50% dilution of skim milk samples with an urea-based solubilisation buffer. Urea is a common chaotrope used in the solublization and denaturation of proteins; by unfolding proteins urea uncovers buried disulphide bonds accessible to reduction and modification. The solubilisation buffer contained DTT to reduce protein disulfide bridges over the 30 min incubation at 30°C, while incubation temperature was purposefully kept well below 35°C so as to inhibit the carbamylation of proteins which may occur in presence of urea. Reduced disulphide bridges were further stabilized using the alkylating reagent IAA. The solubilisation buffer also contained the anionic detergent SDS which disaggregates casein micelles as well as NaCl which influences their physico-chemical stability. The solubilisation buffer was buffered at pH 8.0 using Tris-HCl to improve the stability of denatured/reduced milk proteins. Similar procedures have been employed in which full cream milk samples were skimmed and diluted in a different urea-based buffers prior to 2-DE; such buffers contained 8 M urea, 40 mM Tris, 2 or 4% CHAPS, 50 or 65 mM DTT, and 0.2 or 2% ampholytes as they improve protein focusing during IEF (Boehmer et al., 2008; Jensen et al., 2012a; Yang et al., 2013). We cannot compare the efficiency of extraction of our method A to that of the reports afore-mentioned as the downstream analytical method employed here is different. Furthermore, these reports did not aim at improving protein extraction. Yet, most of the proteins identified by Boehmer et al. (2008) and Jensen et al. (2012a) were also identified in extracts A. In the present study, of the three methods, method A (urea) was by far the simplest and the quickest necessitating only one dilution step, therefore introducing the least variation due to sample handling. However, because method A (urea) does not include a centrifugation step and produced a fully soluble extract devoid of precipitate, it should not remove non-protein compounds thus potentially interfering with subsequent steps. Method B (TCA/acetone) resulted from a simple acetone precipitation procedure under cold, reducing and acidic pH conditions, commonly used in proteomics notably on plant and fungal tissues (Vincent et al., 2005, 2007, 2009, 2012a,b; Vincent and Solomon, 2011) and known as a TCA/acetone precipitation. Acetone reduces the dielectric constant of water and displaces the water molecules surrounding proteins during precipitation, thereby leading to strong hydrophobic interactions between proteins followed by aggregation. The addition of TCA lowers the pH and promotes hydrophobic aggregation by not only disrupting the solvation layers of the proteins but also furthering protein denaturation thereby exposing more hydrophobic surface to the solvent. Dithiothreitol reduces disulfide bonds. Solvent precipitation must be performed at subzero temperatures in order to minimize protein degradation. By removing solvent-soluble compounds such as polar metabolites, method B (TCA/acetone) should result in protein-enriched extracts. While we could not find a publication reporting the use of TCA/acetone to extract cow milk proteins, an acetone-precipitation method was applied to study the inflammation of bovine mammary glands in full cream milk samples as part of the iTRAQ extraction and labeling procedure, resulting in the quantitation and identification of up to 169 proteins (Danielsen et al., 2010). It is possible that more proteins could have been recovered by using a different extraction method, however, iTRAQ manufacturer imposes such acetone precipitation. In another instance, proteins were removed by acetone precipitation prior to Carbograph-4 cartridge elution in order to enrich aflatoxin M1 levels in milk samples (Cavaliere et al., 2006). Again more proteins could have been targeted by using a different removal method, yet, acetone is a solvent compatible with graphitized carbon black cartridge. Therefore, whether used as an enrichment method or a depletion method, acetone successfully precipitated proteins in both studies cited above. Most proteins identified by Danielsen et al. (2010) were also identified in extracts B. Whilst straightforward, method B (TCA/acetone) involved a precipitation step and two washing steps, interspersed with centrifugation steps which made this protocol more labor-intensive, time-consuming and subject to more variation than method A (urea). Method C (methanol/chloroform) (Taylor and Savage, 2006) arose from modifications brought to the Bligh-Dyer chloroform/methanol partition procedure (Bligh and Dyer, 1959). This protocol was initially designed to rapidly extract lipids from wet cod fish muscles, which contain 80% water and 1% lipids. It operates on the principle that the water contained in the sample becomes miscible with a chloroform/methanol solution (1:2 by volume). Further addition of one volume of chloroform and one volume of water creates a biphasic partition where the lipids solubilise in the chloroform layer whereas the non-lipid compounds go into the methanolic layer. The original Bligh-Dyer procedure was subsequently modified by substituting water with a 8% NaCl solution (Taylor and Savage, 2006), thus blocking the binding of some acidic lipids to denatured lipids. This method was successfully applied to recover fatty acids from mussel tissues (Taylor and Savage, 2006). As methanol is a solvent used in proteomics to precipitate proteins, notably following phenol extraction (Vincent et al., 2006, 2009), and because most proteins are insoluble in chloroform, partition protocols such as method C (methanol/chloroform) produce an interphase between the lower chloroform layer and the upper methanol layer that contains milk proteins and is free of most lipids and metabolites, therefore purifying proteins from non-protein compounds. Method C (methanol/chloroform) is routinely used in our lab to extract fatty acids in the chloroform phase from full cream milk samples prior to GC-MS analyses (Ezernieks et al., unpublished data) while polar metabolites are recovered from the methanol phase to undergo LC/MS analysis (Elkins et al., unpublished data). To our knowledge, method C (methanol/chloroform) has never been applied to recover proteins from milk samples. However, comparable methods have been employed as exemplified hereafter. Touati et al. (1992) demonstrated that, in chloroform/methanol solution (1:1 by volume), the solubility of caseins and β-lactoglobulins varied in a pH dependent fashion as it affected the neutralization of milk protein polar functions. More recently, following chloroform/methanol extraction, the milk fat globule membrane fraction displaying the highest anti-rotavirus activity was shown to be highly non-polar and devoid of proteins (Fuller et al., 2013). Method C (methanol/chloroform) was as time-consuming as method B (TCA/acetone), yet more intricate as it required the recovery of the protein interphase. In our hands, the use of a swing bucket rotor during the centrifugation step instead of a fixed-angle rotor (data not shown) increased interphase stability so much so that the paper-thin interphase could be gently pushed aside while the upper and lower liquid phases were tipped out. Method C (methanol/chloroform) involved various steps possibly impacting reproducibility. It also used greater extraction solution volumes than methods A (urea) and B (TCA/acetone), which necessitated larger tubes to the detriment of throughput during the centrifugation step. Placed into a systems biology context, method C (methanol/chloroform) is highly advantageous as it allows the recovery of polar, non-polar metabolites and proteins in one step. This would allow proteomics and metabolomics studies to be conducted on the same sample. If we were to compare the three methods based on their duration and cost, again method A (urea) would outperform the other two methods as it takes much less time, effort and money to complete the protein extraction from milk samples. The time required for method A (urea) is 2.5 h whereas the time required for method B (TCA/acetone) or C (methanol/chloroform) involves 5 h extraction and overnight incubation. Methods B (TCA/acetone) and C (methanol/chloroform) are as time-consuming. Method C (methanol/chloroform) is more labor-intensive and requires more skills, particularly when recovering the protein interphase. Furthermore, as opposed to method A, methods B and C include centrifugation steps which not only limit the throughput of the protocols but also add time. The cost, based on chemicals, associated with method A (urea) is minimum ($0.09 per sample) as opposed to method B (TCA/acetone) which costs fifty times more than method A ($4.45 per sample) and method C (methanol/chloroform) which costs 12 times more than method A ($1.05 per sample). Method B (TCA/acetone) is four times more expensive than method C (methanol/chloroform). In its principle, method A (urea) did not seek to enrich protein content like method B (TCA/acetone) or to purify proteins like method C (methanol/chloroform). Method C (methanol/chloroform) outperformed the other methods when SDS-PAGE patterns, protein concentration and protein recovery rates were considered regardless of the breed, thus confirming that cow milk proteins were more specifically extracted by a tri-phasic partition procedure. Following extraction, the same amount of proteins underwent trypsin digestion per extract, thereby eliminating concentration variations across methods and breed. Digestion and subsequent clean-up steps using SPE and UF of the tryptic peptides were performed uniformly in a rigorous manner for all samples. Differing greatly in their chemistry, each method produced distinct chromatograms during the nLC-MS/MS analyses. Method A (urea) yielded the greatest number of accessions relative to methods B (TCA/acetone) and C (methanol/chloroform), suggesting that extracts A (urea) were compatible with the various steps post-extraction. While method C (methanol/chloroform) was superior to the other methods in most respects as demonstrated by protein assays, SDS-PAGE patterns consistently exhibiting the most prominent proteins, α-caseins, and TICs, it did generate fewer accessions than Method A (urea). We could hypothesize that the preponderance of α-caseins masked the presence of other proteins, and were preferentially targeted during trypsin digestion to the detriment of minor proteins. Indeed the most prominent chromatographic peaks of samples processed using method C (methanol/chloroform) eluted tryptic peptides from α-S1-caseins, the most abundant of all milk proteins. Method B (TCA/acetone) consistently yielded the least optimum results showing little compatibility with downstream analyses; consequently we do not recommend its application for milk samples. All three methods were highly reproducible as demonstrated by the TICs traces and PCA plots, with overall samples originating from method A (urea) generated tighter clusters. This probably arose from the fact that method A (urea) had less steps than methods B (TCA/acetone) and C (methanol/chloroform), therefore less subject to experimental variation. Functional classification did not highlight categories unique to a method because as different as the three methods were, they recovered similar proteins, extracts B (TCA/acetone) generally having less of them. Based on these findings, we reject our hypothesis that all methods are similar in terms of their major attributes, and we recommend either method A (urea) or method C (methanol/chloroform) to extract proteins from cow milk samples in gel-free bottom-up approach. As expected in our study, the most abundant milk proteins were identified across all three methods: caseins (α-S1-, α-S2-, ß-, and κ-forms), ß-lactoglobulin, α-lactalbumin, lactoferrin, and lactoperoxidase. Apart from the major milk proteins, many immunoglobulins (Igs) were also identified. These immunoglobulins belonged to the main classes IgG1, IgG2, IgA, and IgM. Immunoglobulins protect both cow udders and offspring from microbial infections and their abundances fluctuate with cow species, breed, age, stage of lactation, and health status (reviewed in Marnila and Korhonen, 2011). Other proteins involved in the bovine immune defense system identified in the present study included ß2-microglobulin and osteopontin (Wynn et al., 2011 for review). A number of enzymes were found in our protein samples, including the well-studied lipoprotein lipase (LPL). Lipoprotein lipase is a glycoprotein involved in fatty acid synthesis and triggering rancidity in milk and its derivative products (Deeth, 2011). Another enzyme was sulfhydryl oxidase (SOx) which catalyses the disulphide bond formation essential to the three-dimensional structure of proteins (reviewed in Farkye and Bansal, 2011). Another enzyme was identified in all three methods and both cow breeds, xanthine dehydrogenase/oxidase (XOR) which commonly occurs in the milk fat globule membrane (MFGM). Xanthine dehydrogenase/oxidase enzymatic role makes it a source of reactive oxygen and nitrogen species; XOR also displays antimicrobial activities (reviewed in Harrison, 2011). Another enzyme identified in this work is β-1,4-galactosyltransferase 1 (Gal-T1) involved in the synthesis of complex carbohydrates decorating glycoproteins and glycolipids and whose affinity for its substrates is regulated by α-lactalbumin, which is also a glycoprotein (Brew, 2011). Beta-1,4-galactosyltransferase 1 was identified in all extracts and breeds along with various glycoproteins (butyrophilin subfamily 1 member A1, lactadherin, lactotransferrin, lactoperoxidase, mucins 1 and 15, Igs, α-1-acid glycoprotein, α-1B-glycoprotein, α-2-HS-glycoprotein, pancreatic secretory granule membrane major glycoprotein GP2, platelet glycoprotein 4, Zn-α-2-glycoprotein) as well as glycosylation-dependent cell adhesion molecule 1, dystroglycan, and peptidoglycan recognition protein 1. The prominence of glycoproteins in cow milk was reflected in our results, yet surprisingly little is known about their biological funtions; the carbohydrate moieties play an essential communication role in numerous cellular processes (O'Riordan et al., 2014). Several studies have compared top-down analyses of intact milk protein variants from Holstein-Friesian and Jersey breeds (Jensen et al., 2012a,b; Poulsen et al., 2013; Gustavsson et al., 2014). These studies focussed on the most abundant proteins such as caseins, α-lactalbumin and β-lactoglobulins. As far as we know, there are no publications using a bottom-up proteomics strategy to compare milk proteins from Holstein-Friesian and Jersey cows. In this study, bulk milk samples representing whole Holstein-Friesian and Jersey herds were analyzed using many replicates. Our results highlighted proteins that were more prominent in one breed compared to the other (Table 1). For instance, a fatty acid-binding protein was 30% more abundant in Holstein-Friesian milk than Jersey milk. This protein facilitates the transfer of fatty acids between extra- and intracellular membranes. This is may be relevant as Holstein-Friesian and Jersey milk fat content and composition differ, with Jersey milk fat containing higher concentrations of saturated fatty acids, especially of fatty acids with short and medium carbon chains (Arnould and Soyeurt, 2009). Alpha-A 2-HS-glycoprotein, also known as fetuin-A which forms soluble complexes with calcium and phosphate, was 28% more prevalent in Holstein-Friesian milk than in Jersey milk. This could be related to the fact that Holstein-Friesian milk contains less total calcium than Jersey milk (Jensen et al., 2012b). Conversely, actin 1, a globular multi-functional protein that forms microfilaments found in all eukaryotic cells, was occurring 26% more in Jersey milk than in Holstein-Friesian milk. The significance of this finding is unclear at this stage. Two proteins involved in angiogenesis and cellular protein synthesis, lactadherin and angiogenin-1, occurred more in Holstein-Friesian milk than Jersey. The prevalence of fibrinogen (alpha and beta subunits), a glycoprotein complex involved in blood clot formation, in Holstein-Friesian milk relative to Jersey remains to be further investigated. A vitamin D-binding protein and cathelicidin-4, whose levels accumulate with those of vitamin D during an infection (Liu et al., 2006), were more prominent in Holstein-Friesian milk than Jersey milk. Two serpins (serpin A3-1 and α-1-antiproteinase, also called α-1 anti-trypsin or AAT1) prevailed in Holstein-Friesian milk. These serine protease inhibitors activity protects tissues from damage caused by proteolytic enzymes; AAT1 is the most abundant serpin in human (Hunt and Tuder, 2012). The anti-microbial proteins peptidoglycan recognition protein 1 and histatherin, also known as histatin, occurred more in Holstein-Friesian than Jersey milk. Several proteins involved in the immune system underpinned breed difference: Antibodies prevailed in both milks with IgG2 isotype more prevalent in Holstein-Friesian milk. CD5L scavenger receptor protein prevailed in Jersey milk. Combined together, these findings suggest that milk varies in protein species composition and that dairy cattle breeds may have evolved different milk qualities. Many of the differences relate to immune proteins and responses. These results remain preliminary findings as the proteomic analysis was optimized using bulk milk samples which represent a whole herd. Further studies are underway to investigate the profile of these specific proteins in individual Holstein-Friesian and Jersey cows. They will shed light on genetic differences.

Conclusions

In this study, three protein extraction methods performed on bulk milk samples from Jersey and Holstein-Friesian cows were compared using protein assay, SDS-PAGE, and nLC-MS/MS analyses. All major milk proteins such as caseins were extracted along with less abundant proteins such as whey proteins (β-lactoglobulin, α-lactalbumin, lactotransferrin), as well as minor proteins such as glycoproteins, and enzymes. Method A (urea), a simple dilution of milk into an urea-based buffer, yielded the greatest number of unique protein accessions. Method B (TCA/acetone) was not as efficient as methods A (urea) and C (methanol/chloroform). Method C (methanol/chloroform) yielded the highest protein concentration, recovery rates, as well as best SDS-PAGE patterns. Such a tri-phasic partition procedure would be highly desirable for experiments assessing the inter-relationships between metabolites and protein regulation in milk such as in systems biology projects. However, for a proteomics-centric approach, method A (urea) offers advantages in low costs, simplicity, protein coverage and throughput and would be the preferred method for this type of study.

Funding

This work was funded by Department of Economic Development, Jobs, Transport, and Resources.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  35 in total

1.  Liquid chromatography/tandem mass spectrometric confirmatory method for determining aflatoxin M1 in cow milk: comparison between electrospray and atmospheric pressure photoionization sources.

Authors:  Chiara Cavaliere; Patrizia Foglia; Elisabetta Pastorini; Roberto Samperi; Aldo Laganà
Journal:  J Chromatogr A       Date:  2005-10-10       Impact factor: 4.759

2.  Toll-like receptor triggering of a vitamin D-mediated human antimicrobial response.

Authors:  Philip T Liu; Steffen Stenger; Huiying Li; Linda Wenzel; Belinda H Tan; Stephan R Krutzik; Maria Teresa Ochoa; Jürgen Schauber; Kent Wu; Christoph Meinken; Diane L Kamen; Manfred Wagner; Robert Bals; Andreas Steinmeyer; Ulrich Zügel; Richard L Gallo; David Eisenberg; Martin Hewison; Bruce W Hollis; John S Adams; Barry R Bloom; Robert L Modlin
Journal:  Science       Date:  2006-02-23       Impact factor: 47.728

3.  The occurrence of noncoagulating milk and the association of bovine milk coagulation properties with genetic variants of the caseins in 3 Scandinavian dairy breeds.

Authors:  N A Poulsen; H P Bertelsen; H B Jensen; F Gustavsson; M Glantz; H Lindmark Månsson; A Andrén; M Paulsson; C Bendixen; A J Buitenhuis; L B Larsen
Journal:  J Dairy Sci       Date:  2013-06-05       Impact factor: 4.034

4.  Effects of duodenal infusion of free α-linolenic acid on the plasma and milk proteome of lactating dairy cows.

Authors:  Y X Yang; J Q Wang; T J Yuan; D P Bu; J H Yang; P Sun; L Y Zhou
Journal:  Animal       Date:  2012-07-24       Impact factor: 3.240

5.  In-depth analysis of low abundant proteins in bovine colostrum using different fractionation techniques.

Authors:  Asger Nissen; Emøke Bendixen; Klaus Lønne Ingvartsen; Christine Maria Røntved
Journal:  Proteomics       Date:  2012-09       Impact factor: 3.984

6.  Distinct composition of bovine milk from Jersey and Holstein-Friesian cows with good, poor, or noncoagulation properties as reflected in protein genetic variants and isoforms.

Authors:  H B Jensen; N A Poulsen; K K Andersen; M Hammershøj; H D Poulsen; L B Larsen
Journal:  J Dairy Sci       Date:  2012-10-03       Impact factor: 4.034

7.  Proteomic analysis of differentially expressed proteins in bovine milk during experimentally induced Escherichia coli mastitis.

Authors:  J L Boehmer; D D Bannerman; K Shefcheck; J L Ward
Journal:  J Dairy Sci       Date:  2008-11       Impact factor: 4.034

8.  Effects of breed and casein genetic variants on protein profile in milk from Swedish Red, Danish Holstein, and Danish Jersey cows.

Authors:  F Gustavsson; A J Buitenhuis; M Johansson; H P Bertelsen; M Glantz; N A Poulsen; H Lindmark Månsson; H Stålhammar; L B Larsen; C Bendixen; M Paulsson; A Andrén
Journal:  J Dairy Sci       Date:  2014-04-03       Impact factor: 4.034

9.  Genetic variability of milk fatty acids.

Authors:  V M-R Arnould; H Soyeurt
Journal:  J Appl Genet       Date:  2009       Impact factor: 3.240

10.  Changes in Holstein cow milk and serum proteins during intramammary infection with three different strains of Staphylococcus aureus.

Authors:  Yunee Kim; Heba Atalla; Bonnie Mallard; Claude Robert; Niel Karrow
Journal:  BMC Vet Res       Date:  2011-09-01       Impact factor: 2.741

View more
  11 in total

1.  Quantitation and Identification of Intact Major Milk Proteins for High-Throughput LC-ESI-Q-TOF MS Analyses.

Authors:  Delphine Vincent; Aaron Elkins; Mark R Condina; Vilnis Ezernieks; Simone Rochfort
Journal:  PLoS One       Date:  2016-10-17       Impact factor: 3.240

2.  Milk proteome from in silico data aggregation allows the identification of putative biomarkers of negative energy balance in dairy cows.

Authors:  Mylène Delosière; José Pires; Laurence Bernard; Isabelle Cassar-Malek; Muriel Bonnet
Journal:  Sci Rep       Date:  2019-07-04       Impact factor: 4.379

3.  Top-Down Proteomics of Medicinal Cannabis.

Authors:  Delphine Vincent; Steve Binos; Simone Rochfort; German Spangenberg
Journal:  Proteomes       Date:  2019-09-24

4.  Generation of High Affinity Anti-Peptide Polyclonal Antibodies Recognizing Goat αs1-Casein.

Authors:  Aliah Zannierah Mohsin; Rashidah Sukor; Jinap Selamat; Anis Shobirin Meor Hussin; Intan Hakimah Ismail; Nuzul Noorahya Jambari; Farina Mustaffa-Kamal
Journal:  Molecules       Date:  2020-06-05       Impact factor: 4.411

5.  Optimization of Protein Extraction Method for 2DE Proteomics of Goat's Milk.

Authors:  Muzammeer Mansor; Jameel R Al-Obaidi; Nurain Nadiah Jaafar; Intan Hakimah Ismail; Atiqah Farah Zakaria; Mohd Azri Zainal Abidin; Jinap Selamat; Son Radu; Nuzul Noorahya Jambari
Journal:  Molecules       Date:  2020-06-05       Impact factor: 4.411

6.  Optimisation of Milk Protein Top-Down Sequencing Using In-Source Collision-Induced Dissociation in the Maxis Quadrupole Time-of-Flight Mass Spectrometer.

Authors:  Delphine Vincent; Dominik Mertens; Simone Rochfort
Journal:  Molecules       Date:  2018-10-26       Impact factor: 4.411

7.  A Multiple Protease Strategy to Optimise the Shotgun Proteomics of Mature Medicinal Cannabis Buds.

Authors:  Delphine Vincent; Vilnis Ezernieks; Simone Rochfort; German Spangenberg
Journal:  Int J Mol Sci       Date:  2019-11-11       Impact factor: 5.923

8.  Detection of milk powder in liquid whole milk using hydrolyzed peptide and intact protein mass spectral fingerprints coupled with data fusion technologies.

Authors:  Lijuan Du; Weiying Lu; Yaqiong Zhang; Boyan Gao; Liangli Yu
Journal:  Food Sci Nutr       Date:  2020-02-03       Impact factor: 2.863

9.  Mining the Wheat Grain Proteome.

Authors:  Delphine Vincent; AnhDuyen Bui; Doris Ram; Vilnis Ezernieks; Frank Bedon; Joe Panozzo; Pankaj Maharjan; Simone Rochfort; Hans Daetwyler; Matthew Hayden
Journal:  Int J Mol Sci       Date:  2022-01-10       Impact factor: 5.923

10.  Pudding Proteomics: Cyclomaltodextrin Glucanotransferase and Microbial Proteases Can Liquefy Extended Shelf Life Dairy Products.

Authors:  Kristina J H Kleinwort; Maria Weigand; Lydia Hoffmann; Roxane L Degroote; Richard Dietrich; Erwin Märtlbauer; Stefanie M Hauck; Cornelia A Deeg
Journal:  Metabolites       Date:  2022-03-17
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.