Literature DB >> 34957696

Quantification of FAM20A in human milk and identification of calcium metabolism proteins.

Vaksha Patel1, Enriko Klootwijk1, Gail Whiting2, Detlef Bockenhauer1, Keith Siew1, Stephen Walsh1, Markus Bleich3, Nina Himmerkus3, Graciana Jaureguiberry1, Naomi Issler1, Jasminka Godovac-Zimmermann1, Robert Kleta1, Jun Wheeler2.   

Abstract

BACKGROUND: FAM20A, a recently discovered protein, is thought to have a fundamental role in inhibiting ectopic calcification. Several studies have demonstrated that variants of FAM20A are causative for the rare autosomal recessive disorder, enamel-renal syndrome (ERS). ERS is characterized by defective mineralization of dental enamel and nephrocalcinosis suggesting that FAM20A is an extracellular matrix protein, dysfunction of which causes calcification of the secretory epithelial tissues. FAM20A is a low-abundant protein that is difficult to detect in biofluids such as blood, saliva, and urine. Thus, we speculated the abundance of FAM20A to be high in human milk, since the secretory epithelium of lactating mammary tissue is involved in the secretion of highly concentrated calcium. Therefore, the primary aim of this research is to describe the processes/methodology taken to quantify FAM20A in human milk and identify other proteins involved in calcium metabolism.
METHOD: This study used mass spectrometry-driven quantitative proteomics: (1) to quantify FAM20A in human milk of three women and (2) to identify proteins associated with calcium regulation by bioinformatic analyses on whole and milk fat globule membrane fractions.
RESULTS: Shotgun MS/MS driven proteomics identified FAM20A in whole milk, and subsequent analysis using targeted proteomics also successfully quantified FAM20A in all samples. Combination of sample preparation, fractionation, and LC-MS/MS proteomics analysis generated 136 proteins previously undiscovered in human milk; 21 of these appear to be associated with calcium metabolism.
CONCLUSION: Using mass spectrometry-driven proteomics, we successfully quantified FAM20A from transitional to mature milk and obtained a list of proteins involved in calcium metabolism. Furthermore, we show the value of using a combination of both shotgun and targeted driven proteomics for the identification of this low abundant protein in human milk.
© 2021 The Authors. Physiological Reports published by Wiley Periodicals LLC on behalf of The Physiological Society and the American Physiological Society.

Entities:  

Keywords:  FAM20A and human milk; calcium metabolism; enamel renal syndrome

Mesh:

Substances:

Year:  2021        PMID: 34957696      PMCID: PMC8711012          DOI: 10.14814/phy2.15150

Source DB:  PubMed          Journal:  Physiol Rep        ISSN: 2051-817X


INTRODUCTION

Human milk is produced by mammary epithelial cells (MEC). The milk produced is comprised of three fractions: the MEC, whey, and milk fat globule membrane (MFGM). All three fractions contain an array of proteins (including immunoglobulins), lipids, carbohydrates, minerals, and bioactive compounds to provide the necessary nutrients and immunity for the developing infant. Calcium is one of the major minerals that is transported through the MEC during milk production. The amount of calcium ion (Ca2+) passing through these MEC during lactation ranges from 200 to 400 mg per day (Kovacs, 2016; Prentice, 2003). During lactation, there is an increase in Ca2+ reabsorption via the renal tubular epithelium (Beggs et al., 2017; Ritchie et al., 1998) and in bone demineralization (Kent et al., 1993) with intestinal calcium absorption remaining normal (Kovacs et al., 2000). Furthermore, total serum calcium concentrations are maintained at ~2.4 mM (Kovacs, 2016) and do not differ between lactating, pregnant, or non‐pregnant persons. At physiological serum Ca2+ concentrations, nearly half is protein bound (predominantly to albumin) to prevent precipitation. However, the total calcium concentration in human milk is much higher (5–10 mM) (Mahdavi et al., 2015; Perrin et al., 2017; Suzuki et al., 1991) suggesting that calcium buffering would be essential in lactating MEC. To date, several proteins have been associated with handling Ca2+ throughput in the secretory epithelial tissue of the kidney (Hwang et al., 2014; Muto et al., 2010; Zhang & O'Neil, 1996) and breast tissue during lactation (Cross et al., 2013; Reinhardt et al., 2004; Ross et al., 2013; Shennan & Peaker, 2000; Vanhouten & Wysolmerski, 2013). Of particular interest is FAM20A, a member of the Golgi‐Associated Kinase family (Nalbant et al., 2005; Zhang et al., 2018); other members include FAM20B (Koike et al., 2009) and FAM20C (Vogel et al., 2012). FAM20A enhances phosphorylation of extracellular matrix proteins through dimerization with FAM20C (Cui et al., 2015; Tagliabracci, 2015; Tagliabracci et al., 2012). Notably, mutations in FAM20A results in enamel‐renal syndrome (ERS) (Jaureguiberry et al., 2012; Kantaputra et al., 2014; Kantaputra et al., 2014; Wang et al., 2013, 2014), a condition characterized by defective mineralization of dental enamel (amelogenesis imperfecta) and calcification within renal tissues (nephrocalcinosis) (Jaureguiberry et al., 2012; Kantaputra, Bongkochwilawan, et al., 2014; Wang et al., 2013, 2014). Histopathological evaluation of FAM20A knockout mice reveal calcification in the renal arteries, the choroid plexus of the brain, the junction between the retina and the ciliary body of the eyes, as well as tissues of the lungs (Vogel et al., 2012). Taken together, these data suggest a significant role for FAM20A in regulating calcium handling within epithelia, and we can assume that FAM20A has a preventive role in ectopic calcification based on associations rather than a direct causative evidence. Furthermore, a recent publication has identified FAM20A protein to be present in lactating but absent in non‐lactating mammary glands (Cui et al., 2015), this suggested a potentially important role for FAM20A in lactation. This is further supported by data from the gene annotation portal, BioGPS (http://biogps.org) which shows mRNA expression of FAM20A to be the highest in the lactating mammary gland compared to any other tissues in mouse. Therefore, to investigate this further we aimed to quantify FAM20A in human milk and identify other proteins involved in calcium metabolism. There have been several shotgun proteomics studies performed mainly on either the whey or MFGM fraction of human milk (Gao et al., 2012; Hettinga et al., 2011; Liao et al., 2011a, 2011b; Palmer et al., 2006; Picariello et al., 2012). Advances in relative quantitative multiplex proteomics techniques of two‐dimensional difference gel electrophoresis (2D‐DIGE) (Molinari et al., 2012; Unlü et al., 1997), tandem mass tags (TMT) (Thompson et al., 2003; Zhang et al., 2013), isobaric tags for relative and absolute quantification (iTRAQ), (Mange et al., 2013; Molinari et al., 2012; Ross et al., 2004; Yang et al., 2015) and dimethyl labeling (Zhang et al., 2016) have improved the ability to obtain a quantitative inventory of differentially expressed proteins across the various stages of milk postpartum. The concentrations of some components change significantly over the three distinct stages of milk postpartum; colostrum (first few days), transitional (up to 3 weeks), and mature (up to 52 weeks plus) milk (Lonnerdal et al., 1987). Previous studies have identified FAM20A (Gao et al., 2012; Mange et al., 2013; Zhang et al., 2013) in human milk but to date, there has been no quantification data on this protein. In biological fluids such as blood, human milk, and CSF where the dynamic range of protein concentration is over 10 orders of magnitude, low abundant proteins are often missed when using mass spectrometry, due to competition with high abundant proteins. Furthermore, where low abundant proteins have been identified through a traditional shotgun proteomic experiment, identification of these low abundant proteins can easily be missed when performing a shotgun quantitative experiment. This is due to the inherent limitation of mass spectrometry. Therefore, this manuscript describes a comprehensive procedure for the identification and quantification of FAM20A from transitional to mature milk and to obtain a list of proteins potentially involved in calcium metabolism.

METHODS AND MATERIALS

Sample Collection

Human milk samples were donated by three healthy lactating volunteers. The first individual provided milk at 2 weeks, the second at 24 weeks, and the third at 44 weeks postpartum. To assess whether an absence of protease inhibitors (PI) in the samples influenced the identification and quantification of FAM20A, each of the three samples was divided into two aliquots. The first had PI (cOmpleteTM Mini EDTA‐free Protease Inhibitor Cocktail, Roche) added to it (one tablet of PI was dissolved in 10 ml of human milk) the second did not. Samples collected at 2 and 44 weeks were processed immediately, and the sample collected at 24 weeks was immediately stored in a domestic freezer prior to it being processed. A summary of the experimental design including sample processing is illustrated in Figure 1
FIGURE 1

Summary of experimental design and sample processing. Whole milk and MFGM from three human milk samples (sample 1 [S1], sample 2 [S2], and sample 3 [S3]) with and without protease inhibitor were digested with trypsin. Whole milk samples were labeled with TMT sixplex and the MFGM fractions were not labeled. Either the labeled and unlabeled samples were pooled and fractionated by high pH reverse phase chromatography. Proteomic analysis on whole milk was performed using CID MS/MS and HCD MS/MS. Proteomic analysis was MFGM fraction was performed using CID MS/MS

Summary of experimental design and sample processing. Whole milk and MFGM from three human milk samples (sample 1 [S1], sample 2 [S2], and sample 3 [S3]) with and without protease inhibitor were digested with trypsin. Whole milk samples were labeled with TMT sixplex and the MFGM fractions were not labeled. Either the labeled and unlabeled samples were pooled and fractionated by high pH reverse phase chromatography. Proteomic analysis on whole milk was performed using CID MS/MS and HCD MS/MS. Proteomic analysis was MFGM fraction was performed using CID MS/MS

Protein extraction

For whole milk, protein extraction was achieved by thoroughly mixing each of the six samples (with and without PI) with equal volumes of 1% (w/v) sodium dodecyl sulfate (SDS). The samples were centrifuged for 10 min at 10,000 g at 14°C. Protein estimation was performed on the supernatants using the Bradford protein assay. All samples were diluted to a final total protein concentration of 1 µg/µl in 200 mM triethylammonium bicarbonate. The final percentage of SDS in all samples were below 0.25% (w/v). Extraction of proteins from the MFGM was performed as described by Fortunato et al. (2003) with slight modification. In short, human milk was centrifuged for 10 min at 10,000 g; the fat layers were carefully removed and washed three times with phosphate buffer saline (by centrifugation at 20,000 g for 5 min at 14°C) to remove any water‐soluble proteins. The MFGM was then mixed with equal amounts of 1% SDS (w/v) and incubated at room temperature overnight on a rotator. Samples were then centrifuged for 30 min at 20,000 g at 14°C. The proteins in each of these solutions were precipitated using acetone, where the final percentage of acetone within each of the sample was 80% (v/v). The samples were left to incubate at −20°C for one hour prior to centrifugation for 10 min at 20,000 g. Acetone was removed. Once all the pellets were air‐dried, samples were suspended to the original same volume with water. Protein estimation using the Bradford method was performed on all samples.

Tryptic digestion and TMT labeling

Equal amounts of proteins extracted from human milk of the three control samples (2, 24, and 44 weeks postpartum) with and without PI were taken. The six samples were individually reduced with tris (2‐carboxyethyl) phosphine, alkylated with iodoacetic acid, and precipitated with ice‐cold acetone. The resulting protein pellets were digested with trypsin (Promega). Two and a half micrograms for trypsin was used to digest 100 µg of total protein at 37°C overnight before labeling with either the TMT of 126, 127, 128, 129, 130, or 131 (Thermo Fisher Scientific) according to the manufacturer's instruction and then mixing into a single “sixplex” sample. Due to low and variable protein concentration in the MFGM samples (0.3–5.4 μg/μl) across sets of samples, tryptic digest was performed on the pooled samples without TMT labeling. The rationale for pooling was to see whether FAM20A could be identified. The amount of protein extracted from the MFGM fraction in one sample was below the optimal conditions (<25 µg) required for TMT labeling. The varying protein content extracted from MFGM samples may be due to the small volume of sample processed and protein extraction efficiency being below optimal conditions.

First dimensional high pH reversed‐phase high‐performance liquid chromatography (RP‐HPLC) separation for peptide mixture

For both the whole milk (TMT labeled) and MFGM (non‐labeled), an aliquot of the samples was subjected to high pH RP‐HPLC separation on a XBridge C18 column (5 µm, 4.6 mm id, and 25 cm long, from Waters) using a linear gradient consisting of mobile phase A (10 mM ammonium formate, pH 10.0) and up to 70% B (90% acetonitrile in A) for 2 h at a flow rate of 0.5 ml/min on a Jasco HPLC system (Jasco). 18 fractions with the retention time of 10–110 min were collected and concatenated.

Nano‐LC and tandem mass spectrometry for protein identification and quantification

The dried high pH RP‐HPLC fractions were resuspended in 0.1% (v/v) formic acid. Online nano‐LC and tandem mass spectrometry (MS/MS) were performed on a LTQ‐Orbitrap mass spectrometer (Thermo Fisher Scientific) coupled with U3000 direct nanosystem (Thermo Fisher Scientific), using a PepMap C18 RP nano column (3 µm, 100 Å, 50 cm long) under a column flow rate of 0.3 µl/min and a gradient of 5%–25% for 180 min, 25%–32% for 20 min, and 32%–90% for 10 min of a solution of 95% (v/v) acetonitrile and 0.1% (v/v) formic acid. In the case of protein identification only, five events of collision‐induced dissociation fragmentation (CID MS/MS) were used for generating spectra for peptide sequencing with data‐dependent acquisition and dynamic exclusion mode enabled. However, in the case of protein identification and quantification, three events of high‐energy collisional dissociation (HCD) were followed by three events of CID to generate spectra both for peptide sequencing and relative quantification via report ions, that is, the TMT tags (HCD MS/MS). Targeted MS/MS was based on data‐dependent acquisition using the top five most intense peaks and dynamic exclusion enabled. The mass of the peptide of interest was used in the inclusion list so that the mass detector was scanning and looking for the parent ion followed by CID MS/MS.

MS/MS Data analysis

Thermo Proteome Discovery version 1.2 data analysis software was used for mass spectra processing, database searching, and quantification by in silico matching of the MS/MS spectra against the UniProtKB human Swiss‐Prot database (https://www.uniprot.org/uniprot/?query=HUMAN&sort=score). Spectra from the 18 fractions were added together as one sample during searching. Initial mass tolerances by MS were set to 10 ppm and up to two missed tryptic cleavages were included. Methionine oxidation was set as dynamic modification whereas carboxymethylation on cysteine and TMTsixplex labels on N‐terminus and lysine side chain were set as static modifications. Protein with two minimal peptides at rank 1 with high confidence was unambiguously identified. Only data generated from the TMTsixplex reporter ion spectra from the peptides that could be uniquely assigned to a given protein were used for relative quantification. TMT reporter ions were detected under HCD mode, and their peak areas were calculated to indicate their intensities. The percentage abundance was calculated by normalizing the peak area of the individual protein to the sum of peak areas of all proteins identified for each MS/MS experiments.

MS/MS data comparison

The proteins identified from whole milk and MFGM fractions were combined and this list was compared to the entire human milk proteome as mapped out recently by several studies prior to August 2020 (Herwijnen et al., 2016; Lu et al., 2018; Ma et al., 2019; Yang et al., 2017; Zhu et al., 2019). One of the earlier studies by van Herwijnen et al. (2016) aggregated 38 additional studies. The rationale for conducting such a comparison was to investigate whether our alternative approach of a combination of sample preparation and mass spectrometry strategy identified proteins previously undiscovered proteins in human milk and to see whether any of these may be involved in the regulation of calcium. To determine how the proteins are involved in calcium regulation, a functional classification analysis was performed. This involved extracting GO terms related to relevant keywords (i.e., calcium) with the intent to identify proteins that may be involved in calcium regulation. Furthermore, since gene names (GN) were used as protein identifiers in the van Herwijnen et al. (2016) publication, all proteins identified in our investigations were therefore converted to their GN for ease of comparison. Any anomalies which included proteins having different accession numbers but sharing the same associated GN, and proteins that do not have a GN assigned, were removed and not included in the final list of comparison.

Bioinformatic analyses

We have chosen two web‐based platforms: STRING 10.5 (Search Tool for the Retrieval of Interacting Genes, http://string‐db.org/cgi/input.pl) and QuickGO (https://www.ebi.ac.uk/QuickGO/) to demonstrate protein‐to‐protein interaction and functional classification, respectively. Both these platforms were selected for their ease of use and having been peer reviewed.

Interaction of FAM20A with other proteins

Network analysis was performed using STRING to identify FAM20A interaction with other proteins. The criteria used were of high confidence interaction score of 0.7, together with interaction evidence based on experiments, co‐occurrence, databases, neighborhood, text mining, fusion, and co‐expression all selected. In addition, not more than 10 interactors were shown for the first shell (direct interaction of proteins to FAM20A).

Functional classification

Functional classification analysis was performed using QuickGO. This software package was used to extract all available GO terms that are relevant to the keyword “calcium.” The software was also used to identify which of the proteins identified in this investigation were associated with the GO term calcium. QuickGO is a browser‐based application for GO terms and annotations, which contains information for 45919 GO terms, as defined in the version of the GO OWL definition file, accessed on April 20, 2016. (Binns et al., 2009) Only protein with UniProt IDs could be used as input within the QuickGO database. Proteins within our dataset that are associated with GO calcium term were then extracted using the following settings: molecular function, cellular components, and biological processes enabled, Taxon was set at Homo sapiens (9606) enabled, all reference selected, and all databases publicly available at the time of analysis. The output GNs were then matched with the original input list to highlight any anomalies.

RESULTS

Identification and quantification of FAM20A in whole milk

In this investigation, three fragmentation experiments were employed. FAM20A was identified in whole milk from fractions 16 and 18 from the shotgun CID MS/MS fragmentation experiment. Identity was based on two peptide fragments of FAM20A; one peptide (HNAEIAAFHLDR) is also found in FAM20B and FAM20C and the other (EILEVTK) was unique to FAM20A (Figure 2). The mass of 645 shown in Figure 2 is the doubly charged species of 1289.086. Ions were selected based on their m/z value, where m = mass and z = charge, and therefore the detected peak is 645 (1289.086/2).
FIGURE 2

MS/MS spectra of the identified peptides for FAM20A. The (a) EILEVTK (c) HNAEIAAFHLDR peptide spectra were obtained from fractions 18–16 of human milk, respectively. The (b) and (d) depicts the raw masses (red are b ions and blue are y ions) detected and identified EILEVTK and HNAEIAAFHLDR peptide, respectively

MS/MS spectra of the identified peptides for FAM20A. The (a) EILEVTK (c) HNAEIAAFHLDR peptide spectra were obtained from fractions 18–16 of human milk, respectively. The (b) and (d) depicts the raw masses (red are b ions and blue are y ions) detected and identified EILEVTK and HNAEIAAFHLDR peptide, respectively

Targeted proteomics: Relative quantification of FAM20A in human milk between the three samples with and without PI

Relative quantification using shotgun HCD MS/MS fragmentation experiment did not identify FAM20A. Therefore, a targeted HCD MS/MS fragmentation experiment was employed. This was achieved by selecting the doubly charged m/z ion (1289.80681) from the full MS scan for fragmentation. This doubly charged m/z ion was selected on basis of the EILEVTK ion being identified and sequenced using a shotgun CID MS/MS fragmentation experiment. In the targeted approach the peptide was sequenced 43 times and quantified four times in each sample. The TMTsixplex reporter ion spectra generated from this doubly charged ion was used to report the relative abundance of the unique FAM20A peptide (EILEVTK) between the three samples with and without PI (Figure 3).
FIGURE 3

Relative quantification of the unique FAM20A peptide (EILEVTK). The unique peptide, EILEVTK for FAM20A was sequenced 43 times and quantified four times using a targeted method based on a doubly charged m/z ion, 1289.80681. The reporter ions of the TMTsixplex derived from HCD MS/MS of this peptide were used to obtain relative quantification of EILEVTK four times (n = 4; technical replicates) in each of the six samples multiplexed. A Student's t‐test was performed on the intensity of the EILEVTK peptide identified in milk with and without PI for each individual. Error bars are mean ± SEM. The p‐value is based on a Student's t‐test. **p < 0.001

Relative quantification of the unique FAM20A peptide (EILEVTK). The unique peptide, EILEVTK for FAM20A was sequenced 43 times and quantified four times using a targeted method based on a doubly charged m/z ion, 1289.80681. The reporter ions of the TMTsixplex derived from HCD MS/MS of this peptide were used to obtain relative quantification of EILEVTK four times (n = 4; technical replicates) in each of the six samples multiplexed. A Student's t‐test was performed on the intensity of the EILEVTK peptide identified in milk with and without PI for each individual. Error bars are mean ± SEM. The p‐value is based on a Student's t‐test. **p < 0.001 A comparison was performed for each individual with and without PI using a Student's t‐test. Figure 3 shows the mean ± standard error of mean (SEM) of the relative quantification of the reporter ion loss (n = 4; technical replicates per measured sample) from EILEVTK. Little difference in the relative quantification of FAM20A peptide (EILEVTK) was observed when comparing samples with and without PI from each individual sample; further, signal detected from this peptide was similar between the first and third individuals. There was a marked decrease in the observed EILEVTK signal in the second sample when compared to the first and third samples. This second sample had been stored in a domestic freezer prior to processing. Whereas samples from individuals 1 and 3 were processed within 30 min of collection. PI was added to one aliquot prior to sample 2 being thawed. There was a significant (Student's t‐test p < 0.001) increase in the level of the EILEVTK peptide in sample 2 with PI when compared to the same sample without PI. The total protein content was similar in sample 2 with (5.2 µg/µl) and without (4.5 µg/µl) PI.

FAM20A abundance in human milk

To determine the relative abundance of FAM20A in the pooled sample of whole milk, a plot based on the relative abundance and concentration of the four major milk proteins, serum albumin, lysozyme, lactotransferrin, and lactoalbumin as determined by Fast Protein Liquid Chromatography (Hambraeus et al., 1978) was generated. The relative abundance of these proteins was calculated and expressed as a percentage of the total protein content of milk, assuming the total protein content of milk was 1.1 g/100 ml (Lynch et al., 2004). Since there would always be a linear relationship between the relative abundance (expressed as a percentage) of a protein (be it highly abundant or low abundant) and the total protein concentration, we compared our dataset with Hambraeus et al. (1978). The percentage abundance of the stated four major proteins obtained from our CID MS/MS and HCD MS/MS fragmentation experiment was calculated by normalizing the peak area of the individual protein to the sum of peak areas of all proteins identified for each MS/MS fragmentation experiments. However, one limitation of this approach is the under‐representation of the whole milk proteome, therefore the percentage abundance for any given protein identified maybe slightly over‐represented. Nevertheless, the results show that the concentrations of these four major proteins from both our MS/MS datasets were similar to that determined by Hambraeus et al. (1978). Given mass spectrometry is not inherently quantitative because of differences in the ionization efficiency, we were able to perform a comparative analysis on our shotgun CID MS/MS fragmentation experiment to extrapolate and speculate the calculated concentration of 1.6 µM using it as a relative quantification. (Figure 4).
FIGURE 4

Relative abundance of FAM20A in human milk. Concentration of FAM20A was extrapolated by comparing the relative abundance of major proteins: serum albumin, lysozyme, lactotransferrin, and alpha‐lactalbumin in our human milk sample from both the CID MS/MS and HCD MS/MS dataset with that of the known concentration of the same proteins as determined by Hambraeus et al. (1978). The data shown represents the combined peak area for each protein obtained from the three individual human milk samples with and without PI

Relative abundance of FAM20A in human milk. Concentration of FAM20A was extrapolated by comparing the relative abundance of major proteins: serum albumin, lysozyme, lactotransferrin, and alpha‐lactalbumin in our human milk sample from both the CID MS/MS and HCD MS/MS dataset with that of the known concentration of the same proteins as determined by Hambraeus et al. (1978). The data shown represents the combined peak area for each protein obtained from the three individual human milk samples with and without PI

Proteomics profiling and identification of calcium metabolism proteins in human milk

Figure 5 illustrates the workflow on the data generated for proteomic profiling and the processes involved in the identification of proteins involved in calcium metabolism in human milk.
FIGURE 5

Flow chart on the data generated and how the data were processed. The output data from the fragmentation experiments (CID MS/MS and HCD MS/MS) for whole milk and MFGM fractions were combined and a total of 1391 proteins having accession numbers were identified. The gene names for these 1391 proteins were extracted. Thirty‐one proteins were removed for having either duplicate gene names or no gene names. Comparison on the remaining 1360 proteins with the 2529 previous milk proteome identified 136 previously undiscovered proteins in human milk. The 1391 proteins with accession numbers were analyzed in the QuickGO application to determine which proteins are associated with calcium GO term, the resulting output of 462 proteins was given as gene names. Comparison of these 462 gene names with the gene names extracted from the original input list (1391) gave only 201 matches. Final comparison between these 201 proteins and the list of 136 previously undiscovered human milk proteins identified 21 calcium metabolism proteins that have never been identified in human milk

Flow chart on the data generated and how the data were processed. The output data from the fragmentation experiments (CID MS/MS and HCD MS/MS) for whole milk and MFGM fractions were combined and a total of 1391 proteins having accession numbers were identified. The gene names for these 1391 proteins were extracted. Thirty‐one proteins were removed for having either duplicate gene names or no gene names. Comparison on the remaining 1360 proteins with the 2529 previous milk proteome identified 136 previously undiscovered proteins in human milk. The 1391 proteins with accession numbers were analyzed in the QuickGO application to determine which proteins are associated with calcium GO term, the resulting output of 462 proteins was given as gene names. Comparison of these 462 gene names with the gene names extracted from the original input list (1391) gave only 201 matches. Final comparison between these 201 proteins and the list of 136 previously undiscovered human milk proteins identified 21 calcium metabolism proteins that have never been identified in human milk Shotgun proteomics identified a total of 1391 proteins from both whole milk and MFGM datasets using CID MS/MS and HCD MS/MS fragmentation strategy Data File S1). In the final list of comparisons, 31 proteins were excluded on the basis of having the same GN but different accession numbers, or if they were without an assigned GN. Subsequently, the remaining 1360 GN were then compared to a previous publication (Herwijnen et al., 2016; Lu et al., 2018; Ma et al., 2019; Yang et al., 2017; Zhu et al., 2019) that lists the entire human milk proteome to date. In fact, this study, which uses an alternative approach of 1% SDS for protein extraction coupled with high pH RP‐HPLC fractionation and nano‐LC‐MS/MS identified 23 and 113 newly discovered proteins in whole milk and MFGM, respectively (Table 1).
TABLE 1

Newly discovered proteins in human milk

Milk fractionMS/MSGene nameAccession #DescriptionEntry name
MFGMCIDABCA13Q86UQ4ATP‐binding cassette sub‐family A member 13[ABCAD_HUMAN]
MFGMCIDABCD1P33897ATP‐binding cassette sub‐family D member 1[ABCD1_HUMAN]
MFGMCIDACBD5Q5T8D3Acyl‐CoA‐binding domain‐containing protein 5[ACBD5_HUMAN]
MFGM CID ADGRE2 Q9UHX3 Adhesion G protein‐coupled receptor E2 [AGRE2_HUMAN]
MFGMCIDADGRF1Q5T601Adhesion G‐protein coupled receptor F1[AGRF1_HUMAN]
MFGMCIDADGRG2Q8IZP9Adhesion G‐protein coupled receptor G2[AGRG2_HUMAN]
MFGMCIDAGFG1P52594Arf‐GAP domain and FG repeat‐containing protein 1[AGFG1_HUMAN]
MFGMCIDARFGAP1Q8N6T3ADP‐ribosylation factor GTPase‐activating protein 1[ARFG1_HUMAN]
MFGMCIDARFGAP2Q8N6H7ADP‐ribosylation factor GTPase‐activating protein 2[ARFG2_HUMAN]
MFGMCIDARFGAP3Q9NP61ADP‐ribosylation factor GTPase‐activating protein 3[ARFG3_HUMAN]
MFGMCIDATP5IP56385ATP synthase subunit e, mitochondrial[ATP5I_HUMAN]
MFGMCIDATP5JP18859ATP synthase‐coupling factor 6, mitochondrial[ATP5J_HUMAN]
MFGMCIDATXN2LQ8WWM7Ataxin‐2‐like protein[ATX2L_HUMAN]
MFGMCIDBCAT2O15382Branched‐chain‐amino‐acid aminotransferase, mitochondrial[BCAT2_HUMAN]
MFGM CID BLOC1S1 P78537 Biogenesis of lysosome‐related organelles complex 1 subunit 1 [BL1S1_HUMAN]
MFGM CID BLOC1S2 Q6QNY1 Biogenesis of lysosome‐related organelles complex 1 subunit 2 [BL1S2_HUMAN]
MFGM CID BLOC1S3 Q6QNY0 Biogenesis of lysosome‐related organelles complex 1 subunit 3 [BL1S3_HUMAN]
MFGMCIDBLOC1S5Q8TDH9Biogenesis of lysosome‐related organelles complex 1 subunit 5[BL1S5_HUMAN]
MFGMCIDC19orf25Q9UFG5UPF0449 protein C19orf25[CS025_HUMAN]
MFGMCIDC1QBPQ07021Complement component 1 Q subcomponent‐binding protein, mitochondrial[C1QBP_HUMAN]
MFGMCIDC2orf88Q9BSF0Small membrane A‐kinase anchor protein[SMAKA_HUMAN]
MFGMCIDC4orf32Q8N8J7Uncharacterized protein C4orf32[CD032_HUMAN]
MFGMCIDCALCBP10092Calcitonin gene‐related peptide 2[CALCB_HUMAN]
MFGMCIDCASP1P29466Caspase‐1[CASP1_HUMAN]
MFGMCIDCCDC124Q96CT7Coiled‐coil domain‐containing protein 124[CC124_HUMAN]
MFGMCIDCCDC134Q9H6E4Coiled‐coil domain‐containing protein 134[CC134_HUMAN]
MFGMCIDCD320Q9NPF0CD320 antigen[CD320_HUMAN]
MFGMCIDCDK17Q00537Cyclin‐dependent kinase 17[CDK17_HUMAN]
MFGMCIDCDNFQ49AH0Cerebral dopamine neurotrophic factor[CDNF_HUMAN]
MFGMCIDCHCHD3Q9NX63MICOS complex subunit MIC19[MIC19_HUMAN]
MFGMCIDCNPY3Q9BT09Protein canopy homolog 3[CNPY3_HUMAN]
MFGMCIDCNPY4Q8N129Protein canopy homolog 4[CNPY4_HUMAN]
MFGMCIDCOX5AP20674Cytochrome c oxidase subunit 5A, mitochondrial[COX5A_HUMAN]
MFGMCIDCOX5BP10606Cytochrome c oxidase subunit 5B, mitochondrial[COX5B_HUMAN]
MFGMCIDCOX6B1P14854Cytochrome c oxidase subunit 6B1[CX6B1_HUMAN]
MFGMCIDCRADDP78560Death domain‐containing protein CRADD[CRADD_HUMAN]
MFGMCIDCTAGE5O15320cTAGE family member 5[CTGE5_HUMAN]
MFGMCIDDDX19AQ9NUU7ATP‐dependent RNA helicase DDX19A[DD19A_HUMAN]
MFGMCIDDIABLOQ9NR28Diablo homolog, mitochondrial[DBLOH_HUMAN]
MFGMCIDDNAJC1Q96KC8DnaJ homolog subfamily C member 1[DNJC1_HUMAN]
MFGM CID DTNBP1 Q96EV8 Dysbindin [DTBP1_HUMAN]
MFGMCIDDYNC1LI1Q9Y6G9Cytoplasmic dynein 1 light intermediate chain 1[DC1L1_HUMAN]
MFGMCIDDYNLRB1Q9NP97Dynein light chain roadblock‐type 1[DLRB1_HUMAN]
MFGMCIDEIF2AK2P19525Interferon‐induced, double‐stranded RNA‐activated protein kinase[E2AK2_HUMAN]
MFGMCIDEIF5BO60841Eukaryotic translation initiation factor 5B[IF2P_HUMAN]
MFGMCIDEMC1Q8N766ER membrane protein complex subunit 1[EMC1_HUMAN]
MFGM CID EPS15 P42566 Epidermal growth factor receptor substrate 15 [EPS15_HUMAN]
MFGM CID EPS15L1 Q9UBC2 Epidermal growth factor receptor substrate 15‐like 1 [EP15R_HUMAN]
MFGMCIDERICH5Q6P6B1Glutamate‐rich protein 5[ERIC5_HUMAN]
MFGM CID ERO1A Q96HE7 ERO1‐like protein alpha [ERO1A_HUMAN]
MFGMCIDF5P12259Coagulation factor V[FA5_HUMAN]
MFGM CID F7 P08709 Coagulation factor VII [FA7_HUMAN]
MFGM CID F9 P00740 Coagulation factor IX [FA9_HUMAN]
MFGMCIDFAM177A1Q8N128Protein FAM177A1[F177A_HUMAN]
MFGMCIDFAM213AQ9BRX8Redox‐regulatory protein FAM213A[F213A_HUMAN]
MFGM CID FKBP7 Q9Y680 Peptidyl‐prolyl cis‐trans isomerase FKBP7 [FKBP7_HUMAN]
MFGMCIDGNG10P50151Guanine nucleotide‐binding protein G(I)/G(S)/G(O) subunit gamma‐10[GBG10_HUMAN]
MFGMCIDGPAT4Q86UL3Glycerol‐3‐phosphate acyltransferase 4[GPAT4_HUMAN]
MFGMCIDHNRNPDLO14979Heterogeneous nuclear ribonucleoprotein D‐like[HNRDL_HUMAN]
MFGMCIDHOOK3Q86VS8Protein Hook homolog 3[HOOK3_HUMAN]
MFGMCIDHSBP1O75506Heat shock factor‐binding protein 1[HSBP1_HUMAN]
MFGM CID IMMT Q16891 MICOS complex subunit MIC60 [MIC60_HUMAN]
MFGMCIDINPPL1O15357Phosphatidylinositol 3,4,5‐trisphosphate 5‐phosphatase 2[SHIP2_HUMAN]
MFGMCIDKLC3Q6P597Kinesin light chain 3[KLC3_HUMAN]
MFGMCIDLAMTOR5O43504Regulator complex protein LAMTOR5[LTOR5_HUMAN]
MFGMCIDLNPQ9C0E8Protein lunapark[LNP_HUMAN]
MFGMCIDLRFN1Q9P244Leucine‐rich repeat and fibronectin type III domain‐containing protein 1[LRFN1_HUMAN]
MFGMCIDMRPL12P5281539S ribosomal protein L12, mitochondrial[RM12_HUMAN]
MFGMCIDMSLNQ13421Mesothelin[MSLN_HUMAN]
MFGMCIDMYO18AQ92614Unconventional myosin‐XVIIIa[MY18A_HUMAN]
MFGMCIDNDUFA5Q16718NADH dehydrogenase [ubiquinone] 1 alpha subcomplex subunit 5[NDUA5_HUMAN]
MFGMCIDOXR1Q8N573Oxidation resistance protein 1[OXR1_HUMAN]
MFGMCIDPBXIP1Q96AQ6Pre‐B‐cell leukaemia transcription factor‐interacting protein 1[PBIP1_HUMAN]
MFGMCIDPCSK1NQ9UHG2ProSAAS[PCSK1_HUMAN]
MFGMCIDPDRG1Q9NUG6p53 and DNA damage‐regulated protein 1[PDRG1_HUMAN]
MFGMCIDPEX14O75381Peroxisomal membrane protein PEX14[PEX14_HUMAN]
MFGMCIDPEX19P40855Peroxisomal biogenesis factor 19[PEX19_HUMAN]
MFGMCIDPLD2O14939Phospholipase D2[PLD2_HUMAN]
MFGM CID PRKCA P17252 Protein kinase C alpha type [KPCA_HUMAN]
MFGM CID PROZ P22891 Vitamin K‐dependent protein Z [PROZ_HUMAN]
MFGMCIDPXNP49023Paxillin[PAXI_HUMAN]
MFGMCIDPYM1Q9BRP8Partner of Y14 and mago[PYM1_HUMAN]
MFGMCIDRAB11FIP1Q6WKZ4Rab11 family‐interacting protein 1[RFIP1_HUMAN]
MFGM CID RAB3GAP1 Q15042 Rab3 GTPase‐activating protein catalytic subunit [RB3GP_HUMAN]
MFGMCIDRABL6Q3YEC7Rab‐like protein 6[RABL6_HUMAN]
MFGM CID RCN2 Q14257 Reticulocalbin‐2 [RCN2_HUMAN]
MFGMCIDSARGQ9BW04Specifically, androgen‐regulated gene protein[SARG_HUMAN]
MFGMCIDSCYL1Q96KG9N‐terminal kinase‐like protein[NTKL_HUMAN]
MFGMCIDSEC63Q9UGP8Translocation protein SEC63 homolog[SEC63_HUMAN]
MFGMCIDSH3GLB2Q9NR46Endophilin‐B2[SHLB2_HUMAN]
MFGMCIDSH3YL1Q96HL8SH3 domain‐containing YSC84‐like protein 1[SH3Y1_HUMAN]
MFGMCIDSHTN1A0MZ66Shootin‐1[SHOT1_HUMAN]
MFGMCIDSPAG9O60271C‐Jun‐amino‐terminal kinase‐interacting protein 4[JIP4_HUMAN]
MFGMCIDSRA1Q9HD15Steroid receptor RNA activator 1[SRA1_HUMAN]
MFGMCIDSRPRP08240Signal recognition particle receptor subunit alpha[SRPR_HUMAN]
MFGMCIDSSBP1Q04837Single‐stranded DNA‐binding protein, mitochondrial[SSBP_HUMAN]
MFGM CID STOML2 Q9UJZ1 Stomatin‐like protein 2, mitochondrial [STML2_HUMAN]
MFGM CID SYTL2 Q9HCH5 Synaptotagmin‐like protein 2 [SYTL2_HUMAN]
MFGMCIDTACC2O95359Transforming acidic coiled‐coil‐containing protein 2[TACC2_HUMAN]
MFGMCIDTAPBPO15533Tapasin[TPSN_HUMAN]
MFGMCIDTFPIP10646Tissue factor pathway inhibitor[TFPI1_HUMAN]
MFGMCIDTMED7Q9Y3B3Transmembrane emp24 domain‐containing protein 7[TMED7_HUMAN]
MFGMCIDTMPOP42167Lamina‐associated polypeptide 2, isoforms beta/gamma[LAP2B_HUMAN]
MFGMCIDTOR1AIP2Q8NFQ8Torsin‐1A‐interacting protein 2[TOIP2_HUMAN]
MFGMCIDTSC22D4Q9Y3Q8TSC22 domain family protein 4[T22D4_HUMAN]
MFGMCIDTTC1Q99614Tetratricopeptide repeat protein 1[TTC1_HUMAN]
MFGMCIDUBAP2LQ14157Ubiquitin‐associated protein 2‐like[UBP2L_HUMAN]
MFGMCIDUBTD2Q8WUN7Ubiquitin domain‐containing protein 2[UBTD2_HUMAN]
MFGMCIDUQCRBP14927Cytochrome b‐c1 complex subunit 7[QCR7_HUMAN]
MFGMCIDUQCRFS1P47985Cytochrome b‐c1 complex subunit Rieske, mitochondrial[UCRI_HUMAN]
MFGM CID VSNL1 P62760 Visinin‐like protein 1 [VISL1_HUMAN]
MFGMCIDZC3H15Q8WU90Zinc finger CCCH domain‐containing protein 15[ZC3HF_HUMAN]
MFGMCIDZNF185O15231Zinc finger protein 185[ZN185_HUMAN]
WMCIDDDX39BQ13838Spliceosome RNA helicase DDX39B[DX39B_HUMAN]
WMCIDEEF1A1P5Q5VTE0Putative elongation factor 1‐alpha‐like 3[EF1A3_HUMAN]
WMCIDEIF2S3LQ2VIR3Putative eukaryotic translation initiation factor 2 subunit 3‐like protein[IF2GL_HUMAN]
WMCIDH2AFXP16104Histone H2AX[H2AX_HUMAN]
WMCIDH2BFSP57053Histone H2B type F‐S[H2BFS_HUMAN]
WMCIDH3F3AP84243Histone H3.3[H33_HUMAN]
WMCIDHIST1H2AAQ96QV6Histone H2A type 1‐A[H2A1A_HUMAN]
WMCIDHIST1H2BCP62807Histone H2B type 1‐C/E/F/G/I[H2B1C_HUMAN]
WMCIDHIST1H2BHQ93079Histone H2B type 1‐H[H2B1H_HUMAN]
WMCIDHIST1H2BNQ99877Histone H2B type 1‐N[H2B1N_HUMAN]
WMCIDHIST2H3AQ71DI3Histone H3.2[H32_HUMAN]
WMCIDNUMA1Q14980Nuclear mitotic apparatus protein 1[NUMA1_HUMAN]
WMCIDRPL26L1Q9UNX360S ribosomal protein L26‐like 1[RL26L_HUMAN]
WMCIDRPL36AP8388160S ribosomal protein L36a[RL36A_HUMAN]
WMCIDRPL36ALQ969Q060S ribosomal protein L36a‐like[RL36L_HUMAN]
WMCIDTKFCQ3LXA3Triokinase/FMN cyclase[TKFC_HUMAN]
WMCIDUBA52P62987Ubiquitin‐60S ribosomal protein L40[RL40_HUMAN]
WMCIDUBE2NLQ5JXB2Putative ubiquitin‐conjugating enzyme E2 N‐like[UE2NL_HUMAN]
WM HCD FYN P06241 Tyrosine‐protein kinase Fyn [FYN_HUMAN]
WMHCDHIST1H3AP68431Histone H3.1[H31_HUMAN]
WMHCDHIST2H2ABQ8IUE6Histone H2A type 2‐B[H2A2B_HUMAN]
WMHCDHSD17B10Q997143‐hydroxyacyl‐CoA dehydrogenase type‐2[HCD2_HUMAN]
WMHCDSKOR2Q2VWA4SKI family transcriptional corepressor 2[SKOR2_HUMAN]

The text in bold indicates proteins associated with calcium metabolism.

Newly discovered proteins in human milk The text in bold indicates proteins associated with calcium metabolism.

Functional Classification

Functional classification using QuickGO extracted 318 GO terms associated with the keyword calcium. The next stage was to determine which of the 1391 proteins identified in this investigation are associated with any of these 318 “calcium” GO terms. We found that 462 of our identified proteins to be associated with a total of 178 of the 318 calcium GO’s (shown in Data File S2). From these 462 proteins, 201 (43%) could be matched to GNs, which is the output format of QuickGO. Notably, FAM20A is linked to GO terms: calcium ion homeostasis (GO: 0055074) and enamel mineralization (GO: 0070166). From this list of 201, 21 proteins (1 from whole milk and 20 from MFGM fraction) were from our 136 proteins previously undiscovered in whole milk, Table 2.
TABLE 2

Previously undiscovered proteins in human milk associated with calcium metabolism

Milk fractionAccession #Gene nameDescriptionGO IDGO nameAspect
1MFGMQ9UHX3ADGRE2Adhesion G protein‐coupled receptorGO:0005509Calcium ion bindingFunction
2MFGMP78537BLOC1S1Biogenesis of lysosome‐related organelles complex 1 subunitGO:0060155Platelet dense granule organizationProcess
3MFGMQ6QNY1BLOC1S2Biogenesis of lysosome‐related organelles complex 1 subunit 2GO:0060155Platelet dense granule organizationProcess
4MFGMQ6QNY0BLOC1S3Biogenesis of lysosome‐related organelles complex 1 subunit 3GO:0060155Platelet dense granule organizationProcess
5MFGMP10092CALCBCalcitonin gene‐related peptideGO:0006874Cellular calcium ion homeostasisProcess
6MFGMQ96EV8DTNBP1DysbindinGO:0060155Platelet dense granule organizationProcess
7MFGMP42566EPS15Epidermal growth factor receptor substrate 15GO:0005509Calcium ion bindingFunction
8MFGMQ9UBC2EPS15L1Epidermal growth factor receptor substrate 15‐like 1GO:0005509Calcium ion bindingFunction
9MFGMQ96HE7ERO1AERO1‐like protein alphaGO:0051209Release of sequestered calcium ion into cytosolProcess
10MFGMP08709F7Coagulation factor VIIGO:0005509Calcium ion bindingFunction
11MFGMP00740F9Coagulation factor IXGO:0005509Calcium ion bindingFunction
12MFGMQ9Y680FKBP7Peptidyl‐prolyl cis‐trans isomerase FKBP7GO:0005509Calcium ion bindingFunction
13Whole milkP06241FYNTyrosine‐protein kinase FynGO:0006816Calcium ion transportProcess
14MFGMQ16891IMMTMICOS complex subunit MIC60GO:0051560Mitochondrial calcium ion homeostasisProcess
15MFGMP17252PRKCAProtein kinase C alpha typeGO:0004698Calcium‐dependent protein kinase C activityFunction
16MFGMP22891PROZVitamin K‐dependent protein ZGO:0005509Calcium ion bindingFunction
17MFGMQ15042RAB3GAP1Rab3 GTPase‐activating protein catalytic subunitGO:1903233Regulation of calcium ion‐dependent exocytosis of neurotransmitterProcess
18MFGMQ14257RCN2Reticulocalbin−2GO:0005509Calcium ion bindingFunction
19MFGMQ9UJZ1STOML2Stomatin‐like protein 2, mitochondrialGO:0006851 GO:0006874Mitochondrial calcium ion transport Cellular calcium ion homeostasisProcess
20MFGMQ9HCH5SYTL2Synaptotagmin‐like protein 2GO:0005509 GO:0005544Calcium ion binding calcium‐dependent phospholipid bindingFunction
21MFGMP62760VSNL1Visinin‐like protein 1GO:0005509Calcium ion bindingFunction
Previously undiscovered proteins in human milk associated with calcium metabolism

Network Analysis

Network analysis was performed using STRING to identify proteins that have a known interaction with FAM20A in humans (H. sapiens) or mice (Mus musculus) and consequently to see whether these proteins are present within our dataset and in the previously cataloged human milk proteome (Figure 6). This analysis revealed FAM20A interactions with nine and ten other proteins in H. sapiens and M. musculus, respectively. The GN, description and functions of these proteins are detailed in Table 3, we were only able to identify FAM20C, ENAM, XYLT1, and Wdr72 from our dataset and these proteins have been previously listed as part of the milk proteome (Lu et al., 2018; Ma et al., 2019; Yang et al., 2017; Zhu et al., 2019). We were unable to find any of these proteins in our list of newly discovered proteins in human milk.
FIGURE 6

Interaction of known protein with FAM20A in (a) Homo sapiens and Mus musculus (b). STRING analysis reveals the interaction of FAM20A with nine and 10 other proteins in human and mouse, respectively. The relevant proteins are given as gene names and are shown as nodes. These proteins are produced by a single protein‐coding gene locus. The filled nodes symbolize 3D structure is either known or it is predicted. The edges (lines) represent either experimentally determined interaction (pink), or through text mining (green), or its co‐expression (black) and protein homology (purple). †Proteins that were also identified in our dataset

TABLE 3

STRING analysis of proteins known to interact with FAM20A

Gene nameDescriptionFunction
Homo sapiens Mus Musculus
ENAM No interaction EnamelinInvolved in mineralization of enamel
AMTN No interaction AmelotinPromotes calcium phosphate mineralization
FKBP9 No interaction Peptidyl‐prolyl cis‐trans isomerase FKBP9Involved in protein folding
AMBN Ambn AmeloblastinInvolved in mineralization of enamel
FAM20C Fam20c Extracellular serine/threonine protein kinasePhosphorylates proteins with a Ser‐x‐Glu/pSer motifs in the secretory pathway
TMEM38B Tmem38b Trimeric intracellular cation channel type BChannel allowing rapid release of intracellular calcium
FKB10 Fkb10 Peptidyl‐prolyl cis‐trans isomerase FKBP1BInvolved in protein folding

LEPRE1 (P3H1)

Lepre (P3h1) Prolyl 3‐hydroxylase 1Involved in post translational modification of collagen
IFITM5 Ifitm5 Interferon‐induced transmembrane protein 5Plays a role in bone mineralization
No interaction Fjx1 Four‐jointed box protein 1Descendants from the four‐jointed (FJ) family of protein kinases
No interaction

Xylt1

Xylosyltransferase 1Involved in the synthesis of glycosaminoglycan
No interaction Wdr72 WD repeat‐containing protein 72Involved in mineralization of enamel
No interaction Fam83h Protein FAM83HInvolved in mineralization of enamel

KEY: Bold = HUGO Gene Names and non‐bold = alternative Gene Names.

Interaction of known protein with FAM20A in (a) Homo sapiens and Mus musculus (b). STRING analysis reveals the interaction of FAM20A with nine and 10 other proteins in human and mouse, respectively. The relevant proteins are given as gene names and are shown as nodes. These proteins are produced by a single protein‐coding gene locus. The filled nodes symbolize 3D structure is either known or it is predicted. The edges (lines) represent either experimentally determined interaction (pink), or through text mining (green), or its co‐expression (black) and protein homology (purple). †Proteins that were also identified in our dataset STRING analysis of proteins known to interact with FAM20A LEPRE1 (P3H1) Xylt1 KEY: Bold = HUGO Gene Names and non‐bold = alternative Gene Names.

DISCUSSION

Calcium is essential for many cellular functions such as nerve impulse transmission, muscular contraction, hormone secretion, and intracellular adhesion and for bone mineralization. Intracellular calcium levels need to be tightly maintained; dysregulation can lead to severe cell dysfunction and death. Ectopic calcification may also occur due to dysregulation of calcium homeostasis. Pathological calcification is associated with a number of medical conditions such as abdominal aortic aneurysm (Buijs et al., 2013) and chronic kidney disease. Tumor calcification is also common in breast cancer (Mischak et al., 2007). Surprisingly, as milk is rich in minerals such as calcium and phosphates there seems not to be any evidence of calcification in the tissue during lactation. However, there are a few case reports of post‐lactation micro‐calcifications (Giron et al., 2004; Stucker et al., 2000). In these individual cases, there is a strong family history of breast cancer where calcifications have been noted in the breast tissue. The processes that normally maintain calcium balance can be delineated by studying epithelial tissues where calcification is occurring. Since biopsies can be difficult to obtain from relevant cohorts this study has used human milk to identify and quantify FAM20A, a protein that is known to be involved in calcium regulation. The availability of samples in this investigation was limited and collected from specific lactating donors on an ad hoc basis. This resulted in three samples being collected for the detection and quantification of FAM20A. Several studies have identified FAM20A in human milk through the use of mass spectrometry (Gao et al., 2012; Mange et al., 2013; Zhang et al., 2013). Here we performed the first comparative quantitative study using a targeted HCD MS/MS fragmentation experiment. This involved combining shotgun CID MS/MS and targeted HCD MS/MS fragmentation experiments to provide relative quantification across the three different milk samples. Furthermore, we performed proteomic analyses to establish potential functional network of proteins involved in calcium metabolism in milk and confirm whether any of these or existing proteins have any interaction with FAM20A as shown through the STRING analysis in Figure 6. Despite low concentrations of FAM20A mRNA previously reported within the MFGM fraction of human milk (Lemay et al., 2013), we were unable to identify the protein within our pooled MFGM fraction. Instead, using a shotgun CID MS/MS fragmentation experiment, the FAM20A protein, based on the combined peak area of the two peptides, was identified at low concentrations (1.6 µM) in human milk. This calculated amount is speculative, and a larger sample cohort would be needed to determine the concentration range in the human population. Regardless of the samples being handled differently, the difficulties of human milk storage conditions in neonatal hospitals are widely recognized. Additionally, an investigation on the impact of storage conditions on the breast milk peptidome for MS experiments demonstrated that the majority of peptides kept quite stable when stored at −80°C, −20°C, or even at 4°C (Howland et al., 2020). Previous work by Nalbant et al. (2005), has shown FAM20A to be a secreted glycoprotein. Therefore, it is likely the FAM20A detected in our whole milk is expressed within the lactocytes and secreted into the whey fraction of human milk. This concurs with previous findings (Beck et al., 2015; Gao et al., 2012; Mange et al., 2013; Zhang et al., 2016), identifying FAM20A in the whey fraction of human milk. The FAM20A mRNA detected by Lemay et al. (2013) may be encapsulated within the MFGM during its development. The development starts with the formation of lipids within the endoplasmic reticulum (ER). Secretion of this lipid occurs by it being enveloped by the plasma membrane at the apical side of the lactocytes, taking with it the mRNA as well as protein remnants from these cells. Subsequent analysis involved HCD MS/MS fragmentation experiment to quantify all proteins including FAM20A in whole milk from the three individuals with and without PI. This procedure quantified 247 proteins and FAM20A protein was not identified in this list. This suggests that without a hypothesis‐driven question these quantitative platforms can miss out proteins of clinical relevance. It was only through a targeted HCD MS/MS experiment that we were then able to successfully quantify FAM20A up to 44 weeks milk postpartum. Little is known regarding the role of FAM20A, though there is an association that FAM20A prevents calcification. Furthermore, there is evidence suggesting that FAM20A, a pseudokinase, forms a functional unit with FAM20C and this complex enhances the ability to phosphorylate several proteins with serine‐x‐glutamic acid/phosphoserine motifs. One of these proteins is the secreted phosphoprotein 1 (SPP1) (Cui et al., 2015; Ohyama et al., 2016) also known as Osteopontin (OPN). This protein was identified in both the whole milk and MFGM fraction. However, network analysis with STRING (as accessed in May 2017) did not confirm an association of SPP1 with FAM20A; we know STRING is an evolving application that employs algorithms based on known knowledge and prediction of protein to protein interaction and therefore not surprising at the time of analysis SPP1 was not known in the database. Furthermore, despite STRING being a currently accepted application for protein networks, any interactions identified would need further experimental validation to confirm any putative biological interactions. STRING did identify direct interaction (as indicated by selecting the first shelf in the settings) with 13 other proteins with FAM20A (Table 3). From these FAM20C, ENAM, XYLT1, and WDR72 have previously been identified in human milk. As already mentioned, FAM20C forms a functional unit with FAM20A and enhances phosphorylation of secreted proteins such as SPP1, RCN1, and NUCB1 (Cui et al., 2015; Ohyama et al., 2016). Furthermore mutations in FAM20C have been associated with lethal bone dysplasia (Raine syndrome); a condition characterized by over‐calcification of bone and other non‐mineralized tissues (Ababneh et al., 2013; Acevedo et al., 2015; Fradin et al., 2011; Seidahmed et al., 2015; Simpson et al., 2007, 2009) Interestingly, mutations in WDR72 (El‐Sayed et al., 2009) and ENAM (Rajpar et al., 2001) are also causative for amelogenesis imperfecta, emphasizing the essential role of these proteins in normal physiological calcification processes, as disruptions in these genes lead to loss of, thinning or softening of enamel as aberration (Slootweg, 2007). For example, loss of WDR72 results in mislocalization of the sodium calcium potassium exchanger 4 (SLC24A4) away from the distal membrane and into the cytosol of enamel‐depositing ameloblast cells (Wang et al., 2015). WDR72 has also been identified as causative for distal renal tubular acidosis, which in turn is associated with nephrocalcinosis and thus shares important parallels with FAM20A (Zhang et al., 2019). These ameloblast cells, which are only present during tooth development, also secrete the enamel matrix protein, ENAM which is required for proper formation of enamel (Rajpar et al., 2001). Whereas point mutations in the XYLT1 and XYLT2, both of which code for a protein required for initiation of biosynthesis of proteoglycan in articulate cartilage, have been associated with decreased enzyme activity and early onset osteoarthritis (Schön et al., 2006). Interestingly, calcification has not been reported in lactating breast tissue of ERS patients with recessive mutations in FAM20A. This could be a result of the extreme rarity of this disorder or be compatible with the presence of other proteins related to calcium metabolism. QuickGo and STRING applications are constantly undergoing changes due to ever‐increasing knowledge of protein multi‐location and functional moonlighting. Functional classification analysis only revealed a 43% (201 proteins) match to our original input; a disparity likely the result of variations in the output generated by two unsynchronized applications (QuickGo and Proteome Discoverer). Furthermore, 21/201 proteins are from our list of previously undiscovered proteins in human milk. Interestingly one of these is the calcitonin gene‐related peptide‐2 CALCB (GO: 0006874, cellular calcium ion homeostasis) which was identified in the MFGM fraction. Its family member, the hormone calcitonin is involved in regulating calcium levels (Steenbergh et al., 1985) and it is likely CACLB could also be involved in maintaining Ca2+ levels. So far studies have shown that CALCB is expressed in the enteric and stomach neuronal system and plays an important role (1) as a vasodilator in stimulating milk production (Smillie & Brain, 2011; Thulesen et al., 1994) and (2) in maintaining the mucosal integrity and gastric acid regulation (Beglinger et al., 1988; Feng et al., 2011; Lenz et al., 1985; Ohno et al., 2008; Pappas et al., 1986; Tache, 1992). In addition supplementation of CALCB has been shown to relieve gastric hyperacidity (Beglinger et al., 1988). Human milk has recently been shown to activate the coagulations system by triggering the coagulation tissue factor (TF) located in extracellular vesicle (Hu et al., 2020). Additionally, from our unique list of calcium metabolism proteins, F7 (Broze, 2001), F9 (Millar et al., 2000) and PROZ (Davie & Fujikawa, 1975) are involved in blood coagulation. F7 is a high‐affinity receptor and a co‐factor for F7. Upon vascular injury, F7 contacts TF, and is cleaved to its active form; this reaction is the primary event in blood coagulation. The complex of TF and activated factor VII serves to activate other coagulation factors including F9 (Millar et al., 2000). Furthermore, the BLOC1 complex is also known to regulate coagulation (Merideth et al., 2020), and its members BLOC1S1, BLOC1S2, BLOC1S3, (Starcevic & Dell'Angelica, 2004) and DTNBP3 (Nazarian et al., 2006) are part of our newly discovered list of proteins in human milk that are associated with calcium metabolism.

Limitations

The principal limitation of our study is the samples were collected at different time points. Furthermore, due to the inherent limitation of labeling kits for quantitative proteomics the maximum number of samples that can be analyzed is 16. Hence there are no quantitative proteomics studies using large numbers of human milk samples, probably reflecting the difficulties in obtaining such a collection. We therefore can only comment on the presence of FAM20A in human milk but not on how the concentration may vary (1) between individuals and (2) the number of weeks post‐partum. Nevertheless, the identification of FAM20A in all samples supports a role for this protein in preventing the calcification of epithelial tissues. Further studies using the sample from defined lactation time points would be needed to better understand this.

CONCLUSION

In this current study, we were able to quantify FAM20A in transitional and mature human milk via the MS/MS driven proteomics targeted approach. The different fragmentation protocols used in this investigation show variable results in terms of type and number of proteins identified. Thus, the study highlights the importance of combining both traditional shotgun CID MS/MS fragmentation experiments with targeted HCD MS/MS fragmentation experiment for the analysis of low abundance proteins in hypothesis‐driven research. Additionally, the 136 newly discovered proteins are now cataloged to the evolving human milk proteome. Interestingly, 21 of the newly discovered proteins associated with calcium regulation, 11 are associated with calcium‐binding ion (GO: 0005509), and 2 were associated with cellular calcium homeostasis (GO: 0006874). We speculate that these proteins could potentially be key players in buffering the large amount of calcium passing through MEC, hence preventing calcification of the secretory epithelial tissue of breast during lactation.

CONFLICT OF INTEREST

There is no conflict of interest.

AUTHOR CONTRIBUTIONS

VP wrote the manuscript, prepared the samples, and analyzed the data. EK was involved in analysis of the data. GW contributed to sample preparation and optimizing mass spectrometry protocols. DB obtained funding and revised the manuscript. KS, SBW, MB, and NH contributed in writing of the manuscript. GJ and NI contributed to the design of the study. JGZ revised the manuscript and contributed to the data analysis. RK obtained funding and contributed to the conception of this study and was involved in the overall design of the study, JW contributed to the overall design of the study and performed all mass spectrometry experiments.

ETHICAL APPROVAL

This work has been performed in accordance with the Declaration of Helsinki under a protocol approved by the NHS research ethics committee (05/Q0508/6). Data File S1 Click here for additional data file. Data File S2 Click here for additional data file. Data File S1‐S2 Click here for additional data file.
  87 in total

1.  Human milk triggers coagulation via tissue factor-exposing extracellular vesicles.

Authors:  Yong Hu; Lena Hell; Ruth Anna Kendlbacher; Najat Hajji; Chi Hau; Annemieke van Dam; René J Berckmans; Lukas Wisgrill; Cihan Ay; Ingrid Pabinger; Alain Brisson; Andreas Repa; Rienk Nieuwland; Johannes Thaler
Journal:  Blood Adv       Date:  2020-12-22

2.  Calcitonin gene-related peptide-induced selective inhibition of gastric acid secretion in dogs.

Authors:  T Pappas; H T Debas; J H Walsh; J Rivier; Y Taché
Journal:  Am J Physiol       Date:  1986-01

3.  A Single Kinase Generates the Majority of the Secreted Phosphoproteome.

Authors:  Vincent S Tagliabracci; Sandra E Wiley; Xiao Guo; Lisa N Kinch; Eric Durrant; Jianzhong Wen; Junyu Xiao; Jixin Cui; Kim B Nguyen; James L Engel; Joshua J Coon; Nick Grishin; Lorenzo A Pinna; David J Pagliarini; Jack E Dixon
Journal:  Cell       Date:  2015-06-18       Impact factor: 41.582

4.  A longitudinal study of calcium homeostasis during human pregnancy and lactation and after resumption of menses.

Authors:  L D Ritchie; E B Fung; B P Halloran; J R Turnlund; M D Van Loan; C E Cann; J C King
Journal:  Am J Clin Nutr       Date:  1998-04       Impact factor: 7.045

Review 5.  Calcitonin gene-related peptide (CGRP) and its role in hypertension.

Authors:  Sarah-Jane Smillie; Susan D Brain
Journal:  Neuropeptides       Date:  2011-01-26       Impact factor: 3.286

6.  Molecular analysis of the genotype-phenotype relationship in factor VII deficiency.

Authors:  D S Millar; G Kemball-Cook; J H McVey; E G Tuddenham; A D Mumford; G B Attock; J C Reverter; N Lanir; L A Parapia; J Reynaud; E Meili; A von Felton; U Martinowitz; D R Prangnell; M Krawczak; D N Cooper
Journal:  Hum Genet       Date:  2000-10       Impact factor: 4.132

7.  Proteomic characterization and comparison of mammalian milk fat globule proteomes by iTRAQ analysis.

Authors:  Yongxin Yang; Nan Zheng; Xiaowei Zhao; Yangdong Zhang; Rongwei Han; Lu Ma; Shengguo Zhao; Songli Li; Tongjun Guo; Jiaqi Wang
Journal:  J Proteomics       Date:  2015-01-08       Impact factor: 4.044

8.  Nephrocalcinosis (enamel renal syndrome) caused by autosomal recessive FAM20A mutations.

Authors:  Graciana Jaureguiberry; Muriel De la Dure-Molla; David Parry; Mickael Quentric; Nina Himmerkus; Toshiyasu Koike; James Poulter; Enriko Klootwijk; Steven L Robinette; Alexander J Howie; Vaksha Patel; Marie-Lucile Figueres; Horia C Stanescu; Naomi Issler; Jeremy K Nicholson; Detlef Bockenhauer; Christopher Laing; Stephen B Walsh; David A McCredie; Sue Povey; Audrey Asselin; Arnaud Picard; Aurore Coulomb; Alan J Medlar; Isabelle Bailleul-Forestier; Alain Verloes; Cedric Le Caignec; Gwenaelle Roussey; Julien Guiol; Bertrand Isidor; Clare Logan; Roger Shore; Colin Johnson; Christopher Inglehearn; Suhaila Al-Bahlani; Matthieu Schmittbuhl; François Clauss; Mathilde Huckert; Virginie Laugel; Emmanuelle Ginglinger; Sandra Pajarola; Giuseppina Spartà; Deborah Bartholdi; Anita Rauch; Marie-Claude Addor; Paulo M Yamaguti; Heloisa P Safatle; Ana Carolina Acevedo; Hercílio Martelli-Júnior; Pedro E dos Santos Netos; Ricardo D Coletta; Sandra Gruessel; Carolin Sandmann; Denise Ruehmann; Craig B Langman; Steven J Scheinman; Didem Ozdemir-Ozenen; Thomas C Hart; P Suzanne Hart; Ute Neugebauer; Eberhard Schlatter; Pascal Houillier; William A Gahl; Miikka Vikkula; Agnès Bloch-Zupan; Markus Bleich; Hiroshi Kitagawa; Robert J Unwin; Alan Mighell; Ariane Berdal; Robert Kleta
Journal:  Nephron Physiol       Date:  2013-02-23

9.  Assessment of ORAI1-mediated basal calcium influx in mammary epithelial cells.

Authors:  Diana G F Ross; Chanel E Smart; Iman Azimi; Sarah J Roberts-Thomson; Gregory R Monteith
Journal:  BMC Cell Biol       Date:  2013-12-20       Impact factor: 4.241

10.  Discovery and Quantification of Nonhuman Proteins in Human Milk.

Authors:  Jing Zhu; Luc Garrigues; Henk Van den Toorn; Bernd Stahl; Albert J R Heck
Journal:  J Proteome Res       Date:  2018-12-12       Impact factor: 4.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.