Literature DB >> 28596590

A comprehensive analysis and annotation of human normal urinary proteome.

Mindi Zhao1,2, Menglin Li1,3, Yehong Yang4, Zhengguang Guo4, Ying Sun5, Chen Shao1, Mingxi Li5, Wei Sun6, Youhe Gao7.   

Abstract

Biomarkers are measurable changes associated with the disease. Urine can reflect the changes of the body while blood is under control of the homeostatic mechanisms; thus, urine is considered an important source for early and sensitive disease biomarker discovery. A comprehensive profile of the urinary proteome will provide a basic understanding of urinary proteins. In this paper, we present an in-depth analysis of the urinary proteome based on different separation strategies, including direct one dimensional liquid chromatography-tandem mass spectrometry (LC/MS/MS), two dimensional LC/MS/MS, and gel-eluted liquid fraction entrapment electrophoresis/liquid-phase isoelectric focusing followed by two dimensional LC/MS/MS. A total of 6085 proteins were identified in healthy urine, of which 2001 were not reported in previous studies and the concentrations of 2571 proteins were estimated (spanning a magnitude of 106) with an intensity-based absolute quantification algorithm. The urinary proteins were annotated by their tissue distribution. Detailed information can be accessed at the "Human Urine Proteome Database" (www.urimarker.com/urine).

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28596590      PMCID: PMC5465101          DOI: 10.1038/s41598-017-03226-6

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Urine is associated with glomerular filtration, tubular reabsorption and secretion[1]. Biomarkers are measurable changes associated with the disease[2]. Because urine can accumulate changes from the body[2, 3], it is considered to be one of the most attractive sources for early and sensitive biomarker discovery. Urinary proteomic studies have identified many candidate biomarkers for various urogenital diseases, such as acute kidney injury, bladder cancer and diabetic nephropathy[4-6]. As urinary proteins are composed largely of filtered plasma proteins, the urinary proteome is also considered to be valuable for detecting a broad range of complex disorders, such as encephalopathy, heart failure and intestinal ischemia[7-9]. In the biomarker discovery process, it is essential to comprehensively profile the normal urinary proteome as a baseline reference. With the rapid development of mass spectrometry (MS), larger numbers of urinary proteins were identified by various strategies. In 2001, Patterson et al. first identified 124 urinary proteins[10, 11]. In 2005, Sun et al. identified 226 proteins in normal urine with quality control LC/MS/MS data[3]. In 2006, Adachi et al. reported the first urinary proteome result (1543 proteins) from high resolution mass spectrometry[12]. In 2011, 1823 urinary proteins were identified by high resolution MS and MS/MS[13]. Many efforts have been made to identify more urinary proteins in recent years[14-16]. Currently, the human urine PeptideAtlas database contains a total of 23,739 peptides corresponding to 2487 proteins[17]. In 2014, two large-scale MS-based drafts of the human proteome identified 17,294 and 18,097 human gene products from 30 and 44 tissues and body fluids, respectively[18, 19]. In each study, the number of identified proteins was quite large and even approached the number of protein-coding genes in the complete human genome analysis[20]. Compared with the depth of the human tissue proteome, the urinary proteome has been relatively less studied. We are curious about how many proteins could be identified in human urinary proteome. Therefore, we performed an in-depth urinary proteomic analysis using normal human urine samples. And to achieve maximal urinary proteome coverage one-, two- and three-dimensional separation strategies (Fig. 1) were employed in this study. By in-depth analysis, a readily obtainable source for the human urinary proteome, “Human Urinary Proteome Database” could be provided. The comparison of three separation strategies could provide detailed information about the potential application of different separation methods. The detailed workflow was as followings: In one-dimensional (1D) separation, digested urinary peptides were directly analyzed by 1D liquid chromatography-tandem mass spectrometry (LC/MS/MS). In two-dimensional (2D) separation, urinary peptides were fractionated by offline high-pH reverse-phase liquid chromatography (RPLC) prior to analysis by 1DLC/MS/MS. In three-dimensional (3D) separation, urinary proteins were first fractionated by gel-eluted liquid fraction entrapment electrophoresis (GELFrEE) or liquid-phase isoelectric focusing (LP-IEF) and urinary peptides digested from GELFrEE/LP-IEF fractions were fractionated by RPLC as performed for 2D separation and finally analyzed by 1DLC/MS/MS. In total, 383 LC/MS/MS runs were analyzed by hybrid quadrupole-time-of-flight mass spectrometry (TripleTOF 5600).
Figure 1

The workflow of urinary proteome analysis. Pooled urine from 24 humans was analyzed using three separation strategies. 1D: Urinary peptides were directly analyzed via 1DLC/MS/MS without fractionation. 2D: Urinary peptides were analyzed via offline RPLC and 1DLC/MS/MS. 3D: Urinary proteins were first fractionated by GELFrEE/LP-IEF prior to offline RPLC. A total of 383 fractions were analyzed by LC/MS/MS using high-resolution TripleTOF 5600 MS. A urine proteome database was then constructed based on bioinformatics analyses.

The workflow of urinary proteome analysis. Pooled urine from 24 humans was analyzed using three separation strategies. 1D: Urinary peptides were directly analyzed via 1DLC/MS/MS without fractionation. 2D: Urinary peptides were analyzed via offline RPLC and 1DLC/MS/MS. 3D: Urinary proteins were first fractionated by GELFrEE/LP-IEF prior to offline RPLC. A total of 383 fractions were analyzed by LC/MS/MS using high-resolution TripleTOF 5600 MS. A urine proteome database was then constructed based on bioinformatics analyses.

Results and Discussion

Comprehensive identification of urinary proteome

In this study, pooled urine samples were used to establish a large database of urinary proteins. The following filters were used to select the final protein identification list (1). The FDR at the protein level was set to <1%, and (2) each protein should include at least two unique peptides. When identified peptides were shared between two proteins, they were combined and reported as one protein group. The results from 1DLC, 2DLC and 3DLC yielded average FDRs of 0.10%, 0.26% and 1% at the spectrum, peptide and protein levels, respectively (Supplemental File 1). Then the datasets were combined together with Scaffold perSPECtives. In 1D analysis, 808 protein groups were identified in three technical replicates, and the protein-overlapping rate was 86.3%, indicating the superior reproducibility of LC/MS analysis. In 2D analysis, a total of 3162 protein groups were identified. In 3D analysis, urinary proteins were first separated by GELFrEE/LP-IEF (Fig. 2A,B). GELFrEE enables mass range proteome separations based on molecular weight (MW), and IEF fractionates proteins according to isoelectric point (pI)[21, 22]. The GELFrEE and LP-IEF fractions were then further separated by RPLC, and a total of 6085 protein groups were identified. The overlap among the proteins identified in the 1D, 2D and 3D analyses is displayed in Fig. 2C. Almost all proteins from the 1D and 2D analyses were included in the 3D results except for 9 and 15 proteins from the 1D and 2D results, respectively. The possible reasons why these proteins cannot be identified in the 3D methods were still unknown. Maybe these proteins were lost during 2D or 3D separate by high pH RPLC or IEF/GELFrEE. It may be also as a result of the random sampling of DDA detection modes. Therefore, we are not sure whether these proteins were false positive identification or not. Then these proteins (Supplemental File 2) were removed from the subsequent analysis to ensure data accuracy and reliability. Thus, the whole urine proteome dataset eventually contained 6085 protein groups (Supplemental Table 1).
Figure 2

The results from three separation strategies. (A) A Coomassie-stained Bis-Tris gel image of 12 GELFrEE fractions over a broad mass range. (B) Coomassie-stained Bis-Tris gel image of 10 LP-IEF fractions over a pI range from 3 to 10. (C) Venn diagram of proteins identified by three separation strategies. (D) Venn diagram of proteins identified from this study as well as previous urine and exosome proteome studies. (E) Comparative analysis of the urine, kidney and plasma proteome.

The results from three separation strategies. (A) A Coomassie-stained Bis-Tris gel image of 12 GELFrEE fractions over a broad mass range. (B) Coomassie-stained Bis-Tris gel image of 10 LP-IEF fractions over a pI range from 3 to 10. (C) Venn diagram of proteins identified by three separation strategies. (D) Venn diagram of proteins identified from this study as well as previous urine and exosome proteome studies. (E) Comparative analysis of the urine, kidney and plasma proteome. Several studies have been conducted to characterize the normal human urinary proteome. Table 1 summarizes the current largest-scale studies of human urine and urinary exosomes using high-resolution MS[12–16, 23–25]. The protein accessions in each dataset were mapped to the corresponding gene IDs[26, 27]. Total nine large-scale urinary and exosome proteomic analyses were performed in recent years. When all of the data from these nine studies were combined, a total of 8021 gene products were detected in the human urinary proteome (Supplemental Table 2). When comparing previous data with our results (Fig. 2D), total 2001 gene products were uniquely identified in this study. The possible reasons of differences in urine proteome between different studies may be genetic factors, individual variations, different separate methods and MS preference.
Table 1

Recent large-scale proteomic studies of healthy human urine.

SampleNumber of identificationsDatabaseMS InstrumentAnalysis MethodsSingle peptide includedReference
Urine1543IPILTQ-OrbitrapSDS-PAGEYesAdachi et al.[12]
Urine1310IPILTQ-OrbitrapSCX/SAXYesLi et al.[14]
Urine1823GILTQ-Orbitrap VelosSDS-PAGEYesMarimuthu et al.[13]
Urine1985IPILTQ-Orbitrap VelosSDS-PAGEYesZheng et al.[15]
Urine3429UniprotLTQ-Orbitrap Velos ProCombinatorial peptide ligand librariesYesSantucci et al.[16]
Exosome1132GILTQSDS-PAGEYesGonzales et al.[23]
Exosome3280UniprotLTQ-Orbitrap VelosSDS-PAGE followed by SCXNoWang et al.[24]
Exosome1830SwissprotLTQ-Orbitrap VelosSDS-PAGENoHogan et al.[25]
Urine6085SwissprotTripleTOF 5600GELFREE/IEF-RPLCNoZhao et al. 2017
Recent large-scale proteomic studies of healthy human urine. Urinary proteins, which are considered to represent the protein composition of the output of the kidneys[28], are primarily composed of proteins derived from plasma filtration and urinary tract system secretion. A comparative analysis of the urine, plasma and kidney proteome would provide a more concrete link to determine how many plasma- and kidney-related proteins could be detected in urine. The PeptideAtlas builds yielded 3553 and 4005 non-redundant proteins at 1% FDR for the plasma and kidney proteomes[29]. In contrast, a total of 2940 (47.7%) and 3238 (52.6%) of the gene products identified in this urinary proteome study were common to the gene products (Fig. 2E) that were reported in the plasma (81.1%) and kidney proteomes (75.6%), respectively. According to previous report, approximately 30% of urinary proteins originate from the plasma proteins, whereas 70% comes from the kidney and the urinary tract[30]. From our study, maybe the difference between plasma and urine is smaller than expected. And it might be possible that more common proteins will be identified with the development of MS in the future. By comparison with kidney proteome, we want to show the overlap between urine and kidney proteome. The large overlap may give evidences that urine can better reflect the functions of kidney.

Quantitative analysis of urinary proteins

Quantitation of urinary proteins will improve our understanding of the urinary proteome and will facilitate the development of urinary biomarkers. Accordingly, we aimed to quantify each protein using the iBAQ algorithm, which provides a rough indication of actual protein levels[31]. In 3D analysis, equal amounts of protein from each GELFrEE/LP-IEF fraction were used for LC/MS/MS analysis, which thus cannot provide an accurate quantitative analysis. Therefore, data from the 2D strategy were used, and a total of 2571 proteins were quantified with the iBAQ algorithm. The dynamic range of relative abundance spanned six orders of magnitude (Fig. 3A), which was consistent with previous observations[32]. Considering that more than 3000 proteins in the 3D analysis were not quantified, the dynamic range is expected to be even bigger. In the present analysis, serum albumin and uromodulin were the most abundant urinary proteins.
Figure 3

Quantitative analysis of urinary proteins by the iBAQ method. (A) The relative expression and concentrations of 2,571 proteins in the 2D analysis were estimated by iBAQ. The left y-axis represents relative abundance, and the right y axis represents estimated concentration (pg/mL). (1) ALB: albumin; UROM: uromodulin, the two most abundant proteins. (2) RARS: arginine-tRNA ligase, the least abundant protein in 2D analysis; (3) RNASE 6: ribonuclease K6, the least abundant protein in 1D analysis. (B) Correlation plot between estimated concentrations and immunoassays results.

Quantitative analysis of urinary proteins by the iBAQ method. (A) The relative expression and concentrations of 2,571 proteins in the 2D analysis were estimated by iBAQ. The left y-axis represents relative abundance, and the right y axis represents estimated concentration (pg/mL). (1) ALB: albumin; UROM: uromodulin, the two most abundant proteins. (2) RARS: arginine-tRNA ligase, the least abundant protein in 2D analysis; (3) RNASE 6: ribonuclease K6, the least abundant protein in 1D analysis. (B) Correlation plot between estimated concentrations and immunoassays results. The average concentration of urinary albumin, which was one of the most easily detected urinary proteins, was approximately 2.2–3.3 µg/mL[12, 33] in normal human urine. With the iBAQ value ratios, concentrations of the other 2570 proteins could subsequently be estimated (Supplemental Table 3). The estimated concentration of arginine-tRNA ligase (RARS), which had the lowest relative abundance in the 2D analysis, was 0.68~1.02 pg/mL. As the 2D results contained almost all of the proteins from the 1D separation, concentrations of the 753 proteins from the 1D analysis could be inferred. Among them, ribonuclease K6 (RNASE 6) was the least abundant protein in the 1D analysis with an estimated concentration of 5.58~8.37 pg/mL. To evaluate the accuracy of estimated concentrations with the iBAQ algorithm and the corresponding application to other samples in different labs, the estimated concentrations were compared with the results from immunoassay screening in a previous urinary candidate biomarker study[34]. A total of 89 proteins were commonly evaluated in both studies (Fig. 3B and Supplemental Table 4, R2 = 0.59).

Functional annotation of three separation strategies

Functional annotations of urinary proteins based on the degree of analysis depth may be helpful in providing insight into the analysis approach difference in protein composition, reflecting pathophysiological states and determining suitable separation methods for some diseases. To analyze the protein identification data from the three separation strategies, 6085 proteins were divided into three groups as follows: Group 1D, proteins identified in 1D analysis (799 proteins); Group 2D, proteins identified in 2D analysis, excluding those identified in the 1D analysis (2362 proteins); and Group 3D, proteins identified in 3D analysis, excluding those identified in both the 1D and 2D analyses (2924 proteins). IPA analysis was performed to provide insight into the functions of the three groups (Fig. 4 and Supplemental Table 5). Extracellular proteins and plasma membrane proteins were enriched in Group 1D (56%), as previously reported[12]. The most significant pathways in Group 1D were functionally similar to plasma components, such as inflammatory responses, coagulation and glucose metabolism. Acute phase response signaling, which is one of the top pathways for Group 1D, is a rapid inflammatory response that provides protection against some infections by nonspecific defenses. It consists of an increase in inflammatory factors (such as IL-1) and a change in the levels of several plasma proteins (such as ALB and APOA1/2). For example, alpha-1-acid glycoprotein 1 (ORM1), an extracellular protein, is involved in the acute phase response. Overexpression of ORM1 in urine was associated with acute pediatric appendicitis[35]. As Group 2D demonstrated considerable enrichment of intracellular proteins (58%), most of the pathways were involved in cellular signaling such as EIF2 Signaling and Regulation of eIF4 and p70S6K signaling. Proteins in Group 3D were also over-represented in the cytoplasm and nucleus (63%). Most of the canonical pathways in Group 3D were closely related to interleukin signaling.
Figure 4

Cellular component and canonical pathway analyses of three separation groups. (A) Cellular component analysis of the three groups. (B) The top 10 canonical pathways from the three groups. The y-axis denotes the negative log of the p value.

Cellular component and canonical pathway analyses of three separation groups. (A) Cellular component analysis of the three groups. (B) The top 10 canonical pathways from the three groups. The y-axis denotes the negative log of the p value. Considering the above results, we assumed that proteins in the three groups were functionally different. If the purpose of research was to study basic physiological activities, such as cell movement and proliferation, maybe the urinary proteome can be analyzed without further separation in most cases. If aim at intracellular activities and functions of organs, maybe the in-depth analysis is necessary.

Landscape of proteins detected in urine

Previous studies reported that urine might reflect kidney function and identified some potential biomarkers of kidney disease[36]. According to previous analyses, the Human Urinary Proteome Database contains proteins localized in the glomeruli of nephron segments (Table 2) and molecules to detect injures to specific tubules of nephron segments (Table 3). Extracellular macromolecular laminin, type IV collagen α3α4α5, heparan sulfate proteoglycan agrin, and nidogen were the main components of the glomerular basement membrane (GBM)[37] and could all be identified by the one-dimensional method without fractionation. Nephrin and podocin are both specifically expressed in the slit diaphragm, which is pivotal in maintaining the selective permeability of podocytes in the glomerular filtration barrier[38]. The cytoplasmic protein CD2-associated protein (CD2AP) localizes to the podocyte slit diaphragm where it has been shown to bind to nephrin and podocin[39]. The above three podocyte-related proteins could be identified in Group 2D. The fatty acid-binding proteins (FABPs) in Group 2D are a class of small intracellular proteins that bind long chain fatty acids. Liver-type FABP is mainly present in the cells of the proximal tubules, while heart-type FABP is predominantly localized in the distal tubules[40]. The above results showed that both glomerulus and tubules-related proteins could be found in the urine, which indicated that the urine proteome might reflect changes of kidney function.
Table 2

Urinary candidate biomarkers of glomerular injury.

Protein NameUniprot IDProtein in GroupNephron segment[25, 37, 38, 55, 56] LocationMolecular FunctionBiomarker ApplicationReference
PodocinQ9NP852DPodocyte & slit diaphragmPlasma MembraneotherIgA nephropathy, membranous nephropathy 57, 58
Alpha-actinin-4O437071DPodocyteCytoplasmotherDiabetic nephropathy, focal segmental glomerulosclerosis 59, 60
NeprilysinP084731DPodocytePlasma MembranepeptidaseGlomerulonephritis 61
Myosin-9P355791DPodocyte & mesangial cellsCytoplasmenzymeGlomerulopathy 62
AgrinO004681DGlomerular basement membranePlasma MembraneotherDiabetic nephropathy, transplant glomerulopathy 63, 64
Collagen alpha-3(VI) chainP121111DGlomerular basement membraneExtracellular SpaceotherAlport syndrome, diabetic nephropathy 65, 66
NidogenP14543, Q141121DGlomerular basement membraneExtracellular SpaceotherMembranous nephropathy 67
LamininMultiple Ma1DGlomerular basement membraneExtracellular SpaceotherDiabetic nephropathy 68
NephrinO605002DPodocytePlasma MembraneotherDiabetic nephropathy 69
CD2-associated proteinQ9Y5K62DPodocyteCytoplasmotherFocal segmental glomerulosclerosis 70
PodocalyxinO005921DPodocyte & parietal epithelial cellsPlasma MembranekinaseDiabetic nephropathy 71
Vascular endothelial growth factorP15692, P49767, P497653DPodocyteExtracellular Spacegrowth factorDiabetic nephropathy 72
Proliferating cell nuclear antigenP120043DParietal epithelial cells & podocyteNucleusenzymeSchistosomal nephropathy 73
Secretory phospholipase A2 receptorQ130182DGlomerulusPlasma Membranetransmembrane receptorIdiopathic membranous nephropathy 41
Complement C3P010241DGlomerular basement membrane, mesangium, capillary loopsExtracellular SpacepeptidaseLupus nephritis 74
Apolipoprotein EP026491DMesangial cellsExtracellular SpacetransporterDiabetic nephropathy 75
CD151 antigenP485092DPodocyte, glomerular basement membranePlasma MembraneotherType 1 diabetic nephropathy 76
Cofilin-1P235281DPodocyteNucleusotherHypertension-induced renal damage 77
FibronectinP027511DMesangial and subendothelial cellsExtracellular SpaceenzymeGlomerulopathy with fibronectin deposits 78
MyeloperoxidaseP051641DGlomerular capillaryCytoplasmenzymeAnti-neutrophil cytoplasmic antibody-associated glomerulonephritis 79
Table 3

Urinary candidate biomarkers of tubular injury.

Protein NameUniprot IDProtein GroupNephron segment[25, 40, 80] LocationMolecular FunctionBiomarker ApplicationRef.
Beta-2-microglobulinP617691DProximal tubulePlasma Membranetransmembrane receptorAcute renal allograft rejection, acute kidney injury, diabetic nephropathy 81, 82
GST-alphaP092101DProximal tubuleCytoplasmenzymeAcute kidney injury 83
GSTP1P092111DDistal tubuleCytoplasmenzymeAcute renal failure 81
ClusterinP109091DProximal tubule & distal tubuleCytoplasmotherRenal-cell carcinoma, acute kidney injury 84
CubilinO604941DProximal tubulePlasma Membranetransmembrane receptorType 1 diabetes 85
Liver-type fatty acid-binding protein acid-binding proteinP071482DProximal tubuleCytoplasmtransporterDiabetic nephropathy, contrast nephropathy, IgA nephropathy 40
Heart-type fatty acid-binding proteinP054132DDistal tubuleCytoplasmtransporterAcute kidney injury after cardiac surgery 86
Cystatin-CP010341DGlomerulus & proximal tubuleExtracellular SpaceotherAcute kidney injury, acute renal dysfunction 87, 88
CalbindinP059371DDistal tubule & collecting ductCytoplasmotherDistal nephron segment injuries 89
CYR61O006222DProximal tubuleExtracellular SpaceotherGlomerular disease 90
Alkaline phosphatase, tissue-nonspecific isozymeP099232DProximal tubulePlasma MembranephosphataseAcute renal failure 91
Intestinal-type alkaline phosphataseP051862DProximal tubulePlasma MembranephosphataseDiabetic nephropathy, acute renal failure 92
Alpha-N-acetylglucosaminidaseP548021DProximal tubuleCytoplasmenzymeAcute kidney injury 93
Netrin-1O956313DProximal tubuleExtracellular Spacegrowth factorAcute kidney injury, diabetic nephropathy 94
Neutrophil gelatinase-associated lipocalinP801881DProximal tubule & distal tubuleExtracellular SpacetransporterAcute kidney injury, chronic kidney disease 95
OsteopontinP104511DProximal tubule & loop of henle & distal tubuleExtracellular SpacecytokineProgressive renal injury 96
Interleukin-18Q141162DProximal tubuleExtracellular SpacecytokineAcute kidney injury 97
Retinol-binding proteinP02753, P82980, P50120, P094551DProximal tubuleExtracellular Space, CytoplasmtransporterAcute kidney injury, renal failure 98
Urinary candidate biomarkers of glomerular injury. Urinary candidate biomarkers of tubular injury. Some tissue or serum biomarkers of kidney diseases could also be detected in our urine proteome database. For example, the phospholipase A2 receptor (PLA2R), a plasma membrane glycoprotein located on normal podocytes, was a major target antigen in idiopathic membranous nephropathy[41]. PLA2R could be detected in Groups 2D and 3D. Urokinase plasminogen activator surface receptor (uPAR) is a glycosylphosphatidylinoisitol -anchored three-domain protein and is expressed in human glomerular cells. Serum concentrations of soluble uPAR are significantly elevated in most subjects with primary focal segmental glomerulosclerosis (FSGS)[42]. If these tissue or serum biomarkers could be confirmed as urinary biomarkers, the human urinary proteome database would provide a convenient way to discover noninvasive urinary candidate biomarkers. In addition to kidney diseases, previous studies also reported that some other diseases, such as acute pancreatitis[43], might possess urinary biomarkers. The human urinary proteome database provides brief information on known biomarkers for predicting various types of organ injury (Supplemental Table 6). Moreover, these proteins detected in urine were annotated by their tissue distribution based on an integrated omics approach that involves quantitative transcriptomics and tissue microarray–based immunohistochemistry in previous studies[44]. The detailed annotation data of each protein were shown in the following database. The tissue with maximum numbers of highly expressed proteins detected in urine both at protein and mRNA levels was brain (Fig. 5A, Supplemental Figure 1). Other tissues with more highly expressed proteins were mostly digestive organs such as colon and stomach. As expected, more tissue-related proteins could be detected in Group 2D and 3D than in Group 1D (Fig. 5B).
Figure 5

Tissue distribution of urinary proteins at protein level. (A) Urinary proteome distributions across 44 tissues. The numbers in the bracket denote the number of highly expressed proteins of the tissue detected in urine. (B) The distribution of tissue-related proteins and the corresponding separation strategy for top ten tissues.

Tissue distribution of urinary proteins at protein level. (A) Urinary proteome distributions across 44 tissues. The numbers in the bracket denote the number of highly expressed proteins of the tissue detected in urine. (B) The distribution of tissue-related proteins and the corresponding separation strategy for top ten tissues.

The Human Urinary Proteome Database

To provide a readily obtainable source for the human urinary proteome, the “Human Urinary Proteome Database” was constructed (Fig. 6) based on the above analyses. The database was constructed using open source technologies and is freely available at www.urimarker.com/urine. A total of 3048648 spectra, 68151 unique peptides and 6085 proteins are included, along with detailed information such as the protein name, accession number, peptide sequence, sequence coverage and unique peptide number.
Figure 6

An overview of the human urinary proteome database. (A) The protein level results include the unique peptide count, total peptide count and relative quantitation and estimated concentration. Proteins are linked to the UniProt website by clicking the accessions. (B) The peptide level results include peptide sequences and observed m/z values. (C) The database provides the experimental pI and MW distribution of all identified proteins. (D) The “MW-PI” section provides a succinct figure summarizing the theoretical MW and pI information for each protein. (E) Biomarker application of all identified proteins.

An overview of the human urinary proteome database. (A) The protein level results include the unique peptide count, total peptide count and relative quantitation and estimated concentration. Proteins are linked to the UniProt website by clicking the accessions. (B) The peptide level results include peptide sequences and observed m/z values. (C) The database provides the experimental pI and MW distribution of all identified proteins. (D) The “MW-PI” section provides a succinct figure summarizing the theoretical MW and pI information for each protein. (E) Biomarker application of all identified proteins. Each protein is featured with annotated data, including relative quantitative information, estimated concentrations, theoretical and experimental MW and pI. Remarkably, some high-abundance proteins were observed spanning multiple fractions in both the GELFrEE and LP-IEF separations. It is generally accepted that mass/pI deviation may occur due to the presence of fragments, protein polymers, isoforms, protein degradation, post-translational modifications and low focusing quality in the basic region of the immobilized pH-gradient strips, as well as due to the pI prediction algorithm used[45-48]. Moreover, a novel section labeled ‘MW-PI image’ provides a succinct figure indicating the significant MW and pI information for all of the identified urinary proteins, which might be helpful for generating a brief scan of proteins in a pI and MW range of interest. For biomarker studies, the “Biomarker” section also yields potential biomarkers for applications in diagnosis, disease progression and prognosis. The Human Urinary Proteome Database serves as a reference repository for urinary proteins, as it offers the largest number of such proteins to date. All of the data retrieved from three separations not only detail the normal human urinary proteome but also categorize all proteins by different separation methods. Moreover, the database can be used for targeted proteomics that rely on the proper selection of peptides and transitions to guide the selection of proteotypic peptides for candidate proteins[49].

Materials and Methods

Ethics statement

Prior to study enrollment, all of the healthy volunteers were given a verbal explanation of the study and each participant signed an informed consent document. The consent procedure and the research protocol were approved by the Medical Ethics Committee of Peking Union Medical College. All methods in this study were performed in accordance with the guidelines and regulations.

Experimental design and statistical rationale

Twenty-four healthy volunteers (38 ± 11 years old), including twelve males and twelve females, were enrolled. Exclusion criteria included the following conditions: menstrual bleeding, any prescription drug use and acute or chronic medical illness. The age, sex and smoking habits of the healthy subjects were recorded (Supplemental File 3). After random urine collection, all of the samples were immediately centrifuged for 30 min at 3,500 g. After precipitate removal, urinary proteins were extracted by acetone precipitation. Lysis buffer (7 M urea, 2 M thiourea, 25 mM dithiothreitol and 50 mM Tris) was used to re-dissolve urinary proteins. The twenty-four urinary protein samples were pooled with equal amounts of protein into one sample for 1D, 2D and 3D analyses (Supplemental File 4).

GELFrEE and LP-IEF fractionation

For GELFrEE separation, urine samples were prepared using a protocol by Tran et al.[45]. Briefly, the pooled sample was fractionated in parallel using an eight-channel multiplexed GELFrEE 8100 Fractionation system (Protein Discovery, Knoxville, TN, USA). Application of 50 V for approximately 75 min and then 100 V for 105 min resulted in twelve GELFrEE fractions. The volume of each fraction was concentrated to approximately 125 μL using a SpeedVac Concentrator (Thermo Fisher Scientific, Asheville, NC, USA). Next, the samples underwent SDS removal using Pierce Detergent Removal Spin Columns (Pierce, Rockford, IL, USA). For LP-IEF fractionation, urinary proteins were desalted and cleaned using Amicon Ultrafiltration devices with a 10-kDa molecular weight cutoff (Merck Millipore Inc., Billerica, MA, USA). Then, the desalted urinary proteins were focused (approximately 2.5 h at 1 W) using a ten-chamber Microrotofor LP-IEF system (Bio-Rad, Hemel Hempstead, UK). Ten IEF fractions were collected; few protein bands appeared in fractions 7–10. Then fractions 6–10 were pooled into one sample.

Protein digestion

Urinary proteins were digested with trypsin (Trypsin Gold, mass spec grade, Promega, WI, USA) using filter-aided sample preparation methods[50]. Proteins were loaded onto 10-kDa filter devices (Pall, Port Washington, NY, USA), and 8 M urea in 0.1 M Tris-HCl (pH 8.5) was added to wash the samples. The proteins were denatured by incubation with 50 mM dithiothreitol at 56 °C for 1 h and then alkylated in the dark for 45 min in 55 mM iodoacetamide. Trypsin was added (enzyme to protein ratio of 1:50), and the samples were incubated at 37 °C overnight. After digestion, the peptide mixtures were desalted on Oasis HLB cartridges (Waters, Milford, USA) and lyophilized for high-performance liquid chromatography separation.

Offline high-pH RPLC separation

In total, nineteen samples, including eighteen fractions that were separated by GELFrEE and LP-IEF and a pooled urine sample, were fractionated by offline high-pH RPLC columns (4.6 mm × 250 mm, C18, 3 μm; Waters Corp, Milford, USA). The samples were loaded onto the column in buffer A1 (10 mM NH4FA in H2O, pH = 10). The elution gradient was 5–30% buffer B1 (10 mM NH4FA in 90% acetonitrile, pH = 10; flow rate = 1 mL/min) for 60 min. The eluted peptides were collected at one fraction per minute. After lyophilization, the 60 fractions were re-suspended in 0.1% formic acid and concatenated into 20 fractions by combining fractions 1, 21, 41 and so on[51].

Online LC-MS/MS analysis

Each sample was analyzed on a reverse-phase C18 self-packed capillary LC column (75 μm × 100 mm, 3 μm). The elution gradient was 5–30% buffer B2 (0.1% formic acid, 99.9% acetonitrile; flow rate = 0.3 μL/min) for 100 min. A TripleTOF 5600 coupled with UPLC system was used to analyze the sample, and the MS data were acquired in a high-sensitivity mode using the following parameters: 30 data-dependent MS/MS scans per full scan; full scans were acquired at a resolution of 40,000 and MS/MS scans were acquired at 20,000; rolling collision energy; charge state screening (including precursors with +2 to +4 charge state); dynamic exclusion (exclusion duration 15 s); MS/MS scan range of 250–1800 m/z; and scan time of 50 ms. For 1D separation, the pooled urine sample was analyzed with three technical replicates.

Data processing

The MS/MS data were processed using Mascot software (version 2.3.02, Matrix Science, London, UK) and searched against the SwissProt database (Homo sapiens, 20,267 sequences, 2013_07 version). The search allowed two missed cleavage sites in the trypsin digestion, cysteine carbamidomethylation was set as a fixed modification and both parent and fragment ion mass tolerances were set to 0.05 Da. Mascot search results were filtered using the decoy database method in Scaffold (version 4.3.2, Proteome Software Inc., Portland, OR). Peptide identifications were accepted if they could be shown to achieve a false discovery rate (FDR) of less than 1.0% by the Scaffold Local FDR algorithm. Protein identifications were accepted if they could be shown to achieve a FDR of less than 1.0% and contained at least 2 unique identified peptides. Protein probabilities were assigned by the Protein Prophet algorithm[52]. Proteins that contained similar peptides and could not be differentiated based on MS/MS analysis alone were grouped to satisfy the principles of parsimony. Proteins sharing significant peptide evidence were grouped into clusters. Total 20 results from 1DLC, 2DLC and 3DLC (12 GELFrEE fractions and 6 LP-IEF fractions) were filtered by Scaffold with the above parameters and yielded average FDRs of 0.10%, 0.26% and 1% at the spectrum, peptide and protein levels, respectively. Then, the 20 datasets were combined together with Scaffold perSPECtives (version 2.0.4, Proteome Software Inc., Portland, OR). To rank the relative abundance of different proteins, an intensity-based absolute quantification (iBAQ) algorithm was used[53]. The protein intensities were first computed by Progenesis LC–MS (version 2.6, Nonlinear Dynamics, UK)[54] as the sum of all identified peptide intensities (maximum peak intensities of the peptide elution profile, including all peaks in the isotope cluster). The iBAQ result was obtained as the peptide intensities divided by the number of theoretically observable peptides of the protein (calculated by in silico protein digestion; all fully tryptic peptides between 6 and 30 amino acids were counted). For functional analysis, ingenuity pathway analysis (IPA) software (Ingenuity Systems, www.ingenuity.com) was used to analyze cellular components, canonical gene pathways, functions and candidate biomarkers. Supplemental files Supplemental Table 1 Supplemental Table 2 Supplemental Table 3 Supplemental Table 4 Supplemental Table 5 Supplemental Table 6
  97 in total

1.  Towards defining the urinary proteome using liquid chromatography-tandem mass spectrometry. I. Profiling an unfractionated tryptic digest.

Authors:  C S Spahr; M T Davis; M D McGinley; J H Robinson; E J Bures; J Beierle; J Mort; P L Courchesne; K Chen; R C Wahl; W Yu; R Luethy; S D Patterson
Journal:  Proteomics       Date:  2001-01       Impact factor: 3.984

2.  Towards defining the urinary proteome using liquid chromatography-tandem mass spectrometry. II. Limitations of complex mixture analyses.

Authors:  M T Davis; C S Spahr; M D McGinley; J H Robinson; E J Bures; J Beierle; J Mort; W Yu; R Luethy; S D Patterson
Journal:  Proteomics       Date:  2001-01       Impact factor: 3.984

3.  Neutrophil gelatinase-associated lipocalin (NGAL) as a biomarker for acute renal injury after cardiac surgery.

Authors:  Jaya Mishra; Catherine Dent; Ridwan Tarabishi; Mark M Mitsnefes; Qing Ma; Caitlin Kelly; Stacey M Ruff; Kamyar Zahedi; Mingyuan Shao; Judy Bean; Kiyoshi Mori; Jonathan Barasch; Prasad Devarajan
Journal:  Lancet       Date:  2005 Apr 2-8       Impact factor: 79.321

Review 4.  Mechanistic biomarkers for cytotoxic acute kidney injury.

Authors:  Vishal S Vaidya; Joseph V Bonventre
Journal:  Expert Opin Drug Metab Toxicol       Date:  2006-10       Impact factor: 4.481

5.  Podocin, a raft-associated component of the glomerular slit diaphragm, interacts with CD2AP and nephrin.

Authors:  K Schwarz; M Simons; J Reiser; M A Saleem; C Faul; W Kriz; A S Shaw; L B Holzman; P Mundel
Journal:  J Clin Invest       Date:  2001-12       Impact factor: 14.808

6.  Urinary podocalyxin positive-element occurs in the early stage of diabetic nephropathy and is correlated with a clinical diagnosis of diabetic nephropathy.

Authors:  Hui Ye; Xueli Bai; Hua Gao; Li Li; Chunxiao Wu; Xinping Sun; Chunzhi Zhang; Yajuan Shen; Jian Zhang; Zhiming Lu
Journal:  J Diabetes Complications       Date:  2013-09-24       Impact factor: 2.852

7.  Evaluation of urinary S100B protein level and lactate/creatinine ratio for early diagnosis and prognostic prediction of neonatal hypoxic-ischemic encephalopathy.

Authors:  Li Liu; Chong-Xun Zheng; Shu-Feng Peng; Hong-Yan Zhou; Zu-You Su; Li He; Ting Ai
Journal:  Neonatology       Date:  2009-07-07       Impact factor: 4.035

8.  Decreased urinary calbindin 1 levels in proteinuric rats and humans with distal nephron segment injuries.

Authors:  Tomoko Iida; Hidehiko Fujinaka; Bo Xu; Ying Zhang; Sameh Magdeldin; Masaaki Nameta; Zan Liu; Yutaka Yoshida; Eishin Yaoita; Shuichi Tomizawa; Akihiko Saito; Tadashi Yamamoto
Journal:  Clin Exp Nephrol       Date:  2013-07-18       Impact factor: 2.801

9.  Podocin is translocated to cytoplasm in puromycin aminonucleoside nephrosis rats and in poor-prognosis patients with IgA nephropathy.

Authors:  Hiromitsu Fukuda; Teruo Hidaka; Miyuki Takagi-Akiba; Koichiro Ichimura; Juan Alejandro Oliva Trejo; Yu Sasaki; Juan Wang; Tatsuo Sakai; Katsuhiko Asanuma; Yasuhiko Tomino
Journal:  Cell Tissue Res       Date:  2015-02-13       Impact factor: 5.249

10.  An extensive targeted proteomic analysis of disease-related protein biomarkers in urine from healthy donors.

Authors:  Brian M Nolen; Lidiya S Orlichenko; Adele Marrangoni; Liudmila Velikokhatnaya; Denise Prosser; William E Grizzle; Kevin Ho; Frank J Jenkins; Dana H Bovbjerg; Anna E Lokshin
Journal:  PLoS One       Date:  2013-05-28       Impact factor: 3.240

View more
  40 in total

1.  Comprehensive Analysis of Individual Variation in the Urinary Proteome Revealed Significant Gender Differences.

Authors:  Chen Shao; Mindi Zhao; Xizhao Chen; Haidan Sun; Yehong Yang; Xiaoping Xiao; Zhengguang Guo; Xiaoyan Liu; Yang Lv; Xiangmei Chen; Wei Sun; Di Wu; Youhe Gao
Journal:  Mol Cell Proteomics       Date:  2019-03-20       Impact factor: 5.911

2.  Simple Tip-Based Sample Processing Method for Urinary Proteomic Analysis.

Authors:  David J Clark; Yingwei Hu; Michael Schnaubelt; Yi Fu; Sean Ponce; Shao-Yung Chen; Yangying Zhou; Punit Shah; Hui Zhang
Journal:  Anal Chem       Date:  2019-04-08       Impact factor: 6.986

Review 3.  The Urothelium: Life in a Liquid Environment.

Authors:  Marianela G Dalghi; Nicolas Montalbetti; Marcelo D Carattino; Gerard Apodaca
Journal:  Physiol Rev       Date:  2020-03-19       Impact factor: 37.312

4.  Spatiotemporal Proteomics Reveals the Molecular Consequences of Hormone Treatment in a Mouse Model of Lower Urinary Tract Dysfunction.

Authors:  Samuel Thomas; Ling Hao; Kellen DeLaney; Dalton McLean; Laura Steinke; Paul C Marker; Chad M Vezina; Lingjun Li; William A Ricke
Journal:  J Proteome Res       Date:  2020-03-16       Impact factor: 4.466

5.  Proteomics Profiling of the Urine of Patients with Hyperthyroidism after Anti-Thyroid Treatment.

Authors:  Hicham Benabdelkamel; Afshan Masood; Aishah A Ekhzaimy; Assim A Alfadda
Journal:  Molecules       Date:  2021-04-01       Impact factor: 4.411

6.  Proteomic Analysis of Urine from California Sea Lions ( Zalophus californianus): A Resource for Urinary Biomarker Discovery.

Authors:  Benjamin A Neely; Katherine C Prager; Alison M Bland; Christine Fontaine; Frances M Gulland; Michael G Janech
Journal:  J Proteome Res       Date:  2018-08-28       Impact factor: 4.466

7.  Quantitative proteomic analysis of a genetically induced prostate inflammation mouse model via custom 4-plex DiLeu isobaric labeling.

Authors:  Ling Hao; Samuel Thomas; Tyler Greer; Chad M Vezina; Sagar Bajpai; Arya Ashok; Angelo M De Marzo; Charles J Bieberich; Lingjun Li; William A Ricke
Journal:  Am J Physiol Renal Physiol       Date:  2019-04-17

8.  HBFP: a new repository for human body fluid proteome.

Authors:  Dan Shao; Lan Huang; Yan Wang; Xueteng Cui; Yufei Li; Yao Wang; Qin Ma; Wei Du; Juan Cui
Journal:  Database (Oxford)       Date:  2021-10-13       Impact factor: 3.451

9.  An in-depth Comparison of the Pediatric and Adult Urinary N-glycomes.

Authors:  Haiying Li; Viral Patel; Shannon E DiMartino; John W Froehlich; Richard S Lee
Journal:  Mol Cell Proteomics       Date:  2020-07-31       Impact factor: 5.911

10.  Lack of APOL1 in proximal tubules of normal human kidneys and proteinuric APOL1 transgenic mouse kidneys.

Authors:  Natalya A Blessing; Zhenzhen Wu; Sethu M Madhavan; Jonathan W Choy; Michelle Chen; Myung K Shin; Maarten Hoek; John R Sedor; John F O'Toole; Leslie A Bruggeman
Journal:  PLoS One       Date:  2021-06-17       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.