Literature DB >> 20728591

Yeast proteomics and protein microarrays.

Abstract

Our understanding of biological processes as well as pan class="Species">human diseases has improved greatly thanks to studies on model organisms such as n>n class="Species">yeast. The power of scientific approaches with yeast lies in its relatively simple genome, its facile classical and molecular genetics, as well as the evolutionary conservation of many basic biological mechanisms. However, even in this simple model organism, systems biology studies, especially proteomic studies had been an intimidating task. During the past decade, powerful high-throughput technologies in proteomic research have been developed for yeast including protein microarray technology. The protein microarray technology allows the interrogation of protein-protein, protein-DNA, protein-small molecule interaction networks as well as post-translational modification networks in a large-scale, high-throughput manner. With this technology, many groundbreaking findings have been established in studies with the budding yeast Saccharomyces cerevisiae, most of which could have been unachievable with traditional approaches. Discovery of these networks has profound impact on explicating biological processes with a proteomic point of view, which may lead to a better understanding of normal biological phenomena as well as various human diseases.

Entities: Chemical Disease Gene Species

Mesh：

Substances：
Fungal Proteins

Year: 2010 PMID： 20728591 PMCID： PMC2949546 DOI： 10.1016/j.jprot.2010.08.003

Source DB: PubMed Journal: J Proteomics ISSN： 1874-3919 Impact factor: 4.044

Introduction

Until relatively recently, investigation of the full proteome has been a daunting task. One of the main reasons was the incomplete definition of the proteome due to the lack of comprehensive genomic information, as well as technical limitations for large scale profiling of proteins. During the past few decades, improved and novel technologies have emerged as powerful tools for proteomic studies including shotgun proteomics by mass spectrometry technology [1] and protein microarray technology [2]. Mass spectrometry technology, combined with various sample preparation methods, has been used extensively in biological research such as proteome profiling [3], protein–protein interaction mapping [4], [5] and identification of post-translational modification [6], [7], [8]. The greatest advantage of this technology lies in its capability of high throughput protein identification and quantification. Nevertheless, mass spectrometry still has its limitations, such as the undersampling issue (insufficient coverage of the proteome) [9], [10] as well as bias against low abundant proteins [11]. For the purpose of this review mass spectrometry technology will not be covered here in detail. Protein microarray technology, on the other hand, does not have the above limitations for mass spectrometry owing to its particular platform. Protein microarray technology [12], [13] was developed upon the completion of the genome sequence of multiple organisms, including the budding n class="Species">yeast n>n class="Species">Saccharomyces cerevisiae [14] and humans (Homo sapiens) [15]. Determination of genomic sequences leads to the annotation of known and predicted open reading frames (ORFs) in these genomes, which makes it possible to express the full or partial proteome with large-scale cloning and gene expression methods [2]. Protein microarray typically contains hundreds to thousands of proteins that are arrayed in an addressable format. Protein microarrays are generally one of two types: analytical/diagnostic microarrays and functional microarrays [16], [17]. One form of analytical/diagnostic microarrays is the antibody microarray, in which specific antibodies against defined target proteins are arrayed on the surface of a support material (such as glass slides), and are used for the detection and quantification of their specific antigens [18], [19]. This type of microarray has great potential for clinical diagnosis and clinical research, and its advantages and disadvantages are reviewed in [17], [20], [21]. The other type of protein microarray is the functional microarray. These arrays usually contain a large number of individually expressed and purified functional, full-length proteins or peptides printed in a high-density format on support surfaces, and can represent the complete or partial proteome of a given organism [22]. This type of protein microarray has been used in studies of protein–protein, protein–DNA, and protein–small molecule interactions as well as protein modifications (Fig. 1 ) [2], [17], [23], [24], [25]. Many of these studies were done in the budding n class="Species">yeast n>n class="Species">S. cerevisiae, and will be discussed in detail below. Functional protein microarrays containing human proteins have also been widely applied in various clinical studies of cancer, autoimmune diseases and viral infections [26], [27], [28], [29], [30], [31], [32], which will not be covered here.

Fig. 1

Application of functional protein microarray and novel insights gained in protein interaction and modification. Protein–protein interactions may be detected with either fluorophore-labeled proteins or specific recognition antibodies to the probe protein. Protein–nucleic acid interactions can be visualized with fluorophore-labeled DNA/RNA. Protein–small molecule interactions may be identified with pan class="Chemical">biotinylated small molecules and fluorophore-labeled streptavidin. Posttranslational modifications of proteins are not drawn to scale. P, phosphorylation; U, pan class="Gene">ubiquitination; Ac, acetylation. In this article we will review functional protein microarray technologies in pan class="Species">yeast. We cover the various methods for manufacturing of protein microarray, and discuss their applications in studying protein–protein, protein–DNA, protein–small molecule interactions, and protein post-translational modifications (phosphorylation S-nitrosylation and n>n class="Gene">ubiquitination) (Fig. 1). We will also discuss the advantages and limitations of this technology.

Manufacture of high-density protein microarray

Since our group generated the first protein microarray covering 5800 ORFs of the budding pan class="Species">yeast n>n class="Species">S. cerevisiae [2], various methods have been developed in protein microarray production. A diagram of the manufacture of protein microarray is shown in Fig. 2 .

Fig. 2

Manufacture and application of functional protein microarrays. Functional protein microarrays can either be manufactured by printing a library of in vitro or in vivo expressed, affinity-purified proteins on to coated glass slides with a microarray printer (top), or printing the protein expression plasmids on to the slides followed by on-slide in vitro expression (bottom). The printed microarrays are then ready for various downstream applications, such as protein–protein/DNA/small molecule interaction and protein post-translational modification studies. There are different support surfaces for making protein microarray, including nanowells [33], solid surfaces (such as glass slides) [2], [34], and absorbent surfaces (such as pan class="Chemical">polyacrylamide gel pads) [35]. The pros and cons of these surfaces are reviewed by Zhu et al. in [17] and [36]. The n>n class="Chemical">polyacrylamide gel pad technology has not been applied extensively due to cumbersome slide preparation and inconvenient slide handling (e.g. hard to change buffers); similarly, the nanowell surface is less convenient to make and use [17], [36]. Currently, the most popular high-density protein microarrays are manufactured on chemically modified or coated glass microscope slides (e.g. nitrocellulose or amino-silane-coated slides) using a standard contact printer [22], [25]. This format is compatible with many commercial scanners. It is noteworthy that, even within this format, different surface chemistries have different attributes in terms of their protein binding ability, protein function/folding and background, and one has to carefully choose the optimal surface that best fits the specific need of the experiment [21]. For example, stronger protein attachment, such as gold-coated glass surface [37] or reactive surfaces with bifunctional cross-linkers [2] can retain the proteins firmly on the surface with covalent cross-linking, but may decrease their activity due to steric hindrance or disruption of proper folding; however, these surfaces will allow detection methods such as Surface Plasmon Resonance or mass spectrometry to study the dynamics of biochemical reactions on these proteins [21]. In addition, we found that amino-silane-coated glass slides provide the highest signal-to-noise ratio in kinase studies [25]. Proteins for functional microarrays are either produced individually prior to printing [2] or produced in situ by coupling in vitro transcription and translation of DNA that is printed directly on the surface of the array (e.g. the Nucleic Acid Programmable Protein Array, NAPPA) [38]. Our group has built two collections of full-length n class="Species">yeast genes cloned in expression plasmids that produce either N-terminal GST (glutathione-S-transferase) -tagged or C-terminal TAP (tandem affinity purification) -tagged fusion proteins [39], [40]. The recombinant n>n class="Species">yeast proteins are expressed in individual yeast clones in a 96-well format, and purified using corresponding affinity tags (GST or TAP). The purified proteins are then arrayed with a 48-pin ESI contact printer on nitrocellulose or amino-silane-coated glass slides in an addressable, high-density format. The detailed protein microarray manufacture protocols can be found in references [22], [25]. In addition to the yeast functional microarrays, we have also produced an Arabidopsis array [41], [42] and a coronavirus array [26]. Invitrogen has produced a human protein array which currently contains more than 9000 proteins (expressed individually using Baculovirus/sf9 expression systems). With a similar method, the group led by Heng Zhu at the Johns Hopkins University also fabricated an Escherichia coli proteome chip containing 4256 proteins encoded by the K12 strain, which is the first reported prokaryotic proteome microarray [43]. Working with Chuan He's group at the University of Chicago, the authors identified proteins with DNA damage-recognizing activity by probing this array with DNA probes containing a single base-mismatch or an abasic site, which included the known cold-shock DNA-binding protein CspE, and proteins with previous unknown functions, such as YbcN and YbaZ [43]. In this study, the authors further validated the function of YbcN and YbaZ with biochemical analyses, and identified the potential binding partner of YbaZ with the same arrays, demonstrating great potentials of functional protein microarrays. The approach of producing proteins and directly spotting them has the following advantages: first of all, in vivo expression of the proteins may help ensure their proper folding and post-translational modifications; second, the individual expression clones allow us to perform single protein quality control, if needed, over the overexpressed collections compared to the coupled transcription-translation method developed by LaBaer et al. [38]. In contrast, the NAPPA method by LaBaer et al. excels in the simplified steps with coupled in vitro, on-slide transcription and translation of the proteins, which saves considerable time and effort [38]. Various detection and data acquisition methods for protein microarray are used to meet different purposes of proteomic studies. For protein–protein, protein–DNA and protein–small molecule interaction studies, these arrays are usually probed with fluorescently labeled molecules and the signals are then acquired with a confocal laser scanner [2], [44], [45]. These interactions can also be detected with pan class="Chemical">oligonucleotide-conjugated probes followed by Rolling Circle Amplification [46]. For kinase assays, isotope-labeled n>n class="Chemical">ATP and autoradiography film exposure are often used for data acquisition [25], [33], [47].

Detection of protein–protein, protein–dna and protein–small molecule interactions using protein microarray

Because of its unique features, protein microarrays provide a powerful and convenient platform to study protein–protein, protein–nucleic acid and protein–small molecule interaction networks. Unlike DNA and RNA, whose functional information depends mainly on the sequence of individual molecules (with the exception of ribozymes), the function and proper regulation of proteins often requires their 3-dimensional conformation and post-translational modifications. Moreover, the function of proteins also depends heavily on their binding partners in the functional complexes. Therefore, to completely understand the function of a given protein, it is critical to know the information on its interaction to various molecular species in the living cells, whether they are other proteins, nucleic acids, and small molecules. Much of the potential protein–protein interaction information in pan class="Species">yeast was obtained by n>n class="Species">yeast two-hybrid approaches [48], [49], which is very labor-intensive. Currently the most comprehensive protein–protein binary interaction data in S. cerevisiae were obtained by Yu et al. [50], by performing high-throughput yeast two-hybrid screening with 3917 bait proteins and 5246 prey proteins, which yielded 1809 interactions among 1278 proteins. Ninety-four randomly chosen interactions were validated with a precision rate of 94–100% in this study. In addition, a high quality binary protein interaction map was generated in combination with other available data sets including the “Uetz-screen” by Uetz et al. [49], the “Ito-score” by Ito et al. [51], and high-throughput affinity purification/mass spectrometry data sets by Gavin et al. [4] and Krogan et al. [5]. Protein microarray has many advantages over the pan class="Species">yeast two-hybrid method in studying protein interactions. First, once made, its in vitro nature does not require n>n class="Species">yeast culture, transformation and mating, which greatly saves time and effort. Second, it uses fluorescently labeled probes instead of reporter genes, therefore the interaction signal can be readily quantified with a laser scanner in a matter of minutes, and the relative fluorescence intensity can also reflect the binding strength of the two interacting proteins. Moreover, protein microarray also allows many types of molecules to be used as probes (e.g. DNA and small molecules) besides proteins or peptides as compared with the yeast two-hybrid method which only allows detection of protein–protein interactions. Since its inception, our group has long been interested in pursuing protein–protein and protein–small molecule interaction networks [2]. We obtained the binding profile of pan class="Gene">calmodulin by probing the array with n>n class="Chemical">biotinylated calmodulin and Cy3-labeled streptavidin. Calmodulin is an important calcium binding protein which is highly conserved between species and participates in the regulation of many signaling pathways [52]. With this high throughput method, we identified 33 previously unknown potential binding partners of calmodulin, as well as a novel binding-motif, (I/L)QXXK(K/X)GB for this protein (Fig. 2 in ref. [2]). Meanwhile, to test the possibility of detecting protein–small molecule interactions, we also probed these arrays with phosphoinositide (PI), a well known secondary messenger lipid. We identified 150 novel lipid-binding proteins, which include membrane-associated proteins as well as proteins involved in glucose metabolism [2]. Our study was the first to show the capability of protein microarray technology to pursue protein–protein and protein–small molecule interactions. Since its first debut a decade ago, protein microarray technology has been used to discover protein interacting partners/networks in a wide range of applications. For detecting protein–protein interactions, Hesselberth et al. identified interacting proteins of the WW domain in pan class="Species">S. cerevisiae [53]. The WW domain is a highly conserved domain in many organisms including n>n class="Species">yeast and human. Proteins containing these domains are involved in many cellular processes including cell cycle regulation, targeted protein degradation and transcription activation [54], [55], [56]. There are 10 WW domain-containing proteins (with 13 WW domains) in S. cerevisiae, including Prp40 which is involved in mRNA splicing, the ubiquitin ligases Rsp5 and SSm4, the histone methyltransferase Set2, the peptidyl-prolyl isomerase Ess1, and proteins with still unknown functions such as YPR152C, Wwm1, Aus1, Vid30 and Alg9 [53]. Hesselberth et al. determined the binding partners of 12 of the 13 WW domains by probing them on the yeast protein arrays, and identified 587 high-confidence interactions with 207 proteins [53]. These WW-domain interaction proteins are enriched in cofactor metabolism, peroxisome localization, and ER localization with Gene Ontology annotation classifications, suggesting their potential functions. They compared their protein microarray findings with yeast two-hybrid and affinity-purification/mass spectrometry data available at that time and identified 19 unreported interacting partners. Interestingly they also found that some WW-domain containing proteins, such as Rsp5, Alg9, Wwm1 and Ssm4 also have conserved WW-domain binding motifs, which suggested possible multimerization between these proteins. Using a similar approach, Galindo et al. also identified 4 binding partners for the Aeromonas hydrophila cytotoxic enterotoxin Act in yeast, one of which was the vesicle tethering protein Vsp52, indicating that Act might mediate its function through interfering with vesicle transport of the host cells [57]. Protein microarray technology can also be used for specific protein quantification. Park et al. fabricated an antibody protein microarray (named by the authors as “reverse-phase protein microarray”), and quantified the protein N-myc Downstream Regulated Gene 2 (NDRG2) in hepatocellular carcinoma patients and normal controls and observed significant lower expression of this protein in cancerous tissues [58]. Protein microarrays have also been used to identify protein-nucleic acid interactions. Our group probed the pan class="Species">yeast protein microarray with n>n class="Chemical">Cy3-labeled single-stranded (ssDNA) or double-stranded (dsDNA) yeast genomic DNA and identified 84 ssDNA-binding, 58 dsDNA binding, and 131 ssDNA and dsDNA-binding proteins (schematic design shown in Fig. 1 as well as Fig. 1 in ref. [59]). We chose 8 proteins previously unreported for their DNA-binding activity for further validation, and 3 proteins were confirmed by Chromatin-Immunoprecipitation followed by probing genomic DNA microarrays (ChIP-Chip; [60]). Of the 3 validated proteins, Arg5,6 was found to bind both mitochondrial (15 S ribosomal DNA, COX1, COX3, and COB1) and nuclear genes (PUF4, PHO23 and THI13), and participate in both transcription elongation and RNA-processing. Using a similar approach, Hu et al. identified conserved DNA motif-binding proteins using a protein microarray containing 4191 human proteins [61]. Interestingly, besides known transcription factors, they also identified over 300 previously unknown or unexpected DNA-binding proteins, including ERK2, a multifaceted mitogen-activated protein kinase [62], which also acts as a transcription repressor in the interferon gamma signaling pathway. Another interesting study was reported by Zhu et al. on the identification and validation of binding proteins to the clamped adenine motif (CAM) of Brome mosaic virus (BMV), a model positive-strand RNA plant virus [63]. As BMV can also replicate in S. cerevisiae, the authors probed the yeast proteome array (containing ~ 5000 purified proteins) with both Cy3-labeled CAM-containing RNA and a Cy5-labeled control, and identified 12 hits that showed equal or stronger signals than BMV's own capsid protein (CP). The authors focused on 2 of the 3 strongest binders, Pus4 and App1, and found that they could also inhibit positive-strand RNA accumulation and systemic viral infection in vivo. Their further investigation demonstrated that Pus4 inhibited in vivo viral systemic spread by affecting virion formation through a direct competition with CP. This study shows a good example of the capability of functional protein microarray in the discovery of interactions of in vivo physiological importance, and in this case, components involved in antiviral response. Besides protein–protein and protein–DNA/RNA discoveries, protein microarrays were also used to identify protein–small molecule interactions. n class="Chemical">Rapamycin is a small molecule drug that can induce starvation response and inhibit cell growth through its target n>n class="Chemical">TOR (Target of Rapamycin), a highly conserved kinase regulating cell proliferation and metabolism, in yeast and humans [64]. Rapamycin analogs are promising chemotherapeutic drugs to treat cancer [65]. Jiang et al. identified small molecules that could enhance (small-molecule enhancers of rapamycin, or SMERs) or inhibit (small-molecule inhibitors of rapamycin, or SMIRs) rapamycin effects in yeast, and discovered intracellular protein targets of two SMIRs, SMIR3 and SMIR4, by probing the yeast protein microarray with biotinylated SMIRs [66]. Among the strongest protein targets identified in this study, two proteins, Tep1p and Nir1p, were shown to be components of the TOR signaling pathway, suggesting that the protein array method is capable of detecting specific protein–small molecule interactions. By probing the protein arrays with biotinylated polyanions (two proteins, two compounds, and DNA) Salamat-Miller et al. also identified a total of 893 polyanion-binding proteins in S. cerevisiae [67]. Interestingly, the polyanions and their binding proteins are found to form a network that is involved in maintaining the structure and activity of the yeast cells.

Detection of post-translational modifications using protein microarray

The proteome ultimately carries out most cellular functions, and is regulated by many different types of post-translational modifications. These modifications include phosphorylation, glycosylation, acetylation, pan class="Gene">ubiquitination, SUMOylation, and S-nitrilation [68], [69], [70]. Therefore, the status of post-translational modification of the proteome is a snapshot of the dynamic activities of the living cell. The specific platform of protein microarray makes it ideal to identify proteome-wide potential targets for protein post-translational modification enzymes. The high throughput format enables screening of potential substrates in the whole or partial proteome in a single assay, and its in vitro nature allows fine and accurate controls over the assay conditions. Of all post-translational modifications, phosphorylation is the most studied. The kinome, or collection of all kinases in the cell, is responsible for the dynamic phosphorylation of proteins and controls many different cellular processes. Levels of kinase activities are likely to be very tightly controlled; indeed Sopko et al. examined the influence of individual overexpression of more than 80% of all n class="Species">yeast genes and observed enrichment of kinases and phosphatases in the list of “toxic gene set”; a set of genes whose overexpression significantly slowed n>n class="Species">yeast growth [71]. Several groups including ours have carried out large-scale studies to identify the substrates of yeast kinases at the proteomic level. In 2000 our group screened 119 of the 122 yeast kinases with 17 different substrates (including the kinases themselves for monitoring autophosphorylation) on a prototype of protein microarray [33]. A schematic design was shown in Fig. 3 . The substrates were immobilized onto nanowell protein chips and phosphorylation events were identified by adding 33P-γ-ATP and a specific yeast kinase and exposing the chip to a phosphoimager. We discovered that more than 60% of the kinases autophosphorylated themselves, and 94% of the tested kinases had at least one substrate in vitro, with 32 of them specifically phosphorylating one or two substrates. Twenty-seven kinases were found to phosphorylate poly(Tyr-Glu), which quadrupled the number of identified tyrosine kinases (seven) reported at that time. Moreover, these tyrosine kinases preferentially contain 3 conserved lysines and one conserved methionine near the catalytic region, indicating their potential roles in substrate selection. The same method was later used to identify Hrr25p as a kinase for the zinc-finger transcription factor Crz1, which turned out to negatively regulate Crz1 activity and nuclear localization by phosphorylation in vivo [72]. Our group later expanded the study to search for the substrates of 87 different S. cerevisiae kinases in a large set of more than 4400 full-length, functional yeast proteins with a yeast protein microarray containing 4400 yeast proteins [47]. In this study we discovered about 4200 phosphorylation events affecting 1325 proteins and generated the first version of the phosphorylation network in yeast (schematic design shown in Fig. 3). There were many interesting findings in this work. We found that each kinase had a unique substrate profile and most phosphorylation substrates were recognized by 3 or fewer kinases. Even within closely related kinases such as Tpk1, Tpk2 and Tpk3, there seem to be a low overlap between their substrate profiles (only 12.3% of all their identified substrates could be recognized by more than one of the three kinases). Moreover, we also found that kinases might display distinct substrate preferences when they form complexes with different proteins. We determined the consensus motifs for 11 kinases in this study using a method developed by Jonassen et al. [73], and further investigated this in a larger scale in a separate study with a rapid peptide screening approach (schematic design shown in Fig. 3), in which we determined the consensus phosphorylation motifs for 61 kinases [74]. Lastly, by combining our profiling data with available transcription factor binding and protein interaction data sets [49], [51], [75], [76], [77], [78], [79], we discovered 8 regulatory modules in the phosphorylation network. All of the above indicated that kinases have evolved to have their unique, specific roles in carrying out molecular and cellular functions in the eukaryotic cells.

Fig. 3

Schematic diagram for protein kinase target identification, consensus phosphorylation motif identification and determination.

Schematic diagram for protein kinase target identification, consensus phosphorylation motif identification and determination. Besides protein microarrays, large-scale profiling of the protein phosphorylation networks was also approached with other methods. Instead of direct proteomic approaches, Fiedler et al. determined the epistasis of binary interactions between all pan class="Species">yeast kinases, phosphatases and key cellular regulators by building an epistatic miniarray profile (E-MAP) through the generation of double mutants [80]. In this study, by analyzing “Triplet Genetic Motifs”, the authors detected both known (such as Cak1 and its substrates Ctk1 and Smk1) and previously undiscovered (e.g. involvement of Cak1 and Fus3 in the Set2/Rpd3C(S) pathway) interactions with this method. Moreover, Korf et al. generated an antibody-based protein microarray, which can be used to monitor time-dependent quantitative changes in the phosphorylation states of individual kinases [81]. This method could be applied potentially to monitor many targets at the same time; however, like other antibody protein microarrays, this method significantly depends on the quality and specificity of the antibody pairs for each protein [21]. Alternatively, the phospho-proteome can also be profiled with quantitative mass spectrometry [82], [83]. Soufi et al. identified 5534 unique phospho-peptides associated with the osmotic response of S. cerevisiae with a quantitative high-resolution mass spectrometry of their phospho-proteome, and observed that greater than 15% of these changes occured within 5 min after the start of the treatment, suggesting that phosphorylation of a group of specific proteins is part of the early response to osmotic changes [82]. Other post-translational modifications have also been investigated with protein microarray technology. By probing the pan class="Species">yeast protein microarray with Concanavalin A or n>n class="Species">Wheat-Germ Agglutinin, both of which are lectins that bind glycans on proteins, Kung et al. identified 534 glycosylated yeast proteins including 30 mitochondrial proteins and a number of nuclear proteins; 406 of the glycosylated protein were unreported previously [84]. Of the novel identifications, they validated 45 by gel mobility shift assay. Interestingly, they also found that inhibition of glycosylation by N-linked glycosylation inhibitor tunicamycin affected the localization of some mitochondrial proteins, such as Ydr065wp and Lpe10p, which suggests that proper protein glycosylation is important for their normal localization and function. In addition to glycosylation, Tao et al. used an analogous approach to examine poly(ADP-ribosyl)ation in yeast with protein microarray [85]. Poly(ADP-ribosyl)ation of proteins is usually carried out by the enzyme poly(ADP-ribose) polymerase-1 (PARP-1), a pluripotent enzyme involved in both DNA damage sensing mechanism and regulation of gene expression [86], [87]. The authors identified 33 PARP-1 substrates in this study, including six proteins involved in ribosome biogenesis, indicating that poly(ADP-ribosyl)ation may be required for the normal biogenesis process of the ribosomes. Acetylation and deacetylation of the pan class="Chemical">lysine residues in proteins, especially histones are important events in regulating chromatin-related processes and gene expression, although not much was known for the non-chromatin-associated substrates of these histone acetyltransferases and deacetylases [88], [89]. Of the many different histone acetyltransferases, only the catalytic enzyme n>n class="Gene">Esa1p in the essential nucleosome acetyltransferase of H4 (NuA4) complex was found required for the viability of the cells, indicating this complex may have additional roles beyond gene expression regulation [90]. By incubating the NuA4 complex with yeast proteome microarrays in the presence of 14C labeled Acetyl-CoA, Lin et al. discovered a large number of novel non-histone substrates for NuA4 and confirmed 13 of them with standard in vitro activity assays [88]; and more importantly, the authors also validated these substrates with in vivo experiments [91]: the levels of acetylated substrates were found decreased dramatically in ESA1 Ts (esa1-531) mutant cells when they were grown at the nonpermissive temperature, which proved that these proteins are real in vivo targets of Esa1p. The authors then focused on Pck1p (phosphoenolpyruvate carboxykinase), a metabolic enzyme involved in gluconeogenesis, and found that the acetylation of Pck1p is essential for its enzymatic activity and the life span of yeast. This study is one of the first to examine an important acetyltransferase complex for non-histone-related regulatory functions in the cell and suggests that histone acetyltransferases and deacetylases may also have other important cellular roles besides chromatin-related processes. Moreover, in vivo validation of the acetylation targets of the NuA4 complex suggested that using enzyme complexes instead of individual proteins in protein microarray studies may help identify substrates that are more physiologically relevant. pan class="Gene">Ubiquitination is another critical post-translational modification involved in various cellular processes. It facilitates proteosome-mediated protein degradation/recycling [92] as well as activation of cell signaling molecules [93]. Gupta et al. identified the substrates of n>n class="Gene">Rsp5, a ubiquitin-protein ligase (E3) in yeast by both in vitro ubiquitination and protein interaction assays [94]. To identify substrates that could be ubiquitinated by Rsp5, the authors incubated the yeast protein microarrays with Rsp5 and FITC-labeled ubiquitin; alternatively, the arrays were probed with Alexa Fluor 647-labeled Rsp5 to identify interacting proteins. One hundred and fifty proteins were identified in the ubiquitination assay and 155 in the interaction assay, of which 64 proteins were identified in both sets of experiments. Of these identified substrates, the authors discovered more details of the known PY motif for Rsp5 binding. They found that Prolines, Serines and Alanines were enriched in the (L/P)PxY motif (at position x) which may also contribute to substrate specificity. Protein microarray technology was also used for the discovery of other protein post-translational modifications such as S-nitrosylation [95]. This type of post-translational modification plays an important role in pan class="Chemical">nitric oxide (NO) and endogenous n>n class="Chemical">S-nitrosothiol (SNO) signaling, abnormal regulation of which is associated with diseases such as neurological dysfunction and cancer [95], [96], [97]. Using a biotin switch technique that converts SNO to S-biotinylated Cys [98], which is then readily detectable with anti-biotin antibody, Foster et al. identified several hundred SNO-proteins after S-nitrosocysteine treatment of the yeast protein microarrays [95]. These proteins were enriched with proteins with active-site Cys thiols, however, neither secondary structures nor Cys thiol nucleophilicity could explain substrate specificity. The authors then found out that S-nitrosylation efficiency might depend on the stereochemistry and structure of the NO-donor as well as allosteric effectors.

Advantages and limitations of protein microarray technology

As reviewed above, protein microarray technology serves as a powerful and convenient tool in a large-scale, high-throughput manner for proteomic studies such as protein interactions and modifications. The advantages of the protein microarray technology are (also refer to review by Zhu et al. [17]): its convenient and straightforward probing using a wide variety of standard detection platforms, the ability to control and fine-tune experimental conditions, and its unbiased protein target selection (regardless of their endogenous cellular abundance) due to its particular fashion of manufacture. The latter enables the inclusion of normally underrepresented proteins such as low abundant proteins, membrane and secreted proteins, and conditionally expressed proteins. Protein microarray technology also have limitations. First, the scope of protein microarray relies heavily on the availability of genomic information of the specific organisms. Therefore, this technology is not capable of covering unknown ORFs/splicing variants, and is unavailable for organisms whose genome information is unknown. A related limitation is that for most genes usually only one splicing variant is used to represent the specific gene, therefore the splicing diversity of the represented proteome is limited [12]. Second, as proteins are purified from the cells, they may contain mixed post-translational modifications or even co-purified interacting proteins, which may interfere with the protein binding, kinase assays and post-translational modification detection. In vitro expression methods may avoid contamination from other tightly interacting proteins; however, proteins produced this way may not be properly folded or modified. Third, as protein microarray-based studies are in vitro studies, off-target interactions and modifications that do not normally happen in cell may also occur. For this reason protein microarray findings should be validated with more direct methods such as immunoprecipitation, immunoblotting and in vitro and in vivo activity assays. Furthermore, as the cost for manufacturing protein microarrays is still high, it can be expensive to study large proteomes such as the pan class="Species">human proteome. Therefore in studies on these proteomes they are often partially represented. Lastly, the specific platform of protein microarray usually allows only one or two probes/enzymes per each assay on a single proteome array, which, with the cost of manufacturing the protein microarrays, limits researchers to investigate only one or a few probes/enzymes at a time. This limitation will be significantly alleviated, however, with the drop of manufacturing cost as well as improvements in multiplexing technologies.

Closing remarks

Since its debut a decade ago, protein microarray technology has evolved into a powerful tool in proteomic studies. Application of this technology in the model organism pan class="Species">S. cerevisiae has led to numerous significant scientific findings at the proteomic level, including protein–protein, protein–DNA and protein–small molecule interactions as well as post-translational modifications. Since many of these mechanisms are conserved through evolution, some of these findings are expected to be relevant to those occurring in other organisms, such as plant and n>n class="Species">human [99]. In fact, multiple studies mentioned above have already confirmed the counterparts of their findings in humans [57], [85], [88]. The proteome is only a part of the complex network in a living organism. It coordinates with the other systems including the genome, the transcriptome, and the metabolome, to carry out complex cellular functions and intercellular communication [100]. With the emergence of new high throughput technologies such as next generation high-throughput sequencing [101] and improved mass spectrometry technologies [1], it becomes possible to simultaneously generate integrated networks at the systems level from the same sample sets in a time dependent manner, if desired. Time-resolved global networks generated from quantitative data sets of the proteome, the genome, the methylome, the transcriptome and the metabolome, with environmental facn class="Chemical">tors such as the microbiome (Fig. 4 ), will enable a comprehensive view of miscellaneous biological processes, as well as their roles in natural courses such as development and aging, and in pathogenesis of n>n class="Species">human diseases such as cancer.

Fig. 4

Comprehensive understanding of biological processes through integrated information of biological systems and environmental factors.

Comprehensive understanding of biological processes through integrated information of biological systems and environmental facpan class="Chemical">tors.

100 in total

1. Profiling of cancer cells using protein microarrays: discovery of novel radiation-regulated proteins.

Authors: A Sreekumar; M K Nyati; S Varambally; T R Barrette; D Ghosh; T S Lawrence; A M Chinnaiyan
Journal: Cancer Res Date: 2001-10-15 Impact factor: 12.701

2. Printing proteins as microarrays for high-throughput function determination.

Authors: G MacBeath; S L Schreiber
Journal: Science Date: 2000-09-08 Impact factor: 47.728

3. Immunoassays with rolling circle DNA amplification: a versatile platform for ultrasensitive antigen detection.

Authors: B Schweitzer; S Wiltshire; J Lambert; S O'Malley; K Kukanskis; Z Zhu; S F Kingsmore; P M Lizardi; D C Ward
Journal: Proc Natl Acad Sci U S A Date: 2000-08-29 Impact factor: 11.205

4. Regulation of gene expression by a metabolic enzyme.

Authors: David A Hall; Heng Zhu; Xiaowei Zhu; Thomas Royce; Mark Gerstein; Michael Snyder
Journal: Science Date: 2004-10-15 Impact factor: 47.728

5. Utlization of human expert techniques for detection of low-abundant peaks in high-resolution mass spectra.

Authors: Grzegorz M Boratyn; Michael L Merchant; Jon B Klein
Journal: Conf Proc IEEE Eng Med Biol Soc Date: 2006

6. MAPK target networks in Arabidopsis thaliana revealed using functional protein microarrays.

Authors: Sorina C Popescu; George V Popescu; Shawn Bachan; Zimei Zhang; Mark Gerstein; Michael Snyder; Savithramma P Dinesh-Kumar
Journal: Genes Dev Date: 2008-12-18 Impact factor: 11.361

7. ESA1 is a histone acetyltransferase that is essential for growth in yeast.

Authors: E R Smith; A Eisen; W Gu; M Sattah; A Pannuti; J Zhou; R G Cook; J C Lucchesi; C D Allis
Journal: Proc Natl Acad Sci U S A Date: 1998-03-31 Impact factor: 11.205

8. Negative regulation of calcineurin signaling by Hrr25p, a yeast homolog of casein kinase I.

Authors: Kimberly A Kafadar; Heng Zhu; Michael Snyder; Martha S Cyert
Journal: Genes Dev Date: 2003-11-01 Impact factor: 11.361

9. Global analysis of the yeast osmotic stress response by quantitative proteomics.

Authors: Boumediene Soufi; Christian D Kelstrup; Gabriele Stoehr; Florian Fröhlich; Tobias C Walther; Jesper V Olsen
Journal: Mol Biosyst Date: 2009-09-10

10. Next-generation high-density self-assembling functional protein arrays.

Authors: Niroshan Ramachandran; Jacob V Raphael; Eugenie Hainsworth; Gokhan Demirkan; Manuel G Fuentes; Andreas Rolfs; Yanhui Hu; Joshua LaBaer
Journal: Nat Methods Date: 2008-05-11 Impact factor: 28.547

9 in total

1. Utilizing Yeast Surface Human Proteome Display Libraries to Identify Small Molecule-Protein Interactions.

Authors: Scott Bidlingmaier; Bin Liu
Journal: Methods Mol Biol Date: 2015

Review 2. Identification of Posttranslational Modification-Dependent Protein Interactions Using Yeast Surface Displayed Human Proteome Libraries.

Authors: Scott Bidlingmaier; Bin Liu
Journal: Methods Mol Biol Date: 2015

Review 3. A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.

Authors: Alexey I Nesvizhskii
Journal: J Proteomics Date: 2010-09-08 Impact factor: 4.044

4. Proteome-wide Identification of Novel Ceramide-binding Proteins by Yeast Surface cDNA Display and Deep Sequencing.

Authors: Scott Bidlingmaier; Kevin Ha; Nam-Kyung Lee; Yang Su; Bin Liu
Journal: Mol Cell Proteomics Date: 2016-01-04 Impact factor: 5.911

5. New host factors important for respiratory syncytial virus (RSV) replication revealed by a novel microfluidics screen for interactors of matrix (M) protein.

Authors: Sarit Kipper; Samar Hamad; Leon Caly; Dorit Avrahami; Eran Bacharach; David A Jans; Doron Gerber; Monika Bajorek
Journal: Mol Cell Proteomics Date: 2015-01-02 Impact factor: 5.911

6. Direct iterative protein profiling (DIPP) - an innovative method for large-scale protein detection applied to budding yeast mitosis.

Authors: Régis Lavigne; Emmanuelle Becker; Yuchen Liu; Bertrand Evrard; Aurélie Lardenois; Michael Primig; Charles Pineau
Journal: Mol Cell Proteomics Date: 2011-10-13 Impact factor: 5.911