Literature DB >> 21190000

Genomics and proteomics in stem cell research: the road ahead.

Sung-Min Ahn1, Richard Simpson, Bonghee Lee.   

Abstract

Stem cell research has been widely studied over the last few years and has attracted increasing attention from researchers in all fields of medicine due to its potential to treat many previously incurable diseases by replacing damaged cells or tissues. As illustrated by hematopoietic stem research, understanding stem cell differentiation at molecular levels is essential for both basic research and for clinical applications of stem cells. Although multiple integrative analyses, such as genomics, epigenomics, transcriptomics and proteomics, are required to understand stem cell biology, proteomics has a unique position in stem cell research. For example, several major breakthroughs in HSC research were due to the identification of proteins such as colony-stimulating factors (CSFs) and cell-surface CD molecules. In 2007, the Human Proteome Organization (HUPO) and the International Society for Stem Cell Research (ISSCR) launched the joint Proteome Biology of Stem Cells Initiative. A systematic proteomics approach to understanding stem cell differentiation will shed new light on stem cell biology and accelerate clinical applications of stem cells.

Entities:  

Keywords:  Stem cells; epigenomics; genomics; proteomics; transcriptomics

Year:  2010        PMID: 21190000      PMCID: PMC2998774          DOI: 10.5115/acb.2010.43.1.1

Source DB:  PubMed          Journal:  Anat Cell Biol        ISSN: 2093-3665


Stem cells: definition and clinical relevance

Currently, stem cells are widely and intensively studied in all fields of medicine and science due to their potential to treat many incurable diseases. At the same time, there have been a wide range of social and ethical concerns on using human embryonic stem cells (ES cells) for research or therapy purposes because their derivation requires the destruction of embryos. Stem cells are functionally defined as undifferentiated, primitive cells that retain the capability of indefinitely reproducing themselves ("self-renewal") and also have the ability to generate multiple types of cells upon proper signals from internal and external cues ("pluripotency" or "multipotency"). Stem cells are largely categorized into two groups, ES cells and adult stem cells. Although these two types of stem cells have different sets of advantages and disadvantages, they both may be valuable sources for the future cell therapy. ES cells have many advantages as a cell source for regenerative therapy. Given the appropriate environmental conditions, they can be coaxed into forming most cell types in the body and can provide unlimited amounts of cells for cell therapy. However, their applications for human treatment require special safety precautions. One concern is the potential risk of genomic abnormality (Draper et al., 2004; Maitra et al., 2005) and tumor formation (Vogel 2005). Other concerns associated with ES cell-mediated therapy are immune rejection of transplanted tissue, difficulty in achieving homogeneous cell populations after differentiation, and ethical controversy resulting from the destruction of embryos. Many scientists believe that there still exists a great deal of social and scientific uncertainty surrounding ES cell-mediated cell therapy, which requires more extensive and thorough study. In contrast, adult stem cells have been used for cell therapy for quite a long time. For example, bone marrow transplant has been used to treat leukemia more than 10 years. Thus far, more than 70 different diseases and injuries have been successfully treated with adult stem cells. These include a number of blood-related abnormalities, immunological dysfunctions, cancers, stroke, brain trauma, juvenile diabetes, Parkinson's disease, blindness, spinal cord injuries and so on. More than 1200 clinical trials with adult stem cells have been approved by the U.S. Food and Drug Administration (FDA). There are several reasons that account for the wide use of adult stem cells in clinical applications. These include very little chance of tumor formation, no ethical problems, the possibility of autologous transplantation, which causes no immune rejection, and easy differentiation into a certain specific lineage of cells. However, the prevailing belief that adult stem cells do not form tumors needs thorough reevaluation. There has been a recent report that adipose tissue-derived human mesenchymal stromal cell (MSC) populations can immortalize and transform spontaneously. This MSC-TMC (Transformed Mesenchymal cells) transition happened after in vitro culture and expansion (Rubio et al., 2005). Furthermore, several other reports argue that some cancer cells are derived from adult stem cells in the body (Marx 2003; Pardal et al., 2003; Clarke & Fuller, 2006). Currently, safety issues and techniques for in vitro expansion of the cells may be the two most important topics to develop additional successful cell therapies using adult stem cells. Future studies should be focused on overcoming three major obstacles that hinder successful stem cell therapy. First, obtaining autologous cell sources is required to avoid immune rejection. This issue is especially serious for ES cell-mediated cell therapy. One way to get around the immune problem is to generate pluripotent, ESC-like cells from the patient's own cells either by cellular reprogramming or by Somatic Cell Nuclear Transfer (SCNT). Recently, there have been great achievements in generating induced pluripotent stem cells (iPS cells) (Takahashi et al., 2007; Yu et al., 2007), although more work is needed to bring iPS cells to patient treatment. At the same time, the recent success of SCNT in rhesus macaque primates (Byrne et al., 2007) raises a hope that it may soon also be possible in humans. Establishing a stem cell therapeutic bank (Taylor et al., 2005) and autologous cell therapy using adult stem cells would be other options to solve the immune rejection problem. The second issue is preventing tumor formation. This issue is especially serious for clinical use of ES cells. Since tumor formation is thought to be caused by residual undifferentiated ES cells, the development of methods that remove these remnant ES cells is of great importance. The third issue is establishing efficient protocols to differentiate ES cells into the cell types of interest. This will increase the efficacy of cell therapies, reduce side effects caused by the presence of unwanted cell types, and minimize tumor formation resulting from remnant undifferentiated ES cells.

Multiple aspects of understanding stem cell differentiation

Each cell type in the human body, including stem cells, has a unique information architecture maintained by cell-specific transcriptional regulatory states that lead to differential expression of genes (Davidson 2006). According to the International Human Genome Consortium, the number of genes in the human genome is estimated to be between 20,000 and 25,000. The 274 different kinds of cells in the human body are defined by combinatorial expression of this limited number of genes (Wenick & Hobert, 2004). The transcriptional regulatory circuitry in cells is maintained by the dynamic interplay of multiple regulatory mechanisms, including epigenetic, transcriptional, posttranscriptional, and posttranslational regulation of gene expression (Cheng et al., 2005). Therefore, understanding stem cell differentiation requires an overall understanding of the elaborate web of regulations at the levels of the genome, epigenome, transcriptome, microRNAome, and proteome. Transcription factors (TFs) are good examples of how multiple regulatory mechanisms act in concert. TFs are core elements of the transcriptional regulatory circuitry (Boyer et al., 2005). In theory, it is possible to turn a brain cell into a liver cell by transferring all TFs in a liver cell to a brain cell while inactivating all brain-specific TFs (Reik 2007). A recent study that shows induction of pluripotent stem cells from fibroblasts by defined TFs also supports this idea (Takahashi & Yamanaka, 2006). Given their importance in cell specification and function, it is not surprising that multiple regulatory mechanisms are involved in the regulation of TFs. For example, when embryonic stem cells are induced to differentiate, Oct-3/4, a main regulator of human embryonic stem cell pluripotency and self-renewal (Boyer et al., 2005), is epigenetically repressed through histone modifications, followed by DNA methylation at the promoter region (Feldman et al., 2006). Once transcribed, mRNAs of TFs may be posttranscriptionally regulated by microRNAs (Bartel 2004). According to a recent computational analysis, many TFs are target hub genes of microRNAs, and TF-microRNA pairs may coregulate large sets of common targets (Shalgi et al., 2007). Posttranslational modifications (PTMs) provide another layer of regulation. It is well known that TFs, such as STATs and Smad 2/3, require PTMs for their nuclear localization and regulatory activities (O'Shea et al., 2002; Elliott & Blobe, 2005). In this section, we aim to provide a concise review of multiple aspects of understanding and studying stem cell-differentiation: genomics, epigenomics, transcriptomics, microRNAomics, and proteomics. In the next section, we will specifically discuss the potential roles of proteomics in the field of stem cell biology.

Regulatory genome

The human genome encodes 20,000~25,000 protein-coding genes that account for only 1.5% of the nearly three billion base pairs of the genome (Stein 2004). In addition to protein-coding regions, a substantial portion of the non-protein-coding regions in the genome, which are poorly defined, is transcribed (The EPC 2004). Non-coding RNAs (i.e. transcripts of non-protein-coding genes) include microRNAs, small regulatory RNAs, and tens of thousands of longer transcripts with unknown functions (Mattick & Makunin, 2006). In addition to coding sequences, the genome contains regulatory architecture called cis-regulatory modules (CRMs) that consist of a set of TF-binding sites (TFBSs) located in the vicinity of the gene being regulated (Blanchette et al., 2006). As previously described, differentiation requires the institution of new transcriptional regulatory states. The CRMs, which are "information processing devices hardwired into the genome," read the given transcriptional regulatory states, process that information, and instruct the basal transcription apparatus to turn the gene on or off (Davidson 2006). Therefore, the human genome is a self-executing set of instructions that not only encodes proteins and non-coding RNAs, but also directs where, when, and how much each of these products will be expressed, ultimately producing the complex body plan of the adult organism (Howard & Davidson, 2004). One of the most widely and intensively studied examples of the regulatory genome in stem cells is the regulatory network of Oct4, Sox2, Nanog and Klf4 in embryonic stem cells (Nichols et al., 1998; Avilion et al., 2003; Chambers et al., 2007). Activation of these genes is key to maintaining pluripotency in embryonic stem cells. Another group of genes is also reported as related to neural stem cells and their progenitor cells (Wang et al., 2008). These examples of the regulatory genome have been a focus of stem cell research because of their importance in maintaining pluripotency. Scientists are still trying to identify novel regulatory genome components that are involved in maintaining self renewal of stem cells or lead to differentiation.

Epigenetic aspects of stem cell differentiation

Genetic information is stored in the DNA sequence using a four digit code (i.e., A, G, T, C), and the genome is the complete set of information in an organism's DNA (Alberts 2002). Epigenetic information is heritable information that is not encoded in the DNA sequence but still affects gene expression (Russo et al., 1996). In eukaryotes, the DNA molecule is wrapped around a core of eight histones (two H2A, H2B, H3, and H4 histones) to form nucleosomes, which, in turn, are packaged into chromatin (Luger et al., 1997). Epigenetic information is stored by covalent modifications of the DNA or its packaging histones. Therefore, an epigenome can be defined as the sum of heritable covalent modifications of the chromatin, 'the dynamic template of the genetic information'. The human genome is organized into accessible (euchromatic) or inaccessible (heterochromatic) subdomains by epigenetic changes (Arney & Fisher, 2004) (Fig. 1).
Fig. 1

One genome, many epigenomes. Every somatic cell in the human body has almost the same genetic information. Epigenetic changes define accessibility to the genetic information, thus creating cell-type-specific, dynamic genetic templates. In general, stems cells have more open, accessible chromatin structures than differentiated cells.

"Development is, by definition, epigenetic" (Reik 2007). Epigenetic mechanisms provide both stable and flexible gene regulation required for development. Histone modifications confer short-term, flexible regulation while DNA methylation confers long-term, stable silencing (Reik 2007). Since one genome can generate many epigenomes, the dynamic genetic templates created by the combination of a genome and a variety of epigenomes provide the basis of cell-type-specific, unique gene expression profiles. DNA methylation: DNA methylation in mammalian cells refers to covalent modification of DNA by methylation of the cytosine, which occurs predominantly in CpG dinucleotides. DNA methylation has been implicated in a wide range of physiological and pathological processes such as genomic imprinting, X chromosome inactivation, tissue-specific gene expression, cellular differentiation, carcinogenesis and aging (Jones & Takai, 2001; Bird 2002). Recent findings in animal cloning indicate that epigenetic reprogramming is important for successful cloning (Kang et al., 2001), which in turn support the idea that DNA methylation provides a specific cellular memory function in development (Bird 2002). DNA methylation can affect gene transcription through two main mechanisms. Firstly, site-specific DNA methylation may interfere with the binding to DNA of transcription factors (TFs) that activate gene transcription (Watt & Molloy, 1988). Secondly, DNA methylation attracts methyl-CpG-binding domain (MBD) proteins that in turn recruit corepressor complexes involved in transcriptional silencing. Comparative sequence analysis using the sequence of MeCP2, the prototypical MBD protein, led to the identification of additional family members (MBD1, MBD2, MBD3, and MBD4) (Wade 2001). Interestingly, each MBD protein seems to have exclusive binding sites, which suggests that MBD proteins are not functionally redundant (Klose et al., 2005). As previously mentioned, DNA methylation is involved in the long term silencing of gene expression, which plays an important role in the regulation of pluripotency-associated genes (Reik 2007). For example, transcriptional repression of Oct-3/4 is first followed by histone H3 methylation on Lys 9, which leads to local heterochromatization and DNA methylation at the promoter region (Feldman et al., 2006). The importance of this mechanism is manifested in cancers. According to one recent study, Oct-3/4 is expressed in adult human stem cells, tumor cells and cell lines, but not in differentiated cells (Tai et al., 2005). This evidence is consistent with the cancer stem cell hypothesis, which argues that tumors arise from small populations of cancer stem cells that originate from the transformation of normal stem cells (Pardal et al., 2003). According to the epigenetic progenitor origin of cancer theory, a modified version of the cancer stem cell hypothesis, the first step to carcinogenesis is epigenetic disruption of stem cells or progenitor cells, which is demonstrated as changes in DNA methylation (Feinberg et al., 2006). Thus, understanding stem cell differentiation from an epigenetic perspective may provide new insights on carcinogenesis. Recently, detailed analysis of DNA methylation in the maintenance of pluripotency has become a popular area of study to better understand the basic mechanisms underlying this process in ES cells as well as in iPSCs (Bhutani et al., 2009). Histone code: As previously described, histones are not merely DNA packaging molecules but also important carriers of epigenetic information. Epigenetic information is stored in histones by chemical modifications of the N-terminal tails of core histones, including acetylation, phosphorylation, methylation, and ubiquitination. Histone modifications can affect the chromatin template and therefore transcription through three main mechanisms: Cis-effects, trans-effects, and histone replacements. Firstly, histone modifications can alter intranucleosomal interactions by changing the physical properties of modified histone tails, such as electrostatic charges (cis-effects). These intranucleosomal structural changes can affect the accessibility of the DNA contained on that nucleosome to transcriptional factors. For example, phosphorylation of the linker histone H1, which reduces the electrostatic binding of H1 to DNA in chromatin, can repress transcription (Dou & Gorovsky, 2000). Secondly, histone modifications are recognized (or read) by modification-specific binding partners (trans-effects), which mediate down-stream chromatin-modulating events. For example, methylation of lysine 9 on histone H3 is read by the chromodomain of heterochromatin protein 1 (HP1), which is involved in both maintenance and somatic inheritance of transcriptionally silent heterochromatin (Bannister et al., 2001). Lastly, histone modifications can recruit ATP-dependent chromatin remodelers that shuffle histone variants into and out of chromatin. The replacement of histone variants plays important roles in transcriptional regulation. For example, the replacement of H3 with the histone variant H3.3 is associated with active chromatin and coupled to transcription (Ahmad & Henikoff, 2002; Schwartz & Ahmad, 2005). The "histone code" hypothesis has evolved from the recognition of the combinatorial nature of histone modifications that can generate synergistic or antagonistic interaction affinities for chromatin-associated proteins (Jenuwein & Allis, 2001). According to this hypothesis, histone modifications can act sequentially, in that a given modification of a specific histone residue is required for subsequent modifications of the same histone or another histone molecule (Margueron et al., 2005). For example, ubiquitination of histone H2B on lysine 123 is required for the methylation of histone H3 on lysine 4, which leads to transcriptional repression (Dover et al., 2002). The histone code hypothesis also implies that multiple signaling pathways converge on chromatin through histone modifications, the combination or summation of which are 'read' by chromatin-associated proteins (Nightingale et al., 2006). The presence of local binary switches (e.g., the 'methyl/phos switch' on H3 Lys 9/Ser 10) and modification cassettes (i.e. the combination of modifications in short clusters with distinct biological readouts) support this hypothesis (Fischle et al., 2003). One of the most studied modifications of the histones is acetylation. There are a few reports describing histone acetylation as important in human embryonic stem cell pluripotency (Lee et al., 2004, Bernstein et al., 2006). This epigenomic control has been a focus of stem cell research for many years, with DNA methylation and miRNA studies of transcriptional regulation in maintaining self-renewal and regulating differentiation.

Transcriptomic approach to understand stem cell differentiation

Microarray technology has revolutionized the field of transcriptomics, enabling biologists to perform global gene expression analyses (Chang et al., 2006). After the isolation of various stem cells, including ESCs, NSCs, and HSCs, one of the first questions addressed by microarray technology was identifying whether there are intrinsic molecular programs shared by all stem cells that define "stemness." The search for stemness is based on the hypothesis that stem cells have to respond in similar ways to regulate self-renewal and differentiation, irrespective of their lineal origin (Cai et al., 2004). The first studies comparing the transcriptomic profiles of stem cells raised both enthusiasm and confusion. Ramalho-Santos et al. (Ramalho-Santos et al., 2002) and Ivanova et al. (Ivanova et al., 2002) compared transcriptional profiles of ESCs, HSCs, and NSCs, each identifying a list of genes that are likely to reveal core stem cell properties. Interestingly, the number of overlapping genes between the two studies was minimal (only 15 genes), and most of the stemness genes from both groups were not exclusively expressed in NSCs. These findings raised two fundamental questions: 1) whether stemness can be defined at the genetic level (Fortunel et al., 2003), and 2) whether a specific combination of genes rather than individual genes endows stemness (Burns & Zon, 2002). These studies not only provided a first glimpse of stemness networks, but also revealed critical issues in the transcriptomic profiling approach to defining stemness.

microRNA

Basic concepts: Non-coding RNAs or nonprotein-coding RNAs control various levels of gene expression in physiology and development, including chromatin architecture/epigenetic memory, transcription, and RNA splicing, editing, translation and turnover (Mattick & Makunin, 2006). MicroRNAs (miRNAs) are a class of non-coding RNAs that are about 22 nucleotides in length and play important roles in the regulation of target genes (Bartel 2004). According to current estimates, miRNAs account for ~1% of predicted genes in higher eukaryotic genomes and regulate up to 10~30% of genes (Cui et al., 2006). miRNAs regulate gene expression by either of two post-transcriptional mechanisms, depending on sequence complementarity; miRNA induces cleavage of the mRNA (100% complementarity) or represses translation by binding to the 3' UTR (less complementary) (Pasquinelli et al., 2005). In plants, miRNAs base pair with target mRNAs by precise or nearly precise complementarity, leading to direct cleavage and destruction of the targets. In contrast, most animal miRNAs are less complementary to their target mRNAs and inhibit protein synthesis while preserving the stability of the target mRNAs. According to the combinatorial rheostat model of gene expression micromanagement by miRNAs, especially in animals, the combinatorial binding of miRNAs with different effective concentrations in different cell types leads to resistance changes (i.e., resistance to translation), thereby fine-tuning the level of protein that is produced (Bartel & Chen, 2004). The imprecise base pairing between the typical animal miRNA and a target mRNA suggests that animal miRNAs may regulate a wide repertoire of different mRNAs. This hypothesis was supported by a recent study using microarray analysis, in which transfection of tissue-specific miRNAs downregulated a large number of target mRNAs (e.g., miR-124 downregulated 174 genes) (Lim et al., 2005). Interestingly, there are some apparent trends in the types of genes regulated by miRNAs. For example, in Drosophila, the target genes of miR-277 are enriched for functionally related enzymes involved in the catabolism of leucine, isoleucine and valine, suggesting that miR-277 acts as a 'metabolic switch' of this biochemical pathway (Stark et al., 2003). Certain miRNAs (e.g., miR-34a-c) are important components of signaling pathways, such as the p53 network (He et al., 2007). miRNAs are also known to be involved in the regulation of various biological processes, including apoptosis (Xu et al., 2004), animal development (Wienholds & Plasterk, 2005), physiological function (e.g., insulin secretion) (Poy et al., 2004), and differentiation (Chen et al., 2004). miRNA and stem cell differentiation: From the perspective of stem cell biology, miRNAs are attractive regulatory molecules, as they provide an efficient means for coordinating the action of many functionally related target genes (Cheng et al., 2005). The highly tissue-specific expression of most miRNAs also suggests that miRNAs play important roles in differentiation and maintenance of cell identity (Wienholds et al., 2005). As previously described, TFs are core elements of the transcriptional regulatory circuitry that determines cell specifications (Davidson 2006). Therefore, the fact that many TFs are target hub genes of miRNAs underscores the roles of miRNAs in the transcriptional regulatory circuitry and cell specifications. miRNAs involved in differentiation not only target TFs such as Nanog, Smad1, and c-Myb (Luzi et al., 2007; Xiao et al., 2007), but also form elaborate regulatory loops with TFs, which provides a mechanism for determining stable end states. For example, Johnston et al. (Johnston et al., 2005) showed that two miRNAs and their TF targets act in a double-negative feedback loop to control neuronal cell fate decision, and miRNAs function as developmental switches in such regulatory loops. Yoo and Greenwald (Yoo & Greenwald, 2005) also showed a similar feedback loop in which a miRNA works as a developmental switch for mutually exclusive expression of two cell-fate-determining genes. In this model, the protein encoded by the first gene switches on a miRNA, which in turn switches off expression of the second gene (Plasterk 2006). Recently, several reports has been published regarding novel miRNAs and their function in both maintaining pluripotency and self renewal (Sanosaka et al., 2009; Liu & Zhao, 2009, Judson et al., 2009).

Stem cell proteomics: the road ahead

Why stem cell proteomics?

Major breakthroughs in HSC research were made by the identification of proteins such as colony-stimulating factors (CSFs) and cell-surface CD molecules. Proteins, key players in the cell, have diverse features that are not predictable from gene sequences or transcript levels. For example, posttranslational modifications (PTMs), protein-protein interactions, and subcellular locations affect the function and activity of proteins, but these are not predictable using genomics or transcriptomics technology. Both basic and clinically oriented stem cell research are confronted with many open questions that can be most efficiently answered by proteomics. For instance, the cell surface proteins and signaling cascades of stem cells and their differentiated progenies are largely unknown, as are the differentiation-specific proteins that can be used as biomarkers of the intermediate or terminal steps of cell differentiation, or discriminate tumorigenic cells from the pool (Krijgsveld et al., 2008). According to Metcalf, one of the pioneers of CSF research, the most formidable technical challenge in identifying CSFs was their purification (Metcalf 1991). Given the current status of proteomics technology, a project like that of CSF purification and identification can now be completed in six months rather than years or decades. The biggest challenge we now face is the systematic and optimized use of proteomics technology to decipher stem cell biology.

Stem cell proteomics: the road ahead

Finding CDs: the membrane proteome and beyond Membrane proteins perform most of the specific functions of the plasma membrane (Alberts 2002). According to bioinformatic analyses, 20~30% of all open reading frames in the genome are predicted to encode membrane proteins (Wallin & von Heijne, 1998). Membrane proteins represent more than two-thirds of drug targets (Hopkins & Groom, 2002), and they are essential to understand the underlying mechanisms of various biological functions. For example, our understanding of immune function is based on recognition of only 10~20% of leukocyte membrane proteins (Zola & Swart, 2003). In stem cell research, membrane proteins are important not only for understanding stem cell biology, but also for their clinical application. The first membrane proteomics initiative: the Human Leukocyte Differentiation Antigens Workshop (HLDA), arguably the first systematic membrane proteomics consortium, was initiated in 1982, before the word 'proteomics' was even coined in 1995. HLDA aimed to bring order to the chaos caused by the generation of large numbers of monoclonal antibodies reactive against leukocyte cell-surface molecules, each with unknown molecular targets and different associated nomenclatures (Zola et al., 2005). Through the organized efforts of HLDA, many cell-surface molecules with important functions in leukocyte biology were identified and characterized under the CD ("cluster of differentiation") system, many of which are currently used widely as diagnostic reagents (e.g., CD3, CD4), and as the basis for therapeutic agents (e.g., CD3, CD20) (Zola & Swart, 2003). In 2005, the HLDA council changed the name of the organization to Human Cell Differentiation Molecules (HCDM) to reflect their broader objectives: to extend the focus from leukocytes to other cell types and to broaden the organization's scope from cell-surface molecules to any molecule whose expression reflects differentiation (Zola et al., 2005). Therefore, CD molecules now include proteins involved in differentiation as well as membrane proteins. After the development of protein sequencing technology, such as Edman degradation (Edman 1960) and mass spectrometry (Hunt et al., 1986), monoclonal antibodies no longer represent the primary tool for discovering new proteins, but combining knowledge of the human genome sequence, powerful bioinformatics, and cloning technologies allows for new opportunities in antibody proteomics. In addition, CD antibody microarrays, used for high-throughput screening of CD antigens in cells and tissues, provides a good example of how the conventional CD approach can be used in conjunction with high-throughput proteomics technology (Woolfson et al., 2006). Antibody proteomics: Antibody proteomics, one of the major initiatives of the Human Proteome Organization (HUPO), is defined as the systematic generation and use of protein-specific antibodies to functionally explore the proteome (Uhlen & Ponten, 2005). Agaton et al. (Agaton et al., 2003) showed that Protein Epitope Signatures Tags (PrESTs), unique epitopes present in native proteins, can be used as antigens and affinity ligands for cost-effective generation of highly selective, mono-specific antibodies or affinity-purified polyclonal antibodies. Currently, this approach has been extensively used to construct a comprehensive, antibody-based protein atlas of expression and localization profiles in human normal and cancer tissues (Uhlen et al., 2005). As monoclonal antibodies have been systematically validated for leukocyte biology through HLDA (currently HCDM), the resources developed by the antibody proteomics initiative can be systematically used for stem cell research. For that purpose, well-defined panels of stem cells and their derivatives are required. Once prepared, these panels of cells can be efficiently screened using high-throughput cell microarray (tissue microarray) methods. This approach allows for detection of novel CD molecules involved in the differentiation of a variety of stem cells. MS-based membrane proteomics: As previously mentioned, the antibody-based approach has both success and promise in identifying novel proteins, especially membrane proteins. However, the antibody-based approach has intrinsic limitations because it is an indirect method for characterizing proteins, and there can be cross-reactivity in antibody-antigen interactions (Bentley et al., 1994). Also, the antibody-based approach cannot provide much information about posttranslational modifications that may be important for protein function. In contrast, a mass spectrometry (MS)-based approach allows direct sequencing of peptides and proteins, and it can be used either as a method complementing the antibody-based approach or for direct identification and characterization of CD molecules. Firstly, MS can be used to identify molecular targets of antibodies (e.g. identification of immunoprecipitated proteins). Furthermore, even epitope mapping of antibodies is possible using MS (Yu et al., 1998). Secondly, a MS-based approach, coupled with membrane protein preparation and separation technologies, can be used to directly identify and characterize membrane proteins from cells and tissues (Wu et al., 2003; Wu & Yates, 2003). In 2005, AOHUPO (Asia Oceania Human Proteome Organization) launched the Membrane Proteomics Initiative (MPI), which aims to develop methods for characterizing membrane proteomes, and to characterize the proteomics of specific membrane systems. Once phase I (technology development) and phase II (large-scale analysis of membrane proteomes) are finished, the methods and experiences from MPI can be readily utilized for more subtle areas, such as stem cell proteomics. Posttranslational modifications (PTMs): The high complexity of the human proteome, consisting of ~300,000~3,000,000 distinct protein forms, results from the diversification of genetic information at both the mRNA level and the protein level (Walsh 2006). Genetic information stored in ~30,000 genes is diversified through alternative splicing of primary mRNA transcripts at the mRNA level (Black 2003). Once translated, proteins are diversified through post-translational covalent changes termed PTMs (Krishna & Wold, 1993), and many important regulatory steps depend on proteins' PTMs rather than on their expression levels (Levchenko 2005). PTMs play important roles in stem cell biology. Elliott et al. (Elliott et al., 2004) showed that ~21% of the proteins identified in murine R1 ES cells had PTMs, and several of them, including Ras-GTPase activating protein binding protein 1 and phosphoglycerate kinase, had not been previously associated with PTMs. Unwin et al. (Unwin et al., 2006), revealed the importance of posttranslational control as a regulatory factor in primary hematopoietic stem cells. From the proteomics perspective, assaying changes in PTMs during stem cell differentiation in a high-throughput fashion is a key to understanding the underlying mechanisms of stemness and differentiation. The determination of PTMs is one of the main challenges in proteomics research. However, recent developments in affinity-based enrichment and extraction methods, multidimensional separation technologies, and mass spectrometry (MS) now allow systematic investigation of PTMs. For example, histone PTMs have been extensively characterized by MS-based proteomics approaches (Pesavento et al., 2004; Thomas et al., 2006). Furthermore, emerging top-down MS technology captures not only individual PTMs, but also their combinations present on the same protein molecule (Garcia et al., 2007; Taverna et al., 2007). Proteomics also allows for quantitative analysis of subproteomes with a specific PTM (e.g. the phosphoproteome (Reinders & Sickmann, 2005) and the methylproteome (Ong et al., 2004). For example, Kratchmarova et al. (Kratchmarova et al., 2005) used MS-based approaches to quantitatively monitor tyrosine-phosphorylated proteins upon the differentiation of mesenchymal stem cells into bone-forming cells in response to growth factors. Recently, Van Hoof et al. (2009) successfully reported phospho-protein profiling during early differentiation of human embryonic stem cells, which enables the identification of key protein activations in specific signaling pathways of the early differentiation process. This paper provides strong evidence of how PTM-based proteomics can contribute to understanding the mechanism of adaptor signaling at the stem cell level. This technique will likely be widely applied to mechanism-based research focusing on self-renewal and differentiation processes of stem cells. Protein-protein interactions (PPI): A single protein may have diverse functions that are determined in the context of the network or functional module it belongs to (Sharan et al., 2007). The importance of PPI is well illustrated in the famous Myc-Max-Mad network in which different combinations of TFs lead to different functional outcomes (Baudino & Cleveland, 2001). With the help of proteomics and bioinformatics, a comprehensive map of protein interactions in human was recently created, which describes the sum of all potential protein interactions in humans (Gandhi et al., 2006). Since each of the ~274 different cell types in the human body has different protein expression profiles and thus different protein networks, understanding the unique protein network architecture of each cell type is a key to understanding stem cell differentiation. As previously described, the transcriptional regulatory circuitry, in which TFs are core elements, determine cell specification and function. Therefore, it is important to understand protein interactions of transcription factors in order to understand the transcriptional regulatory circuitry. Kim et al. (Kim et al., 2006) showed that a small protein interaction network starting from Nanog, a central TF in ES cells, might serve as a functional module for maintaining ES cell pluripotency. The diverse roles of TFs in both stem cells and differentiated cells also indicate that their corresponding protein networks might differ. For example, in neural crest stem cells, Sox10, a high-mobility-group TF, maintains pluripotency and inhibits neuronal differentiation (Kim et al., 2003). On the other hand, Sox10 is involved in the terminal differentiation of myelin-forming oligodendrocytes (Stolt et al., 2002). The fact that Sox10 interacts with many TFs, forming diverse protein networks, may partly explain its contradictory roles in stem cells and differentiated cells (Wissmuller et al., 2006). Finding stem cell growth factors: secretome As mentioned earlier, understanding differentiation is important both for basic research and clinical applications. In HSC research, the first major technical advance towards understanding hemopoiesis was the introduction of solid-state cultures of bone marrow and spleen cells (Bradley & Metcalf, 1966; Ichikawa et al., 1966). Then, the discovery of hemopoietic growth factors or colony-stimulating factors (CSFs) led to molecular level understanding of hemopoietic differentiation, followed by clinical applications of CSFs (Metcalf 1991). The extensive purification required to achieve complete purification of each CSF illustrates the importance and usefulness of proteomics technology in discovering growth factors present in minute amounts. In the most difficult case, enrichment of one million-fold was required for purification and characterization of a CSF (Metcalf 1991). As illustrated in HSC research, finding growth factors in the secretome is a critical step towards understanding and controlling stem cell-differentiation at the molecular level. Recently, a proteomics approach revealed the endodermal secretome network guiding stem cell cardiopoiesis from ES cells (Arrell et al., 2008). Evidence from normal and tumor cells also illustrate the linkage between signal transduction pathways and the secretome (Jacobs et al., 2006; Khwaja et al., 2006). Most CSFs were purified from media conditioned by various human tumor cells, and it is noteworthy that tumors secreting CSFs include not only hemopoietic cancers such as a T-lymphoblast cell line (Gasson et al., 1984), but also a human squamous cell line from a lower oral cavity (Nomura et al., 1986) and a gastric cancer cell line (Baba et al., 1995), which are not related to hemopoiesis. Therefore, systematic screening of stem and cancer cell media, combined with extensive proteomic purification and characterization, may be required for high-throughput characterization of the growth factor repertoire of each stem cell line. Recently, Kang et al. (Kang et al., 2008) reported proteomic differences of various sized exosomes from human neural stem cells and its differentiated oligodendrocyte progeny. In the study, the authors separated exosomes with Field-Flow Fractionation by 10 nm size differences, and showed that this step was useful for reproducibly analyzing the proteomics changes. These promising results suggest that well-designed sampling strategy and analysis technologies should be further developed to achieve better yields in secretome-based proteomics.

Perspectives

In 2007, the HUPO and ISSCR joint initiative, Proteome Biology of Stem Cells, was established as a collaborative platform bringing together stem cell biologists and researchers in proteomics (Heck et al., 2007; Krijgsveld et al., 2008). The aim of the initiative is to effectuate the implementation of cutting edge proteomic technology in stem cell research to further our understanding of stem cell biology. This has been prompted primarily by major breakthroughs in stem cell biology, the potential of stem cells for biomedical application, and the awareness that proteomics may be able to accelerate this progress further and possibly open yet unexplored areas of research. Over the last 2 years, acting groups of the initiative have started workshops to optimize protocols for hESC sampling, MS analysis, and bioinformatics analyses. The initiative has successfully chosen hESC cells for initial study from the ES cell bank of the International Embryonic Stem Cell Consortium. Analysis of hESC membrane proteins will be started soon to standardize biomarker discovery, and this will help provide standard guidelines for stem cell proteomics. The voyage to stem cell proteomics has already begun.
  123 in total

1.  The role of DNA methylation in mammalian epigenetics.

Authors:  P A Jones; D Takai
Journal:  Science       Date:  2001-08-10       Impact factor: 47.728

2.  Phosphorylation of linker histone H1 regulates gene expression in vivo by creating a charge patch.

Authors:  Y Dou; M A Gorovsky
Journal:  Mol Cell       Date:  2000-08       Impact factor: 17.970

Review 3.  Portrait of a stem cell.

Authors:  Caroline Erter Burns; Leonard I Zon
Journal:  Dev Cell       Date:  2002-11       Impact factor: 12.270

Review 4.  MicroRNAs: genomics, biogenesis, mechanism, and function.

Authors:  David P Bartel
Journal:  Cell       Date:  2004-01-23       Impact factor: 41.582

Review 5.  Micromanagers of gene expression: the potentially widespread influence of metazoan microRNAs.

Authors:  David P Bartel; Chang-Zheng Chen
Journal:  Nat Rev Genet       Date:  2004-05       Impact factor: 53.242

6.  Selective recognition of methylated lysine 9 on histone H3 by the HP1 chromo domain.

Authors:  A J Bannister; P Zegerman; J F Partridge; E A Miska; J O Thomas; R C Allshire; T Kouzarides
Journal:  Nature       Date:  2001-03-01       Impact factor: 49.962

7.  Cytosine methylation prevents binding to DNA of a HeLa cell transcription factor required for optimal expression of the adenovirus major late promoter.

Authors:  F Watt; P L Molloy
Journal:  Genes Dev       Date:  1988-09       Impact factor: 11.361

8.  Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs.

Authors:  Lee P Lim; Nelson C Lau; Philip Garrett-Engele; Andrew Grimson; Janell M Schelter; John Castle; David P Bartel; Peter S Linsley; Jason M Johnson
Journal:  Nature       Date:  2005-01-30       Impact factor: 49.962

9.  Establishment and characteristics of a gastric cancer cell line (HuGC-OOHIRA) producing high levels of G-CSF, GM-CSF, and IL-6: the presence of autocrine growth control by G-CSF.

Authors:  M Baba; H Hasegawa; M Nakayabu; N Shimizu; S Suzuki; N Kamada; K Tani
Journal:  Am J Hematol       Date:  1995-07       Impact factor: 10.047

10.  Shotgun annotation of histone modifications: a new approach for streamlined characterization of proteins by top down mass spectrometry.

Authors:  James J Pesavento; Yong-Bin Kim; Gregory K Taylor; Neil L Kelleher
Journal:  J Am Chem Soc       Date:  2004-03-24       Impact factor: 15.419

View more
  2 in total

Review 1.  Transcriptomics and proteomics in stem cell research.

Authors:  Hai Wang; Qian Zhang; Xiangdong Fang
Journal:  Front Med       Date:  2014-06-27       Impact factor: 4.592

2.  Expression of surface markers and myogenic potential of rat bone marrow- and adipose-derived stem cells: a comparative study.

Authors:  Vahid Bayati; Mahmoud Hashemitabar; Roohollah Gazor; Reza Nejatbakhsh; Dariush Bijannejad
Journal:  Anat Cell Biol       Date:  2013-06-30
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.