Literature DB >> 25068131

A human pluripotent stem cell surface N-glycoproteome resource reveals markers, extracellular epitopes, and drug targets.

Kenneth R Boheler1, Subarna Bhattacharya2, Erin M Kropp2, Sandra Chuppa2, Daniel R Riordon3, Damaris Bausch-Fluck4, Paul W Burridge5, Joseph C Wu5, Robert P Wersto3, Godfrey Chi Fung Chan6, Sridhar Rao7, Bernd Wollscheid4, Rebekah L Gundry2.   

Abstract

Detailed knowledge of cell-surface proteins for isolating well-defined populations of human pluripotent stem cells (hPSCs) would significantly enhance their characterization and translational potential. Through a chemoproteomic approach, we developed a cell-surface proteome inventory containing 496 N-linked glycoproteins on human embryonic (hESCs) and induced PSCs (hiPSCs). Against a backdrop of human fibroblasts and 50 other cell types, >100 surface proteins of interest for hPSCs were revealed. The >30 positive and negative markers verified here by orthogonal approaches provide experimental justification for the rational selection of pluripotency and lineage markers, epitopes for cell isolation, and reagents for the characterization of putative hiPSC lines. Comparative differences between the chemoproteomic-defined surfaceome and the transcriptome-predicted surfaceome directly led to the discovery that STF-31, a reported GLUT-1 inhibitor, is toxic to hPSCs and efficient for selective elimination of hPSCs from mixed cultures.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25068131      PMCID: PMC4110789          DOI: 10.1016/j.stemcr.2014.05.002

Source DB:  PubMed          Journal:  Stem Cell Reports        ISSN: 2213-6711            Impact factor:   7.765


Introduction

Human pluripotent stem cells (PSCs) can differentiate into nearly all somatic cell types present in the human body and can generate clinically relevant numbers of cells for regenerative medicine. The advent of hiPSCs, derived from somatic cells by the exogenous expression of defined transcription factors, has overcome ethical issues associated with human embryonic stem cells (hESCs) and, when derived from the patient, may avoid immunological complications. Human iPSCs have also opened new avenues of research for the study of basic disease mechanisms and development of informative model systems for drug discovery. Although promising, significant limitations to the therapeutic use of hiPSCs remain unresolved. These include interline variations ranging from inconsistent transcription factor expression and differential DNA methylation to sporadic point mutations and chromosomal defects that affect in vitro differentiation, tumorigenicity, and potential clinical applications (Feng et al., 2010; Gore et al., 2011; Robinton and Daley, 2012). Moreover, current tests of hiPSC potency rely on extensive in vitro differentiation tests, in vivo teratoma assays in rodents (Maherali and Hochedlinger, 2008; Robinton and Daley, 2012) or bioinformatic and gene expression assays (Bock et al., 2011; Müller et al., 2011), which cannot be practically implemented into high-throughput hiPSC line generation designed to limit interline variability. The lack of suitable cell-surface marker panels and related affinity-based reagents for isolating high-quality hiPSCs and well-defined progeny significantly restricts our ability to minimize interline variability and employ hiPSCs for regenerative medicine. Although guidelines and animal-free methods have been proposed for the derivation and characterization of therapeutic and good manufacturing practice compliant hiPSCs (Buta et al., 2013; Funk et al., 2012; Maherali and Hochedlinger, 2008; Müller et al., 2010), no system is available to overcome safety and efficacy issues of hiPSCs analogous to immunophenotyping of blood lineages for identifying and isolating hematopoietic stem cells (HSCs). Although markers such as SSEA-3, SSEA-4, Tra-1-60, and Tra-1-81 aid in the identification of hPSCs, few known surface markers and application-specific antibodies are restricted to the pluripotent state (Damjanov et al., 1982; Kannagi et al., 1983; Lowry et al., 2008). Moreover, as cell-surface proteins play critical roles in inter- and intracellular communication, a better understanding of the cell surface should inform the dynamic interplay between cells and their microenvironment that ultimately regulates how hPSCs interact with and respond to external cues and differentiate in a directed manner (Lian et al., 2013; Murry and Keller, 2008; Yan et al., 2005). Coupling this functional relevance with the fact that more than 60% of US Food and Drug Administration-approved drug therapies target membrane proteins, and 38% of disease-related proteins are membrane associated (Cheng et al., 2012; Yildirim et al., 2007), we aimed to generate a new resource derived from a targeted analytical approach, Cell Surface Capture (CSC) technology (Gundry et al., 2009, 2012; Hofmann et al., 2010; Wollscheid et al., 2009) that experimentally verifies extracellular domains of surface proteins. This resource, through its direct protein measurements, will foster the identification of proteins and epitopes useful for immunophenotyping and facilitate the identification of drugs that target hPSCs. The value of this resource is exemplified by the identification of proteins capable of marking live hPSCs and the identification of a small molecule inhibitor, STF-31, that allows for selective depletion of hPSCs from hESC and hiPSC-derived progeny.

Results

Human Pluripotent Cell-Surface N-Glycoproteome

We employed the chemoproteomic CSC technology to directly measure and define the average N-glycoprotein surface proteome of hiPSCs, hESCs, and human fibroblasts (hFibs) (Figure 1A). Biotin labeling of glycoproteins was specific to the cell surface (Figure 1B), and all reported glycoproteins detected by this technology were identified by peptides containing a deamidation (resulting from PNGaseF cleavage after glycosylation-based enrichment) at the consensus amino acid sequence motif (NxS/T) for N-glycosylation (Figure 1C). Altogether, 496 N-glycoproteins were identified with a false discovery rate (FDR) ≤5% on the cell surface of hPSCs (Figure 2; Table S1 available online), composed of 92 cluster of differentiation (CD) and 411 non-CD molecules, including predicted transmembrane (92%), Glycosylphosphatidylinisotol (GPI)-linked (4%), and extracellular matrix proteins (4%). A complete catalog of extracellular glycopeptides and glycosites is provided in Tables S2, S3, and S4, and data were compiled into ProteinCenter (Figure 1D). These data provide direct experimental evidence regarding surface localization, which is critical for proteins whose subcellular localization is unknown or ambiguous, and transmembrane orientation for proteins whose structure has not yet been resolved. LRRN1, a protein with generic “membrane” annotation but not currently linked to gene ontology (GO) terms for the cell surface or plasma membrane (Figure 1E), and FAM216A, a protein with previously unannotated subcellular localization (UniProt), are examples of where CSC technology data provide new experimentally derived localization information, and immunostaining corroborates the surface localization of these proteins, thus confirming the validity of CSC-technology-based annotations. The discovery approach used in this study is limited to the identification of N-glycosylated peptides; therefore, it is not uncommon for proteins to be identified by a single N-glycopeptide (e.g., proteins with only a single N-glycosite in their amino acid sequence). Additional confidence in all protein identifications is provided by the use of high mass accuracy mass spectrometry during data acquisition and the implementation of three independent search algorithms for data processing. Examples of proteins that were identified by a single N-glycopeptide, but for which we provide positive antibody-based evidence include ADCYAP1R1, EFNA3, FAM216A, HTR2C, IL27RA, NPR1, PVRL2, and SLC7A3. Compared to published proteomic studies of hPSCs (i.e., compilation of 15 studies; herein referred to as the “PSC proteome” (Gundry et al., 2011), 158 proteins (31% of the current data) have not been reported among the >9,500 proteins previously described in hPSCs, and 106 proteins (21% of the current data) were identified in only a single other study within the PSC proteome (Gundry et al., 2011). Consistent with the PSC phenotype, EPCAM and ALPL were observed exclusively in hPSCs, whereas others such as Thy1 (CD90) were also observed in fibroblasts.
Figure 1

CSC Technology Workflow

(A) Simplified schematic workflow of the CSC technology.

(B) Immunofluorescence image of hESC colony after cell-surface biotin labeling via the CSC technology, illustrating selective extracellular biotinylation. Blue, DNA (Hoechst); green, Biotin (Streptavidin-FITC). Scale bar, 200 μm.

(C) Annotated MS/MS spectrum of one peptide from Interleukin-17 receptor D illustrating the deamidation (N115) within the N-glycosite sequence motif (highlighted), which collectively represents the data “tag” used for filtering out non-cell-surface contaminants from the final protein list.

(D) Bioinformatics workflow for merging multiple data types for identifying cell-surface proteins of interest for downstream characterization.

(E) Graphical representation of the coverage of N-glycopeptides and confirmation of extracellular domain for LRRN1, a previously “generically” annotated membrane protein without GO terms linking it to the cell surface. Image generated using Protter (http://wlab.ethz.ch/protter) (Omasits et al., 2013).

Figure 2

The hPSC Cell-Surface N-Glycoproteome

All 496 N-glycoproteins identified in hPSCs are listed in order of their increasing mRNA expression (top left to right bottom). For simplicity, the microarray values represented here are an average among all replicates of H9 hESC and KB3 hiPSC. The fill color of each box is representative of transcript abundance (unlogged microarray value), where white indicates values below 200. The border color indicates Gene Ontology (GO) protein annotations of cell-surface localization. Proteins represented by white boxes with red borders are examples of proteins identified via the CSC technology but would otherwise be unlikely to be categorized as potential surface markers due to their low transcriptional expression and lack of surface localization GO annotation.

Surface Proteome Comparisons Reveal Putative hPSC-Restricted Markers

Putative markers for positive selection were identified by comparing the surface proteome of hPSCs with those from hFibs, nondiseased somatic cell types (n = 12) and cancer cell types (n = 38) present in the Cell Surface Protein Atlas (see Table S5). As selection criteria, putatively restricted PSC markers were required to be absent from hFibs, present in no more than three somatic cell types, but were not limited by the number of cancer cell types in which they were observed, as similarities between cancer and pluripotent cells are known (Kim et al., 2010; Ratajczak et al., 2013; Riggs et al., 2013). Two hundred proteins met these selection criteria, and 90 of these were uniquely identified in this study, whereas 47 others were reported in only one other proteomic study (Table S6). Of these 200 PSC-restricted proteins, 48 were uniquely identified in hPSCs when compared to other cells within the Atlas (Table 1). We provide surface immunostaining for eight of the 90 hPSC-restricted surface proteins uniquely identified here (ADCYAP1R1, EFNA3, FAM216A, FGFR3, HTR2C, IL17RD, NPR1, and OPCML; Figures 3 and 4), as well as for ten hPSC-restricted proteins observed among the 15 previous proteomic reports (CXADR [seven other studies], EPHA1 [five]; IL27RA [one]; LINGO1 [one]; LRRN1 [three]; PTPRZ1 [four]; SEMA6A [six]; SPINT1 [three]; SLC7A3 [four]; and UNC5D [one]). Eight additional cell-surface proteins (ACVR2A, F3, ICAM3, ITGA6, NRCAM, PCDH1, PVRL2, and UNC5C) not hPSC-restricted are also illustrated in Figures 3 and 4 and S1. In all cases, antibody-based analyses were consistent with surface localization determined by the CSC technology. Moreover, costaining of fixed cells illustrates that surface expression colocalizes to OCT4-positive cells (Figure 3A), and altogether we demonstrate the ability of 28 antibodies to successfully stain live hPSCs either by immunocytochemistry (Figures 3B and S1) or flow cytometry and confirm the absence of hPSC-restricted markers on hFibs (Figure 4).
Table 1

Cell-Surface Proteins of Interest

Gene Symbol for ProteinMouse ESCMouse iPSChESChiPSCHuman FibroblastsNo. of Nondisease Human Cell Types in Cell Surface Protein Atlas (out of 14)No. of Human Cancer Cell Types in CSPA (out of 38)No. of Human Pluripotent Stem Cell Proteomic Studies (out of 15; Gundry et al., 2011)Promoter Occupancy
NanogOct4Sox2
Reference Markers

ALPL557
EPCAM (CD326)147
THY1 (CD90)6225

Pluripotency Restricted

ADAM22<1132
ADCYAP1R1000
AMIGO3001
APLP20223
ARHGAP28003
ASTN1000
ATP13A3002
ATP6V0A21165
B4GALT31111
C11ORF87000
CACNG8000
CALCR100
CCKBR101
CD830170
CDH3113
CELSR2161
CHST6160
CLECL1000
CLUL1000
CNTFR130
CNTN2100
CPSF3L101
DBC1000
DPEP1120
DPP6000
EFNA3<a000
ELFN1191
ENTPD2131
EPHA7100
FAM216A000
FGFR4 (CD334)151
FKRP0112
FLT1011
FTH1101
GABRA3000
GABRA5001
GABRB3013
GCNT2003
GDPD2100
GNPTAB1224
GPM6A000
GPR20000
GPR64120
GPR98000
GRID2000
HEPH100
HPN000
HTR2C000
IGSF1122
IGSF21001
IGSF9002
IL17RD150
IL27RA091
IL4R (CD124)172
INHBE120
KAL1000
KCNE1L100
KCNE3030
KDR (CD309)103
KIT (CD117)112
LAMA1196
LAMB2122
LGR4102
LINGO1101
LPAR3000
LPAR4101
LRIG1104
LRP4143
LRRN1143
MC5R000
MEGF10000
MFSD11000
MICB191
MMGT1015
MMP15<100
NCLN016
NLGN4X032
NPR1110
NPY1R111
NRXN1000
NTN1100
OLFM2000
OLFM4000
OLFML3041
PCDH11X001
PLA2G3011
PLXDC2001
PODXL2161
PTPRD033
PTPRU002
PTPRZ1054
SEMA4B1161
SEMA6A006
SLC15A1001
SLC15A2010
SLC24A3100
SLC2A12010
SLC30A4000
SLC38A2104
SLC38A5002
SLC38A9122
SLC43A1161
SLC6A5000
SLC7A3106
SLITRK4032
SPINT1103
ST14061
STX1B<102
TLR2 (CD282)<160
TMEM114100
TMEM132B111
TMEM200B131
TMEM63A133
TMEM63C130
TNFRSF10C (CD263)000
TNFRSF21041
TNFRSF8 (CD301102
TNFSF11 (CD254)000
UNC5D001
VSIG10152
VSIG10L000
WBSCR17<012

Pluripotency: Putative Negative Markers

ANPEP (CD13)7142
BTN3A39240
COLEC127111
DPP4 (CD26)9161
EMR2 (CD312)7210
FAP6121
FBN1682
GPR1266130
ICAM56230
NRP1 (CD304)8190
PLAUR6201
PRNP (CD230)10372
PTPRM7121

Pluripotency restricted were identified in one or fewer nondiseased cell types from the Cell Surface Protein Atlas. Negative markers were identified in human fibroblasts and six or more nondiseased cell types from the Atlas, but not in human pluripotent cells. Mouse ESC and iPSC data are from Gundry et al. (2012).

For the hESC and hiPS surface protein data from the current study, “<” indicates the protein was identified with a probability of <0.5 in that data set.

Figure 3

Immunocytochemistry of DF6-6-9T hiPS Cells Stained with Surface Markers

(A) Surface marker labeling on fixed hiPSCs with inset showing overlay with OCT4.

(B) Surface markers labeling on live hiPSCs with inset showing overlay with Hoechst. Scale bar, 50 μm.

Figure 4

Flow Cytometry Histograms of Live, Unfixed, H1 hESC, DF6-6-9T hiPSC, and hFib

(A) Reference markers.

(B) Positive hPSC selection markers.

(C) Negative markers.

See also Figure S1.

Surface Proteome Comparisons Reveal Negative hPSC Markers

Negative selection markers are not well established for hPSCs, even though positive selection carries the risk of functional changes with antibody binding (e.g., CD3 antibody binding promotes T cell activation; Berg et al., 1998). To identify putative markers for negative cell selection, we focused on proteins present in hFib, present in six or more nondiseased cell types in the CSPA, and which were not observed in hPSCs via the CSC technology. This led to the classification of 15 proteins as potential negative markers of selection of hPSCs from coculture with hFibs (Table 1). Flow cytometry confirmed the absence of three selected proteins (DPP4 [CD26], NRP1 [CD304], COLEC12) on hESCs and hiPSCs that were highly immunoreactive on hFibs (Figure 4C). Moreover, we classified aminopeptidase N (ANPEP, CD13) as a negative marker of pluripotency, consistent with a recent report showing successful fluorescence activated cell sorting (FACS) of hiPSCs after reprogramming (Kahler et al., 2013). These data also reveal discrepancies between mouse and human PSCs. We previously identified NRP1, COLEC12, and CD13 in mouse PSCs (Gundry et al., 2012), but, based on CSC technology and antibody-based analyses here and (Kahler et al., 2013), these surface proteins are absent from hPSCs.

Theoretical Surface Proteome versus CSC Technology

A comparison of the theoretical surface proteome based on transcriptomic data to that defined by the CSC technology underscores why proteomic strategies are essential for providing direct evidence for the location and quantity of cell-surface proteins (Figure 2). For example, proteins such as ADCYAP1R1, EFNA3, HTR2C, NPR1, and UNC5D would have been overlooked as potential surface markers if relying solely on transcriptomic approaches. By microarray, these transcripts were detected at only background/threshold levels (Table S1), even though robust protein levels were detected using Ab-based techniques. Moreover, proteins such as FAM216A may be overlooked as potential surface markers as their GO terms do not specifically indicate they are localized to the plasma membrane. Surface proteins are affected by nontranscriptional mechanisms of regulation (e.g., surface proteins shuttle among cellular compartments, protein half-life independent of mRNA expression timeline (Kristensen et al., 2013; Vogel and Marcotte, 2012), and these types of posttranscriptional regulatory mechanisms may be broadly important as other surface proteins identified here with similarly low mRNA levels have known critical functions in early embryonic development or pluripotency (e.g., LPAR2, 3, and 4 [Liu and Armant, 2004] and TLR2 [Taylor et al., 2010]). Additionally, the microarray data may highlight a number of other PSC-restricted proteins with functional relevance to pluripotency due to their relatively high mRNA levels (e.g., APLP2, FRAS1, FTH1, GPC4, GCNT2, NLGN4X, PTPRZ1, SEMA6A, SLC38A2, SLC7A3, and VCAN). Although several cell-surface-annotated proteins with microarray values above 5,000 were not detected via CSC technology, further inspection reveals that GO term assignments for these can be misleading (e.g., TUBB3, MYH9 have GO terms including plasma membrane, even though these are not cell-surface-accessible proteins), and others do not contain an N-glycosylation site (e.g., ENO1, KARS, and SETP2) explaining why they were not detected in the current study, though alternative targeted proteome strategies such as Cys-CSC and K-CSC (Hofmann et al., 2010) would likely capture them if they are truly located at the cell surface.

hPSC Cell-Surface Protein Resource Enables Rational Selection of Drug Targets

The facilitated glucose transporter family is one subset of proteins that came to light as a result of the perceived discrepancies observed between the transcriptome analyses and the CSC technology data. We observed four members of this family (SLC2A1 [GLUT1], SLC2A3 [GLUT3], SLC2A4 [GLUT4], and SLC2A12 [GLUT12]) in hESCs and hiPSCs via the CSC technology, of which three are summarized in Figure 5A. Surface expression of GLUT1 and GLUT4 were observed by flow cytometry or immunocytochemistry (Figures 5B and 5C). By microarray, GLUT3 mRNA is three to five times higher than that of GLUT1 (Figure 5A), an observation validated by quantitative real-time PCR (Figure S2); however, significantly more MS/MS spectra for GLUT1 (487 MS/MS spectra in hESC) were observed than for GLUT3 (41 spectra) and GLUT4 (32 spectra) (Figure 5A). Protein sequences for GLUT1, GLUT3, and GLUT4 each contain a single site of N-glycosylation (UniProt) and were identified by one unique peptide sequence, whereas GLUT12 was identified by a single peptide containing three of the four possible N-glycosylation sites. Of course, the comparison of the total number of MS/MS spectra identified is only a crude measure of relative protein abundance, and it is typically inappropriate to apply spectral counting methods for relative abundance among different proteins within a sample (rather than one protein among multiple samples) due to variation in number of tryptic peptides that could result from different protein sequences. Moreover, changes in glycan structure could affect our ability to efficiently capture N-glycopeptides. Despite these caveats and in contradiction to the transcriptomic data, the relative number of spectra identified for GLUT1, GLUT3, and GLUT4 may reflect an authentic difference in surface abundance among these proteins because only a single peptide sequence is expected and explains why this protein family came to our attention.
Figure 5

STF-31 Studies

(A) Table of the number of MS/MS spectra identified via the CSC technology and the microarray values for three GLUT transporters in H9 hESCs and KB3 hiPSCs.

(B) Immunofluorescence image of GLUT4 on DF6-9-9T hiPS showing surface localization, and inset shows overlay with OCT4. Scale bar, 50 μm.

(C) Flow cytometry histograms of live cells illustrating surface localization of GLUT1 on the surface of hESCs and hiPSCs.

(D) Phase contrast images of hESCs, hiPSCs, hMSCs, hFibs, and hES-CMs after treatment with 2.5 μM STF-31, showing significant hPSC death by 48 hr and ∼100% PSC death by 72 hr. Scale bar, 100 μm.

(E) Cell viability via neutral red assay of H1 hESCs and DF6-9-9T hiPSCs treated with 2.5 μM STF-31 compared to vehicle control. Data are an average of three biological replicates, and error bars represent SEM. For 48, 72, and 96 hr treatment, all p values are below 0.05, and the p values for the H1 data are shown.

(F) hESC/hFib coculture treated with STF-31 shows selective loss of hPSCs. Top: flow cytometry histograms showing OCT4 staining of untreated hESC/hFib coculture (blue) and absence of OCT4 staining in hESC/hFib coculture treated for 72 hr with STF-31 (red). Middle: bright-field and immunofluorescence images of hESC/hFib coculture untreated (left) or treated (right) with STF-31 for 72 hr, showing fibroblast marker DPP4 and pluripotency marker OCT4. Scale bar, 100 μm. Bottom: quantitative real-time PCR of OCT4, NANOG, DPP4, NRP1, and GAPDH in hESC/hFib coculture treated with STF-31 for 72 hr compared to untreated. Results from triplicate technical analyses of two biological replicates are shown, and error bars represent SEM.

(G) Left: flow cytometry histograms of day 11 hESC-CMs for TNNT2 and MLC2V, illustrating high-quality hESC-CMs obtained by in vitro differentiation of hESCs. Right: quantitative real-time PCR of cardiac genes NKX2.5 and TNNT2 in hESC-CMs treated for 72 hr (starting on day 10 of differentiation) with STF-31, indicating no adverse effects on the expression of these genes by the inhibitor as compared to untreated (Figure S2). Results from triplicate technical analyses of two biological replicates are shown and are normalized to untreated hESC. See also Figure S2.

Knowing that GLUT1 is critical in early mouse embryonic development (Chi et al., 2000; Mobasheri et al., 2005), these observations prompted us to address whether surface expression of GLUT1 is required for hPSC biology using the chemical STF-31, a reported GLUT1 inhibitor (Chan et al., 2011). The media from STF-31-treated hPSCs do not show visible changes in pH during the first 48 hr of treatment, relative to vehicle controls, confirming that overall cell metabolism is reduced (Figure S2). Although human mesenchymal stem cell (MSCs), hESC-cardiomyocytes (CMs), and hFibs exhibit no adverse effects to STF-31 treatment at any time point examined (Figure 5D), treatment with STF-31 results in 80% cell death in hPSC by 48 hr and ∼100% cell death by 72 hr (Figure 5E). Following 72 hr treatment, culture dishes of hiPSCs and hESCs were examined for up to 5 days post-treatment, and no viable cells were detected (data not shown). Successful elimination of hPSCs was observed regardless of cell density (30%–100% tested; data not shown). To further assess the selectivity of STF-31 for hPSCs, we established cocultured hESCs and hFibs (seeded 50:50; maintained on Matrigel for 4 days prior to treatment). Following 72 hr of STF-31 (2.5 μM) treatment, OCT4 expression was undetectable by flow cytometry and immunocytochemistry, whereas the fibroblast marker DPP4 was unaffected (Figure 5F, top and middle). Quantitative real-time PCR analyses of the treated coculture confirmed these findings, where the fibroblast markers DPP4 and NRP1 exhibit a relative increase (due to loss of hPSCs) and pluripotency markers (OCT4 and NANOG) are reduced to background levels (Figure 5F, bottom). STF-31 could also be added to mixed cultures without killing hPSC-derived differentiated progeny. To demonstrate this, we generated monolayers of hESC-derived cardiomyocytes, which in our hands are ∼98% TNNT2+ by day 10 of differentiation (Figure 5G). After 72 hr of STF-31 (2.5 μM) treatment, we were unable to detect any effects on spontaneous contraction (rate or frequency) and by quantitative real-time PCR, cardiac transcripts NKX2.5 decreased from day 10 to day 13, whereas TNNT2 positivity remained constant. These results were indistinguishable from vehicle controls (Figure S2), thus indicating that STF-31, although toxic to hPSCs, spares a wide variety of progenitor and differentiated cells. Therefore, this resource also provides a “look-up directory” for the potential repurposing of drugs based on the direct knowledge of cell-surface protein abundance.

Discussion

Until the recent development of proteomic technologies for the direct measurement of the pool of cell-surface-exposed proteins such as the CSC technology, researchers had to rely on antibody-based probes for the sequential verification of cell-surface proteins. Microarray data partially enabled the inference of cell-surface expressed proteins, but discrepancies with flow cytometric data showed that posttranscriptional regulatory mechanisms need to be considered in order to obtain insights into the quantities of proteins in certain locations at a particular point in time. Thus, researchers lack a comprehensive and reliable resource of cell-surface exposed proteins for most human stem cells and progeny. Here, we directly measured the N-glycoprotein cell-surface proteome of hESCs and hiPSCs and provide an experimentally verified resource for the selection of lineage markers, markers for cell isolation, and the selection of reagents for the characterization of new hiPSC lines. Together with our analysis of hFibs, we have identified 719 surface proteins including 496 on hPSCs. Many of the identified proteins are barely detectable at the mRNA level depending on thresholds set forth or lack commercially available and validated antibodies that preclude their identification in antibody-based screens. Moreover, our analysis of the transcriptionally defined cell-surface proteome as predicted based on GO term annotations illustrate the limitations in relying solely on sometimes predicted annotations and transcriptional signals. A genome-scale location analysis using published data (Gifford et al., 2013) was performed to identify surface proteins that have promoters occupied by POU5F1(OCT4)/SOX2/NANOG (OSN). In total, 30 (6%) of the 496 proteins identified on the surface of hPSCs have promoters occupied by OSN (Table S7). An additional 47 were occupied by two factors. Of the 206 hPSC-restricted proteins, 16 (7%) are occupied by OSN. Transcriptional dogma would suggest that promoters occupied by OSN are critical to pluripotency, and the surface data are consistent with this fact, because none of the putative negative markers were occupied by OSN and only five of the hFib restricted were occupied by OSN. Although a direct role in pluripotency has not yet been described for these 16 OSN occupied PSC-restricted proteins, several have been implicated in early developmental processes or mechanisms with relevance to pluripotency, including CXADR (Dorner et al., 2005), GPR98 (McMillan et al., 2002), IL17RD (Torii et al., 2004), LINGO1 (Mi et al., 2004), LRIG1 (Laederich et al., 2004), and LRRN1 (Hossain et al., 2008). In the HSC field, immunophenotyping with antibodies has been critical to the evaluation of cell-surface proteins as surrogate markers of potency, function, immunological compatibility, and for the isolation of cells for therapeutic purposes. As we (Gundry et al., 2011, 2012) and others (Kahler et al., 2013; Kang et al., 2011; Manos et al., 2011) have demonstrated, accessible surface proteins are useful as markers for sorting and characterizing pluripotent cells. Application of the positive selection hPSC markers described in this resource provides the field with new targets suitable for improving the efficiency of isolation and characterization of high quality hiPSC colonies and/or those with defined differentiation potential. In addition to the positive PSC markers, these data reveal fibroblast markers (DPP4, NRP1, and COLEC12) useful for negative selection of hiPSCs from reprogrammed fibroblasts. Additional negative markers are likely to emerge as the cell types within the Cell Surface Protein Atlas expands or if less stringent data filtering are used (i.e., only require presence in hFib and absence in hPSCs, regardless of expression among other cell types). Precise mapping (time and quantity) of transcriptional and protein levels of these putative negative markers during reprogramming will ultimately influence the utility of these accessible proteins for selection purposes. The major bottleneck in using surface proteins for isolation and cell characterization is the lack of suitable antibodies or other affinity reagents that recognize native extracellular epitopes, a requirement for live cell selection. The information generated in this study partially overcome this limitation as the CSC technology experimentally verifies N-glycosylated peptides present in extracellular exposed domains of transmembrane and GPI-anchored proteins, which directly facilitates antigen design of surface epitopes useful for antibody production. Therefore, CSC technology lends itself for the immunophenotyping of cells without antibodies. The hPSC surface proteome resource described here can be further exploited to rationally identify accessible and putative drug targets on the surface of hESCs and hiPSCs. This might be especially useful for repurposing known drugs. Although GLUT1 is present on many cell types, hPSCs, unlike many other cells that often contain multiple glucose transporters or rely on oxidative phosphorylation, are believed to be highly reliant on glycolysis, analogous to the Warburg effect in cancer cells (Folmes et al., 2011; Varum et al., 2011). Of the cells tested in this study, the reported GLUT1 inhibitor STF-31 proved to be selectively toxic to hPSCs. Recently, using a small molecule screening approach, Ben-David et al. (2013a) discovered the utility of a small molecule inhibitor of oleic acid biosynthesis, PluriSIn, with selective toxicity for hPSCs. Although it is known that metabolism influences a cell’s decision to proliferate, differentiate, or remain quiescent (Shyh-Chang et al., 2013), these two studies firmly establish that small molecule-based metabolic inhibitors represent viable alternative strategies to suicide genes (Cao et al., 2007; Rong et al., 2012), SSEA-5, lactate, and Claudin-6-based approaches (Ben-David et al., 2013b; Tang et al., 2011; Tohyama et al., 2013). Furthermore, because another reported small molecule inhibitor of GLUT1 that we tested, WZB117, also proves toxic to hPSCs (data not shown), the use of other strategies to specifically inhibit GLUT1 may be broadly applicable for the removal of tumorigenic hPSCs from cultures of differentiating cells. In contrast to the previous reports that required alternative energy sources (Tohyama et al., 2013; Tomizawa et al., 2013), our study achieves selective hPSC elimination without alternative fuel substrates or general glucose starvation, making our findings more broadly useful to the community than previous approaches. The comprehensive and dynamic single cell hPSC surface proteome is a work in progress. The analyses used here to identify hPSC-restricted proteins are based on information currently available in our Cell Surface Protein Atlas (D.B.-F., A. Hofmann, T. Bock, A. Frei, F. Cerciello, A. Jacobs, H. Moest, U. Omasits, R.L.G., C. Yoon, R. Schiess, A. Schmidt, P. Mirkowska, A. Härtlová, J. van Eyk, J. Bourquin, R. Aebersold, K.R.B., P. Zandstra, and B.W., unpublished data). Because this database expands and includes cell types from more developmental stages and lineages, classifications for some proteins may change, resulting in some of the hPSC-restricted proteins identified here being reclassified as nonrestricted, as well as new hPSC-restricted (positive and negative) markers coming to light. As we have learned from comparisons made possible by the Cell Surface Protein Atlas as well as the work from the hematopoietic cell field, we recognize that individual surface proteins are generally not restricted to a single cell type, and that is the reason we also considered proteins that were “relatively” restricted to hPSCs (in fewer than three other cell types) in addition to the truly restricted as a way to broaden the scope and bring more potential markers to light. Thus, this resource serves as a step toward defining surface protein and antibody panels more selective for authentic pluripotent cells. Second, as with any discovery-based mass spectrometry analysis of complex samples, the failure to identify a protein in a particular sample is not conclusive evidence that the protein is absent, due to the limitations in peptide sampling in the instrument. This is a well-recognized limitation in mass-spectrometry-based discovery approaches (Gingras et al., 2005) and can be exacerbated in the case of low abundance proteins (e.g., cell-surface proteins). Thus, the few putative differences between hESCs and hiPSCs based only on CSC technology data are not emphasized here but are included in Table S1 for completeness. Third, proteomic technologies are valuable for describing the average proteome present at a selected point in time or more simply a snapshot of proteins present on the cell surface at time of labeling. However, the presence of a protein or change in abundance of a protein with perturbation does not inherently reflect the biological relevance of the protein for a particular cell type. Nevertheless, it is predicted that proteins such as IL17RD, LINGO1, and LRRN1, which are among the most restricted to hPSCs, present in highest levels, and have promoters occupied by OSN, will be included as high-priority candidates going forward. Of note for functional relevance are LINGO1 and IL17RD as they are linked with inhibitory functions that may contribute to the maintenance of the undifferentiated state of PSCs. Moreover, although this data set is restricted to N-linked glycoproteins, it is predicted that ∼90% of all cell-surface proteins are predicted to be glycosylated (Apweiler et al., 1999), and thus these data should represent a majority of the surface proteins present. In conclusion, application of the CSC technology provides direct evidence of the pool of N-glycoproteins detectable at the hPSC surface. The bioinformatics strategy chosen based on the integration of disparate data sets provides a good filtering mechanism for the identification of sensible candidates for practical (i.e., immunophenotyping) and functional studies, not only for the hPSC populations studied here, but for a broad range of other stem cell types. Furthermore, we show that small molecules and/or inhibitors selected based on our resource could be further exploited for the depletion of cell populations in a heterogeneous cell context, such as shown here for the depletion of pluripotent cells, without the expense of a large-scale screen or gene targeting approaches. This protein-centric resource provides experimental evidence for location-specific proteins which, when further developed in the form of surface protein “barcodes” (Gundry et al., 2011), might help to delineate cellular lineages even further based on quantitative panels of cell-surface markers. A detailed understanding of the functional roles of the detected cell-surface markers in developmental and environmental interactions is certainly a prerequisite to understanding hiPSCs and ensuring their reproducible production for therapeutic purposes. Going forward, this technology and the resource described here will serve as a platform for future comparisons of stem cells of other origins. We acknowledge that extensive work will be necessary to refine the protein candidates described in this resource to arrive at the most effective and informative markers for isolating the highest quality pluripotent cells. This resource, which contains >100 proteins not previously described on the surface of hPSCs, should therefore provide a major step in a series of studies designed to define the optimum marker combinations for high throughput isolation and quality control studies of hPSCs.

Experimental Procedures

Cell Culture

Generation of hiPSCs, cultivation of hiPSCs (KB3, DF6-9-9T [WiCell], hiPSC2a [Si-Tayeb et al., 2010]), hESC lines (H1, H9 [WiCell]), hFibs, bone marrow-derived mesenchymal stem cells (hMSCs), and differentiation of cardiomyocytes are described in the Supplement Information. The CSC technology was applied to H9 hESCs and KB3 hiPSCs. The flow cytometry and immunofluorescence staining were performed on DF-6-9T hiPSCs and H1 hESCs.

Cell-Surface Capture: CSC Technology

Approximately 0.6-1E8 cells per biological replicate (n ≥ 3) of H9 hESCs, KB3 hiPSCs, and hFibs were taken through the CSC technology workflow as reported previously (Gundry et al., 2009, 2012; Hofmann et al., 2010; Wollscheid et al., 2009) with details provided in the Supplemental Information.

STF-31 Studies

Cells were treated with 0, 2.5, and 5 μM STF-31 (4-[[[[4-(1,1-Dimethylethyl)phenyl]sulfonyl]amino]methyl]-N-3-pyridinylbenzamide, Tocris Bioscience) for 96 hr to test cell viability with treatment. RNA was collected for quantitative real-time PCR, or cells were stained for DPP4 or OCT4. Day 10 hESC-CMs were treated for 48 hr with 0, 2.5, and 5 μM STF-31 treatment, and RNA was collected at 72 hr (day 13 of differentiation) for quantitative real-time PCR analysis of TNNT2 and NKX2.5. In vitro cell toxicity was determined using a neutral red uptake assay as previously described (Steer et al., 2006).

Transcriptome Data

Microarray data were obtained in two laboratories on H9, HiPSC2a, and KB3 cells, as described in the Supplemental Information. To obtain a single expression value for visualization in Figure 2, values for a single probe were averaged among all H9 and KB3 replicates analyzed using the Illumina system; if multiple probes per gene were measured, only the highest value was included.

Quantitative Real-Time PCR, Flow Cytometry, and Immunocytochemistry

Details are provided in the Supplemental Information and Table S8.
  58 in total

Review 1.  Advances in protein complex analysis using mass spectrometry.

Authors:  Anne-Claude Gingras; Ruedi Aebersold; Brian Raught
Journal:  J Physiol       Date:  2004-12-20       Impact factor: 5.182

2.  Drug-target network.

Authors:  Muhammed A Yildirim; Kwang-Il Goh; Michael E Cusick; Albert-László Barabási; Marc Vidal
Journal:  Nat Biotechnol       Date:  2007-10       Impact factor: 54.908

3.  Sustained TCR signaling is required for mitogen-activated protein kinase activation and degranulation by cytotoxic T lymphocytes.

Authors:  N N Berg; L G Puente; W Dawicki; H L Ostergaard
Journal:  J Immunol       Date:  1998-09-15       Impact factor: 5.422

4.  Molecular imaging of embryonic stem cell misbehavior and suicide gene ablation.

Authors:  Feng Cao; Micha Drukker; Shuan Lin; Ahmad Y Sheikh; Xiaoyan Xie; Zongjin Li; Andrew J Connolly; Irving L Weissman; Joseph C Wu
Journal:  Cloning Stem Cells       Date:  2007

5.  Expression of the GLUT1 and GLUT9 facilitative glucose transporters in embryonic chondroblasts and mature chondrocytes in ovine articular cartilage.

Authors:  A Mobasheri; H Dobson; S L Mason; F Cullingham; M Shakibaei; J F Moley; K H Moley
Journal:  Cell Biol Int       Date:  2005-03-17       Impact factor: 3.612

6.  Directed differentiation of dopaminergic neuronal subtypes from human embryonic stem cells.

Authors:  Yiping Yan; Dali Yang; Ewa D Zarnowska; Zhongwei Du; Brian Werbel; Chuck Valliere; Robert A Pearce; James A Thomson; Su-Chun Zhang
Journal:  Stem Cells       Date:  2005 Jun-Jul       Impact factor: 6.277

7.  Coxsackievirus-adenovirus receptor (CAR) is essential for early embryonic cardiac development.

Authors:  Armin A Dorner; Frank Wegmann; Stefan Butz; Karen Wolburg-Buchholz; Hartwig Wolburg; Andreas Mack; Ines Nasdala; Benjamin August; Jürgen Westermann; Fritz G Rathjen; Dietmar Vestweber
Journal:  J Cell Sci       Date:  2005-08-01       Impact factor: 5.285

8.  The leucine-rich repeat protein LRIG1 is a negative regulator of ErbB family receptor tyrosine kinases.

Authors:  Melanie B Laederich; Melanie Funes-Duran; Lily Yen; Ellen Ingalla; Xiuli Wu; Kermit L Carraway; Colleen Sweeney
Journal:  J Biol Chem       Date:  2004-09-01       Impact factor: 5.157

9.  Immunohistochemical localization of murine stage-specific embryonic antigens in human testicular germ cell tumors.

Authors:  I Damjanov; N Fox; B B Knowles; D Solter; P H Lange; E E Fraley
Journal:  Am J Pathol       Date:  1982-08       Impact factor: 4.307

10.  Interleukin-1 stimulates beta-cell necrosis and release of the immunological adjuvant HMGB1.

Authors:  Sarah A Steer; Anna L Scarim; Kari T Chambers; John A Corbett
Journal:  PLoS Med       Date:  2005-12-20       Impact factor: 11.069

View more
  45 in total

1.  Inhibition of an NAD⁺ salvage pathway provides efficient and selective toxicity to human pluripotent stem cells.

Authors:  Erin M Kropp; Bryndon J Oleson; Katarzyna A Broniowska; Subarna Bhattacharya; Alexandra C Chadwick; Anne R Diers; Qinghui Hu; Daisy Sahoo; Neil Hogg; Kenneth R Boheler; John A Corbett; Rebekah L Gundry
Journal:  Stem Cells Transl Med       Date:  2015-04-01       Impact factor: 6.940

2.  Stem Cell Surface Marker Expression Defines Late Stages of Reprogramming to Pluripotency in Human Fibroblasts.

Authors:  Jordan E Pomeroy; Shelley R Hough; Kathryn C Davidson; Alex M Quaas; Jordan A Rees; Martin F Pera
Journal:  Stem Cells Transl Med       Date:  2016-05-09       Impact factor: 6.940

3.  Creation of Cardiac Tissue Exhibiting Mechanical Integration of Spheroids Using 3D Bioprinting.

Authors:  Chin Siang Ong; Takuma Fukunishi; Andrew Nashed; Adriana Blazeski; Huaitao Zhang; Samantha Hardy; Deborah DiSilvestre; Luca Vricella; John Conte; Leslie Tung; Gordon Tomaselli; Narutoshi Hibino
Journal:  J Vis Exp       Date:  2017-07-02       Impact factor: 1.355

4.  A Combined Omics Approach to Generate the Surface Atlas of Human Naive CD4+ T Cells during Early T-Cell Receptor Activation.

Authors:  Anke Graessel; Stefanie M Hauck; Christine von Toerne; Edda Kloppmann; Tatyana Goldberg; Herwig Koppensteiner; Michael Schindler; Bettina Knapp; Linda Krause; Katharina Dietz; Carsten B Schmidt-Weber; Kathrin Suttner
Journal:  Mol Cell Proteomics       Date:  2015-05-19       Impact factor: 5.911

5.  Targeting glucose transport and the NAD pathway in tumor cells with STF-31: a re-evaluation.

Authors:  Dominik Kraus; Jan Reckenbeil; Nadine Veit; Stefan Kuerpig; Michael Meisenheimer; Imke Beier; Helmut Stark; Jochen Winter; Rainer Probstmeier
Journal:  Cell Oncol (Dordr)       Date:  2018-06-11       Impact factor: 6.730

6.  Doxorubicin Blocks Cardiomyocyte Autophagic Flux by Inhibiting Lysosome Acidification.

Authors:  Dan L Li; Zhao V Wang; Guanqiao Ding; Wei Tan; Xiang Luo; Alfredo Criollo; Min Xie; Nan Jiang; Herman May; Viktoriia Kyrychenko; Jay W Schneider; Thomas G Gillette; Joseph A Hill
Journal:  Circulation       Date:  2016-03-16       Impact factor: 29.690

7.  Cell Surface Proteomics of N-Linked Glycoproteins for Typing of Human Lymphocytes.

Authors:  Nicole A Haverland; Matthew Waas; Ioanna Ntai; Theodore Keppel; Rebekah L Gundry; Neil L Kelleher
Journal:  Proteomics       Date:  2017-10       Impact factor: 3.984

8.  SurfaceGenie: a web-based application for prioritizing cell-type-specific marker candidates.

Authors:  Matthew Waas; Shana T Snarrenberg; Jack Littrell; Rachel A Jones Lipinski; Polly A Hansen; John A Corbett; Rebekah L Gundry
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

9.  N-glycoprotein surfaceomes of four developmentally distinct mouse cell types.

Authors:  Erin M Kropp; Subarna Bhattacharya; Matthew Waas; Sandra L Chuppa; Anna-Katerina Hadjantonakis; Kenneth R Boheler; Rebekah L Gundry
Journal:  Proteomics Clin Appl       Date:  2014-08       Impact factor: 3.494

Review 10.  Global and site-specific analysis of protein glycosylation in complex biological systems with Mass Spectrometry.

Authors:  Haopeng Xiao; Fangxu Sun; Suttipong Suttapitugsakul; Ronghu Wu
Journal:  Mass Spectrom Rev       Date:  2019-01-03       Impact factor: 10.946

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.