Literature DB >> 19820112

Dynamic Proteomics: a database for dynamics and localizations of endogenous fluorescently-tagged proteins in living human cells.

Milana Frenkel-Morgenstern1, Ariel A Cohen, Naama Geva-Zatorsky, Eran Eden, Jaime Prilusky, Irina Issaeva, Alex Sigal, Cellina Cohen-Saidon, Yuvalal Liron, Lydia Cohen, Tamar Danon, Natalie Perzov, Uri Alon.   

Abstract

Recent advances allow tracking the levels and locations of a thousand proteins in individual living human cells over time using a library of annotated reporter cell clones (LARC). This library was created by Cohen et al. to study the proteome dynamics of a human lung carcinoma cell-line treated with an anti-cancer drug. Here, we report the Dynamic Proteomics database for the proteins studied by Cohen et al. Each cell-line clone in LARC has a protein tagged with yellow fluorescent protein, expressed from its endogenous chromosomal location, under its natural regulation. The Dynamic Proteomics interface facilitates searches for genes of interest, downloads of protein fluorescent movies and alignments of dynamics following drug addition. Each protein in the database is displayed with its annotation, cDNA sequence, fluorescent images and movies obtained by the time-lapse microscopy. The protein dynamics in the database represents a quantitative trace of the protein fluorescence levels in nucleus and cytoplasm produced by image analysis of movies over time. Furthermore, a sequence analysis provides a search and comparison of up to 50 input DNA sequences with all cDNAs in the library. The raw movies may be useful as a benchmark for developing image analysis tools for individual-cell dynamic-proteomics. The database is available at http://www.dynamicproteomics.net/.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19820112      PMCID: PMC2808965          DOI: 10.1093/nar/gkp808

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

To study the proteome of living human cells requires quantification of the levels and localization of thousands of proteins in space and time (1–4). Proteome dynamics are crucial for understanding, for instance, the effects of drugs on cells (5–7). A recent advance in molecular cell biology and image analysis, reported by Cohen et al. allows measuring the proteome dynamics in individual living human cancer cells (1). Cohen et al. studied the levels of over a thousand unique proteins in single cells of a human lung carcinoma line treated with the anti-cancer drug, camptothecin (CPT) (1). The study was focused on a library of annotated reporter cell clones (LARC), in which each clone has a fluorescently tagged protein expressed form its native chromosomal location. The LARC library was created using the ‘CD tagging’ approach (2–4,8–12), in which a retrovirus inserts a fluorescent label (for example, enhanced yellow fluorescent protein, eYFP or Venus) into the intron of a protein-encoding gene. Protein identity was established by sequencing the mRNA downstream of the eYFP insertion site (3′ RACE). The LARC library contains about 1200 unique tagged proteins. Cohen et al. used the time-lapse fluorescence microscopy to study protein fluorescence levels and changes in the subcellular localizations for 48 hours after drug treatment (1). We present here the Dynamics Proteomics database, which is a compendium of endogenously tagged human proteins studied by Cohen et al. (1), and their time-lapse microscopy movies that illustrate the protein dynamics in space and in time in individual living human cancer cells in response to an anti-cancer drug CPT (1–4). The database provides the annotation of the tagged proteins, alignment of protein dynamics for proteins of interest, sequence search and comparison of up to 50 input sequences to all the cDNAs in the library. The web interface enables a visual overview of the available proteins in the library, fluorescent and phase images of their intracellular localizations and time-lapse microscopy movies.

DATABASE CONTENT AND STATISTICS

The database version 1.0 incorporates the LARC library (1), which includes 2189 cell-line clones, 1144 unique characterized proteins and over 150 uncharacterized proteins. Each clone contains a different protein fused to a fluorescent protein (eYFP or Venus), expressed from its endogenous chromosomal location under its natural regulation (1). The Dynamic Proteomics annotation is produced by the similarity search of cDNA sequences against public databases of human DNA sequences using BLAST-Like Alignment Tool (BLAT) (13). For each database entry, all BLAT hits are recorded with database IDs, and information in text is used for the manual QA process. Additional automatic PERL programs produce an extended protein annotation from the Ensembl (14) and Entrez (NCBI) databases for entries selected according to QA standards. A database entry contains a protein description, cellular process and function, and the chromosomal position of the integrated fluorescent tag. Moreover, the entry page includes phase and fluorescence images of protein localization in the cell, a manual annotation of the observed localization patterns, time-lapse movies, and protein dynamics. The protein dynamics is a quantitative trace of the tagged-protein fluorescence levels in the nucleus and cytoplasm obtained by image analysis of the movies over time (1). Tagged proteins in the library have different cell localizations, including cytoplasm, cytoskeleton, nucleus, nucleolus, endoplasmic reticulum, Golgi, mitochondrion, plasma membrane and others (Figure 1A). These localizations are distributed among the clones similarly to the distribution of known human proteins annotated in the GO database (Figure 1B). We found that the uncharacterized proteins tagged in the library (proteins annotated as hypothetical, or proteins encoded from regions in the genome denoted as ESTs and mRNAs), have a distribution of localizations similar to that of the characterized proteins in the library (Figure 1C).
Figure 1.

Tagged proteins in the LARC library are found across all cellular localizations. The localization distribution is similar to that of all known proteins. Represented are distributions of cellular localizations for: (A) all proteins in the Dynamic Proteomics database with published localizations; (B) all proteins in the GO database; (C) uncharacterized proteins in the database based on manual inspection. These proteins have no available published localization.

Tagged proteins in the LARC library are found across all cellular localizations. The localization distribution is similar to that of all known proteins. Represented are distributions of cellular localizations for: (A) all proteins in the Dynamic Proteomics database with published localizations; (B) all proteins in the GO database; (C) uncharacterized proteins in the database based on manual inspection. These proteins have no available published localization.

DYNAMIC PROTEOMICS INTERFACE

The Dynamic Proteomics interface (based on MySQL) is designed to make it easy for users to find genes of interest. The database offers a search for gene names (e.g. LMNA), DNA sequences (e.g. ATGGGAAAGAAAACCAAGCGGAC), protein description (e.g. ‘synthetases’), image or published localization (e.g. ‘cytoplasm’), exon-tag insertion point (e.g. ‘intron 1’). In addition, it provides an alignment of protein dynamics for user-defined gene names (e.g. ACTN4 GARS TOP1, separated by ‘space’) from the ‘Search Dynamics’ page. When a gene of interest has been selected, or if a query that entered matches a certain gene, the user is directed to the database entry page (for example, LMNA, Figure 2). This page is the primary interface for viewing the annotation of tagged proteins in the library, its fluorescent and phase images, protein dynamics, exon-tag insertion point, a protein sequence and references to the time-lapse microscopy movies. In addition, protein movies and dynamics can be viewed from the ‘All Movies/Dynamics’ page linked from the menu. It provides a list of all available movies for library clones, a search and an alignment of dynamics for the proteins of interest (Figure 3). Usually, one to six movies are presented for each protein in the database. Movies were selected manually according to QA standards (e.g. sufficient number of cells per field of view). Quantitative protein dynamics are displayed for available movies.
Figure 2.

Example of the database entry page with a detailed protein annotation for LMNA (Lamin A/C isoform 2). The protein is localized in nucleus as it can be seen on the fluorescent image (white ‘beans’). This entry includes links to four microscopy movies. In addition, the clone ID, the published and image localizations, the protein description and annotation, the eYFP insertion point, the protein dynamics following drug addition, and links to other public databases are shown on the database entry page.

Figure 3.

The alignment of protein dynamics for ACTN4 (actinin, alpha 4), GARS (glycyl-tRNA synthetase) and TOP1 (DNA topoisomerase I). (A) The dynamics is presented for the individual proteins normalized to the total fluorescence at time t = 0. (B) The alignment of protein dynamics in cytoplasm, nucleus and total protein dynamics are presented for all three proteins together.

Example of the database entry page with a detailed protein annotation for LMNA (Lamin A/C isoform 2). The protein is localized in nucleus as it can be seen on the fluorescent image (white ‘beans’). This entry includes links to four microscopy movies. In addition, the clone ID, the published and image localizations, the protein description and annotation, the eYFP insertion point, the protein dynamics following drug addition, and links to other public databases are shown on the database entry page. The alignment of protein dynamics for ACTN4 (actinin, alpha 4), GARS (glycyl-tRNA synthetase) and TOP1 (DNA topoisomerase I). (A) The dynamics is presented for the individual proteins normalized to the total fluorescence at time t = 0. (B) The alignment of protein dynamics in cytoplasm, nucleus and total protein dynamics are presented for all three proteins together.

LINKS TO OTHER DATABASES

Each entry in the Dynamic Proteomics is associated with a cDNA sequence, a detailed protein annotation obtained from the Ensembl (14) and Entrez (NCBI) databases. In additional, each protein has links to GeneCards (15), InterPro (16) and UniProt (17) for the complete protein sequence information.

QUANTITATIVE PROTEIN DYNAMICS

The presentation and alignment of protein dynamics are produced by PERL script running online. The protein dynamics alignment is obtained by placing each profile onto a common time scale (hours, following drug addition) with zero corresponding to the drug-addition time point for the corresponding clone. All fluorescence levels are normalized to the total fluorescence at time point zero (t = 0) (Figure 3). Such normalization is helpful because fluorescence levels vary between different proteins. Details on the image analysis methods used for producing protein dynamics is presented by Cohen et al. and Sigal et al. (1–4). Figure 3 displays the protein-dynamics alignment results for proteins: ACTN4 (actinin, alpha 4), GARS (glycyl-tRNA synthetase) and TOP1 (DNA topoisomerase I).

SEQUENCE COMPARISON TOOL

Using the MySQL database and the FASTA sequence comparison tool (18), the ‘Sequence Analysis’ page provides a powerful search engine for user sequences. Users can enter up to 50 different DNA sequences and search over all cDNAs in the database. The query parameters are passed to the search engine using CGI. Query results are presented with a corresponding query or gene name. The results refer to the proteins in the Dynamic Proteomics database, links to their movies and dynamics. The alignment of proteins dynamics can be obtained from the result page.

RAW MOVIES DATA

The Dynamic Proteomics database provides the raw data of the time-lapse microscopy experiments for 50 proteins at the ‘Raw Data’ page. For all other proteins, the data are available from the authors. The data includes three types of movies: cell background (fluorescently mCherry-colored), the protein fluorescent (eYFP- or Venus-colored) and phase (1). The raw movies may be useful as a benchmark dataset for developing image analysis tools.

DATABASE ACCESS AND FEEDBACK

The Dynamic Proteomics databases can be accessed online (http://www.dynamicproteomics.net or http://www.weizmann.ac.il/mcb/UriAlon/DynamProt/). The database is regularly updated with new clones, images, movies and dynamics. The ftp access to a bulk download of all images and movies in the database is provided from the public directory: ftp://alon-serv.weizmann.ac.il/pub/dynamprot/. Statistics are available with each update at the ‘Statistics’ page. We consider user feedback as extremely valuable. Please contact us at dynamicproteomics@gmail.com.

OUTLOOK

The Dynamic Proteomics version 1.0 database contains more than 2180 fluorescently tagged proteins in the H1299 non-small lung cell carcinoma line. We expect this number to increase, and new cell lines to be tagged. We invite other authors to submit their measured or calculated protein dynamics using the raw data provided by the database.

ACKNOWLEDGEMENTS

The authors thank the Kahn Family Foundation and the Israel Science Foundation for the project support. M.F.M. and E.E. are supported by the Horowitz Center for Complexity Science. The authors also thank Pierre Choukroun and Michael Green for the UNIX system administration; Malka Cymbalista and Shlomit Afgin for a technical help in the website and database construction.

FUNDING

Funding for open access charge: The Kahn Family Foundation and the Israel Science Foundation. Conflict of interest statement. None declared.
  17 in total

1.  A protein trap strategy to detect GFP-tagged proteins expressed from their endogenous loci in Drosophila.

Authors:  X Morin; R Daneman; M Zavortink; W Chia
Journal:  Proc Natl Acad Sci U S A       Date:  2001-12-11       Impact factor: 11.205

2.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

3.  Dynamic proteomics in individual human cells uncovers widespread cell-cycle dependence of nuclear proteins.

Authors:  Alex Sigal; Ron Milo; Ariel Cohen; Naama Geva-Zatorsky; Yael Klein; Inbal Alaluf; Naamah Swerdlin; Natalie Perzov; Tamar Danon; Yuvalal Liron; Tal Raveh; Anne E Carpenter; Galit Lahav; Uri Alon
Journal:  Nat Methods       Date:  2006-07       Impact factor: 28.547

4.  Variability and memory of protein levels in human cells.

Authors:  Alex Sigal; Ron Milo; Ariel Cohen; Naama Geva-Zatorsky; Yael Klein; Yuvalal Liron; Nitzan Rosenfeld; Tamar Danon; Natalie Perzov; Uri Alon
Journal:  Nature       Date:  2006-11-19       Impact factor: 49.962

5.  Integrating high-content screening and ligand-target prediction to identify mechanism of action.

Authors:  Daniel W Young; Andreas Bender; Jonathan Hoyt; Elizabeth McWhinnie; Gung-Wei Chirn; Charles Y Tao; John A Tallarico; Mark Labow; Jeremy L Jenkins; Timothy J Mitchison; Yan Feng
Journal:  Nat Chem Biol       Date:  2007-12-09       Impact factor: 15.040

6.  Functional classification of drugs by properties of their pairwise interactions.

Authors:  Pamela Yeh; Ariane I Tschumi; Roy Kishony
Journal:  Nat Genet       Date:  2006-03-19       Impact factor: 38.330

7.  Dynamic proteomics of individual cancer cells in response to a drug.

Authors:  A A Cohen; N Geva-Zatorsky; E Eden; M Frenkel-Morgenstern; I Issaeva; A Sigal; R Milo; C Cohen-Saidon; Y Liron; Z Kam; L Cohen; T Danon; N Perzov; U Alon
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

8.  Generation of a fluorescently labeled endogenous protein library in living human cells.

Authors:  Alex Sigal; Tamar Danon; Ariel Cohen; Ron Milo; Naama Geva-Zatorsky; Gila Lustig; Yuvalal Liron; Uri Alon; Natalie Perzov
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

9.  InterPro: the integrative protein signature database.

Authors:  Sarah Hunter; Rolf Apweiler; Teresa K Attwood; Amos Bairoch; Alex Bateman; David Binns; Peer Bork; Ujjwal Das; Louise Daugherty; Lauranne Duquenne; Robert D Finn; Julian Gough; Daniel Haft; Nicolas Hulo; Daniel Kahn; Elizabeth Kelly; Aurélie Laugraud; Ivica Letunic; David Lonsdale; Rodrigo Lopez; Martin Madera; John Maslen; Craig McAnulla; Jennifer McDowall; Jaina Mistry; Alex Mitchell; Nicola Mulder; Darren Natale; Christine Orengo; Antony F Quinn; Jeremy D Selengut; Christian J A Sigrist; Manjula Thimma; Paul D Thomas; Franck Valentin; Derek Wilson; Cathy H Wu; Corin Yeats
Journal:  Nucleic Acids Res       Date:  2008-10-21       Impact factor: 16.971

10.  The Universal Protein Resource (UniProt) 2009.

Authors: 
Journal:  Nucleic Acids Res       Date:  2008-10-04       Impact factor: 16.971

View more
  14 in total

1.  Importin 7 and exportin 1 link c-Myc and p53 to regulation of ribosomal biogenesis.

Authors:  Lior Golomb; Debora Rosa Bublik; Sylvia Wilder; Reinat Nevo; Vladimir Kiss; Kristina Grabusic; Sinisa Volarevic; Moshe Oren
Journal:  Mol Cell       Date:  2012-01-27       Impact factor: 17.970

Review 2.  Spatial proteomics: a powerful discovery tool for cell biology.

Authors:  Emma Lundberg; Georg H H Borner
Journal:  Nat Rev Mol Cell Biol       Date:  2019-05       Impact factor: 94.444

Review 3.  The emergence of proteome-wide technologies: systematic analysis of proteins comes of age.

Authors:  Michal Breker; Maya Schuldiner
Journal:  Nat Rev Mol Cell Biol       Date:  2014-06-18       Impact factor: 94.444

4.  Identifying biological network structure, predicting network behavior, and classifying network state with High Dimensional Model Representation (HDMR).

Authors:  Miles A Miller; Xiao-Jiang Feng; Genyuan Li; Herschel A Rabitz
Journal:  PLoS One       Date:  2012-06-18       Impact factor: 3.240

5.  A computational algorithm for functional clustering of proteome dynamics during development.

Authors:  Yaqun Wang; Ningtao Wang; Han Hao; Yunqian Guo; Yan Zhen; Jisen Shi; Rongling Wu
Journal:  Curr Genomics       Date:  2014-06       Impact factor: 2.236

6.  Dynamic proteomics reveals bimodal protein dynamics of cancer cells in response to HSP90 inhibitor.

Authors:  Anat Zimmer; Shlomit Amar-Farkash; Tamar Danon; Uri Alon
Journal:  BMC Syst Biol       Date:  2017-03-07

7.  A novel single-cell screening platform reveals proteome plasticity during yeast stress responses.

Authors:  Michal Breker; Melissa Gymrek; Maya Schuldiner
Journal:  J Cell Biol       Date:  2013-03-18       Impact factor: 10.539

8.  Chimeras taking shape: potential functions of proteins encoded by chimeric RNA transcripts.

Authors:  Milana Frenkel-Morgenstern; Vincent Lacroix; Iakes Ezkurdia; Yishai Levin; Alexandra Gabashvili; Jaime Prilusky; Angela Del Pozo; Michael Tress; Rory Johnson; Roderic Guigo; Alfonso Valencia
Journal:  Genome Res       Date:  2012-05-15       Impact factor: 9.043

9.  LoQAtE--Localization and Quantitation ATlas of the yeast proteomE. A new tool for multiparametric dissection of single-protein behavior in response to biological perturbations in yeast.

Authors:  Michal Breker; Melissa Gymrek; Ofer Moldavski; Maya Schuldiner
Journal:  Nucleic Acids Res       Date:  2013-10-22       Impact factor: 16.971

Review 10.  Assemblages: functional units formed by cellular phase separation.

Authors:  Jeffrey A Toretsky; Peter E Wright
Journal:  J Cell Biol       Date:  2014-09-01       Impact factor: 10.539

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.