Literature DB >> 20944583

ProHits: integrated software for mass spectrometry-based interaction proteomics.

Guomin Liu, Jianping Zhang, Brett Larsen, Chris Stark, Ashton Breitkreutz, Zhen-Yuan Lin, Bobby-Joe Breitkreutz, Yongmei Ding, Karen Colwill, Adrian Pasculescu, Tony Pawson, Jeffrey L Wrana, Alexey I Nesvizhskii, Brian Raught, Mike Tyers, Anne-Claude Gingras.   

Abstract

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20944583      PMCID: PMC2957308          DOI: 10.1038/nbt1010-1015

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


× No keyword cloud information.
Affinity purification coupled with mass spectrometric identification (AP-MS) is now a method of choice for charting novel protein-protein interactions, and has been applied to a large number of both small scale and high-throughput studies1. However, general and intuitive computational tools for sample tracking, AP-MS data analysis, and annotation have not kept pace with rapid methodological and instrument improvements. To address this need, we developed the ProHits LIMS platform. ProHits is a complete open source software solution for MS-based interaction proteomics that manages the entire pipeline from raw MS data files to fully annotated protein-protein interaction datasets. ProHits was designed to provide an intuitive user interface from the biologist's perspective, and can accommodate multiple instruments within a facility, multiple user groups, multiple laboratory locations, and any number of parallel projects. ProHits can manage all project scales, and supports common experimental pipelines, including those utilizing gel-based separation, gel-free analysis, and multi-dimensional protein or peptide separation. ProHits is a client-based HTML program written in PHP that runs a MySQL database on a dedicated server. The complete ProHits software solution consists of two main components: a Data Management module, and an Analyst module (Fig. 1a; see Supplementary Fig. 1 for data structure tables). These modules are supported by an Admin Office module, in which projects, instruments, user permissions and protein databases are managed (Supplementary Fig. 2). A simplified version of the software suite (“ProHits Lite”), consisting only of the Analyst module and Admin Office, is also available for users with pre-existing data management solutions or who receive pre-computed search results from analyses performed in a core MS facility (Supplementary Fig. 3). A step-by-step installation package, installation guide and user manual (see Supplementary Information) are available on the ProHits website (www.prohitsMS.com).
Figure 1

Overview of ProHits. (a) Modular organisation of ProHits. The Data Management module backs up all raw mass spectrometry data from acquisition computers, and handles data conversion and database searches. The Analyst module organizes data by project, bait, experiment and sample (gel-free project shown; see Supplementary Fig. 8 for gel-based organization). Search results from the Data Management module are parsed to individual samples defined within the Analyst module. ProHits can handle large collaborative projects, and offers several security layers. In the Analyst module, several view, filter and export functions enable data analysis. Functions provided by external software are listed on the right. (b) ProHits Comparison page. Left: Filtered Comparison results for four human baits and one negative control (see Supplementary Fig. 17 online for unfiltered data). Display, sort, filter and literature overlap options are listed on the top; selected options in this example are shown in red. Filtered results are displayed at the bottom of the page. Columns represent individual baits. Comparison at the Experiment or Sample levels is also possible. Rows list the hits that pass selected filters. Color-coding and intensity in each cell is based in the property selected for visualization, shown for this example as total peptide numbers; mouse-overs of each cell will list all properties. A star or triangle inside the cell indicates an interaction identified in previous high-throughput (star) or low-throughput (triangle) studies in BioGRID. Each term in the hits column is hyperlinked to external databases (Entrez Gene, BioGRID or NCBI Protein) or to the list of identified peptides. Right, top: Visualization of data in Cytoscape with mass spectrometric information encoded as an edge attribute. Interactions detected for the example bait protein WASL that are not reported in BioGRID are shown as blue edges with color intensity mapped spectral counts and thickness mapped to number of unique peptides; overlap interactions detected in both the experiment and in BioGRID are shown in green; interactions detected only in BioGRID are shown in grey. Right, bottom: Example of the Peptide View for the protein WIPF3 in the WASL AP-MS experiment.

In the Data Management module, raw data from all mass spectrometers in a facility or user group are copied to a single secure storage location in a scheduled manner. Data are organized in an instrument-specific manner, with folder and file organization mirroring the organization on the acquisition computer. ProHits also assigns unique identifiers to each folder and file. Log files and visual indicators of current connection status assist in monitoring the entire system. The Data Management module monitors the use of each instrument for reporting purposes (Supplementary Fig. 4–5). Raw MS files can be automatically converted to appropriate file formats using the open source ProteoWizard converters (http://proteowizard.sourceforge.net/). Converted files may be subjected to manual or automated database searches, followed by statistical analysis of the search results, according to any user-defined schedule; search engine parameters are also recorded to facilitate reporting and compliance with MIAPE guidelines2. Mascot3, X!Tandem4 and the TransProteomics Pipeline (TPP5) are fully integrated with ProHits via linked search engine servers (Supplementary Fig. 6–7). The Analyst module organizes data by project, bait, experiment and/or sample, for gel-based or gel-free approaches (Fig. 1a; for description of a gel-based project, see Supplementary Fig. 8). To create and analyze a gel-free affinity purification sample, the user specifies the bait gene name and species. ProHits automatically retrieves the amino acid sequence and other annotation from its associated database. Bait annotation may then be modified as necessary, for example to specify the presence of an epitope tag or mutation (Supplementary Fig. 9). A comprehensive annotation page tracks experimental details (Supplementary Fig. 10), including descriptions of the Sample, Affinity Purification protocol, Peptide Preparation methodology, and LC-MS/MS procedures. Controlled vocabulary lists for experimental descriptions can be added via drop-down menus to facilitate compliance with annotation guidelines such as MIAPE6 and MIMIx7, and to facilitate the organization and retrieval of data files. Free text notes for cross-referencing laboratory notebook pages, adding experimental details not captured in other sections, describing deviations from reference protocols and links to gel images or other file types may be added in the Experimental Detail page. Once an experiment is created, multiple samples may be linked to it, for example technical replicates of the same sample, or chromatographic fractions derived from the same preparation. All baits, experiments, samples and protocols are assigned unique identifiers. Once a sample is created, it is linked to both the relevant raw files and database search results. For multiple samples in HTP projects, automatic sample annotation may be established by using a standardized file naming system (Supplementary Fig. 11), or files may be manually linked. Alternatively, search results obtained outside of ProHits (with the X!Tandem or Mascot search engines) can be manually imported into the Analyst module (Supplementary Fig. 12). The ProHits Lite version enables uploading of external search results for users with an established MS data management system. In the Analyst module, mass spectrometry data can be explored in an intuitive manner, and results from individual samples, experiments or baits can be viewed and filtered (Supplementary Fig. 13–14). A user interface enables alignment of data from multiple baits or MS analyses using the Comparison tool. Data from individual MS runs, or derived from any user-defined sample group, are selected for visualization in a tabular format, for side-by-side comparisons (Fig. 1b; Supplementary Fig. 15–17). In the Comparison view, control groups and individual baits, experiments or samples are displayed by column. Proteins identified in each MS run or group of runs are displayed by row, and each cell corresponds to a putative protein hit, according to user-specified database search score cutoff. Cells display spectral count number, unique peptides, scores from search engines, and/or protein coverage information; a mouse-over function reveals all associated data for each cell in the table. For each protein displayed in the Comparison view, an associated Peptide link (Fig. 1b) may also be selected to reveal information such as sequence, location, spectral counts, and score, for each associated peptide. Importantly, all search results can be filtered. For example, ProHits allows for the removal of non-specific background proteins from the hit list, as defined by negative controls, search engine score thresholds, or contaminant lists. Links to the external NCBI and BioGRID8 databases are provided for each hit to facilitate data interpretation. Overlap with published interaction data housed in the BioGRID database8 can be displayed to allow immediate identification of new interaction partners. A flexible export function enables visualization in a graphical format with Cytoscape9, in which spectral counts, unique peptides, and search engine scores can be visualized as interaction edge attributes. The Analyst module also includes advanced search functions, bulk export functions for filtered or unfiltered data, and management of experimental protocols and background lists (e.g. Supplementary Fig. 18–20). Deposition of all mass spectrometry-associated data in public repositories is likely to become mandatory for publication of proteomics experiments2, 7, 10. Open access to raw files is essential for data reanalysis and cross-platform comparison; however, data submission to public repositories can be laborious due to strict formatting requirements. ProHits facilitates extraction of the necessary details in compliance with current standards, and generates Proteomic Standard Initiative (PSI) v2.5 compliant reports11, either in the MITAB format for BioGRID8 or in XML format for submission to IMEx consortium databases12, including IntAct13 (Supplementary Fig. 21). MS raw files associated with a given project can also be easily retrieved and grouped for submission to data repositories such as Tranche14. ProHits has developed to manage many large-scale in-house projects, including a systematic analysis of kinase and phosphatase interactions in yeast, consisting of 986 affinity purifications15. Smaller-scale projects from individual laboratories are readily handled in a similar manner. Examples of AP-MS data from both yeast and mammalian projects are provided in a demonstration version of ProHits at www.prohitsMS.com, and in Supplementary documents. The modular architecture of ProHits will accommodate additional new features, as dictated by future experimental and analytical needs. Although ProHits has been designed to handle protein interaction data, simple modifications of the open source code will enable straightforward adaptation to other proteomics workflows.
  15 in total

1.  Probability-based protein identification by searching sequence databases using mass spectrometry data.

Authors:  D N Perkins; D J Pappin; D M Creasy; J S Cottrell
Journal:  Electrophoresis       Date:  1999-12       Impact factor: 3.535

2.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

3.  TANDEM: matching proteins with tandem mass spectra.

Authors:  Robertson Craig; Ronald C Beavis
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

Review 4.  Analysis of protein complexes using mass spectrometry.

Authors:  Anne-Claude Gingras; Matthias Gstaiger; Brian Raught; Ruedi Aebersold
Journal:  Nat Rev Mol Cell Biol       Date:  2007-08       Impact factor: 94.444

Review 5.  The minimum information required for reporting a molecular interaction experiment (MIMIx).

Authors:  Sandra Orchard; Lukasz Salwinski; Samuel Kerrien; Luisa Montecchi-Palazzi; Matthias Oesterheld; Volker Stümpflen; Arnaud Ceol; Andrew Chatr-aryamontri; John Armstrong; Peter Woollard; John J Salama; Susan Moore; Jérôme Wojcik; Gary D Bader; Marc Vidal; Michael E Cusick; Mark Gerstein; Anne-Claude Gavin; Giulio Superti-Furga; Jack Greenblatt; Joel Bader; Peter Uetz; Mike Tyers; Pierre Legrain; Stan Fields; Nicola Mulder; Michael Gilson; Michael Niepmann; Lyle Burgoon; Javier De Las Rivas; Carlos Prieto; Victoria M Perreau; Chris Hogue; Hans-Werner Mewes; Rolf Apweiler; Ioannis Xenarios; David Eisenberg; Gianni Cesareni; Henning Hermjakob
Journal:  Nat Biotechnol       Date:  2007-08       Impact factor: 54.908

Review 6.  The minimum information about a proteomics experiment (MIAPE).

Authors:  Chris F Taylor; Norman W Paton; Kathryn S Lilley; Pierre-Alain Binz; Randall K Julian; Andrew R Jones; Weimin Zhu; Rolf Apweiler; Ruedi Aebersold; Eric W Deutsch; Michael J Dunn; Albert J R Heck; Alexander Leitner; Marcus Macht; Matthias Mann; Lennart Martens; Thomas A Neubert; Scott D Patterson; Peipei Ping; Sean L Seymour; Puneet Souda; Akira Tsugita; Joel Vandekerckhove; Thomas M Vondriska; Julian P Whitelegge; Marc R Wilkins; Ioannnis Xenarios; John R Yates; Henning Hermjakob
Journal:  Nat Biotechnol       Date:  2007-08       Impact factor: 54.908

7.  The HUPO PSI's molecular interaction format--a community standard for the representation of protein interaction data.

Authors:  Henning Hermjakob; Luisa Montecchi-Palazzi; Gary Bader; Jérôme Wojcik; Lukasz Salwinski; Arnaud Ceol; Susan Moore; Sandra Orchard; Ugis Sarkans; Christian von Mering; Bernd Roechert; Sylvain Poux; Eva Jung; Henning Mersch; Paul Kersey; Michael Lappe; Yixue Li; Rong Zeng; Debashis Rana; Macha Nikolski; Holger Husi; Christine Brun; K Shanker; Seth G N Grant; Chris Sander; Peer Bork; Weimin Zhu; Akhilesh Pandey; Alvis Brazma; Bernard Jacq; Marc Vidal; David Sherman; Pierre Legrain; Gianni Cesareni; Ioannis Xenarios; David Eisenberg; Boris Steipe; Chris Hogue; Rolf Apweiler
Journal:  Nat Biotechnol       Date:  2004-02       Impact factor: 54.908

8.  IntAct--open source resource for molecular interaction data.

Authors:  S Kerrien; Y Alam-Faruque; B Aranda; I Bancarz; A Bridge; C Derow; E Dimmer; M Feuermann; A Friedrichsen; R Huntley; C Kohler; J Khadake; C Leroy; A Liban; C Lieftink; L Montecchi-Palazzi; S Orchard; J Risse; K Robbe; B Roechert; D Thorneycroft; Y Zhang; R Apweiler; H Hermjakob
Journal:  Nucleic Acids Res       Date:  2006-12-01       Impact factor: 16.971

9.  Large-scale mapping of human protein-protein interactions by mass spectrometry.

Authors:  Rob M Ewing; Peter Chu; Fred Elisma; Hongyan Li; Paul Taylor; Shane Climie; Linda McBroom-Cerajewski; Mark D Robinson; Liam O'Connor; Michael Li; Rod Taylor; Moyez Dharsee; Yuen Ho; Adrian Heilbut; Lynda Moore; Shudong Zhang; Olga Ornatsky; Yury V Bukhman; Martin Ethier; Yinglun Sheng; Julian Vasilescu; Mohamed Abu-Farha; Jean-Philippe Lambert; Henry S Duewel; Ian I Stewart; Bonnie Kuehl; Kelly Hogue; Karen Colwill; Katharine Gladwish; Brenda Muskat; Robert Kinach; Sally-Lin Adams; Michael F Moran; Gregg B Morin; Thodoros Topaloglou; Daniel Figeys
Journal:  Mol Syst Biol       Date:  2007-03-13       Impact factor: 11.429

10.  A uniform proteomics MS/MS analysis platform utilizing open XML file formats.

Authors:  Andrew Keller; Jimmy Eng; Ning Zhang; Xiao-jun Li; Ruedi Aebersold
Journal:  Mol Syst Biol       Date:  2005-08-02       Impact factor: 11.429

View more
  113 in total

1.  Proteomic profiling of the human cytomegalovirus UL35 gene products reveals a role for UL35 in the DNA repair response.

Authors:  Jayme Salsman; Madhav Jagannathan; Patrick Paladino; Pak-Kei Chan; Graham Dellaire; Brian Raught; Lori Frappier
Journal:  J Virol       Date:  2011-11-09       Impact factor: 5.103

2.  Interaction proteomics identify NEURL4 and the HECT E3 ligase HERC2 as novel modulators of centrosome architecture.

Authors:  Abdallah K Al-Hakim; Mikhail Bashkurov; Anne-Claude Gingras; Daniel Durocher; Laurence Pelletier
Journal:  Mol Cell Proteomics       Date:  2012-01-19       Impact factor: 5.911

3.  Relevance Rank Platform (RRP) for Functional Filtering of High Content Protein-Protein Interaction Data.

Authors:  Yuba Raj Pokharel; Jani Saarela; Agnieszka Szwajda; Christian Rupp; Anne Rokka; Shibendra Lal Kumar Karna; Kaisa Teittinen; Garry Corthals; Olli Kallioniemi; Krister Wennerberg; Tero Aittokallio; Jukka Westermarck
Journal:  Mol Cell Proteomics       Date:  2015-10-23       Impact factor: 5.911

4.  CEP192 interacts physically and functionally with the K63-deubiquitinase CYLD to promote mitotic spindle assembly.

Authors:  Maria Ana Gomez-Ferreria; Mikhail Bashkurov; Michael Mullin; Anne-Claude Gingras; Laurence Pelletier
Journal:  Cell Cycle       Date:  2012-08-16       Impact factor: 4.534

5.  BioID-based Identification of Skp Cullin F-box (SCF)β-TrCP1/2 E3 Ligase Substrates.

Authors:  Etienne Coyaud; Monika Mis; Estelle M N Laurent; Wade H Dunham; Amber L Couzens; Melanie Robitaille; Anne-Claude Gingras; Stephane Angers; Brian Raught
Journal:  Mol Cell Proteomics       Date:  2015-04-21       Impact factor: 5.911

6.  Structure-function analysis of core STRIPAK Proteins: a signaling complex implicated in Golgi polarization.

Authors:  Michelle J Kean; Derek F Ceccarelli; Marilyn Goudreault; Mario Sanches; Stephen Tate; Brett Larsen; Lucien C D Gibson; W Brent Derry; Ian C Scott; Laurence Pelletier; George S Baillie; Frank Sicheri; Anne-Claude Gingras
Journal:  J Biol Chem       Date:  2011-05-11       Impact factor: 5.157

7.  Assessment of a method to characterize antibody selectivity and specificity for use in immunoprecipitation.

Authors:  Edyta Marcon; Harshika Jain; Anandi Bhattacharya; Hongbo Guo; Sadhna Phanse; Shuye Pu; Gregory Byram; Ben C Collins; Evan Dowdell; Maria Fenner; Xinghua Guo; Ashley Hutchinson; Jacob J Kennedy; Bryan Krastins; Brett Larsen; Zhen-Yuan Lin; Mary F Lopez; Peter Loppnau; Shane Miersch; Tin Nguyen; Jonathan B Olsen; Marcin Paduch; Mani Ravichandran; Alma Seitova; Gouri Vadali; Maryann S Vogelsang; Jeffrey R Whiteaker; Guoqing Zhong; Nan Zhong; Lei Zhao; Ruedi Aebersold; Cheryl H Arrowsmith; Andrew Emili; Lori Frappier; Anne-Claude Gingras; Matthias Gstaiger; Amanda G Paulovich; Shohei Koide; Anthony A Kossiakoff; Sachdev S Sidhu; Shoshana J Wodak; Susanne Gräslund; Jack F Greenblatt; Aled M Edwards
Journal:  Nat Methods       Date:  2015-06-29       Impact factor: 28.547

Review 8.  Multidimensional proteomics for cell biology.

Authors:  Mark Larance; Angus I Lamond
Journal:  Nat Rev Mol Cell Biol       Date:  2015-04-10       Impact factor: 94.444

9.  Src homology 2 domain containing protein 5 (SH2D5) binds the breakpoint cluster region protein, BCR, and regulates levels of Rac1-GTP.

Authors:  Elizabeth J Gray; Evangelia Petsalaki; D Andrew James; Richard D Bagshaw; Melissa M Stacey; Oliver Rocks; Anne-Claude Gingras; Tony Pawson
Journal:  J Biol Chem       Date:  2014-10-20       Impact factor: 5.157

10.  The Shb scaffold binds the Nck adaptor protein, p120 RasGAP, and Chimaerins and thereby facilitates heterotypic cell segregation by the receptor EphB2.

Authors:  Melany J Wagner; Marilyn S Hsiung; Gerald D Gish; Rick D Bagshaw; Sasha A Doodnauth; Mohamed A Soliman; Claus Jørgensen; Monika Tucholska; Robert Rottapel
Journal:  J Biol Chem       Date:  2020-02-14       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.