Literature DB >> 30252043

SanXoT: a modular and versatile package for the quantitative analysis of high-throughput proteomics experiments.

Marco Trevisan-Herraz1,2, Navratan Bagwan1, Fernando García-Marqués1,2, Jose Manuel Rodriguez1, Inmaculada Jorge1,2, Iakes Ezkurdia1,2, Elena Bonzon-Kulichenko1,2, Jesús Vázquez1,2.   

Abstract

SUMMARY: Mass spectrometry-based proteomics has had a formidable development in recent years, increasing the amount of data handled and the complexity of the statistical resources needed. Here we present SanXoT, an open-source, standalone software package for the statistical analysis of high-throughput, quantitative proteomics experiments. SanXoT is based on our previously developed weighted spectrum, peptide and protein statistical model and has been specifically designed to be modular, scalable and user-configurable. SanXoT allows limitless workflows that adapt to most experimental setups, including quantitative protein analysis in multiple experiments, systems biology, quantification of post-translational modifications and comparison and merging of experimental data from technical or biological replicates.
AVAILABILITY AND IMPLEMENTATION: Download links for the SanXoT Software Package, source code and documentation are available at https://wikis.cnic.es/proteomica/index.php/SSP. CONTACT: jvazquez@cnic.es or ebonzon@cnic.es. SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30252043      PMCID: PMC6499250          DOI: 10.1093/bioinformatics/bty815

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Current high-throughput quantitative proteomics presents many bioinformatic challenges, especially in the case of stable isotope-based techniques. Several of these problems have been highlighted in the literature, such as the problem of undersampling (Nilsson ), the need for a null hypothesis (Arntzen ; Karp ; Lin ), the proteome dynamic range (Zubarev, 2013), the non-normality of protein abundance change distributions (Karp ) and the need for quality control measures. Most of these issues were addressed by the weighted spectrum, peptide and protein (WSPP) statistical model (Bonzon-Kulichenko ; García-Marqués ; Jorge ; Navarro ). WSPP models the error structure of the data generated by the mass spectrometer (spectrum level) and integrates the quantitative results into peptide values using weighted averages according to error propagation theory (higher weights are assigned to measurements with lower error). The peptide values are then integrated into protein values and finally the protein values are integrated to determine protein abundance changes. Thus, the data are analysed independently and sequentially at the spectrum, peptide and protein levels and the specific error sources are considered separately, allowing efficient detection of artefacts (Bonzon-Kulichenko ; Bonzon-Kulichenko ; Jorge ). The general applicability of WSPP was demonstrated by validating their underlying null hypothesis in a large variety of labelling schemes and mass spectrometers. At each one of the integration levels WSPP uses standardized variables (z-values) that have been demonstrated to follow normal distributions in hundreds of experiments (see Refs in Supplementary Material). The characteristics of the WSPP model were firstly exploited in the QuiXoT software package (Trevisan-Herraz ). We later generalized the process of data integration according to the WSPP model by developing the Generic Integration Algorithm (GIA), being the Systems Biology Triangle (a method to analyse coordinated protein responses) one of their first applications (García-Marqués ). However, we later noticed a growing need for a faster, more flexible, automated and scalable software for protein quantitation, able to cope with large number of experiments and different experimental setups. Hence, we developed the SanXoT software package to fully exploit the robustness, versatility and general applicability of the GIA. SanXoT is very flexible and, thanks to its modularity, can be used in automated workflows. SanXoT has been developed in Python, and is publicly available under the Apache Licence v2.0. It has been extensively tested in Windows, and portable executables for this OS that do not require installation of any further libraries are also available.

2 Design

SanXoT package workflows follow a modular structure (Fig. 1 and Supplementary Fig. S1), allowing sequential application of the GIA by the SanXoT module. A GIA integration consists of integrating the quantitative data from a lower level (such as peptides) into a higher level (such as proteins), as described (García-Marqués ). The four main modules of the package are depicted in Figure 1, and details about them are provided in the Supplementary Material.
Fig. 1.

Examples of quantitative workflows constructed with modules from the SanXoT package: (A) quantitative analysis of a single experiment, integrating information at the spectrum, peptide and protein levels, including quantitative analysis of post-translational modifications and systems biology analysis using the Systems Biology Triangle; (B) quantitative analysis of an experiment performed with four technical or biological replicates. For simplicity, only the four main modules are represented here

Examples of quantitative workflows constructed with modules from the SanXoT package: (A) quantitative analysis of a single experiment, integrating information at the spectrum, peptide and protein levels, including quantitative analysis of post-translational modifications and systems biology analysis using the Systems Biology Triangle; (B) quantitative analysis of an experiment performed with four technical or biological replicates. For simplicity, only the four main modules are represented here

3 Applications

Workflows using prototype versions of SanXoT have been extensively used for systems biology analyses (Fig. 1A) (see Refs in Supplementary Material). They can also be prepared for automated quantification of post-translational modifications (Bagwan ), in the global context of protein or functional category changes (Fig. 1A). Workflows can also integrate quantitative data from technical or biological replicates (Fig. 1B). In the latter setup, averaged quantitative values for each protein are calculated taking into account the weight of protein measures in each replicate. The technical or biological variance is automatically calculated in the process. The speed and robustness of SanXoT allows unattended analysis of hundreds of experiments, such as those obtained from clinical cohorts, within one day (manuscript in preparation). A more detailed explanation on these functionalities is available at the Supplementary Material and the wiki at the link provided.

4 Conclusion

The successful, extensive use of the SanXoT software package—in preliminary versions—in different biological contexts has demonstrated its utility in exploiting the specific characteristics of the WSPP model for quantitative proteomics analysis (see Refs in Supplementary Material). Perhaps the most notable feature is its robustness, which is mainly a consequence of the use of weighted averages and the estimation of variances from the global distribution of data at each level (Navarro ). This was, in turn, possible thanks to the use standardized variables that accurately follow normal distributions in a way unaffected by the undersampling typical of data-independent mass spectrometry approaches. In addition—thanks to its modular design and the use of plain text files hierarchically structured at each level using relation tables—SanXoT can be easily integrated in other workflows that make use of network analysis or transcriptomics data, or are generated with label-free techniques. SanXoT is currently being adapted for use in parallel with integrated protein identification algorithms, allowing mutual feedback between peptide/protein identification and quantitative information.

Funding

This study was supported by competitive grants [BIO2012-37926, BIO2015-67580-P] from the Spanish Ministry of Economy, Industry and Competitiveness (MEIC), [grant IPT13/0001] (ProteoRed, PRB2, ISCIII-SGEFI/ERDF), [grant IPT17/0019] (ProteoRed, PRB3, ISCIII-SGEFI/ERDF), the Fundació La Marató de TV3, and the European Commission FP7 (FP7-PEOPLE-2013-ITN Next generation training in cardiovascular research and innovation-CardioNext). The Centro Nacional de Investigaciones Cardiovasculares Carlos is supported by the Spanish Ministry of Economy, Industry and Competitiveness (MEIC) and the Pro-CNIC Foundation, and is a Severo Ochoa Center of Excellence (MEIC award SEV-2015-0505). Conflict of Interest: none declared. Click here for additional data file.
  12 in total

1.  A robust method for quantitative high-throughput analysis of proteomes by 18O labeling.

Authors:  Elena Bonzon-Kulichenko; Daniel Pérez-Hernández; Estefanía Núñez; Pablo Martínez-Acedo; Pedro Navarro; Marco Trevisan-Herraz; María del Carmen Ramos; Saleta Sierra; Sara Martínez-Martínez; Marisol Ruiz-Meana; Elizabeth Miró-Casas; David García-Dorado; Juan Miguel Redondo; Javier S Burgos; Jesús Vázquez
Journal:  Mol Cell Proteomics       Date:  2010-08-31       Impact factor: 5.911

2.  Mass spectrometry in high-throughput proteomics: ready for the big time.

Authors:  Tommy Nilsson; Matthias Mann; Ruedi Aebersold; John R Yates; Amos Bairoch; John J M Bergeron
Journal:  Nat Methods       Date:  2010-09       Impact factor: 28.547

3.  Addressing accuracy and precision issues in iTRAQ quantitation.

Authors:  Natasha A Karp; Wolfgang Huber; Pawel G Sadowski; Philip D Charles; Svenja V Hester; Kathryn S Lilley
Journal:  Mol Cell Proteomics       Date:  2010-04-10       Impact factor: 5.911

4.  IsobariQ: software for isobaric quantitative proteomics using IPTL, iTRAQ, and TMT.

Authors:  Magnus Ø Arntzen; Christian J Koehler; Harald Barsnes; Frode S Berven; Achim Treumann; Bernd Thiede
Journal:  J Proteome Res       Date:  2010-11-29       Impact factor: 4.466

5.  Multi-Q: a fully automated tool for multiplexed protein quantitation.

Authors:  Wen-Ting Lin; Wei-Neng Hung; Yi-Hwa Yian; Kun-Pin Wu; Chia-Li Han; Yet-Ran Chen; Yu-Ju Chen; Ting-Yi Sung; Wen-Lian Hsu
Journal:  J Proteome Res       Date:  2006-09       Impact factor: 4.466

6.  Quantitative in-depth analysis of the dynamic secretome of activated Jurkat T-cells.

Authors:  Elena Bonzon-Kulichenko; Sara Martínez-Martínez; Marco Trevisan-Herraz; Pedro Navarro; Juan Miguel Redondo; Jesús Vázquez
Journal:  J Proteomics       Date:  2011-09-01       Impact factor: 4.044

Review 7.  The challenge of the proteome dynamic range and its implications for in-depth proteomics.

Authors:  Roman A Zubarev
Journal:  Proteomics       Date:  2013-03       Impact factor: 3.984

8.  General statistical framework for quantitative proteomics by stable isotope labeling.

Authors:  Pedro Navarro; Marco Trevisan-Herraz; Elena Bonzon-Kulichenko; Estefanía Núñez; Pablo Martínez-Acedo; Daniel Pérez-Hernández; Inmaculada Jorge; Raquel Mesa; Enrique Calvo; Montserrat Carrascal; María Luisa Hernáez; Fernando García; José Antonio Bárcena; Keith Ashman; Joaquín Abian; Concha Gil; Juan Miguel Redondo; Jesús Vázquez
Journal:  J Proteome Res       Date:  2014-02-10       Impact factor: 4.466

9.  Statistical model to analyze quantitative proteomics data obtained by 18O/16O labeling and linear ion trap mass spectrometry: application to the study of vascular endothelial growth factor-induced angiogenesis in endothelial cells.

Authors:  Inmaculada Jorge; Pedro Navarro; Pablo Martínez-Acedo; Estefanía Núñez; Horacio Serrano; Arántzazu Alfranca; Juan Miguel Redondo; Jesús Vázquez
Journal:  Mol Cell Proteomics       Date:  2009-01-29       Impact factor: 5.911

10.  The human HDL proteome displays high inter-individual variability and is altered dynamically in response to angioplasty-induced atheroma plaque rupture.

Authors:  Inmaculada Jorge; Elena Burillo; Raquel Mesa; Lucía Baila-Rueda; Margoth Moreno; Marco Trevisan-Herraz; Juan Carlos Silla-Castro; Emilio Camafeita; Mariano Ortega-Muñoz; Elena Bonzon-Kulichenko; Isabel Calvo; Ana Cenarro; Fernando Civeira; Jesús Vázquez
Journal:  J Proteomics       Date:  2014-04-18       Impact factor: 4.044

View more
  15 in total

1.  NOTCH Activation Promotes Valve Formation by Regulating the Endocardial Secretome.

Authors:  Rebeca Torregrosa-Carrión; Luis Luna-Zurita; Fernando García-Marqués; Gaetano D'Amato; Rebeca Piñeiro-Sabarís; Elena Bonzón-Kulichenko; Jesús Vázquez; José Luis de la Pompa
Journal:  Mol Cell Proteomics       Date:  2019-06-27       Impact factor: 5.911

2.  Aortic disease in Marfan syndrome is caused by overactivation of sGC-PRKG signaling by NO.

Authors:  Andrea de la Fuente-Alonso; Marta Toral; Alvaro Alfayate; María Jesús Ruiz-Rodríguez; Elena Bonzón-Kulichenko; Gisela Teixido-Tura; Sara Martínez-Martínez; María José Méndez-Olivares; Dolores López-Maderuelo; Ileana González-Valdés; Eusebio Garcia-Izquierdo; Susana Mingo; Carlos E Martín; Laura Muiño-Mosquera; Julie De Backer; J Francisco Nistal; Alberto Forteza; Arturo Evangelista; Jesús Vázquez; Miguel R Campanero; Juan Miguel Redondo
Journal:  Nat Commun       Date:  2021-05-11       Impact factor: 14.919

3.  Programmed 'disarming' of the neutrophil proteome reduces the magnitude of inflammation.

Authors:  Jose M Adrover; Alejandra Aroca-Crevillén; Georgiana Crainiciuc; Fernando Ostos; Yeny Rojas-Vega; Andrea Rubio-Ponce; Catia Cilloniz; Elena Bonzón-Kulichenko; Enrique Calvo; Daniel Rico; María A Moro; Christian Weber; Ignacio Lizasoaín; Antoni Torres; Jesús Ruiz-Cabello; Jesús Vázquez; Andrés Hidalgo
Journal:  Nat Immunol       Date:  2020-01-13       Impact factor: 25.606

4.  Unraveling the Molecular Signature of Extracellular Vesicles From Endometrial-Derived Mesenchymal Stem Cells: Potential Modulatory Effects and Therapeutic Applications.

Authors:  Federica Marinaro; María Gómez-Serrano; Inmaculada Jorge; Juan Carlos Silla-Castro; Jesús Vázquez; Francisco Miguel Sánchez-Margallo; Rebeca Blázquez; Esther López; Verónica Álvarez; Javier G Casado
Journal:  Front Bioeng Biotechnol       Date:  2019-12-20

5.  A Proteomics Signature of Mild Hypospadias: A Pilot Study.

Authors:  Coriness Piñeyro-Ruiz; Horacio Serrano; Inmaculada Jorge; Eric Miranda-Valentin; Marcos R Pérez-Brayfield; Emilio Camafeita; Raquel Mesa; Jesús Vázquez; Juan Carlos Jorge
Journal:  Front Pediatr       Date:  2020-12-23       Impact factor: 3.418

6.  Caveolin1 and YAP drive mechanically induced mesothelial to mesenchymal transition and fibrosis.

Authors:  Raffaele Strippoli; Pilar Sandoval; Roberto Moreno-Vicente; Manuel López-Cabrera; Miguel Ángel Del Pozo; Lucia Rossi; Cecilia Battistelli; Michela Terri; Lucía Pascual-Antón; Marta Loureiro; Francesca Matteini; Enrique Calvo; José Antonio Jiménez-Heffernan; Manuel José Gómez; Victor Jiménez-Jiménez; Fátima Sánchez-Cabo; Jesús Vázquez; Marco Tripodi
Journal:  Cell Death Dis       Date:  2020-08-03       Impact factor: 8.469

7.  CIBER-CLAP (CIBERCV Cardioprotection Large Animal Platform): A multicenter preclinical network for testing reproducibility in cardiovascular interventions.

Authors:  Xavier Rossello; Antonio Rodriguez-Sinovas; Gemma Vilahur; Verónica Crisóstomo; Inmaculada Jorge; Carlos Zaragoza; José L Zamorano; Javier Bermejo; Antonio Ordoñez; Lisardo Boscá; Jesús Vázquez; Lina Badimón; Francisco M Sánchez-Margallo; Francisco Fernández-Avilés; David Garcia-Dorado; Borja Ibanez
Journal:  Sci Rep       Date:  2019-12-30       Impact factor: 4.379

8.  Molecular Characterization of the Coproduced Extracellular Vesicles in HEK293 during Virus-Like Particle Production.

Authors:  Jesús Lavado-García; Irene González-Domínguez; Laura Cervera; Inmaculada Jorge; Jesús Vázquez; Francesc Gòdia
Journal:  J Proteome Res       Date:  2020-10-05       Impact factor: 4.466

9.  Proteomic analysis of plasma proteins of high-flux haemodialysis and on-line haemodiafiltration patients reveals differences in transthyretin levels related with anaemia.

Authors:  Emma Martínez-Alonso; Paula Alcázar; Emilio Camafeita; Milagros Fernández-Lucas; Gloria Ruíz-Roso; Alberto Alcázar
Journal:  Sci Rep       Date:  2020-09-29       Impact factor: 4.379

10.  Proteomic and Biological Analysis of an In Vitro Human Endothelial System in Response to Drug Anaphylaxis.

Authors:  Alma Yuste-Montalvo; Sergio Fernandez-Bravo; Tamara Oliva; Carlos Pastor-Vargas; Diana Betancor; María José Goikoetxea; José Julio Laguna; Juan Antonio López; Gloria Alvarez-Llamas; Javier Cuesta-Herranz; Marta Martin-Lorenzo; Vanesa Esteban
Journal:  Front Immunol       Date:  2021-06-25       Impact factor: 7.561

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.