Literature DB >> 32967423

Spritz: A Proteogenomic Database Engine.

Anthony J Cesnik1,2,3,4, Rachel M Miller1, Khairina Ibrahim1, Lei Lu1, Robert J Millikin1, Michael R Shortreed1, Brian L Frey1, Lloyd M Smith1.   

Abstract

Proteoforms are the workhorses of the cell, and subtle differences between their amino acid sequences or post-translational modifications (PTMs) can change their biological function. To most effectively identify and quantify proteoforms in genetically diverse samples by mass spectrometry (MS), it is advantageous to search the MS data against a sample-specific protein database that is tailored to the sample being analyzed, in that it contains the correct amino acid sequences and relevant PTMs for that sample. To this end, we have developed Spritz (https://smith-chem-wisc.github.io/Spritz/), an open-source software tool for generating protein databases annotated with sequence variations and PTMs. We provide a simple graphical user interface for Windows and scripts that can be run on any operating system. Spritz automatically sets up and executes approximately 20 tools, which enable the construction of a proteogenomic database from only raw RNA sequencing data. Sequence variations that are discovered in RNA sequencing data upon comparison to the Ensembl reference genome are annotated on proteins in these databases, and PTM annotations are transferred from UniProt. Modifications can also be discovered and added to the database using bottom-up mass spectrometry data and global PTM discovery in MetaMorpheus. We demonstrate that such sample-specific databases allow the identification of variant peptides, modified variant peptides, and variant proteoforms by searching bottom-up and top-down proteomic data from the Jurkat human T lymphocyte cell line and demonstrate the identification of phosphorylated variant sites with phosphoproteomic data from the U2OS human osteosarcoma cell line.

Entities:  

Keywords:  PTMs; RNA-Seq; modifications; proteoform; proteogenomics; sample-specific; sequence variations; top-down; transcriptomics

Mesh:

Year:  2020        PMID: 32967423      PMCID: PMC8024408          DOI: 10.1021/acs.jproteome.0c00407

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  36 in total

1.  A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.

Authors:  Pablo Cingolani; Adrian Platts; Le Lily Wang; Melissa Coon; Tung Nguyen; Luan Wang; Susan J Land; Xiangyi Lu; Douglas M Ruden
Journal:  Fly (Austin)       Date:  2012 Apr-Jun       Impact factor: 2.160

2.  Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-Seq.

Authors:  Gloria M Sheynkman; Michael R Shortreed; Brian L Frey; Lloyd M Smith
Journal:  Mol Cell Proteomics       Date:  2013-04-29       Impact factor: 5.911

3.  Enhanced Global Post-translational Modification Discovery with MetaMorpheus.

Authors:  Stefan K Solntsev; Michael R Shortreed; Brian L Frey; Lloyd M Smith
Journal:  J Proteome Res       Date:  2018-04-02       Impact factor: 4.466

Review 4.  Proteogenomics: concepts, applications and computational strategies.

Authors:  Alexey I Nesvizhskii
Journal:  Nat Methods       Date:  2014-11       Impact factor: 28.547

5.  Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype.

Authors:  Daehwan Kim; Joseph M Paggi; Chanhee Park; Christopher Bennett; Steven L Salzberg
Journal:  Nat Biotechnol       Date:  2019-08-02       Impact factor: 54.908

6.  Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads.

Authors:  Hongshan Jiang; Rong Lei; Shou-Wei Ding; Shuifang Zhu
Journal:  BMC Bioinformatics       Date:  2014-06-12       Impact factor: 3.169

7.  Using Galaxy-P to leverage RNA-Seq for the discovery of novel protein variations.

Authors:  Gloria M Sheynkman; James E Johnson; Pratik D Jagtap; Michael R Shortreed; Getiria Onsongo; Brian L Frey; Timothy J Griffin; Lloyd M Smith
Journal:  BMC Genomics       Date:  2014-08-22       Impact factor: 3.969

8.  MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics.

Authors:  Andy T Kong; Felipe V Leprevost; Dmitry M Avtonomov; Dattatreya Mellacheruvu; Alexey I Nesvizhskii
Journal:  Nat Methods       Date:  2017-04-10       Impact factor: 28.547

9.  ProteomeGenerator: A Framework for Comprehensive Proteomics Based on de Novo Transcriptome Assembly and High-Accuracy Peptide Mass Spectral Matching.

Authors:  Paolo Cifani; Avantika Dhabaria; Zining Chen; Akihide Yoshimi; Emily Kawaler; Omar Abdel-Wahab; John T Poirier; Alex Kentsis
Journal:  J Proteome Res       Date:  2018-10-19       Impact factor: 4.466

10.  Human Proteomic Variation Revealed by Combining RNA-Seq Proteogenomics and Global Post-Translational Modification (G-PTM) Search Strategy.

Authors:  Anthony J Cesnik; Michael R Shortreed; Gloria M Sheynkman; Brian L Frey; Lloyd M Smith
Journal:  J Proteome Res       Date:  2016-01-12       Impact factor: 4.466

View more
  8 in total

1.  ProteaseGuru: A Tool for Protease Selection in Bottom-Up Proteomics.

Authors:  Rachel M Miller; Khairina Ibrahim; Lloyd M Smith
Journal:  J Proteome Res       Date:  2021-03-04       Impact factor: 4.466

2.  Cloudy with a Chance of Peptides: Accessibility, Scalability, and Reproducibility with Cloud-Hosted Environments.

Authors:  Benjamin A Neely
Journal:  J Proteome Res       Date:  2021-01-29       Impact factor: 4.466

3.  Personalized Proteome: Comparing Proteogenomics and Open Variant Search Approaches for Single Amino Acid Variant Detection.

Authors:  Renee Salz; Robbin Bouwmeester; Ralf Gabriels; Sven Degroeve; Lennart Martens; Pieter-Jan Volders; Peter A C 't Hoen
Journal:  J Proteome Res       Date:  2021-05-17       Impact factor: 4.466

4.  Enhanced protein isoform characterization through long-read proteogenomics.

Authors:  Rachel M Miller; Ben T Jordan; Madison M Mehlferber; Erin D Jeffery; Christina Chatzipantsiou; Simi Kaur; Robert J Millikin; Yunxiang Dai; Simone Tiberi; Peter J Castaldi; Michael R Shortreed; Chance John Luckey; Ana Conesa; Lloyd M Smith; Anne Deslattes Mays; Gloria M Sheynkman
Journal:  Genome Biol       Date:  2022-03-03       Impact factor: 13.583

5.  Generation of ENSEMBL-based proteogenomics databases boosts the identification of non-canonical peptides.

Authors:  Husen M Umer; Enrique Audain; Yafeng Zhu; Julianus Pfeuffer; Timo Sachsenberg; Janne Lehtiö; Rui Branca; Yasset Perez-Riverol
Journal:  Bioinformatics       Date:  2021-12-14       Impact factor: 6.937

6.  Binary Classifier for Computing Posterior Error Probabilities in MetaMorpheus.

Authors:  Michael R Shortreed; Robert J Millikin; Lei Liu; Zach Rolfs; Rachel M Miller; Leah V Schaffer; Brian L Frey; Lloyd M Smith
Journal:  J Proteome Res       Date:  2021-03-08       Impact factor: 4.466

Review 7.  Prospects and challenges of cancer systems medicine: from genes to disease networks.

Authors:  Mohammad Reza Karimi; Amir Hossein Karimi; Shamsozoha Abolmaali; Mehdi Sadeghi; Ulf Schmitz
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

8.  Immunopeptidogenomics: Harnessing RNA-Seq to Illuminate the Dark Immunopeptidome.

Authors:  Katherine E Scull; Kirti Pandey; Sri H Ramarathinam; Anthony W Purcell
Journal:  Mol Cell Proteomics       Date:  2021-09-10       Impact factor: 5.911

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.