Literature DB >> 26147802

PPLine: An Automated Pipeline for SNP, SAP, and Splice Variant Detection in the Context of Proteogenomics.

George Sergeevich Krasnov1,2,3, Alexey Alexandrovich Dmitriev1, Anna Viktorovna Kudryavtseva1,4, Alexander Valerievich Shargunov2,3, Dmitry Sergeevich Karpov1,2, Leonid Andreevich Uroshlev1, Natalya Vladimirovna Melnikova1, Vladimir Mikhailovich Blinov2,3, Ekaterina Vladimirovna Poverennaya2, Alexander Ivanovich Archakov2, Andrey Valerievich Lisitsa2, Elena Alexandrovna Ponomarenko2.   

Abstract

The fundamental mission of the Chromosome-Centric Human Proteome Project (C-HPP) is the research of human proteome diversity, including rare variants. Liver tissues, HepG2 cells, and plasma were selected as one of the major objects for C-HPP studies. The proteogenomic approach, a recently introduced technique, is a powerful method for predicting and validating proteoforms coming from alternative splicing, mutations, and transcript editing. We developed PPLine, a Python-based proteogenomic pipeline providing automated single-amino-acid polymorphism (SAP), indel, and alternative-spliced-variants discovery based on raw transcriptome and exome sequence data, single-nucleotide polymorphism (SNP) annotation and filtration, and the prediction of proteotypic peptides (available at https://sourceforge.net/projects/ppline). In this work, we performed deep transcriptome sequencing of HepG2 cells and liver tissues using two platforms: Illumina HiSeq and Applied Biosystems SOLiD. Using PPLine, we revealed 7756 SAP and indels for HepG2 cells and liver (including 659 variants nonannotated in dbSNP). We found 17 indels in transcripts associated with the translation of alternate reading frames (ARF) longer than 300 bp. The ARF products of two genes, SLMO1 and TMEM8A, demonstrate signatures of caspase-binding domain and Gcn5-related N-acetyltransferase. Alternative splicing analysis predicted novel proteoforms encoded by 203 (liver) and 475 (HepG2) genes according to both Illumina and SOLiD data. The results of the present work represent a basis for subsequent proteomic studies by the C-HPP consortium.

Entities:  

Keywords:  C-HPP; RNA-seq; SAP; SNP; alternative reading frames; alternative splicing; indel; proteotypic peptides

Mesh:

Year:  2015        PMID: 26147802     DOI: 10.1021/acs.jproteome.5b00490

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  21 in total

1.  The influence of pro-longevity gene Gclc overexpression on the age-dependent changes in Drosophila transcriptome and biological functions.

Authors:  Alexey Moskalev; Mikhail Shaposhnikov; Ekaterina Proshkina; Alexey Belyi; Alexander Fedintsev; Svetlana Zhikrivetskaya; Zulfiya Guvatova; Asiya Sadritdinova; Anastasia Snezhkina; George Krasnov; Anna Kudryavtseva
Journal:  BMC Genomics       Date:  2016-12-28       Impact factor: 3.969

Review 2.  Methods, Tools and Current Perspectives in Proteogenomics.

Authors:  Kelly V Ruggles; Karsten Krug; Xiaojing Wang; Karl R Clauser; Jing Wang; Samuel H Payne; David Fenyö; Bing Zhang; D R Mani
Journal:  Mol Cell Proteomics       Date:  2017-04-29       Impact factor: 5.911

3.  Adaptation of gene loci to heterochromatin in the course of Drosophila evolution is associated with insulator proteins.

Authors:  Sergei Yu Funikov; Alexander P Rezvykh; Dina A Kulikova; Elena S Zelentsova; Lyudmila A Protsenko; Lyubov N Chuvakova; Venera I Tyukmaeva; Irina R Arkhipova; Michael B Evgen'ev
Journal:  Sci Rep       Date:  2020-07-17       Impact factor: 4.379

Review 4.  Advances in the Chromosome-Centric Human Proteome Project: looking to the future.

Authors:  Young-Ki Paik; Gilbert S Omenn; William S Hancock; Lydie Lane; Christopher M Overall
Journal:  Expert Rev Proteomics       Date:  2017-11-10       Impact factor: 3.940

5.  FUS(1-359) transgenic mice as a model of ALS: pathophysiological and molecular aspects of the proteinopathy.

Authors:  Sergei Y Funikov; Alexander P Rezvykh; Pavel V Mazin; Alexey V Morozov; Andrey V Maltsev; Maria M Chicheva; Ekaterina A Vikhareva; Mikhail B Evgen'ev; Aleksey A Ustyugov
Journal:  Neurogenetics       Date:  2018-07-07       Impact factor: 2.660

6.  Continent-wide evolutionary trends of emerging SARS-CoV-2 variants: dynamic profiles from Alpha to Omicron.

Authors:  Chiranjib Chakraborty; Manojit Bhattacharya; Ashish Ranjan Sharma; Kuldeep Dhama; Sang-Soo Lee
Journal:  Geroscience       Date:  2022-07-13       Impact factor: 7.581

7.  Low Level of Expression of C-Terminally Truncated Human FUS Causes Extensive Changes in the Spinal Cord Transcriptome of Asymptomatic Transgenic Mice.

Authors:  Ekaterina A Lysikova; Sergei Funikov; Alexander P Rezvykh; Kirill D Chaprov; Michail S Kukharsky; Aleksey Ustyugov; Alexey V Deykin; Ilya M Flyamer; Shelagh Boyle; Sergey O Bachurin; Natalia Ninkina; Vladimir L Buchman
Journal:  Neurochem Res       Date:  2020-03-11       Impact factor: 3.996

8.  High throughput discovery of protein variants using proteomics informed by transcriptomics.

Authors:  Shyamasree Saha; David A Matthews; Conrad Bessant
Journal:  Nucleic Acids Res       Date:  2018-06-01       Impact factor: 16.971

9.  Hsp70 affects memory formation and behaviorally relevant gene expression in Drosophila melanogaster.

Authors:  O G Zatsepina; E A Nikitina; V Y Shilova; L N Chuvakova; S Sorokina; J E Vorontsova; E V Tokmacheva; S Y Funikov; A P Rezvykh; M B Evgen'ev
Journal:  Cell Stress Chaperones       Date:  2021-04-07       Impact factor: 3.667

10.  Improved methods for RNAseq-based alternative splicing analysis.

Authors:  Patrick Pirrotte; Nicholas J Schork; Rebecca F Halperin; Apurva Hegde; Jessica D Lang; Elizabeth A Raupach; Christophe Legendre; Winnie S Liang; Patricia M LoRusso; Aleksandar Sekulic; Jeffrey A Sosman; Jeffrey M Trent; Sampathkumar Rangasamy
Journal:  Sci Rep       Date:  2021-05-24       Impact factor: 4.996

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.