Literature DB >> 32937025

Using Deep Learning to Extrapolate Protein Expression Measurements.

Mitra Parissa Barzine1, Karlis Freivalds2,3, James C Wright4, Mārtiņš Opmanis2, Darta Rituma2,3, Fatemeh Zamanzad Ghavidel5, Andrew F Jarnuczak1, Edgars Celms2,3, Kārlis Čerāns2,3, Inge Jonassen5, Lelde Lace2,3, Juan Antonio Vizcaíno1, Jyoti Sharma Choudhary4, Alvis Brazma1, Juris Viksna2,3.   

Abstract

Mass spectrometry (MS)-based quantitative proteomics experiments typically assay a subset of up to 60% of the ≈20 000 human protein coding genes. Computational methods for imputing the missing values using RNA expression data usually allow only for imputations of proteins measured in at least some of the samples. In silico methods for comprehensively estimating abundances across all proteins are still missing. Here, a novel method is proposed using deep learning to extrapolate the observed protein expression values in label-free MS experiments to all proteins, leveraging gene functional annotations and RNA measurements as key predictive attributes. This method is tested on four datasets, including human cell lines and human and mouse tissues. This method predicts the protein expression values with average R 2 scores between 0.46 and 0.54, which is significantly better than predictions based on correlations using the RNA expression data alone. Moreover, it is demonstrated that the derived models can be "transferred" across experiments and species. For instance, the model derived from human tissues gave a R 2 = 0.51 when applied to mouse tissue data. It is concluded that protein abundances generated in label-free MS experiments can be computationally predicted using functional annotated attributes and can be used to highlight aberrant protein abundance values.
© 2020 The Authors. Proteomics published by Wiley-VCH GmbH.

Entities:  

Keywords:  Gene Ontology; UniProt keywords; deep learning networks; mass spectrometry; protein abundance prediction

Year:  2020        PMID: 32937025      PMCID: PMC7757209          DOI: 10.1002/pmic.202000009

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  30 in total

1.  Cellular source and mechanisms of high transcriptome complexity in the mammalian testis.

Authors:  Magali Soumillon; Anamaria Necsulea; Manuela Weier; David Brawand; Xiaolan Zhang; Hongcang Gu; Pauline Barthès; Maria Kokkinaki; Serge Nef; Andreas Gnirke; Martin Dym; Bernard de Massy; Tarjei S Mikkelsen; Henrik Kaessmann
Journal:  Cell Rep       Date:  2013-06-20       Impact factor: 9.423

2.  Accounting for the Multiple Natures of Missing Values in Label-Free Quantitative Proteomics Data Sets to Compare Imputation Strategies.

Authors:  Cosmin Lazar; Laurent Gatto; Myriam Ferro; Christophe Bruley; Thomas Burger
Journal:  J Proteome Res       Date:  2016-03-01       Impact factor: 4.466

3.  Global proteome analysis of the NCI-60 cell line panel.

Authors:  Amin Moghaddas Gholami; Hannes Hahne; Zhixiang Wu; Florian Johann Auer; Chen Meng; Mathias Wilhelm; Bernhard Kuster
Journal:  Cell Rep       Date:  2013-08-08       Impact factor: 9.423

4.  Immunogenetics. Dynamic profiling of the protein life cycle in response to pathogens.

Authors:  Marko Jovanovic; Michael S Rooney; Philipp Mertins; Dariusz Przybylski; Nicolas Chevrier; Rahul Satija; Edwin H Rodriguez; Alexander P Fields; Schraga Schwartz; Raktima Raychowdhury; Maxwell R Mumbach; Thomas Eisenhaure; Michal Rabani; Dave Gennert; Diana Lu; Toni Delorey; Jonathan S Weissman; Steven A Carr; Nir Hacohen; Aviv Regev
Journal:  Science       Date:  2015-02-12       Impact factor: 47.728

5.  The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity.

Authors:  Jordi Barretina; Giordano Caponigro; Nicolas Stransky; Kavitha Venkatesan; Adam A Margolin; Sungjoon Kim; Christopher J Wilson; Joseph Lehár; Gregory V Kryukov; Dmitriy Sonkin; Anupama Reddy; Manway Liu; Lauren Murray; Michael F Berger; John E Monahan; Paula Morais; Jodi Meltzer; Adam Korejwa; Judit Jané-Valbuena; Felipa A Mapa; Joseph Thibault; Eva Bric-Furlong; Pichai Raman; Aaron Shipway; Ingo H Engels; Jill Cheng; Guoying K Yu; Jianjun Yu; Peter Aspesi; Melanie de Silva; Kalpana Jagtap; Michael D Jones; Li Wang; Charles Hatton; Emanuele Palescandolo; Supriya Gupta; Scott Mahan; Carrie Sougnez; Robert C Onofrio; Ted Liefeld; Laura MacConaill; Wendy Winckler; Michael Reich; Nanxin Li; Jill P Mesirov; Stacey B Gabriel; Gad Getz; Kristin Ardlie; Vivien Chan; Vic E Myer; Barbara L Weber; Jeff Porter; Markus Warmuth; Peter Finan; Jennifer L Harris; Matthew Meyerson; Todd R Golub; Michael P Morrissey; William R Sellers; Robert Schlegel; Levi A Garraway
Journal:  Nature       Date:  2012-03-28       Impact factor: 49.962

6.  The PRIDE database and related tools and resources in 2019: improving support for quantification data.

Authors:  Yasset Perez-Riverol; Attila Csordas; Jingwen Bai; Manuel Bernal-Llinares; Suresh Hewapathirana; Deepti J Kundu; Avinash Inuganti; Johannes Griss; Gerhard Mayer; Martin Eisenacher; Enrique Pérez; Julian Uszkoreit; Julianus Pfeuffer; Timo Sachsenberg; Sule Yilmaz; Shivani Tiwary; Jürgen Cox; Enrique Audain; Mathias Walzer; Andrew F Jarnuczak; Tobias Ternent; Alvis Brazma; Juan Antonio Vizcaíno
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

7.  Quantification and discovery of sequence determinants of protein-per-mRNA amount in 29 human tissues.

Authors:  Basak Eraslan; Dongxue Wang; Mirjana Gusic; Holger Prokisch; Björn M Hallström; Mathias Uhlén; Anna Asplund; Frederik Pontén; Thomas Wieland; Thomas Hopf; Hannes Hahne; Bernhard Kuster; Julien Gagneur
Journal:  Mol Syst Biol       Date:  2019-02-18       Impact factor: 11.429

8.  A draft map of the human proteome.

Authors:  Min-Sik Kim; Sneha M Pinto; Derese Getnet; Raja Sekhar Nirujogi; Srikanth S Manda; Raghothama Chaerkady; Anil K Madugundu; Dhanashree S Kelkar; Ruth Isserlin; Shobhit Jain; Joji K Thomas; Babylakshmi Muthusamy; Pamela Leal-Rojas; Praveen Kumar; Nandini A Sahasrabuddhe; Lavanya Balakrishnan; Jayshree Advani; Bijesh George; Santosh Renuse; Lakshmi Dhevi N Selvan; Arun H Patil; Vishalakshi Nanjappa; Aneesha Radhakrishnan; Samarjeet Prasad; Tejaswini Subbannayya; Rajesh Raju; Manish Kumar; Sreelakshmi K Sreenivasamurthy; Arivusudar Marimuthu; Gajanan J Sathe; Sandip Chavan; Keshava K Datta; Yashwanth Subbannayya; Apeksha Sahu; Soujanya D Yelamanchi; Savita Jayaram; Pavithra Rajagopalan; Jyoti Sharma; Krishna R Murthy; Nazia Syed; Renu Goel; Aafaque A Khan; Sartaj Ahmad; Gourav Dey; Keshav Mudgal; Aditi Chatterjee; Tai-Chung Huang; Jun Zhong; Xinyan Wu; Patrick G Shaw; Donald Freed; Muhammad S Zahari; Kanchan K Mukherjee; Subramanian Shankar; Anita Mahadevan; Henry Lam; Christopher J Mitchell; Susarla Krishna Shankar; Parthasarathy Satishchandra; John T Schroeder; Ravi Sirdeshmukh; Anirban Maitra; Steven D Leach; Charles G Drake; Marc K Halushka; T S Keshava Prasad; Ralph H Hruban; Candace L Kerr; Gary D Bader; Christine A Iacobuzio-Donahue; Harsha Gowda; Akhilesh Pandey
Journal:  Nature       Date:  2014-05-29       Impact factor: 49.962

9.  Genomic Determinants of Protein Abundance Variation in Colorectal Cancer Cells.

Authors:  Theodoros I Roumeliotis; Steven P Williams; Emanuel Gonçalves; Clara Alsinet; Martin Del Castillo Velasco-Herrera; Nanne Aben; Fatemeh Zamanzad Ghavidel; Magali Michaut; Michael Schubert; Stacey Price; James C Wright; Lu Yu; Mi Yang; Rodrigo Dienstmann; Justin Guinney; Pedro Beltrao; Alvis Brazma; Mercedes Pardo; Oliver Stegle; David J Adams; Lodewyk Wessels; Julio Saez-Rodriguez; Ultan McDermott; Jyoti S Choudhary
Journal:  Cell Rep       Date:  2017-08-29       Impact factor: 9.423

10.  Sequential regulatory activity prediction across chromosomes with convolutional neural networks.

Authors:  David R Kelley; Yakir A Reshef; Maxwell Bileschi; David Belanger; Cory Y McLean; Jasper Snoek
Journal:  Genome Res       Date:  2018-03-27       Impact factor: 9.043

View more
  3 in total

1.  Predicting missing proteomics values using machine learning: Filling the gap using transcriptomics and other biological features.

Authors:  Juan Ochoteco Asensio; Marcha Verheijen; Florian Caiment
Journal:  Comput Struct Biotechnol J       Date:  2022-04-22       Impact factor: 6.155

2.  Experimental reproducibility limits the correlation between mRNA and protein abundances in tumor proteomic profiles.

Authors:  Swathi Ramachandra Upadhya; Colm J Ryan
Journal:  Cell Rep Methods       Date:  2022-09-08

3.  Using Deep Learning to Extrapolate Protein Expression Measurements.

Authors:  Mitra Parissa Barzine; Karlis Freivalds; James C Wright; Mārtiņš Opmanis; Darta Rituma; Fatemeh Zamanzad Ghavidel; Andrew F Jarnuczak; Edgars Celms; Kārlis Čerāns; Inge Jonassen; Lelde Lace; Juan Antonio Vizcaíno; Jyoti Sharma Choudhary; Alvis Brazma; Juris Viksna
Journal:  Proteomics       Date:  2020-10-16       Impact factor: 3.984

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.