Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Using Deep Learning to Extrapolate Protein Expression Measurements.

Literature DB >> 32937025

Using Deep Learning to Extrapolate Protein Expression Measurements.

Mitra Parissa Barzine¹, Karlis Freivalds^2,3, James C Wright⁴, Mārtiņš Opmanis², Darta Rituma^2,3, Fatemeh Zamanzad Ghavidel⁵, Andrew F Jarnuczak¹, Edgars Celms^2,3, Kārlis Čerāns^2,3, Inge Jonassen⁵, Lelde Lace^2,3, Juan Antonio Vizcaíno¹, Jyoti Sharma Choudhary⁴, Alvis Brazma¹, Juris Viksna^2,3.

Abstract

Mass spectrometry (MS)-based quantitative proteomics experiments typically assay a subset of up to 60% of the ≈20 000 human protein coding genes. Computational methods for imputing the missing values using RNA expression data usually allow only for imputations of proteins measured in at least some of the samples. In silico methods for comprehensively estimating abundances across all proteins are still missing. Here, a novel method is proposed using deep learning to extrapolate the observed protein expression values in label-free MS experiments to all proteins, leveraging gene functional annotations and RNA measurements as key predictive attributes. This method is tested on four datasets, including human cell lines and human and mouse tissues. This method predicts the protein expression values with average R 2 scores between 0.46 and 0.54, which is significantly better than predictions based on correlations using the RNA expression data alone. Moreover, it is demonstrated that the derived models can be "transferred" across experiments and species. For instance, the model derived from human tissues gave a R 2 = 0.51 when applied to mouse tissue data. It is concluded that protein abundances generated in label-free MS experiments can be computationally predicted using functional annotated attributes and can be used to highlight aberrant protein abundance values.

Entities: CellLine Disease Gene Species

Keywords: Gene Ontology; UniProt keywords; deep learning networks; mass spectrometry; protein abundance prediction

Year: 2020 PMID： 32937025 PMCID： PMC7757209 DOI： 10.1002/pmic.202000009

Source DB: PubMed Journal: Proteomics ISSN： 1615-9853 Impact factor: 3.984

30 in total

1. Cellular source and mechanisms of high transcriptome complexity in the mammalian testis.

Authors: Magali Soumillon; Anamaria Necsulea; Manuela Weier; David Brawand; Xiaolan Zhang; Hongcang Gu; Pauline Barthès; Maria Kokkinaki; Serge Nef; Andreas Gnirke; Martin Dym; Bernard de Massy; Tarjei S Mikkelsen; Henrik Kaessmann
Journal: Cell Rep Date: 2013-06-20 Impact factor: 9.423

2. Accounting for the Multiple Natures of Missing Values in Label-Free Quantitative Proteomics Data Sets to Compare Imputation Strategies.

Authors: Cosmin Lazar; Laurent Gatto; Myriam Ferro; Christophe Bruley; Thomas Burger
Journal: J Proteome Res Date: 2016-03-01 Impact factor: 4.466

3. Global proteome analysis of the NCI-60 cell line panel.

Authors: Amin Moghaddas Gholami; Hannes Hahne; Zhixiang Wu; Florian Johann Auer; Chen Meng; Mathias Wilhelm; Bernhard Kuster
Journal: Cell Rep Date: 2013-08-08 Impact factor: 9.423

4. Immunogenetics. Dynamic profiling of the protein life cycle in response to pathogens.

Authors: Marko Jovanovic; Michael S Rooney; Philipp Mertins; Dariusz Przybylski; Nicolas Chevrier; Rahul Satija; Edwin H Rodriguez; Alexander P Fields; Schraga Schwartz; Raktima Raychowdhury; Maxwell R Mumbach; Thomas Eisenhaure; Michal Rabani; Dave Gennert; Diana Lu; Toni Delorey; Jonathan S Weissman; Steven A Carr; Nir Hacohen; Aviv Regev
Journal: Science Date: 2015-02-12 Impact factor: 47.728

5. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity.

Authors: Jordi Barretina; Giordano Caponigro; Nicolas Stransky; Kavitha Venkatesan; Adam A Margolin; Sungjoon Kim; Christopher J Wilson; Joseph Lehár; Gregory V Kryukov; Dmitriy Sonkin; Anupama Reddy; Manway Liu; Lauren Murray; Michael F Berger; John E Monahan; Paula Morais; Jodi Meltzer; Adam Korejwa; Judit Jané-Valbuena; Felipa A Mapa; Joseph Thibault; Eva Bric-Furlong; Pichai Raman; Aaron Shipway; Ingo H Engels; Jill Cheng; Guoying K Yu; Jianjun Yu; Peter Aspesi; Melanie de Silva; Kalpana Jagtap; Michael D Jones; Li Wang; Charles Hatton; Emanuele Palescandolo; Supriya Gupta; Scott Mahan; Carrie Sougnez; Robert C Onofrio; Ted Liefeld; Laura MacConaill; Wendy Winckler; Michael Reich; Nanxin Li; Jill P Mesirov; Stacey B Gabriel; Gad Getz; Kristin Ardlie; Vivien Chan; Vic E Myer; Barbara L Weber; Jeff Porter; Markus Warmuth; Peter Finan; Jennifer L Harris; Matthew Meyerson; Todd R Golub; Michael P Morrissey; William R Sellers; Robert Schlegel; Levi A Garraway
Journal: Nature Date: 2012-03-28 Impact factor: 49.962

6. The PRIDE database and related tools and resources in 2019: improving support for quantification data.

Authors: Yasset Perez-Riverol; Attila Csordas; Jingwen Bai; Manuel Bernal-Llinares; Suresh Hewapathirana; Deepti J Kundu; Avinash Inuganti; Johannes Griss; Gerhard Mayer; Martin Eisenacher; Enrique Pérez; Julian Uszkoreit; Julianus Pfeuffer; Timo Sachsenberg; Sule Yilmaz; Shivani Tiwary; Jürgen Cox; Enrique Audain; Mathias Walzer; Andrew F Jarnuczak; Tobias Ternent; Alvis Brazma; Juan Antonio Vizcaíno
Journal: Nucleic Acids Res Date: 2019-01-08 Impact factor: 16.971

7. Quantification and discovery of sequence determinants of protein-per-mRNA amount in 29 human tissues.

Authors: Basak Eraslan; Dongxue Wang; Mirjana Gusic; Holger Prokisch; Björn M Hallström; Mathias Uhlén; Anna Asplund; Frederik Pontén; Thomas Wieland; Thomas Hopf; Hannes Hahne; Bernhard Kuster; Julien Gagneur
Journal: Mol Syst Biol Date: 2019-02-18 Impact factor: 11.429

8. A draft map of the human proteome.

Authors: Min-Sik Kim; Sneha M Pinto; Derese Getnet; Raja Sekhar Nirujogi; Srikanth S Manda; Raghothama Chaerkady; Anil K Madugundu; Dhanashree S Kelkar; Ruth Isserlin; Shobhit Jain; Joji K Thomas; Babylakshmi Muthusamy; Pamela Leal-Rojas; Praveen Kumar; Nandini A Sahasrabuddhe; Lavanya Balakrishnan; Jayshree Advani; Bijesh George; Santosh Renuse; Lakshmi Dhevi N Selvan; Arun H Patil; Vishalakshi Nanjappa; Aneesha Radhakrishnan; Samarjeet Prasad; Tejaswini Subbannayya; Rajesh Raju; Manish Kumar; Sreelakshmi K Sreenivasamurthy; Arivusudar Marimuthu; Gajanan J Sathe; Sandip Chavan; Keshava K Datta; Yashwanth Subbannayya; Apeksha Sahu; Soujanya D Yelamanchi; Savita Jayaram; Pavithra Rajagopalan; Jyoti Sharma; Krishna R Murthy; Nazia Syed; Renu Goel; Aafaque A Khan; Sartaj Ahmad; Gourav Dey; Keshav Mudgal; Aditi Chatterjee; Tai-Chung Huang; Jun Zhong; Xinyan Wu; Patrick G Shaw; Donald Freed; Muhammad S Zahari; Kanchan K Mukherjee; Subramanian Shankar; Anita Mahadevan; Henry Lam; Christopher J Mitchell; Susarla Krishna Shankar; Parthasarathy Satishchandra; John T Schroeder; Ravi Sirdeshmukh; Anirban Maitra; Steven D Leach; Charles G Drake; Marc K Halushka; T S Keshava Prasad; Ralph H Hruban; Candace L Kerr; Gary D Bader; Christine A Iacobuzio-Donahue; Harsha Gowda; Akhilesh Pandey
Journal: Nature Date: 2014-05-29 Impact factor: 49.962

9. Genomic Determinants of Protein Abundance Variation in Colorectal Cancer Cells.

Authors: Theodoros I Roumeliotis; Steven P Williams; Emanuel Gonçalves; Clara Alsinet; Martin Del Castillo Velasco-Herrera; Nanne Aben; Fatemeh Zamanzad Ghavidel; Magali Michaut; Michael Schubert; Stacey Price; James C Wright; Lu Yu; Mi Yang; Rodrigo Dienstmann; Justin Guinney; Pedro Beltrao; Alvis Brazma; Mercedes Pardo; Oliver Stegle; David J Adams; Lodewyk Wessels; Julio Saez-Rodriguez; Ultan McDermott; Jyoti S Choudhary
Journal: Cell Rep Date: 2017-08-29 Impact factor: 9.423

10. Sequential regulatory activity prediction across chromosomes with convolutional neural networks.

Authors: David R Kelley; Yakir A Reshef; Maxwell Bileschi; David Belanger; Cory Y McLean; Jasper Snoek
Journal: Genome Res Date: 2018-03-27 Impact factor: 9.043

3 in total

1. Predicting missing proteomics values using machine learning: Filling the gap using transcriptomics and other biological features.

Authors: Juan Ochoteco Asensio; Marcha Verheijen; Florian Caiment
Journal: Comput Struct Biotechnol J Date: 2022-04-22 Impact factor: 6.155

2. Experimental reproducibility limits the correlation between mRNA and protein abundances in tumor proteomic profiles.

Authors: Swathi Ramachandra Upadhya; Colm J Ryan
Journal: Cell Rep Methods Date: 2022-09-08

3. Using Deep Learning to Extrapolate Protein Expression Measurements.

Authors: Mitra Parissa Barzine; Karlis Freivalds; James C Wright; Mārtiņš Opmanis; Darta Rituma; Fatemeh Zamanzad Ghavidel; Andrew F Jarnuczak; Edgars Celms; Kārlis Čerāns; Inge Jonassen; Lelde Lace; Juan Antonio Vizcaíno; Jyoti Sharma Choudhary; Alvis Brazma; Juris Viksna
Journal: Proteomics Date: 2020-10-16 Impact factor: 3.984

3 in total