Adel Dayarian1, Roberto Romero1, Zhiming Wang2, Michael Biehl1, Erhan Bilal1, Sahand Hormoz1, Pablo Meyer1, Raquel Norel1, Kahn Rhrissorrakrai1, Gyan Bhanot1, Feng Luo1, Adi L Tarca2. 1. Kavli Institute for Theoretical Physics, Kohn Hall, University of California, Santa Barbara, CA 93106, Perinatology Research Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD, and Detroit, MI 48201, USA, College of Plant Protection and College of Science, Hunan Agricultural University, Changsha, 410128, China, School of Computing, Clemson University, Clemson, SC 29634, USA, Johann Bernoulli Institute for Mathematics and Computer Science, University of Groningen, 9700 AK Groningen, The Netherlands, IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, Department of Molecular Biology and Biochemistry, Department of Physics and BioMaPS Institute, Rutgers University, Piscataway, NJ 08854 and Department of Computer Science, Wayne State University, Detroit, MI 48202, USA. 2. Kavli Institute for Theoretical Physics, Kohn Hall, University of California, Santa Barbara, CA 93106, Perinatology Research Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD, and Detroit, MI 48201, USA, College of Plant Protection and College of Science, Hunan Agricultural University, Changsha, 410128, China, School of Computing, Clemson University, Clemson, SC 29634, USA, Johann Bernoulli Institute for Mathematics and Computer Science, University of Groningen, 9700 AK Groningen, The Netherlands, IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, Department of Molecular Biology and Biochemistry, Department of Physics and BioMaPS Institute, Rutgers University, Piscataway, NJ 08854 and Department of Computer Science, Wayne State University, Detroit, MI 48202, USA Kavli Institute for Theoretical Physics, Kohn Hall, University of California, Santa Barbara, CA 93106, Perinatology Research Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD, and Detroit, MI 48201, USA, College of Plant Protection and College of Science, Hunan Agricultural University, Changsha, 410128, China, School of Computing, Clemson University, Clemson, SC 29634, USA, Johann Bernoulli Institute for Mathematics and Computer Science, University of Groningen, 9700 AK Groningen, The Netherlands, IBM T.J. Watson Research Center, Computational Biology Center, Yorktown Heights, NY 10003, Department of Molecular Biology and Biochemistry, Department of Physics and BioMaPS Institute, Rutgers University, Piscataway, NJ 08854 and Department of Computer Science, Wayne State University, Detroit, MI 48202, USA.
Abstract
MOTIVATION: Using gene expression to infer changes in protein phosphorylation levels induced in cells by various stimuli is an outstanding problem. The intra-species protein phosphorylation challenge organized by the IMPROVER consortium provided the framework to identify the best approaches to address this issue. RESULTS: Rat lung epithelial cells were treated with 52 stimuli, and gene expression and phosphorylation levels were measured. Competing teams used gene expression data from 26 stimuli to develop protein phosphorylation prediction models and were ranked based on prediction performance for the remaining 26 stimuli. Three teams were tied in first place in this challenge achieving a balanced accuracy of about 70%, indicating that gene expression is only moderately predictive of protein phosphorylation. In spite of the similar performance, the approaches used by these three teams, described in detail in this article, were different, with the average number of predictor genes per phosphoprotein used by the teams ranging from 3 to 124. However, a significant overlap of gene signatures between teams was observed for the majority of the proteins considered, while Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were enriched in the union of the predictor genes of the three teams for multiple proteins. AVAILABILITY AND IMPLEMENTATION: Gene expression and protein phosphorylation data are available from ArrayExpress (E-MTAB-2091). Software implementation of the approach of Teams 49 and 75 are available at http://bioinformaticsprb.med.wayne.edu and http://people.cs.clemson.edu/∼luofeng/sbv.rar, respectively. CONTACT: gyanbhanot@gmail.com or luofeng@clemson.edu or atarca@med.wayne.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US.
MOTIVATION: Using gene expression to infer changes in protein phosphorylation levels induced in cells by various stimuli is an outstanding problem. The intra-species protein phosphorylation challenge organized by the IMPROVER consortium provided the framework to identify the best approaches to address this issue. RESULTS:Rat lung epithelial cells were treated with 52 stimuli, and gene expression and phosphorylation levels were measured. Competing teams used gene expression data from 26 stimuli to develop protein phosphorylation prediction models and were ranked based on prediction performance for the remaining 26 stimuli. Three teams were tied in first place in this challenge achieving a balanced accuracy of about 70%, indicating that gene expression is only moderately predictive of protein phosphorylation. In spite of the similar performance, the approaches used by these three teams, described in detail in this article, were different, with the average number of predictor genes per phosphoprotein used by the teams ranging from 3 to 124. However, a significant overlap of gene signatures between teams was observed for the majority of the proteins considered, while Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were enriched in the union of the predictor genes of the three teams for multiple proteins. AVAILABILITY AND IMPLEMENTATION: Gene expression and protein phosphorylation data are available from ArrayExpress (E-MTAB-2091). Software implementation of the approach of Teams 49 and 75 are available at http://bioinformaticsprb.med.wayne.edu and http://people.cs.clemson.edu/∼luofeng/sbv.rar, respectively. CONTACT: gyanbhanot@gmail.com or luofeng@clemson.edu or atarca@med.wayne.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US.
Authors: Adi L Tarca; Mario Lauria; Michael Unger; Erhan Bilal; Stephanie Boue; Kushal Kumar Dey; Julia Hoeng; Heinz Koeppl; Florian Martin; Pablo Meyer; Preetam Nandy; Raquel Norel; Manuel Peitsch; Jeremy J Rice; Roberto Romero; Gustavo Stolovitzky; Marja Talikka; Yang Xiang; Christoph Zechner Journal: Bioinformatics Date: 2013-08-20 Impact factor: 6.937
Authors: Mohamed Bentires-Alj; J Guillermo Paez; Frank S David; Heike Keilhack; Balazs Halmos; Katsuhiko Naoki; John M Maris; Andrea Richardson; Alberto Bardelli; David J Sugarbaker; William G Richards; Jinyan Du; Luc Girard; John D Minna; Mignon L Loh; David E Fisher; Victor E Velculescu; Bert Vogelstein; Matthew Meyerson; William R Sellers; Benjamin G Neel Journal: Cancer Res Date: 2004-12-15 Impact factor: 12.701
Authors: Robert C Gentleman; Vincent J Carey; Douglas M Bates; Ben Bolstad; Marcel Dettling; Sandrine Dudoit; Byron Ellis; Laurent Gautier; Yongchao Ge; Jeff Gentry; Kurt Hornik; Torsten Hothorn; Wolfgang Huber; Stefano Iacus; Rafael Irizarry; Friedrich Leisch; Cheng Li; Martin Maechler; Anthony J Rossini; Gunther Sawitzki; Colin Smith; Gordon Smyth; Luke Tierney; Jean Y H Yang; Jianhua Zhang Journal: Genome Biol Date: 2004-09-15 Impact factor: 13.583
Authors: Junhee Seok; H Shaw Warren; Alex G Cuenca; Michael N Mindrinos; Henry V Baker; Weihong Xu; Daniel R Richards; Grace P McDonald-Smith; Hong Gao; Laura Hennessy; Celeste C Finnerty; Cecilia M López; Shari Honari; Ernest E Moore; Joseph P Minei; Joseph Cuschieri; Paul E Bankey; Jeffrey L Johnson; Jason Sperry; Avery B Nathens; Timothy R Billiar; Michael A West; Marc G Jeschke; Matthew B Klein; Richard L Gamelli; Nicole S Gibran; Bernard H Brownstein; Carol Miller-Graziano; Steve E Calvano; Philip H Mason; J Perren Cobb; Laurence G Rahme; Stephen F Lowry; Ronald V Maier; Lyle L Moldawer; David N Herndon; Ronald W Davis; Wenzhong Xiao; Ronald G Tompkins Journal: Proc Natl Acad Sci U S A Date: 2013-02-11 Impact factor: 11.205
Authors: Christoph Hafemeister; Roberto Romero; Erhan Bilal; Pablo Meyer; Raquel Norel; Kahn Rhrissorrakrai; Richard Bonneau; Adi L Tarca Journal: Bioinformatics Date: 2014-08-22 Impact factor: 6.937
Authors: George Simon; Courtney D DiNardo; Koichi Takahashi; Tina Cascone; Cynthia Powers; Rick Stevens; Joshua Allen; Mara B Antonoff; Daniel Gomez; Pat Keane; Fernando Suarez Saiz; Quynh Nguyen; Emily Roarty; Sherry Pierce; Jianjun Zhang; Emily Hardeman Barnhill; Kate Lakhani; Kenna Shaw; Brett Smith; Stephen Swisher; Rob High; P Andrew Futreal; John Heymach; Lynda Chin Journal: Oncologist Date: 2018-11-16
Authors: Ömer Sinan Saraç; Rahul Kumar; Sandeep Kumar Dhanda; Ali Tuğrul Balcı; İsmail Bilgen; Roberto Romero; Adi L Tarca Journal: Comput Toxicol Date: 2017-04-28