Literature DB >> 23966112

Strengths and limitations of microarray-based phenotype prediction: lessons learned from the IMPROVER Diagnostic Signature Challenge.

Adi L Tarca1, Mario Lauria, Michael Unger, Erhan Bilal, Stephanie Boue, Kushal Kumar Dey, Julia Hoeng, Heinz Koeppl, Florian Martin, Pablo Meyer, Preetam Nandy, Raquel Norel, Manuel Peitsch, Jeremy J Rice, Roberto Romero, Gustavo Stolovitzky, Marja Talikka, Yang Xiang, Christoph Zechner.   

Abstract

MOTIVATION: After more than a decade since microarrays were used to predict phenotype of biological samples, real-life applications for disease screening and identification of patients who would best benefit from treatment are still emerging. The interest of the scientific community in identifying best approaches to develop such prediction models was reaffirmed in a competition style international collaboration called IMPROVER Diagnostic Signature Challenge whose results we describe herein.
RESULTS: Fifty-four teams used public data to develop prediction models in four disease areas including multiple sclerosis, lung cancer, psoriasis and chronic obstructive pulmonary disease, and made predictions on blinded new data that we generated. Teams were scored using three metrics that captured various aspects of the quality of predictions, and best performers were awarded. This article presents the challenge results and introduces to the community the approaches of the best overall three performers, as well as an R package that implements the approach of the best overall team. The analyses of model performance data submitted in the challenge as well as additional simulations that we have performed revealed that (i) the quality of predictions depends more on the disease endpoint than on the particular approaches used in the challenge; (ii) the most important modeling factor (e.g. data preprocessing, feature selection and classifier type) is problem dependent; and (iii) for optimal results datasets and methods have to be carefully matched. Biomedical factors such as the disease severity and confidence in diagnostic were found to be associated with the misclassification rates across the different teams. AVAILABILITY: The lung cancer dataset is available from Gene Expression Omnibus (accession, GSE43580). The maPredictDSC R package implementing the approach of the best overall team is available at www.bioconductor.org or http://bioinformaticsprb.med.wayne.edu/.

Entities:  

Mesh:

Year:  2013        PMID: 23966112      PMCID: PMC3810846          DOI: 10.1093/bioinformatics/btt492

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  23 in total

Review 1.  Assessing the accuracy of prediction algorithms for classification: an overview.

Authors:  P Baldi; S Brunak; Y Chauvin; C A Andersen; H Nielsen
Journal:  Bioinformatics       Date:  2000-05       Impact factor: 6.937

2.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

3.  The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models.

Authors:  Leming Shi; Gregory Campbell; Wendell D Jones; Fabien Campagne; Zhining Wen; Stephen J Walker; Zhenqiang Su; Tzu-Ming Chu; Federico M Goodsaid; Lajos Pusztai; John D Shaughnessy; André Oberthuer; Russell S Thomas; Richard S Paules; Mark Fielden; Bart Barlogie; Weijie Chen; Pan Du; Matthias Fischer; Cesare Furlanello; Brandon D Gallas; Xijin Ge; Dalila B Megherbi; W Fraser Symmans; May D Wang; John Zhang; Hans Bitter; Benedikt Brors; Pierre R Bushel; Max Bylesjo; Minjun Chen; Jie Cheng; Jing Cheng; Jeff Chou; Timothy S Davison; Mauro Delorenzi; Youping Deng; Viswanath Devanarayan; David J Dix; Joaquin Dopazo; Kevin C Dorff; Fathi Elloumi; Jianqing Fan; Shicai Fan; Xiaohui Fan; Hong Fang; Nina Gonzaludo; Kenneth R Hess; Huixiao Hong; Jun Huan; Rafael A Irizarry; Richard Judson; Dilafruz Juraeva; Samir Lababidi; Christophe G Lambert; Li Li; Yanen Li; Zhen Li; Simon M Lin; Guozhen Liu; Edward K Lobenhofer; Jun Luo; Wen Luo; Matthew N McCall; Yuri Nikolsky; Gene A Pennello; Roger G Perkins; Reena Philip; Vlad Popovici; Nathan D Price; Feng Qian; Andreas Scherer; Tieliu Shi; Weiwei Shi; Jaeyun Sung; Danielle Thierry-Mieg; Jean Thierry-Mieg; Venkata Thodima; Johan Trygg; Lakshmi Vishnuvajjala; Sue Jane Wang; Jianping Wu; Yichao Wu; Qian Xie; Waleed A Yousef; Liang Zhang; Xuegong Zhang; Sheng Zhong; Yiming Zhou; Sheng Zhu; Dhivya Arasappan; Wenjun Bao; Anne Bergstrom Lucas; Frank Berthold; Richard J Brennan; Andreas Buness; Jennifer G Catalano; Chang Chang; Rong Chen; Yiyu Cheng; Jian Cui; Wendy Czika; Francesca Demichelis; Xutao Deng; Damir Dosymbekov; Roland Eils; Yang Feng; Jennifer Fostel; Stephanie Fulmer-Smentek; James C Fuscoe; Laurent Gatto; Weigong Ge; Darlene R Goldstein; Li Guo; Donald N Halbert; Jing Han; Stephen C Harris; Christos Hatzis; Damir Herman; Jianping Huang; Roderick V Jensen; Rui Jiang; Charles D Johnson; Giuseppe Jurman; Yvonne Kahlert; Sadik A Khuder; Matthias Kohl; Jianying Li; Li Li; Menglong Li; Quan-Zhen Li; Shao Li; Zhiguang Li; Jie Liu; Ying Liu; Zhichao Liu; Lu Meng; Manuel Madera; Francisco Martinez-Murillo; Ignacio Medina; Joseph Meehan; Kelci Miclaus; Richard A Moffitt; David Montaner; Piali Mukherjee; George J Mulligan; Padraic Neville; Tatiana Nikolskaya; Baitang Ning; Grier P Page; Joel Parker; R Mitchell Parry; Xuejun Peng; Ron L Peterson; John H Phan; Brian Quanz; Yi Ren; Samantha Riccadonna; Alan H Roter; Frank W Samuelson; Martin M Schumacher; Joseph D Shambaugh; Qiang Shi; Richard Shippy; Shengzhu Si; Aaron Smalter; Christos Sotiriou; Mat Soukup; Frank Staedtler; Guido Steiner; Todd H Stokes; Qinglan Sun; Pei-Yi Tan; Rong Tang; Zivana Tezak; Brett Thorn; Marina Tsyganova; Yaron Turpaz; Silvia C Vega; Roberto Visintainer; Juergen von Frese; Charles Wang; Eric Wang; Junwei Wang; Wei Wang; Frank Westermann; James C Willey; Matthew Woods; Shujian Wu; Nianqing Xiao; Joshua Xu; Lei Xu; Lun Yang; Xiao Zeng; Jialu Zhang; Li Zhang; Min Zhang; Chen Zhao; Raj K Puri; Uwe Scherf; Weida Tong; Russell D Wolfinger
Journal:  Nat Biotechnol       Date:  2010-07-30       Impact factor: 54.908

4.  Quantitative monitoring of gene expression patterns with a complementary DNA microarray.

Authors:  M Schena; D Shalon; R W Davis; P O Brown
Journal:  Science       Date:  1995-10-20       Impact factor: 47.728

5.  Molecular classification of cutaneous malignant melanoma by gene expression profiling.

Authors:  M Bittner; P Meltzer; Y Chen; Y Jiang; E Seftor; M Hendrix; M Radmacher; R Simon; Z Yakhini; A Ben-Dor; N Sampas; E Dougherty; E Wang; F Marincola; C Gooden; J Lueders; A Glatfelter; P Pollock; J Carpten; E Gillanders; D Leja; K Dietrich; C Beaudry; M Berens; D Alberts; V Sondak
Journal:  Nature       Date:  2000-08-03       Impact factor: 49.962

6.  Molecular portraits of human breast tumours.

Authors:  C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

7.  A comparison of batch effect removal methods for enhancement of prediction performance using MAQC-II microarray gene expression data.

Authors:  J Luo; M Schumacher; A Scherer; D Sanoudou; D Megherbi; T Davison; T Shi; W Tong; L Shi; H Hong; C Zhao; F Elloumi; W Shi; R Thomas; S Lin; G Tillinghast; G Liu; Y Zhou; D Herman; Y Li; Y Deng; H Fang; P Bushel; M Woods; J Zhang
Journal:  Pharmacogenomics J       Date:  2010-08       Impact factor: 3.550

8.  Comparison of molecular subtyping with BluePrint, MammaPrint, and TargetPrint to local clinical subtyping in breast cancer patients.

Authors:  Bichlien Nguyen; Pino G Cusumano; Kenneth Deck; Deborah Kerlin; Agustin A Garcia; Julie L Barone; Edgardo Rivera; Katharine Yao; Femke A de Snoo; Jeroen van den Akker; Lisette Stork-Sloots; Daniele Generali
Journal:  Ann Surg Oncol       Date:  2012-08-15       Impact factor: 5.344

Review 9.  Industrial methodology for process verification in research (IMPROVER): toward systems biology verification.

Authors:  Pablo Meyer; Julia Hoeng; J Jeremy Rice; Raquel Norel; Jörg Sprengel; Katrin Stolle; Thomas Bonk; Stephanie Corthesy; Ajay Royyuru; Manuel C Peitsch; Gustavo Stolovitzky
Journal:  Bioinformatics       Date:  2012-03-14       Impact factor: 6.937

Review 10.  Machine learning and its applications to biology.

Authors:  Adi L Tarca; Vincent J Carey; Xue-wen Chen; Roberto Romero; Sorin Drăghici
Journal:  PLoS Comput Biol       Date:  2007-06       Impact factor: 4.475

View more
  57 in total

1.  The maternal plasma proteome changes as a function of gestational age in normal pregnancy: a longitudinal study.

Authors:  Roberto Romero; Offer Erez; Eli Maymon; Piya Chaemsaithong; Zhonghui Xu; Percy Pacora; Tinnakorn Chaiworapongsa; Bogdan Done; Sonia S Hassan; Adi L Tarca
Journal:  Am J Obstet Gynecol       Date:  2017-03-03       Impact factor: 8.661

2.  Predicting protein phosphorylation from gene expression: top methods from the IMPROVER Species Translation Challenge.

Authors:  Adel Dayarian; Roberto Romero; Zhiming Wang; Michael Biehl; Erhan Bilal; Sahand Hormoz; Pablo Meyer; Raquel Norel; Kahn Rhrissorrakrai; Gyan Bhanot; Feng Luo; Adi L Tarca
Journal:  Bioinformatics       Date:  2014-07-23       Impact factor: 6.937

3.  Robust meta-analysis of gene expression using the elastic net.

Authors:  Jacob J Hughey; Atul J Butte
Journal:  Nucleic Acids Res       Date:  2015-03-31       Impact factor: 16.971

4.  Human blood gene signature as a marker for smoking exposure: computational approaches of the top ranked teams in the sbv IMPROVER Systems Toxicology challenge.

Authors:  Adi L Tarca; Xiaofeng Gong; Roberto Romero; Wenxin Yang; Zhongqu Duan; Hao Yang; Chengfang Zhang; Peixuan Wang
Journal:  Comput Toxicol       Date:  2017-07-18

5.  Species translatable blood gene signature as a marker of exposure to smoking: computational approaches of the top ranked teams in the sbv IMPROVER Systems Toxicology challenge.

Authors:  Ömer Sinan Saraç; Rahul Kumar; Sandeep Kumar Dhanda; Ali Tuğrul Balcı; İsmail Bilgen; Roberto Romero; Adi L Tarca
Journal:  Comput Toxicol       Date:  2017-04-28

6.  Enhancement of COPD biological networks using a web-based collaboration interface.

Authors:  Stephanie Boue; Brett Fields; Julia Hoeng; Jennifer Park; Manuel C Peitsch; Walter K Schlage; Marja Talikka; Ilona Binenbaum; Vladimir Bondarenko; Oleg V Bulgakov; Vera Cherkasova; Norberto Diaz-Diaz; Larisa Fedorova; Svetlana Guryanova; Julia Guzova; Galina Igorevna Koroleva; Elena Kozhemyakina; Rahul Kumar; Noa Lavid; Qingxian Lu; Swapna Menon; Yael Ouliel; Samantha C Peterson; Alexander Prokhorov; Edward Sanders; Sarah Schrier; Golan Schwaitzer Neta; Irina Shvydchenko; Aravind Tallam; Gema Villa-Fombuena; John Wu; Ilya Yudkevich; Mariya Zelikman
Journal:  F1000Res       Date:  2015-01-29

7.  Systems biology approaches to study the molecular effects of caloric restriction and polyphenols on aging processes.

Authors:  Sébastien Lacroix; Mario Lauria; Marie-Pier Scott-Boyer; Luca Marchetti; Corrado Priami; Laura Caberlotto
Journal:  Genes Nutr       Date:  2015-11-25       Impact factor: 5.523

8.  The sbv IMPROVER Systems Toxicology Computational Challenge: Identification of Human and Species-Independent Blood Response Markers as Predictors of Smoking Exposure and Cessation Status.

Authors:  Vincenzo Belcastro; Carine Poussin; Yang Xiang; Maurizio Giordano; Kumar Parijat Tripathi; Akash Boda; Stéphanie Boué; Mario Guarracino; Florian Martin; Manuel C Peitsch; Julia Hoeng; Roberto Romero; Adi L Tarca; Zhongqu Duan; Hao Yang; Xiaofeng Gong; Peixuan Wang; Chenfang Zhang; Wenxin Yang; Omer Sinan Sarac; Ismail Bilgen; Ali Tugrul Balci; Rahul Kumar; Sandeep Kumar Dhanda
Journal:  Comput Toxicol       Date:  2017-07-14

9.  Viral fibrotic scoring and drug screen based on MAPK activity uncovers EGFR as a key regulator of COVID-19 fibrosis.

Authors:  Elmira R Vagapova; Timofey D Lebedev; Vladimir S Prassolov
Journal:  Sci Rep       Date:  2021-05-27       Impact factor: 4.379

10.  Crowdsourcing assessment of maternal blood multi-omics for predicting gestational age and preterm birth.

Authors:  Adi L Tarca; Bálint Ármin Pataki; Roberto Romero; Marina Sirota; Yuanfang Guan; Rintu Kutum; Nardhy Gomez-Lopez; Bogdan Done; Gaurav Bhatti; Thomas Yu; Gaia Andreoletti; Tinnakorn Chaiworapongsa; Sonia S Hassan; Chaur-Dong Hsu; Nima Aghaeepour; Gustavo Stolovitzky; Istvan Csabai; James C Costello
Journal:  Cell Rep Med       Date:  2021-06-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.