Literature DB >> 23321025

A structured approach to predictive modeling of a two-class problem using multidimensional data sets.

Heidi Spratt1, Hyunsu Ju, Allan R Brasier.   

Abstract

Biological experiments in the post-genome era can generate a staggering amount of complex data that challenges experimentalists to extract meaningful information. Increasingly, the success of an appropriately controlled experiment relies on a robust data analysis pipeline. In this paper, we present a structured approach to the analysis of multidimensional data that relies on a close, two-way communication between the bioinformatician and experimentalist. A sequential approach employing data exploration (visualization, graphical and analytical study), pre-processing, feature reduction and supervised classification using machine learning is presented. This standardized approach is illustrated by an example from a proteomic data analysis that has been used to predict the risk of infectious disease outcome. Strategies for model selection and post hoc model diagnostics are presented and applied to the case illustration. We discuss some of the practical lessons we have learned applying supervised classification to multidimensional data sets, one of which is the importance of feature reduction in achieving optimal modeling performance.
Copyright © 2013 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23321025      PMCID: PMC3661737          DOI: 10.1016/j.ymeth.2013.01.002

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  12 in total

1.  Clustering gene expression patterns.

Authors:  A Ben-Dor; R Shamir; Z Yakhini
Journal:  J Comput Biol       Date:  1999 Fall-Winter       Impact factor: 1.479

2.  Significance analysis of microarrays applied to the ionizing radiation response.

Authors:  V G Tusher; R Tibshirani; G Chu
Journal:  Proc Natl Acad Sci U S A       Date:  2001-04-17       Impact factor: 11.205

3.  Missing value estimation methods for DNA microarrays.

Authors:  O Troyanskaya; M Cantor; G Sherlock; P Brown; T Hastie; R Tibshirani; D Botstein; R B Altman
Journal:  Bioinformatics       Date:  2001-06       Impact factor: 6.937

4.  Tree and spline based association analysis of gene-gene interaction models for ischemic stroke.

Authors:  Nancy R Cook; Robert Y L Zee; Paul M Ridker
Journal:  Stat Med       Date:  2004-05-15       Impact factor: 2.373

5.  A three-component biomarker panel for prediction of dengue hemorrhagic fever.

Authors:  Allan R Brasier; Hyunsu Ju; Josefina Garcia; Heidi M Spratt; Sundar S Victor; Brett M Forshey; Eric S Halsey; Guillermo Comach; Gloria Sierra; Patrick J Blair; Claudio Rocha; Amy C Morrison; Thomas W Scott; Isabel Bazan; Tadeusz J Kochel
Journal:  Am J Trop Med Hyg       Date:  2012-02       Impact factor: 2.345

6.  More is less: signal processing and the data deluge.

Authors:  Richard G Baraniuk
Journal:  Science       Date:  2011-02-11       Impact factor: 47.728

7.  What information should be required to support clinical "omics" publications?

Authors:  Keith A Baggerly; Kevin R Coombes
Journal:  Clin Chem       Date:  2011-05       Impact factor: 8.327

8.  Generalized additive models for medical research.

Authors:  T Hastie; R Tibshirani
Journal:  Stat Methods Med Res       Date:  1995-09       Impact factor: 3.021

9.  Predicting intermediate phenotypes in asthma using bronchoalveolar lavage-derived cytokines.

Authors:  Allan R Brasier; Sundar Victor; Hyunsu Ju; William W Busse; Douglas Curran-Everett; Eugene Bleecker; Mario Castro; Kian Fan Chung; Benjamin Gaston; Elliot Israel; Sally E Wenzel; Serpil C Erzurum; Nizar N Jarjour; William J Calhoun
Journal:  Clin Transl Sci       Date:  2010-08       Impact factor: 4.689

10.  Learning from our GWAS mistakes: from experimental design to scientific method.

Authors:  Christophe G Lambert; Laura J Black
Journal:  Biostatistics       Date:  2012-01-27       Impact factor: 5.899

View more
  8 in total

1.  Molecular classification of outcomes from dengue virus -3 infections.

Authors:  Allan R Brasier; Yingxin Zhao; John E Wiktorowicz; Heidi M Spratt; Eduardo J M Nascimento; Marli T Cordeiro; Kizhake V Soman; Hyunsu Ju; Adrian Recinos; Susan Stafford; Zheng Wu; Ernesto T A Marques; Nikos Vasilakis
Journal:  J Clin Virol       Date:  2015-01-17       Impact factor: 3.168

Review 2.  Identification of innate immune response endotypes in asthma: implications for personalized medicine.

Authors:  Allan R Brasier
Journal:  Curr Allergy Asthma Rep       Date:  2013-10       Impact factor: 4.806

Review 3.  Targeted proteomics for biomarker discovery and validation of hepatocellular carcinoma in hepatitis C infected patients.

Authors:  Gul M Mustafa; Denner Larry; John R Petersen; Cornelis J Elferink
Journal:  World J Hepatol       Date:  2015-06-08

4.  Development of a Multivariate Predictive Model to Estimate Ionized Calcium Concentration from Serum Biochemical Profile Results in Dogs.

Authors:  J Danner; M D Ridgway; S I Rubin; K Le Boedec
Journal:  J Vet Intern Med       Date:  2017-08-20       Impact factor: 3.333

5.  Establishment and evaluation of prediction model for multiple disease classification based on gut microbial data.

Authors:  Sohyun Bang; DongAhn Yoo; Soo-Jin Kim; Soyun Jhang; Seoae Cho; Heebal Kim
Journal:  Sci Rep       Date:  2019-07-15       Impact factor: 4.379

Review 6.  Cell-Based Chemical Safety Assessment and Therapeutic Discovery Using Array-Based Sensors.

Authors:  Mingdi Jiang; Aritra Nath Chattopadhyay; Vincent M Rotello
Journal:  Int J Mol Sci       Date:  2022-03-27       Impact factor: 5.923

7.  A machine learning-based approach to determine infection status in recipients of BBV152 (Covaxin) whole-virion inactivated SARS-CoV-2 vaccine for serological surveys.

Authors:  Prateek Singh; Rajat Ujjainiya; Satyartha Prakash; Salwa Naushin; Viren Sardana; Nitin Bhatheja; Ajay Pratap Singh; Joydeb Barman; Kartik Kumar; Saurabh Gayali; Raju Khan; Birendra Singh Rawat; Karthik Bharadwaj Tallapaka; Mahesh Anumalla; Amit Lahiri; Susanta Kar; Vivek Bhosale; Mrigank Srivastava; Madhav Nilakanth Mugale; C P Pandey; Shaziya Khan; Shivani Katiyar; Desh Raj; Sharmeen Ishteyaque; Sonu Khanka; Ankita Rani; Jyotsna Sharma; Anuradha Seth; Mukul Dutta; Nishant Saurabh; Murugan Veerapandian; Ganesh Venkatachalam; Deepak Bansal; Dinesh Gupta; Prakash M Halami; Muthukumar Serva Peddha; Ravindra P Veeranna; Anirban Pal; Ranvijay Kumar Singh; Suresh Kumar Anandasadagopan; Parimala Karuppanan; Syed Nasar Rahman; Gopika Selvakumar; Subramanian Venkatesan; Malay Kumar Karmakar; Harish Kumar Sardana; Anamika Kothari; Devendra Singh Parihar; Anupma Thakur; Anas Saifi; Naman Gupta; Yogita Singh; Ritu Reddu; Rizul Gautam; Anuj Mishra; Avinash Mishra; Iranna Gogeri; Geethavani Rayasam; Yogendra Padwad; Vikram Patial; Vipin Hallan; Damanpreet Singh; Narendra Tirpude; Partha Chakrabarti; Sujay Krishna Maity; Dipyaman Ganguly; Ramakrishna Sistla; Narender Kumar Balthu; Kiran Kumar A; Siva Ranjith; B Vijay Kumar; Piyush Singh Jamwal; Anshu Wali; Sajad Ahmed; Rekha Chouhan; Sumit G Gandhi; Nancy Sharma; Garima Rai; Faisal Irshad; Vijay Lakshmi Jamwal; Masroor Ahmad Paddar; Sameer Ullah Khan; Fayaz Malik; Debashish Ghosh; Ghanshyam Thakkar; S K Barik; Prabhanshu Tripathi; Yatendra Kumar Satija; Sneha Mohanty; Md Tauseef Khan; Umakanta Subudhi; Pradip Sen; Rashmi Kumar; Anshu Bhardwaj; Pawan Gupta; Deepak Sharma; Amit Tuli; Saumya Ray Chaudhuri; Srinivasan Krishnamurthi; L Prakash; Ch V Rao; B N Singh; Arvindkumar Chaurasiya; Meera Chaurasiyar; Mayuri Bhadange; Bhagyashree Likhitkar; Sharada Mohite; Yogita Patil; Mahesh Kulkarni; Rakesh Joshi; Vaibhav Pandya; Sachin Mahajan; Amita Patil; Rachel Samson; Tejas Vare; Mahesh Dharne; Ashok Giri; Sachin Mahajan; Shilpa Paranjape; G Narahari Sastry; Jatin Kalita; Tridip Phukan; Prasenjit Manna; Wahengbam Romi; Pankaj Bharali; Dibyajyoti Ozah; Ravi Kumar Sahu; Prachurjya Dutta; Moirangthem Goutam Singh; Gayatri Gogoi; Yasmin Begam Tapadar; Elapavalooru Vssk Babu; Rajeev K Sukumaran; Aishwarya R Nair; Anoop Puthiyamadam; Prajeesh Kooloth Valappil; Adrash Velayudhan Pillai Prasannakumari; Kalpana Chodankar; Samir Damare; Ved Varun Agrawal; Kumardeep Chaudhary; Anurag Agrawal; Shantanu Sengupta; Debasis Dash
Journal:  Comput Biol Med       Date:  2022-04-25       Impact factor: 6.698

8.  Improved Detection of Invasive Pulmonary Aspergillosis Arising during Leukemia Treatment Using a Panel of Host Response Proteins and Fungal Antigens.

Authors:  Allan R Brasier; Yingxin Zhao; Heidi M Spratt; John E Wiktorowicz; Hyunsu Ju; L Joseph Wheat; Lindsey Baden; Susan Stafford; Zheng Wu; Nicolas Issa; Angela M Caliendo; David W Denning; Kizhake Soman; Cornelius J Clancy; M Hong Nguyen; Michele W Sugrue; Barbara D Alexander; John R Wingard
Journal:  PLoS One       Date:  2015-11-18       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.