Literature DB >> 15585126

Multiclass Decision Forest--a novel pattern recognition method for multiclass classification in microarray data analysis.

Huixiao Hong1, Weida Tong, Roger Perkins, Hong Fang, Qian Xie, Leming Shi.   

Abstract

The wealth of knowledge imbedded in gene expression data from DNA microarrays portends rapid advances in both research and clinic. Turning the prodigious and noisy data into knowledge is a challenge to the field of bioinformatics, and development of classifiers using supervised learning techniques is the primary methodological approach for clinical application using gene expression data. In this paper, we present a novel classification method, multiclass Decision Forest (DF), that is the direct extension of the two-class DF previously developed in our lab. Central to DF is the synergistic combining of multiple heterogenic but comparable decision trees to reach a more accurate and robust classification model. The computationally inexpensive multiclass DF algorithm integrates gene selection and model development, and thus eliminates the bias of gene preselection in crossvalidation. Importantly, the method provides several statistical means for assessment of prediction accuracy, prediction confidence, and diagnostic capability. We demonstrate the method by application to gene expression data for 83 small round blue-cell tumors (SRBCTs) samples belonging to one of four different classes. Based on 500 runs of 10-fold crossvalidation, tumor prediction accuracy was approximately 97%, sensitivity was approximately 95%, diagnostic sensitivity was approximately 91%, and diagnostic accuracy was approximately 99.5%. Among 25 genes selected to distinguish tumor class, 12 have functional information in the literature implicating their involvement in cancer. The four types of SRBCTs samples are also distinguishable in a clustering analysis based on the expression profiles of these 25 genes. The results demonstrated that the multiclass DF is an effective classification method for analysis of gene expression data for the purpose of molecular diagnostics.

Entities:  

Mesh:

Year:  2004        PMID: 15585126     DOI: 10.1089/dna.2004.23.685

Source DB:  PubMed          Journal:  DNA Cell Biol        ISSN: 1044-5498            Impact factor:   3.311


  15 in total

1.  CATMoS: Collaborative Acute Toxicity Modeling Suite.

Authors:  Kamel Mansouri; Agnes L Karmaus; Jeremy Fitzpatrick; Grace Patlewicz; Prachi Pradeep; Domenico Alberga; Nathalie Alepee; Timothy E H Allen; Dave Allen; Vinicius M Alves; Carolina H Andrade; Tyler R Auernhammer; Davide Ballabio; Shannon Bell; Emilio Benfenati; Sudin Bhattacharya; Joyce V Bastos; Stephen Boyd; J B Brown; Stephen J Capuzzi; Yaroslav Chushak; Heather Ciallella; Alex M Clark; Viviana Consonni; Pankaj R Daga; Sean Ekins; Sherif Farag; Maxim Fedorov; Denis Fourches; Domenico Gadaleta; Feng Gao; Jeffery M Gearhart; Garett Goh; Jonathan M Goodman; Francesca Grisoni; Christopher M Grulke; Thomas Hartung; Matthew Hirn; Pavel Karpov; Alexandru Korotcov; Giovanna J Lavado; Michael Lawless; Xinhao Li; Thomas Luechtefeld; Filippo Lunghini; Giuseppe F Mangiatordi; Gilles Marcou; Dan Marsh; Todd Martin; Andrea Mauri; Eugene N Muratov; Glenn J Myatt; Dac-Trung Nguyen; Orazio Nicolotti; Reine Note; Paritosh Pande; Amanda K Parks; Tyler Peryea; Ahsan H Polash; Robert Rallo; Alessandra Roncaglioni; Craig Rowlands; Patricia Ruiz; Daniel P Russo; Ahmed Sayed; Risa Sayre; Timothy Sheils; Charles Siegel; Arthur C Silva; Anton Simeonov; Sergey Sosnin; Noel Southall; Judy Strickland; Yun Tang; Brian Teppen; Igor V Tetko; Dennis Thomas; Valery Tkachenko; Roberto Todeschini; Cosimo Toma; Ignacio Tripodi; Daniela Trisciuzzi; Alexander Tropsha; Alexandre Varnek; Kristijan Vukovic; Zhongyu Wang; Liguo Wang; Katrina M Waters; Andrew J Wedlake; Sanjeeva J Wijeyesakere; Dan Wilson; Zijun Xiao; Hongbin Yang; Gergely Zahoranszky-Kohalmi; Alexey V Zakharov; Fagen F Zhang; Zhen Zhang; Tongan Zhao; Hao Zhu; Kimberley M Zorn; Warren Casey; Nicole C Kleinstreuer
Journal:  Environ Health Perspect       Date:  2021-04-30       Impact factor: 9.031

2.  Machine Learning Models for Predicting Liver Toxicity.

Authors:  Jie Liu; Wenjing Guo; Sugunadevi Sakkiah; Zuowei Ji; Gokhan Yavas; Wen Zou; Minjun Chen; Weida Tong; Tucker A Patterson; Huixiao Hong
Journal:  Methods Mol Biol       Date:  2022

3.  Decision forest analysis of 61 single nucleotide polymorphisms in a case-control study of esophageal cancer; a novel method.

Authors:  Qian Xie; Luke D Ratnasinghe; Huixiao Hong; Roger Perkins; Ze-Zhong Tang; Nan Hu; Philip R Taylor; Weida Tong
Journal:  BMC Bioinformatics       Date:  2005-07-15       Impact factor: 3.169

4.  sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides.

Authors:  Heng Luo; Hao Ye; Hui Wen Ng; Sugunadevi Sakkiah; Donna L Mendrick; Huixiao Hong
Journal:  Sci Rep       Date:  2016-08-25       Impact factor: 4.379

5.  CERAPP: Collaborative Estrogen Receptor Activity Prediction Project.

Authors:  Kamel Mansouri; Ahmed Abdelaziz; Aleksandra Rybacka; Alessandra Roncaglioni; Alexander Tropsha; Alexandre Varnek; Alexey Zakharov; Andrew Worth; Ann M Richard; Christopher M Grulke; Daniela Trisciuzzi; Denis Fourches; Dragos Horvath; Emilio Benfenati; Eugene Muratov; Eva Bay Wedebye; Francesca Grisoni; Giuseppe F Mangiatordi; Giuseppina M Incisivo; Huixiao Hong; Hui W Ng; Igor V Tetko; Ilya Balabin; Jayaram Kancherla; Jie Shen; Julien Burton; Marc Nicklaus; Matteo Cassotti; Nikolai G Nikolov; Orazio Nicolotti; Patrik L Andersson; Qingda Zang; Regina Politi; Richard D Beger; Roberto Todeschini; Ruili Huang; Sherif Farag; Sine A Rosenberg; Svetoslav Slavov; Xin Hu; Richard S Judson
Journal:  Environ Health Perspect       Date:  2016-02-23       Impact factor: 9.031

6.  Development of Decision Forest Models for Prediction of Drug-Induced Liver Injury in Humans Using A Large Set of FDA-approved Drugs.

Authors:  Huixiao Hong; Shraddha Thakkar; Minjun Chen; Weida Tong
Journal:  Sci Rep       Date:  2017-12-11       Impact factor: 4.379

7.  Discovery of dominant and dormant genes from expression data using a novel generalization of SNR for multi-class problems.

Authors:  Yu-Shuen Tsai; Chin-Teng Lin; George C Tseng; I-Fang Chung; Nikhil Ranjan Pal
Journal:  BMC Bioinformatics       Date:  2008-10-09       Impact factor: 3.169

8.  A Rat α-Fetoprotein Binding Activity Prediction Model to Facilitate Assessment of the Endocrine Disruption Potential of Environmental Chemicals.

Authors:  Huixiao Hong; Jie Shen; Hui Wen Ng; Sugunadevi Sakkiah; Hao Ye; Weigong Ge; Ping Gong; Wenming Xiao; Weida Tong
Journal:  Int J Environ Res Public Health       Date:  2016-03-25       Impact factor: 3.390

9.  Experimental Data Extraction and in Silico Prediction of the Estrogenic Activity of Renewable Replacements for Bisphenol A.

Authors:  Huixiao Hong; Benjamin G Harvey; Giuseppe R Palmese; Joseph F Stanzione; Hui Wen Ng; Sugunadevi Sakkiah; Weida Tong; Joshua M Sadler
Journal:  Int J Environ Res Public Health       Date:  2016-07-12       Impact factor: 3.390

10.  Consensus Modeling for Prediction of Estrogenic Activity of Ingredients Commonly Used in Sunscreen Products.

Authors:  Huixiao Hong; Diego Rua; Sugunadevi Sakkiah; Chandrabose Selvaraj; Weigong Ge; Weida Tong
Journal:  Int J Environ Res Public Health       Date:  2016-09-29       Impact factor: 3.390

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.