Literature DB >> 20309756

Optimized ranking and selection methods for feature selection with application in microarray experiments.

Xinping Cui1, Haibing Zhao, Jason Wilson.   

Abstract

In microarray experiments, the goal is often to examine many genes, and select some of them for additional investigation. Traditionally, such a selection problem has been formulated as a multiple testing problem. When the genes of interest are genes with unequal distribution of gene expression under different conditions, multiple testing methods provide an appropriate framework for addressing the selection problems. However, when the genes of interest are a set of genes with the largest difference in gene expression under different conditions, multiple testing methods do not directly address the selection goal and sometimes lead to biased conclusions. For such cases, we propose two methods based on the statistical ranking and selection framework to directly address the selection goal. The proposed methods have an inherent optimization nature in that the selection is optimized according to either a prespecified minimum correct selection ratio (r* selection) or probability of making a correct selection (P* selection). These methods are compared with the multiple testing method that controls the tail probability of the proportion of false positives. Both simulation studies and real data applications provide insight into the fundamental difference between the multiple testing methods and the proposed methods in the way of addressing different selection goals. It has been shown that the proposed methods provide a clear advantage over the multiple testing methods when the goal is to select the most significant genes (not all the significant genes). When the goal is to select all the significant genes, the proposed methods perform equally well as the current multiple testing methods. Another advantage provided by the proposed methods is their ability to detect noisy data and therefore suggest no sensible selection can be made.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20309756      PMCID: PMC2909494          DOI: 10.1080/10543400903572720

Source DB:  PubMed          Journal:  J Biopharm Stat        ISSN: 1054-3406            Impact factor:   1.051


  24 in total

1.  Selecting differentially expressed genes from microarray experiments.

Authors:  Margaret Sullivan Pepe; Gary Longton; Garnet L Anderson; Michel Schummer
Journal:  Biometrics       Date:  2003-03       Impact factor: 2.571

2.  Degrees of differential gene expression: detecting biologically significant expression differences and estimating their magnitudes.

Authors:  David R Bickel
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

Review 3.  Microarray data analysis: from disarray to consolidation and consensus.

Authors:  David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal:  Nat Rev Genet       Date:  2006-01       Impact factor: 53.242

4.  Multiple testing. Part II. Step-down procedures for control of the family-wise error rate.

Authors:  Mark J van der Laan; Sandrine Dudoit; Katherine S Pollard
Journal:  Stat Appl Genet Mol Biol       Date:  2004-06-14

5.  Multiple testing. Part I. Single-step procedures for control of general type I error rates.

Authors:  Sandrine Dudoit; Mark J van der Laan; Katherine S Pollard
Journal:  Stat Appl Genet Mol Biol       Date:  2004-06-09

6.  The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements.

Authors:  Leming Shi; Laura H Reid; Wendell D Jones; Richard Shippy; Janet A Warrington; Shawn C Baker; Patrick J Collins; Francoise de Longueville; Ernest S Kawasaki; Kathleen Y Lee; Yuling Luo; Yongming Andrew Sun; James C Willey; Robert A Setterquist; Gavin M Fischer; Weida Tong; Yvonne P Dragan; David J Dix; Felix W Frueh; Frederico M Goodsaid; Damir Herman; Roderick V Jensen; Charles D Johnson; Edward K Lobenhofer; Raj K Puri; Uwe Schrf; Jean Thierry-Mieg; Charles Wang; Mike Wilson; Paul K Wolber; Lu Zhang; Shashi Amur; Wenjun Bao; Catalin C Barbacioru; Anne Bergstrom Lucas; Vincent Bertholet; Cecilie Boysen; Bud Bromley; Donna Brown; Alan Brunner; Roger Canales; Xiaoxi Megan Cao; Thomas A Cebula; James J Chen; Jing Cheng; Tzu-Ming Chu; Eugene Chudin; John Corson; J Christopher Corton; Lisa J Croner; Christopher Davies; Timothy S Davison; Glenda Delenstarr; Xutao Deng; David Dorris; Aron C Eklund; Xiao-hui Fan; Hong Fang; Stephanie Fulmer-Smentek; James C Fuscoe; Kathryn Gallagher; Weigong Ge; Lei Guo; Xu Guo; Janet Hager; Paul K Haje; Jing Han; Tao Han; Heather C Harbottle; Stephen C Harris; Eli Hatchwell; Craig A Hauser; Susan Hester; Huixiao Hong; Patrick Hurban; Scott A Jackson; Hanlee Ji; Charles R Knight; Winston P Kuo; J Eugene LeClerc; Shawn Levy; Quan-Zhen Li; Chunmei Liu; Ying Liu; Michael J Lombardi; Yunqing Ma; Scott R Magnuson; Botoul Maqsodi; Tim McDaniel; Nan Mei; Ola Myklebost; Baitang Ning; Natalia Novoradovskaya; Michael S Orr; Terry W Osborn; Adam Papallo; Tucker A Patterson; Roger G Perkins; Elizabeth H Peters; Ron Peterson; Kenneth L Philips; P Scott Pine; Lajos Pusztai; Feng Qian; Hongzu Ren; Mitch Rosen; Barry A Rosenzweig; Raymond R Samaha; Mark Schena; Gary P Schroth; Svetlana Shchegrova; Dave D Smith; Frank Staedtler; Zhenqiang Su; Hongmei Sun; Zoltan Szallasi; Zivana Tezak; Danielle Thierry-Mieg; Karol L Thompson; Irina Tikhonova; Yaron Turpaz; Beena Vallanat; Christophe Van; Stephen J Walker; Sue Jane Wang; Yonghong Wang; Russ Wolfinger; Alex Wong; Jie Wu; Chunlin Xiao; Qian Xie; Jun Xu; Wen Yang; Liang Zhang; Sheng Zhong; Yaping Zong; William Slikker
Journal:  Nat Biotechnol       Date:  2006-09       Impact factor: 54.908

7.  Significance of gene ranking for classification of microarray samples.

Authors:  Chaolin Zhang; Xuesong Lu; Xuegong Zhang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2006 Jul-Sep       Impact factor: 3.710

8.  Sample size calculations based on ranking and selection in microarray experiments.

Authors:  Shigeyuki Matsui; Shu Zeng; Takeharu Yamanaka; John Shaughnessy
Journal:  Biometrics       Date:  2007-08-03       Impact factor: 2.571

Review 9.  A review of feature selection techniques in bioinformatics.

Authors:  Yvan Saeys; Iñaki Inza; Pedro Larrañaga
Journal:  Bioinformatics       Date:  2007-08-24       Impact factor: 6.937

10.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

View more
  1 in total

1.  Reducing the complexity of complex gene coexpression networks by coupling multiweighted labeling with topological analysis.

Authors:  Alfredo Benso; Paolo Cornale; Stefano Di Carlo; Gianfranco Politano; Alessandro Savino
Journal:  Biomed Res Int       Date:  2013-10-07       Impact factor: 3.411

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.