Literature DB >> 12217910

Determination of minimum sample size and discriminatory expression patterns in microarray data.

Daehee Hwang1, William A Schmitt, George Stephanopoulos, Gregory Stephanopoulos.   

Abstract

MOTIVATION: Transcriptional profiling using microarrays can reveal important information about cellular and tissue expression phenotypes, but these measurements are costly and time consuming. Additionally, tissue sample availability poses further constraints on the number of arrays that can be analyzed in connection with a particular disease or state of interest. It is therefore important to provide a method for the determination of the minimum number of microarrays required to separate, with statistical reliability, distinct disease states or other physiological differences.
RESULTS: Power analysis was applied to estimate the minimum sample size required for two-class and multi-class discrimination. The power analysis algorithm calculates the appropriate sample size for discrimination of phenotypic subtypes in a reduced dimensional space obtained by Fisher discriminant analysis (FDA). This approach was tested by applying the algorithm to existing data sets for estimation of the minimum sample size required for drawing certain conclusions on multi-class distinction with statistical reliability. It was confirmed that when the minimum number of samples estimated from power analysis is used, group means in the FDA discrimination space are statistically different. CONTACT: gregstep@mit.edu

Entities:  

Mesh:

Year:  2002        PMID: 12217910     DOI: 10.1093/bioinformatics/18.9.1184

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  A dynamic, web-accessible resource to process raw microarray scan data into consolidated gene expression values: importance of replication.

Authors:  Nolwenn Le Meur; Guillaume Lamirault; Audrey Bihouée; Marja Steenman; Hélène Bédrine-Ferran; Raluca Teusan; Gérard Ramstein; Jean J Léger
Journal:  Nucleic Acids Res       Date:  2004-10-08       Impact factor: 16.971

2.  Using RNA sample titrations to assess microarray platform performance and normalization techniques.

Authors:  Richard Shippy; Stephanie Fulmer-Smentek; Roderick V Jensen; Wendell D Jones; Paul K Wolber; Charles D Johnson; P Scott Pine; Cecilie Boysen; Xu Guo; Eugene Chudin; Yongming Andrew Sun; James C Willey; Jean Thierry-Mieg; Danielle Thierry-Mieg; Robert A Setterquist; Mike Wilson; Anne Bergstrom Lucas; Natalia Novoradovskaya; Adam Papallo; Yaron Turpaz; Shawn C Baker; Janet A Warrington; Leming Shi; Damir Herman
Journal:  Nat Biotechnol       Date:  2006-09       Impact factor: 54.908

3.  Analysis of time-series gene expression data: methods, challenges, and opportunities.

Authors:  I P Androulakis; E Yang; R R Almon
Journal:  Annu Rev Biomed Eng       Date:  2007       Impact factor: 9.590

4.  Study design in high-dimensional classification analysis.

Authors:  Brisa N Sánchez; Meihua Wu; Peter X K Song; Wen Wang
Journal:  Biostatistics       Date:  2016-05-05       Impact factor: 5.899

5.  Machine learning-based receiver operating characteristic (ROC) curves for crisp and fuzzy classification of DNA microarrays in cancer research.

Authors:  Leif E Peterson; Matthew A Coleman
Journal:  Int J Approx Reason       Date:  2008-01       Impact factor: 3.816

6.  A functional and regulatory network associated with PIP expression in human breast cancer.

Authors:  Marie-Anne Debily; Sandrine El Marhomy; Virginie Boulanger; Eric Eveno; Régine Mariage-Samson; Alessandra Camarca; Charles Auffray; Dominique Piatier-Tonneau; Sandrine Imbeaud
Journal:  PLoS One       Date:  2009-03-05       Impact factor: 3.240

7.  Effective feature selection framework for cluster analysis of microarray data.

Authors:  Gouchol Pok; Jyh-Charn Steve Liu; Keun Ho Ryu
Journal:  Bioinformation       Date:  2010-02-28

8.  Sample size and statistical power considerations in high-dimensionality data settings: a comparative study of classification algorithms.

Authors:  Yu Guo; Armin Graber; Robert N McBurney; Raji Balasubramanian
Journal:  BMC Bioinformatics       Date:  2010-09-03       Impact factor: 3.169

9.  Mixture-model based estimation of gene expression variance from public database improves identification of differentially expressed genes in small sized microarray data.

Authors:  Mingoo Kim; Sung Bum Cho; Ju Han Kim
Journal:  Bioinformatics       Date:  2009-12-16       Impact factor: 6.937

10.  A simulation-approximation approach to sample size planning for high-dimensional classification studies.

Authors:  Perry de Valpine; Hans-Marcus Bitter; Michael P S Brown; Jonathan Heller
Journal:  Biostatistics       Date:  2009-02-21       Impact factor: 5.899

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.