Literature DB >> 16968808

Epistemological issues in omics and high-dimensional biology: give the people what they want.

Tapan S Mehta1, Stanislav O Zakharkin, Gary L Gadbury, David B Allison.   

Abstract

Gene expression microarrays have been the vanguard of new analytic approaches in high-dimensional biology. Draft sequences of several genomes coupled with new technologies allow study of the influences and responses of entire genomes rather than isolated genes. This has opened a new realm of highly dimensional biology where questions involve multiplicity at unprecedented scales: thousands of genetic polymorphisms, gene expression levels, protein measurements, genetic sequences, or any combination of these and their interactions. Such situations demand creative approaches to the processes of inference, estimation, prediction, classification, and study design. Although bench scientists intuitively grasp the need for flexibility in the inferential process, the elaboration of formal supporting statistical frameworks is just at the very start. Here, we will discuss some of the unique statistical challenges facing investigators studying high-dimensional biology, describe some approaches being developed by statistical scientists, and offer an epistemological framework for the validation of proffered statistical procedures. A key theme will be the challenge in providing methods that a statistician judges to be sound and a biologist finds informative. The shift from family-wise error rate control to false discovery rate estimation and to assessment of ranking and other forms of stability will be portrayed as illustrative of approaches to this challenge.

Entities:  

Mesh:

Year:  2006        PMID: 16968808     DOI: 10.1152/physiolgenomics.00095.2006

Source DB:  PubMed          Journal:  Physiol Genomics        ISSN: 1094-8341            Impact factor:   3.107


  11 in total

1.  Real data examples in statistical methods papers: Tremendously valuable, and also tremendously misvalued.

Authors:  K Y Williams; Yun Joo Yoo; Amit Patki; David B Allison
Journal:  Stat Interface       Date:  2011       Impact factor: 0.582

2.  The use of plasmodes as a supplement to simulations: A simple example evaluating individual admixture estimation methodologies.

Authors:  Laura K Vaughan; Jasmin Divers; Miguel Padilla; David T Redden; Hemant K Tiwari; Daniel Pomp; David B Allison
Journal:  Comput Stat Data Anal       Date:  2009-03-15       Impact factor: 1.681

3.  Interdependence of signal processing and analysis of urine 1H NMR spectra for metabolic profiling.

Authors:  Shucha Zhang; Cheng Zheng; Ian R Lanza; K Sreekumaran Nair; Daniel Raftery; Olga Vitek
Journal:  Anal Chem       Date:  2009-08-01       Impact factor: 6.986

4.  The molecular signature of impaired diabetic wound healing identifies serpinB3 as a healing biomarker.

Authors:  Gian Paolo Fadini; Mattia Albiero; Renato Millioni; Nicol Poncina; Mauro Rigato; Rachele Scotton; Federico Boscari; Enrico Brocco; Giorgio Arrigoni; Gianmarco Villano; Cristian Turato; Alessandra Biasiolo; Patrizia Pontisso; Angelo Avogaro
Journal:  Diabetologia       Date:  2014-06-25       Impact factor: 10.122

Review 5.  Common scientific and statistical errors in obesity research.

Authors:  Brandon J George; T Mark Beasley; Andrew W Brown; John Dawson; Rositsa Dimova; Jasmin Divers; TaShauna U Goldsby; Moonseong Heo; Kathryn A Kaiser; Scott W Keith; Mimi Y Kim; Peng Li; Tapan Mehta; J Michael Oakes; Asheley Skinner; Elizabeth Stuart; David B Allison
Journal:  Obesity (Silver Spring)       Date:  2016-04       Impact factor: 5.002

Review 6.  Challenges and approaches to statistical design and inference in high-dimensional investigations.

Authors:  Gary L Gadbury; Karen A Garrett; David B Allison
Journal:  Methods Mol Biol       Date:  2009

7.  Assessing Dissimilarity Measures for Sample-Based Hierarchical Clustering of RNA Sequencing Data Using Plasmode Datasets.

Authors:  Pablo D Reeb; Sergio J Bramardi; Juan P Steibel
Journal:  PLoS One       Date:  2015-07-10       Impact factor: 3.240

8.  A new multitest correction (SGoF) that increases its statistical power when increasing the number of tests.

Authors:  Antonio Carvajal-Rodríguez; Jacobo de Uña-Alvarez; Emilio Rolán-Alvarez
Journal:  BMC Bioinformatics       Date:  2009-07-08       Impact factor: 3.169

9.  Evaluating statistical methods using plasmode data sets in the age of massive public databases: an illustration using false discovery rates.

Authors:  Gary L Gadbury; Qinfang Xiang; Lin Yang; Stephen Barnes; Grier P Page; David B Allison
Journal:  PLoS Genet       Date:  2008-06-20       Impact factor: 5.917

10.  Evaluating statistical analysis models for RNA sequencing experiments.

Authors:  Pablo D Reeb; Juan P Steibel
Journal:  Front Genet       Date:  2013-09-17       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.