Literature DB >> 25256289

Rediscovery rate estimation for assessing the validation of significant findings in high-throughput studies.

Andrea Ganna, Donghwan Lee, Erik Ingelsson, Yudi Pawitan.   

Abstract

It is common and advised practice in biomedical research to validate experimental or observational findings in a population different from the one where the findings were initially assessed. This practice increases the generalizability of the results and decreases the likelihood of reporting false-positive findings. Validation becomes critical when dealing with high-throughput experiments, where the large number of tests increases the chance to observe false-positive results. In this article, we review common approaches to determine statistical thresholds for validation and describe the factors influencing the proportion of significant findings from a 'training' sample that are replicated in a 'validation' sample. We refer to this proportion as rediscovery rate (RDR). In high-throughput studies, the RDR is a function of false-positive rate and power in both the training and validation samples. We illustrate the application of the RDR using simulated data and real data examples from metabolomics experiments. We further describe an online tool to calculate the RDR using t-statistics. We foresee two main applications. First, if the validation study has not yet been collected, the RDR can be used to decide the optimal combination between the proportion of findings taken to validation and the size of the validation study. Secondly, if a validation study has already been done, the RDR estimated using the training data can be compared with the observed RDR from the validation data; hence, the success of the validation study can be assessed.
© The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

Keywords:  false discovery rate; metabolomics; multiple testing; rediscovery rate; statistical validation

Mesh:

Year:  2014        PMID: 25256289     DOI: 10.1093/bib/bbu033

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  16 in total

1.  Nutritional metabolomics and breast cancer risk in a prospective study.

Authors:  Mary C Playdon; Regina G Ziegler; Joshua N Sampson; Rachael Stolzenberg-Solomon; Henry J Thompson; Melinda L Irwin; Susan T Mayne; Robert N Hoover; Steven C Moore
Journal:  Am J Clin Nutr       Date:  2017-06-28       Impact factor: 7.045

2.  Comparing metabolite profiles of habitual diet in serum and urine.

Authors:  Mary C Playdon; Joshua N Sampson; Amanda J Cross; Rashmi Sinha; Kristin A Guertin; Kristin A Moy; Nathaniel Rothman; Melinda L Irwin; Susan T Mayne; Rachael Stolzenberg-Solomon; Steven C Moore
Journal:  Am J Clin Nutr       Date:  2016-08-10       Impact factor: 7.045

Review 3.  Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data.

Authors:  Shun H Yip; Pak Chung Sham; Junwen Wang
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

4.  Use of Proteomics To Investigate Kidney Function Decline over 5 Years.

Authors:  Axel C Carlsson; Erik Ingelsson; Johan Sundström; Juan Jesus Carrero; Stefan Gustafsson; Tobias Feldreich; Markus Stenemo; Anders Larsson; Lars Lind; Johan Ärnlöv
Journal:  Clin J Am Soc Nephrol       Date:  2017-07-21       Impact factor: 8.237

5.  A Metabolomics Analysis of Body Mass Index and Postmenopausal Breast Cancer Risk.

Authors:  Steven C Moore; Mary C Playdon; Joshua N Sampson; Robert N Hoover; Britton Trabert; Charles E Matthews; Regina G Ziegler
Journal:  J Natl Cancer Inst       Date:  2018-06-01       Impact factor: 13.506

6.  Inflammatory markers in women with reported benign gynecologic pathology: an analysis of the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial.

Authors:  Lauren A King; Nicolas Wentzensen; Mark P Purdue; Hormuzd A Katki; Ligia A Pinto; Britton Trabert
Journal:  Ann Epidemiol       Date:  2021-12-11       Impact factor: 3.797

7.  Associations of Circulating Protein Levels With Lipid Fractions in the General Population.

Authors:  Sylwia M Figarska; Stefan Gustafsson; Johan Sundström; Johan Ärnlöv; Anders Mälarstig; Sölve Elmståhl; Tove Fall; Lars Lind; Erik Ingelsson
Journal:  Arterioscler Thromb Vasc Biol       Date:  2018-10       Impact factor: 8.311

8.  Large-scale metabolomic profiling identifies novel biomarkers for incident coronary heart disease.

Authors:  Andrea Ganna; Samira Salihovic; Johan Sundström; Corey D Broeckling; Asa K Hedman; Patrik K E Magnusson; Nancy L Pedersen; Anders Larsson; Agneta Siegbahn; Mihkel Zilmer; Jessica Prenni; Johan Arnlöv; Lars Lind; Tove Fall; Erik Ingelsson
Journal:  PLoS Genet       Date:  2014-12-11       Impact factor: 5.917

9.  Alcohol and oestrogen metabolites in postmenopausal women in the Women's Health Initiative Observational Study.

Authors:  Mary C Playdon; Sally B Coburn; Steven C Moore; Louise A Brinton; Nicolas Wentzensen; Garnet Anderson; Robert Wallace; Roni T Falk; Ruth Pfeiffer; Xia Xu; Britton Trabert
Journal:  Br J Cancer       Date:  2017-12-12       Impact factor: 7.640

10.  Proteomic profiles before and during weight loss: Results from randomized trial of dietary intervention.

Authors:  Sylwia M Figarska; Joseph Rigdon; Andrea Ganna; Sölve Elmståhl; Lars Lind; Christopher D Gardner; Erik Ingelsson
Journal:  Sci Rep       Date:  2020-05-13       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.