Literature DB >> 29930799

Weighted mining of massive collections of [Formula: see text]-values by convex optimization.

Edgar Dobriban1.   

Abstract

Researchers in data-rich disciplines-think of computational genomics and observational cosmology-often wish to mine large bodies of [Formula: see text]-values looking for significant effects, while controlling the false discovery rate or family-wise error rate. Increasingly, researchers also wish to prioritize certain hypotheses, for example, those thought to have larger effect sizes, by upweighting, and to impose constraints on the underlying mining, such as monotonicity along a certain sequence. We introduce Princessp, a principled method for performing weighted multiple testing by constrained convex optimization. Our method elegantly allows one to prioritize certain hypotheses through upweighting and to discount others through downweighting, while constraining the underlying weights involved in the mining process. When the [Formula: see text]-values derive from monotone likelihood ratio families such as the Gaussian means model, the new method allows exact solution of an important optimal weighting problem previously thought to be non-convex and computationally infeasible. Our method scales to massive data set sizes. We illustrate the applications of Princessp on a series of standard genomics data sets and offer comparisons with several previous 'standard' methods. Princessp offers both ease of operation and the ability to scale to extremely large problem sizes. The method is available as open-source software from github.com/dobriban/pvalue_weighting_matlab (accessed 11 October 2017).

Entities:  

Keywords:  P-value weighting; genome-wide association studies; large-scale inference; mining of P-values; multiple testing

Year:  2017        PMID: 29930799      PMCID: PMC5998655          DOI: 10.1093/imaiai/iax013

Source DB:  PubMed          Journal:  Inf inference        ISSN: 2049-8764


  30 in total

1.  Weighting sequence variants based on their annotation increases power of whole-genome association studies.

Authors:  Gardar Sveinbjornsson; Anders Albrechtsen; Florian Zink; Sigurjón A Gudjonsson; Asmundur Oddson; Gísli Másson; Hilma Holm; Augustine Kong; Unnur Thorsteinsdottir; Patrick Sulem; Daniel F Gudbjartsson; Kari Stefansson
Journal:  Nat Genet       Date:  2016-02-08       Impact factor: 38.330

2.  Stratified false discovery control for large-scale hypothesis testing with application to genome-wide association studies.

Authors:  Lei Sun; Radu V Craiu; Andrew D Paterson; Shelley B Bull
Journal:  Genet Epidemiol       Date:  2006-09       Impact factor: 2.135

3.  Increasing power in association studies by using linkage disequilibrium structure and molecular function as prior information.

Authors:  Eleazar Eskin
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

4.  An estimate of the science-wise false discovery rate and application to the top medical literature.

Authors:  Leah R Jager; Jeffrey T Leek
Journal:  Biostatistics       Date:  2013-09-25       Impact factor: 5.899

5.  Systematic meta-analyses and field synopsis of genetic association studies in schizophrenia: the SzGene database.

Authors:  Nicole C Allen; Sachin Bagade; Matthew B McQueen; John P A Ioannidis; Fotini K Kavvoura; Muin J Khoury; Rudolph E Tanzi; Lars Bertram
Journal:  Nat Genet       Date:  2008-07       Impact factor: 38.330

6.  Using prior information to allocate significance levels for multiple endpoints.

Authors:  P H Westfall; A Krishen; S S Young
Journal:  Stat Med       Date:  1998-09-30       Impact factor: 2.373

7.  Genome-wide association study identifies five new schizophrenia loci.

Authors: 
Journal:  Nat Genet       Date:  2011-09-18       Impact factor: 38.330

8.  Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease.

Authors:  Heribert Schunkert; Inke R König; Sekar Kathiresan; Muredach P Reilly; Themistocles L Assimes; Hilma Holm; Michael Preuss; Alexandre F R Stewart; Maja Barbalic; Christian Gieger; Devin Absher; Zouhair Aherrahrou; Hooman Allayee; David Altshuler; Sonia S Anand; Karl Andersen; Jeffrey L Anderson; Diego Ardissino; Stephen G Ball; Anthony J Balmforth; Timothy A Barnes; Diane M Becker; Lewis C Becker; Klaus Berger; Joshua C Bis; S Matthijs Boekholdt; Eric Boerwinkle; Peter S Braund; Morris J Brown; Mary Susan Burnett; Ian Buysschaert; John F Carlquist; Li Chen; Sven Cichon; Veryan Codd; Robert W Davies; George Dedoussis; Abbas Dehghan; Serkalem Demissie; Joseph M Devaney; Patrick Diemert; Ron Do; Angela Doering; Sandra Eifert; Nour Eddine El Mokhtari; Stephen G Ellis; Roberto Elosua; James C Engert; Stephen E Epstein; Ulf de Faire; Marcus Fischer; Aaron R Folsom; Jennifer Freyer; Bruna Gigante; Domenico Girelli; Solveig Gretarsdottir; Vilmundur Gudnason; Jeffrey R Gulcher; Eran Halperin; Naomi Hammond; Stanley L Hazen; Albert Hofman; Benjamin D Horne; Thomas Illig; Carlos Iribarren; Gregory T Jones; J Wouter Jukema; Michael A Kaiser; Lee M Kaplan; John J P Kastelein; Kay-Tee Khaw; Joshua W Knowles; Genovefa Kolovou; Augustine Kong; Reijo Laaksonen; Diether Lambrechts; Karin Leander; Guillaume Lettre; Mingyao Li; Wolfgang Lieb; Christina Loley; Andrew J Lotery; Pier M Mannucci; Seraya Maouche; Nicola Martinelli; Pascal P McKeown; Christa Meisinger; Thomas Meitinger; Olle Melander; Pier Angelica Merlini; Vincent Mooser; Thomas Morgan; Thomas W Mühleisen; Joseph B Muhlestein; Thomas Münzel; Kiran Musunuru; Janja Nahrstaedt; Christopher P Nelson; Markus M Nöthen; Oliviero Olivieri; Riyaz S Patel; Chris C Patterson; Annette Peters; Flora Peyvandi; Liming Qu; Arshed A Quyyumi; Daniel J Rader; Loukianos S Rallidis; Catherine Rice; Frits R Rosendaal; Diana Rubin; Veikko Salomaa; M Lourdes Sampietro; Manj S Sandhu; Eric Schadt; Arne Schäfer; Arne Schillert; Stefan Schreiber; Jürgen Schrezenmeir; Stephen M Schwartz; David S Siscovick; Mohan Sivananthan; Suthesh Sivapalaratnam; Albert Smith; Tamara B Smith; Jaapjan D Snoep; Nicole Soranzo; John A Spertus; Klaus Stark; Kathy Stirrups; Monika Stoll; W H Wilson Tang; Stephanie Tennstedt; Gudmundur Thorgeirsson; Gudmar Thorleifsson; Maciej Tomaszewski; Andre G Uitterlinden; Andre M van Rij; Benjamin F Voight; Nick J Wareham; George A Wells; H-Erich Wichmann; Philipp S Wild; Christina Willenborg; Jaqueline C M Witteman; Benjamin J Wright; Shu Ye; Tanja Zeller; Andreas Ziegler; Francois Cambien; Alison H Goodall; L Adrienne Cupples; Thomas Quertermous; Winfried März; Christian Hengstenberg; Stefan Blankenberg; Willem H Ouwehand; Alistair S Hall; Panos Deloukas; John R Thompson; Kari Stefansson; Robert Roberts; Unnur Thorsteinsdottir; Christopher J O'Donnell; Ruth McPherson; Jeanette Erdmann; Nilesh J Samani
Journal:  Nat Genet       Date:  2011-03-06       Impact factor: 38.330

9.  Improved detection of common variants associated with schizophrenia by leveraging pleiotropy with cardiovascular-disease risk factors.

Authors:  Ole A Andreassen; Srdjan Djurovic; Wesley K Thompson; Andrew J Schork; Kenneth S Kendler; Michael C O'Donovan; Dan Rujescu; Thomas Werge; Martijn van de Bunt; Andrew P Morris; Mark I McCarthy; J Cooper Roddey; Linda K McEvoy; Rahul S Desikan; Anders M Dale
Journal:  Am J Hum Genet       Date:  2013-01-31       Impact factor: 11.025

10.  Data-driven hypothesis weighting increases detection power in genome-scale multiple testing.

Authors:  Nikolaos Ignatiadis; Bernd Klaus; Judith B Zaugg; Wolfgang Huber
Journal:  Nat Methods       Date:  2016-05-30       Impact factor: 28.547

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.