Literature DB >> 31761295

Making the Most of Clumping and Thresholding for Polygenic Scores.

Florian Privé1, Bjarni J Vilhjálmsson2, Hugues Aschard3, Michael G B Blum4.   

Abstract

Polygenic prediction has the potential to contribute to precision medicine. Clumping and thresholding (C+T) is a widely used method to derive polygenic scores. When using C+T, several p value thresholds are tested to maximize predictive ability of the derived polygenic scores. Along with this p value threshold, we propose to tune three other hyper-parameters for C+T. We implement an efficient way to derive thousands of different C+T scores corresponding to a grid over four hyper-parameters. For example, it takes a few hours to derive 123K different C+T scores for 300K individuals and 1M variants using 16 physical cores. We find that optimizing over these four hyper-parameters improves the predictive performance of C+T in both simulations and real data applications as compared to tuning only the p value threshold. A particularly large increase can be noted when predicting depression status, from an AUC of 0.557 (95% CI: [0.544-0.569]) when tuning only the p value threshold to an AUC of 0.592 (95% CI: [0.580-0.604]) when tuning all four hyper-parameters we propose for C+T. We further propose stacked clumping and thresholding (SCT), a polygenic score that results from stacking all derived C+T scores. Instead of choosing one set of hyper-parameters that maximizes prediction in some training set, SCT learns an optimal linear combination of all C+T scores by using an efficient penalized regression. We apply SCT to eight different case-control diseases in the UK biobank data and find that SCT substantially improves prediction accuracy with an average AUC increase of 0.035 over standard C+T.
Copyright © 2019 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  C+T; PRS; UK Biobank; clumping and thresholding; complex traits; polygenic risk scores; stacking

Mesh:

Year:  2019        PMID: 31761295      PMCID: PMC6904799          DOI: 10.1016/j.ajhg.2019.11.001

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  37 in total

Review 1.  Linkage disequilibrium in humans: models and data.

Authors:  J K Pritchard; M Przeworski
Journal:  Am J Hum Genet       Date:  2001-06-14       Impact factor: 11.025

2.  PRSice-2: Polygenic Risk Score software for biobank-scale data.

Authors:  Shing Wan Choi; Paul F O'Reilly
Journal:  Gigascience       Date:  2019-07-01       Impact factor: 6.524

Review 3.  Developing and evaluating polygenic risk prediction models for stratified disease prevention.

Authors:  Nilanjan Chatterjee; Jianxin Shi; Montserrat García-Closas
Journal:  Nat Rev Genet       Date:  2016-05-03       Impact factor: 53.242

4.  Second-generation PLINK: rising to the challenge of larger and richer datasets.

Authors:  Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee
Journal:  Gigascience       Date:  2015-02-25       Impact factor: 6.524

5.  PRSice: Polygenic Risk Score software.

Authors:  Jack Euesden; Cathryn M Lewis; Paul F O'Reilly
Journal:  Bioinformatics       Date:  2014-12-29       Impact factor: 6.937

6.  Genetics of rheumatoid arthritis contributes to biology and drug discovery.

Authors:  Yukinori Okada; Di Wu; Gosia Trynka; Towfique Raj; Chikashi Terao; Katsunori Ikari; Yuta Kochi; Koichiro Ohmura; Akari Suzuki; Shinji Yoshida; Robert R Graham; Arun Manoharan; Ward Ortmann; Tushar Bhangale; Joshua C Denny; Robert J Carroll; Anne E Eyler; Jeffrey D Greenberg; Joel M Kremer; Dimitrios A Pappas; Lei Jiang; Jian Yin; Lingying Ye; Ding-Feng Su; Jian Yang; Gang Xie; Ed Keystone; Harm-Jan Westra; Tõnu Esko; Andres Metspalu; Xuezhong Zhou; Namrata Gupta; Daniel Mirel; Eli A Stahl; Dorothée Diogo; Jing Cui; Katherine Liao; Michael H Guo; Keiko Myouzen; Takahisa Kawaguchi; Marieke J H Coenen; Piet L C M van Riel; Mart A F J van de Laar; Henk-Jan Guchelaar; Tom W J Huizinga; Philippe Dieudé; Xavier Mariette; S Louis Bridges; Alexandra Zhernakova; Rene E M Toes; Paul P Tak; Corinne Miceli-Richard; So-Young Bang; Hye-Soon Lee; Javier Martin; Miguel A Gonzalez-Gay; Luis Rodriguez-Rodriguez; Solbritt Rantapää-Dahlqvist; Lisbeth Arlestig; Hyon K Choi; Yoichiro Kamatani; Pilar Galan; Mark Lathrop; Steve Eyre; John Bowes; Anne Barton; Niek de Vries; Larry W Moreland; Lindsey A Criswell; Elizabeth W Karlson; Atsuo Taniguchi; Ryo Yamada; Michiaki Kubo; Jun S Liu; Sang-Cheol Bae; Jane Worthington; Leonid Padyukov; Lars Klareskog; Peter K Gregersen; Soumya Raychaudhuri; Barbara E Stranger; Philip L De Jager; Lude Franke; Peter M Visscher; Matthew A Brown; Hisashi Yamanaka; Tsuneyo Mimori; Atsushi Takahashi; Huji Xu; Timothy W Behrens; Katherine A Siminovitch; Shigeki Momohara; Fumihiko Matsuda; Kazuhiko Yamamoto; Robert M Plenge
Journal:  Nature       Date:  2013-12-25       Impact factor: 49.962

7.  The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019.

Authors:  Annalisa Buniello; Jacqueline A L MacArthur; Maria Cerezo; Laura W Harris; James Hayhurst; Cinzia Malangone; Aoife McMahon; Joannella Morales; Edward Mountjoy; Elliot Sollis; Daniel Suveges; Olga Vrousgou; Patricia L Whetzel; Ridwan Amode; Jose A Guillen; Harpreet S Riat; Stephen J Trevanion; Peggy Hall; Heather Junkins; Paul Flicek; Tony Burdett; Lucia A Hindorff; Fiona Cunningham; Helen Parkinson
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

8.  Cox regression increases power to detect genotype-phenotype associations in genomic studies using the electronic health record.

Authors:  Jacob J Hughey; Seth D Rhoades; Darwin Y Fu; Lisa Bastarache; Joshua C Denny; Qingxia Chen
Journal:  BMC Genomics       Date:  2019-11-04       Impact factor: 3.969

9.  Improved polygenic prediction by Bayesian multiple regression on summary statistics.

Authors:  Luke R Lloyd-Jones; Jian Zeng; Julia Sidorenko; Loïc Yengo; Gerhard Moser; Kathryn E Kemper; Huanwei Wang; Zhili Zheng; Reedik Magi; Tõnu Esko; Andres Metspalu; Naomi R Wray; Michael E Goddard; Jian Yang; Peter M Visscher
Journal:  Nat Commun       Date:  2019-11-08       Impact factor: 14.919

10.  An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans.

Authors:  Robert A Scott; Laura J Scott; Reedik Mägi; Letizia Marullo; Kyle J Gaulton; Marika Kaakinen; Natalia Pervjakova; Tune H Pers; Andrew D Johnson; John D Eicher; Anne U Jackson; Teresa Ferreira; Yeji Lee; Clement Ma; Valgerdur Steinthorsdottir; Gudmar Thorleifsson; Lu Qi; Natalie R Van Zuydam; Anubha Mahajan; Han Chen; Peter Almgren; Ben F Voight; Harald Grallert; Martina Müller-Nurasyid; Janina S Ried; Nigel W Rayner; Neil Robertson; Lennart C Karssen; Elisabeth M van Leeuwen; Sara M Willems; Christian Fuchsberger; Phoenix Kwan; Tanya M Teslovich; Pritam Chanda; Man Li; Yingchang Lu; Christian Dina; Dorothee Thuillier; Loic Yengo; Longda Jiang; Thomas Sparso; Hans A Kestler; Himanshu Chheda; Lewin Eisele; Stefan Gustafsson; Mattias Frånberg; Rona J Strawbridge; Rafn Benediktsson; Astradur B Hreidarsson; Augustine Kong; Gunnar Sigurðsson; Nicola D Kerrison; Jian'an Luan; Liming Liang; Thomas Meitinger; Michael Roden; Barbara Thorand; Tõnu Esko; Evelin Mihailov; Caroline Fox; Ching-Ti Liu; Denis Rybin; Bo Isomaa; Valeriya Lyssenko; Tiinamaija Tuomi; David J Couper; James S Pankow; Niels Grarup; Christian T Have; Marit E Jørgensen; Torben Jørgensen; Allan Linneberg; Marilyn C Cornelis; Rob M van Dam; David J Hunter; Peter Kraft; Qi Sun; Sarah Edkins; Katharine R Owen; John R B Perry; Andrew R Wood; Eleftheria Zeggini; Juan Tajes-Fernandes; Goncalo R Abecasis; Lori L Bonnycastle; Peter S Chines; Heather M Stringham; Heikki A Koistinen; Leena Kinnunen; Bengt Sennblad; Thomas W Mühleisen; Markus M Nöthen; Sonali Pechlivanis; Damiano Baldassarre; Karl Gertow; Steve E Humphries; Elena Tremoli; Norman Klopp; Julia Meyer; Gerald Steinbach; Roman Wennauer; Johan G Eriksson; Satu Mӓnnistö; Leena Peltonen; Emmi Tikkanen; Guillaume Charpentier; Elodie Eury; Stéphane Lobbens; Bruna Gigante; Karin Leander; Olga McLeod; Erwin P Bottinger; Omri Gottesman; Douglas Ruderfer; Matthias Blüher; Peter Kovacs; Anke Tonjes; Nisa M Maruthur; Chiara Scapoli; Raimund Erbel; Karl-Heinz Jöckel; Susanne Moebus; Ulf de Faire; Anders Hamsten; Michael Stumvoll; Panagiotis Deloukas; Peter J Donnelly; Timothy M Frayling; Andrew T Hattersley; Samuli Ripatti; Veikko Salomaa; Nancy L Pedersen; Bernhard O Boehm; Richard N Bergman; Francis S Collins; Karen L Mohlke; Jaakko Tuomilehto; Torben Hansen; Oluf Pedersen; Inês Barroso; Lars Lannfelt; Erik Ingelsson; Lars Lind; Cecilia M Lindgren; Stephane Cauchi; Philippe Froguel; Ruth J F Loos; Beverley Balkau; Heiner Boeing; Paul W Franks; Aurelio Barricarte Gurrea; Domenico Palli; Yvonne T van der Schouw; David Altshuler; Leif C Groop; Claudia Langenberg; Nicholas J Wareham; Eric Sijbrands; Cornelia M van Duijn; Jose C Florez; James B Meigs; Eric Boerwinkle; Christian Gieger; Konstantin Strauch; Andres Metspalu; Andrew D Morris; Colin N A Palmer; Frank B Hu; Unnur Thorsteinsdottir; Kari Stefansson; Josée Dupuis; Andrew P Morris; Michael Boehnke; Mark I McCarthy; Inga Prokopenko
Journal:  Diabetes       Date:  2017-05-31       Impact factor: 9.337

View more
  30 in total

1.  Screening Human Embryos for Polygenic Traits Has Limited Utility.

Authors:  Ehud Karavani; Or Zuk; Danny Zeevi; Nir Barzilai; Nikos C Stefanis; Alex Hatzimanolis; Nikolaos Smyrnis; Dimitrios Avramopoulos; Leonid Kruglyak; Gil Atzmon; Max Lam; Todd Lencz; Shai Carmi
Journal:  Cell       Date:  2019-11-21       Impact factor: 41.582

Review 2.  Polygenic Scores to Assess Atherosclerotic Cardiovascular Disease Risk: Clinical Perspectives and Basic Implications.

Authors:  Krishna G Aragam; Pradeep Natarajan
Journal:  Circ Res       Date:  2020-04-23       Impact factor: 17.367

3.  An Improved Genome-Wide Polygenic Score Model for Predicting the Risk of Type 2 Diabetes.

Authors:  Wei Liu; Zhenhuang Zhuang; Wenxiu Wang; Tao Huang; Zhonghua Liu
Journal:  Front Genet       Date:  2021-02-11       Impact factor: 4.599

4.  Combined Utility of 25 Disease and Risk Factor Polygenic Risk Scores for Stratifying Risk of All-Cause Mortality.

Authors:  Allison Meisner; Prosenjit Kundu; Yan Dora Zhang; Lauren V Lan; Sungwon Kim; Disha Ghandwani; Parichoy Pal Choudhury; Sonja I Berndt; Neal D Freedman; Montserrat Garcia-Closas; Nilanjan Chatterjee
Journal:  Am J Hum Genet       Date:  2020-08-05       Impact factor: 11.025

Review 5.  Genetic prediction of complex traits with polygenic scores: a statistical review.

Authors:  Ying Ma; Xiang Zhou
Journal:  Trends Genet       Date:  2021-07-06       Impact factor: 11.639

6.  A polygenic-score-based approach for identification of gene-drug interactions stratifying breast cancer risk.

Authors:  Andrew R Marderstein; Scott Kulm; Cheng Peng; Rulla Tamimi; Andrew G Clark; Olivier Elemento
Journal:  Am J Hum Genet       Date:  2021-08-06       Impact factor: 11.025

7.  Evaluation of polygenic prediction methodology within a reference-standardized framework.

Authors:  Oliver Pain; Kylie P Glanville; Saskia P Hagenaars; Saskia Selzam; Anna E Fürtjes; Héléna A Gaspar; Jonathan R I Coleman; Kaili Rimfeld; Gerome Breen; Robert Plomin; Lasse Folkersen; Cathryn M Lewis
Journal:  PLoS Genet       Date:  2021-05-04       Impact factor: 5.917

8.  Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction.

Authors:  Clara Albiñana; Jakob Grove; John J McGrath; Esben Agerbo; Naomi R Wray; Cynthia M Bulik; Merete Nordentoft; David M Hougaard; Thomas Werge; Anders D Børglum; Preben Bo Mortensen; Florian Privé; Bjarni J Vilhjálmsson
Journal:  Am J Hum Genet       Date:  2021-05-07       Impact factor: 11.043

9.  Polygenic prediction of atopic dermatitis improves with atopic training and filaggrin factors.

Authors:  Christopher H Arehart; Michelle Daya; Monica Campbell; Meher Preethi Boorgula; Nicholas Rafaels; Sameer Chavan; Gloria David; Jon Hanifin; Mark K Slifka; Richard L Gallo; Tissa Hata; Lynda C Schneider; Amy S Paller; Peck Y Ong; Jonathan M Spergel; Emma Guttman-Yassky; Donald Y M Leung; Lisa A Beck; Christopher R Gignoux; Rasika A Mathias; Kathleen C Barnes
Journal:  J Allergy Clin Immunol       Date:  2021-06-07       Impact factor: 10.793

Review 10.  How Do We Incorporate Polygenic Risk Scores in Cardiovascular Disease Risk Assessment and Management?

Authors:  Trevor D Hadley; Ali M Agha; Christie M Ballantyne
Journal:  Curr Atheroscler Rep       Date:  2021-04-01       Impact factor: 5.113

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.