Literature DB >> 32330416

Accurate and Scalable Construction of Polygenic Scores in Large Biobank Data Sets.

Sheng Yang1, Xiang Zhou2.   

Abstract

Accurate construction of polygenic scores (PGS) can enable early diagnosis of diseases and facilitate the development of personalized medicine. Accurate PGS construction requires prediction models that are both adaptive to different genetic architectures and scalable to biobank scale datasets with millions of individuals and tens of millions of genetic variants. Here, we develop such a method called Deterministic Bayesian Sparse Linear Mixed Model (DBSLMM). DBSLMM relies on a flexible modeling assumption on the effect size distribution to achieve robust and accurate prediction performance across a range of genetic architectures. DBSLMM also relies on a simple deterministic search algorithm to yield an approximate analytic estimation solution using summary statistics only. The deterministic search algorithm, when paired with further algebraic innovations, results in substantial computational savings. With simulations, we show that DBSLMM achieves scalable and accurate prediction performance across a range of realistic genetic architectures. We then apply DBSLMM to analyze 25 traits in UK Biobank. For these traits, compared to existing approaches, DBSLMM achieves an average of 2.03%-101.09% accuracy gain in internal cross-validations. In external validations on two separate datasets, including one from BioBank Japan, DBSLMM achieves an average of 14.74%-522.74% accuracy gain. In these real data applications, DBSLMM is 1.03-28.11 times faster and uses only 7.4%-24.8% of physical memory as compared to other multiple regression-based PGS methods. Overall, DBSLMM represents an accurate and scalable method for constructing PGS in biobank scale datasets.
Copyright © 2020 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  UK Biobank; complex traits; deterministic Bayesian sparse linear mixed model; polygenic risk score; polygenic score

Mesh:

Year:  2020        PMID: 32330416      PMCID: PMC7212266          DOI: 10.1016/j.ajhg.2020.03.013

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  69 in total

1.  Prediction of total genetic value using genome-wide dense marker maps.

Authors:  T H Meuwissen; B J Hayes; M E Goddard
Journal:  Genetics       Date:  2001-04       Impact factor: 4.562

Review 2.  Predicting genetic predisposition in humans: the promise of whole-genome markers.

Authors:  Gustavo de los Campos; Daniel Gianola; David B Allison
Journal:  Nat Rev Genet       Date:  2010-11-03       Impact factor: 53.242

Review 3.  An Expanded View of Complex Traits: From Polygenic to Omnigenic.

Authors:  Evan A Boyle; Yang I Li; Jonathan K Pritchard
Journal:  Cell       Date:  2017-06-15       Impact factor: 41.582

4.  Genome-wide association study identifies 112 new loci for body mass index in the Japanese population.

Authors:  Masato Akiyama; Yukinori Okada; Masahiro Kanai; Atsushi Takahashi; Yukihide Momozawa; Masashi Ikeda; Nakao Iwata; Shiro Ikegawa; Makoto Hirata; Koichi Matsuda; Motoki Iwasaki; Taiki Yamaji; Norie Sawada; Tsuyoshi Hachiya; Kozo Tanno; Atsushi Shimizu; Atsushi Hozawa; Naoko Minegishi; Shoichiro Tsugane; Masayuki Yamamoto; Michiaki Kubo; Yoichiro Kamatani
Journal:  Nat Genet       Date:  2017-09-11       Impact factor: 38.330

5.  A global overview of pleiotropy and genetic architecture in complex traits.

Authors:  Kyoko Watanabe; Sven Stringer; Oleksandr Frei; Maša Umićević Mirkov; Christiaan de Leeuw; Tinca J C Polderman; Sophie van der Sluis; Ole A Andreassen; Benjamin M Neale; Danielle Posthuma
Journal:  Nat Genet       Date:  2019-08-19       Impact factor: 38.330

Review 6.  Overview of the BioBank Japan Project: Study design and profile.

Authors:  Akiko Nagai; Makoto Hirata; Yoichiro Kamatani; Kaori Muto; Koichi Matsuda; Yutaka Kiyohara; Toshiharu Ninomiya; Akiko Tamakoshi; Zentaro Yamagata; Taisei Mushiroda; Yoshinori Murakami; Koichiro Yuji; Yoichi Furukawa; Hitoshi Zembutsu; Toshihiro Tanaka; Yozo Ohnishi; Yusuke Nakamura; Michiaki Kubo
Journal:  J Epidemiol       Date:  2017-02-08       Impact factor: 3.211

7.  Comparing Within- and Between-Family Polygenic Score Prediction.

Authors:  Saskia Selzam; Stuart J Ritchie; Jean-Baptiste Pingault; Chandra A Reynolds; Paul F O'Reilly; Robert Plomin
Journal:  Am J Hum Genet       Date:  2019-07-11       Impact factor: 11.025

8.  Improved polygenic prediction by Bayesian multiple regression on summary statistics.

Authors:  Luke R Lloyd-Jones; Jian Zeng; Julia Sidorenko; Loïc Yengo; Gerhard Moser; Kathryn E Kemper; Huanwei Wang; Zhili Zheng; Reedik Magi; Tõnu Esko; Andres Metspalu; Naomi R Wray; Michael E Goddard; Jian Yang; Peter M Visscher
Journal:  Nat Commun       Date:  2019-11-08       Impact factor: 14.919

9.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder.

Authors:  Shaun M Purcell; Naomi R Wray; Jennifer L Stone; Peter M Visscher; Michael C O'Donovan; Patrick F Sullivan; Pamela Sklar
Journal:  Nature       Date:  2009-07-01       Impact factor: 49.962

10.  Accuracy of predicting the genetic risk of disease using a genome-wide approach.

Authors:  Hans D Daetwyler; Beatriz Villanueva; John A Woolliams
Journal:  PLoS One       Date:  2008-10-14       Impact factor: 3.240

View more
  21 in total

1.  Multi-scale inference of genetic trait architecture using biologically annotated neural networks.

Authors:  Pinar Demetci; Wei Cheng; Gregory Darnell; Xiang Zhou; Sohini Ramachandran; Lorin Crawford
Journal:  PLoS Genet       Date:  2021-08-19       Impact factor: 5.917

Review 2.  Appraisal of Gene-Environment Interactions in GWAS for Evidence-Based Precision Nutrition Implementation.

Authors:  Rodrigo San-Cristobal; Juan de Toro-Martín; Marie-Claude Vohl
Journal:  Curr Nutr Rep       Date:  2022-08-11

3.  Spatially informed cell-type deconvolution for spatial transcriptomics.

Authors:  Ying Ma; Xiang Zhou
Journal:  Nat Biotechnol       Date:  2022-05-02       Impact factor: 68.164

4.  Leveraging the local genetic structure for trans-ancestry association mapping.

Authors:  Jiashun Xiao; Mingxuan Cai; Xinyi Yu; Xianghong Hu; Gang Chen; Xiang Wan; Can Yang
Journal:  Am J Hum Genet       Date:  2022-06-16       Impact factor: 11.043

Review 5.  Genetic prediction of complex traits with polygenic scores: a statistical review.

Authors:  Ying Ma; Xiang Zhou
Journal:  Trends Genet       Date:  2021-07-06       Impact factor: 11.639

6.  A unified framework for cross-population trait prediction by leveraging the genetic correlation of polygenic traits.

Authors:  Mingxuan Cai; Jiashun Xiao; Shunkang Zhang; Xiang Wan; Hongyu Zhao; Gang Chen; Can Yang
Journal:  Am J Hum Genet       Date:  2021-03-25       Impact factor: 11.025

7.  Evaluation of polygenic prediction methodology within a reference-standardized framework.

Authors:  Oliver Pain; Kylie P Glanville; Saskia P Hagenaars; Saskia Selzam; Anna E Fürtjes; Héléna A Gaspar; Jonathan R I Coleman; Kaili Rimfeld; Gerome Breen; Robert Plomin; Lasse Folkersen; Cathryn M Lewis
Journal:  PLoS Genet       Date:  2021-05-04       Impact factor: 5.917

8.  Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction.

Authors:  Clara Albiñana; Jakob Grove; John J McGrath; Esben Agerbo; Naomi R Wray; Cynthia M Bulik; Merete Nordentoft; David M Hougaard; Thomas Werge; Anders D Børglum; Preben Bo Mortensen; Florian Privé; Bjarni J Vilhjálmsson
Journal:  Am J Hum Genet       Date:  2021-05-07       Impact factor: 11.043

9.  Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks.

Authors:  Lars G Fritsche; Snehal Patil; Lauren J Beesley; Peter VandeHaar; Maxwell Salvatore; Ying Ma; Robert B Peng; Daniel Taliun; Xiang Zhou; Bhramar Mukherjee
Journal:  Am J Hum Genet       Date:  2020-09-28       Impact factor: 11.025

10.  SPARK-X: non-parametric modeling enables scalable and robust detection of spatial expression patterns for large spatial transcriptomic studies.

Authors:  Jiaqiang Zhu; Shiquan Sun; Xiang Zhou
Journal:  Genome Biol       Date:  2021-06-21       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.