Literature DB >> 33836139

Efficient mixed model approach for large-scale genome-wide association studies of ordinal categorical phenotypes.

Wenjian Bi1, Wei Zhou2, Rounak Dey3, Bhramar Mukherjee4, Joshua N Sampson5, Seunggeun Lee6.   

Abstract

In genome-wide association studies, ordinal categorical phenotypes are widely used to measure human behaviors, satisfaction, and preferences. However, because of the lack of analysis tools, methods designed for binary or quantitative traits are commonly used inappropriately to analyze categorical phenotypes. To accurately model the dependence of an ordinal categorical phenotype on covariates, we propose an efficient mixed model association test, proportional odds logistic mixed model (POLMM). POLMM is computationally efficient to analyze large datasets with hundreds of thousands of samples, can control type I error rates at a stringent significance level regardless of the phenotypic distribution, and is more powerful than alternative methods. In contrast, the standard linear mixed model approaches cannot control type I error rates for rare variants when the phenotypic distribution is unbalanced, although they performed well when testing common variants. We applied POLMM to 258 ordinal categorical phenotypes on array genotypes and imputed samples from 408,961 individuals in UK Biobank. In total, we identified 5,885 genome-wide significant variants, of which, 424 variants (7.2%) are rare variants with MAF < 0.01.
Copyright © 2021 American Society of Human Genetics. All rights reserved.

Entities:  

Keywords:  GRM; GWAS; POLMM; PheWAS; UK Biobank; food and other preferences; genetic relationship matrix; genome-wide association studies; mixed model approach; ordinal categorical data; phenome-wide association studies; proportional odds logistic mixed model; saddlepoint approximation; unbalanced phenotypic distribution

Mesh:

Year:  2021        PMID: 33836139      PMCID: PMC8206161          DOI: 10.1016/j.ajhg.2021.03.019

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.043


  27 in total

1.  Merlin--rapid analysis of dense genetic maps using sparse gene flow trees.

Authors:  Gonçalo R Abecasis; Stacey S Cherny; William O Cookson; Lon R Cardon
Journal:  Nat Genet       Date:  2001-12-03       Impact factor: 38.330

2.  Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models.

Authors:  Han Chen; Chaolong Wang; Matthew P Conomos; Adrienne M Stilp; Zilin Li; Tamar Sofer; Adam A Szpiro; Wei Chen; John M Brehm; Juan C Celedón; Susan Redline; George J Papanicolaou; Timothy A Thornton; Cathy C Laurie; Kenneth Rice; Xihong Lin
Journal:  Am J Hum Genet       Date:  2016-03-24       Impact factor: 11.025

3.  Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts.

Authors:  Wei Zhou; Zhangchen Zhao; Jonas B Nielsen; Lars G Fritsche; Jonathon LeFaive; Sarah A Gagliano Taliun; Wenjian Bi; Maiken E Gabrielsen; Mark J Daly; Benjamin M Neale; Kristian Hveem; Goncalo R Abecasis; Cristen J Willer; Seunggeun Lee
Journal:  Nat Genet       Date:  2020-05-18       Impact factor: 38.330

4.  The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities.

Authors:  Lauren J Beesley; Maxwell Salvatore; Lars G Fritsche; Anita Pandit; Arvind Rao; Chad Brummett; Cristen J Willer; Lynda D Lisabeth; Bhramar Mukherjee
Journal:  Stat Med       Date:  2019-12-20       Impact factor: 2.373

5.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data.

Authors:  Kai Wang; Mingyao Li; Hakon Hakonarson
Journal:  Nucleic Acids Res       Date:  2010-07-03       Impact factor: 16.971

6.  Second-generation PLINK: rising to the challenge of larger and richer datasets.

Authors:  Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee
Journal:  Gigascience       Date:  2015-02-25       Impact factor: 6.524

7.  Efficient Bayesian mixed-model analysis increases association power in large cohorts.

Authors:  Po-Ru Loh; George Tucker; Brendan K Bulik-Sullivan; Bjarni J Vilhjálmsson; Hilary K Finucane; Rany M Salem; Daniel I Chasman; Paul M Ridker; Benjamin M Neale; Bonnie Berger; Nick Patterson; Alkes L Price
Journal:  Nat Genet       Date:  2015-02-02       Impact factor: 38.330

8.  A reference panel of 64,976 haplotypes for genotype imputation.

Authors:  Shane McCarthy; Sayantan Das; Warren Kretzschmar; Olivier Delaneau; Andrew R Wood; Alexander Teumer; Hyun Min Kang; Christian Fuchsberger; Petr Danecek; Kevin Sharp; Yang Luo; Carlo Sidore; Alan Kwong; Nicholas Timpson; Seppo Koskinen; Scott Vrieze; Laura J Scott; He Zhang; Anubha Mahajan; Jan Veldink; Ulrike Peters; Carlos Pato; Cornelia M van Duijn; Christopher E Gillies; Ilaria Gandin; Massimo Mezzavilla; Arthur Gilly; Massimiliano Cocca; Michela Traglia; Andrea Angius; Jeffrey C Barrett; Dorrett Boomsma; Kari Branham; Gerome Breen; Chad M Brummett; Fabio Busonero; Harry Campbell; Andrew Chan; Sai Chen; Emily Chew; Francis S Collins; Laura J Corbin; George Davey Smith; George Dedoussis; Marcus Dorr; Aliki-Eleni Farmaki; Luigi Ferrucci; Lukas Forer; Ross M Fraser; Stacey Gabriel; Shawn Levy; Leif Groop; Tabitha Harrison; Andrew Hattersley; Oddgeir L Holmen; Kristian Hveem; Matthias Kretzler; James C Lee; Matt McGue; Thomas Meitinger; David Melzer; Josine L Min; Karen L Mohlke; John B Vincent; Matthias Nauck; Deborah Nickerson; Aarno Palotie; Michele Pato; Nicola Pirastu; Melvin McInnis; J Brent Richards; Cinzia Sala; Veikko Salomaa; David Schlessinger; Sebastian Schoenherr; P Eline Slagboom; Kerrin Small; Timothy Spector; Dwight Stambolian; Marcus Tuke; Jaakko Tuomilehto; Leonard H Van den Berg; Wouter Van Rheenen; Uwe Volker; Cisca Wijmenga; Daniela Toniolo; Eleftheria Zeggini; Paolo Gasparini; Matthew G Sampson; James F Wilson; Timothy Frayling; Paul I W de Bakker; Morris A Swertz; Steven McCarroll; Charles Kooperberg; Annelot Dekker; David Altshuler; Cristen Willer; William Iacono; Samuli Ripatti; Nicole Soranzo; Klaudia Walter; Anand Swaroop; Francesco Cucca; Carl A Anderson; Richard M Myers; Michael Boehnke; Mark I McCarthy; Richard Durbin
Journal:  Nat Genet       Date:  2016-08-22       Impact factor: 38.330

9.  Biological and clinical insights from genetics of insomnia symptoms.

Authors:  Jacqueline M Lane; Samuel E Jones; Deborah A Lawlor; Martin K Rutter; Michael N Weedon; Richa Saxena; Hassan S Dashti; Andrew R Wood; Krishna G Aragam; Vincent T van Hees; Linn B Strand; Bendik S Winsvold; Heming Wang; Jack Bowden; Yanwei Song; Krunal Patel; Simon G Anderson; Robin N Beaumont; David A Bechtold; Brian E Cade; Mary Haas; Sekar Kathiresan; Max A Little; Annemarie I Luik; Andrew S Loudon; Shaun Purcell; Rebecca C Richmond; Frank A J L Scheer; Barbara Schormair; Jessica Tyrrell; John W Winkelman; Juliane Winkelmann; Kristian Hveem; Chen Zhao; Jonas B Nielsen; Cristen J Willer; Susan Redline; Kai Spiegelhalder; Simon D Kyle; David W Ray; John-Anker Zwart; Ben Brumpton; Timothy M Frayling
Journal:  Nat Genet       Date:  2019-02-25       Impact factor: 38.330

10.  The UK Biobank resource with deep phenotyping and genomic data.

Authors:  Clare Bycroft; Colin Freeman; Desislava Petkova; Gavin Band; Lloyd T Elliott; Kevin Sharp; Allan Motyer; Damjan Vukcevic; Olivier Delaneau; Jared O'Connell; Adrian Cortes; Samantha Welsh; Alan Young; Mark Effingham; Gil McVean; Stephen Leslie; Naomi Allen; Peter Donnelly; Jonathan Marchini
Journal:  Nature       Date:  2018-10-10       Impact factor: 49.962

View more
  3 in total

1.  Rare genetic variants explain missing heritability in smoking.

Authors:  Seon-Kyeong Jang; Luke Evans; Allison Fialkowski; Donna K Arnett; Allison E Ashley-Koch; Kathleen C Barnes; Diane M Becker; Joshua C Bis; John Blangero; Eugene R Bleecker; Meher Preethi Boorgula; Donald W Bowden; Jennifer A Brody; Brian E Cade; Brenda W Campbell Jenkins; April P Carson; Sameer Chavan; L Adrienne Cupples; Brian Custer; Scott M Damrauer; Sean P David; Mariza de Andrade; Carla L Dinardo; Tasha E Fingerlin; Myriam Fornage; Barry I Freedman; Melanie E Garrett; Sina A Gharib; David C Glahn; Jeffrey Haessler; Susan R Heckbert; John E Hokanson; Lifang Hou; Shih-Jen Hwang; Matthew C Hyman; Renae Judy; Anne E Justice; Robert C Kaplan; Sharon L R Kardia; Shannon Kelly; Wonji Kim; Charles Kooperberg; Daniel Levy; Donald M Lloyd-Jones; Ruth J F Loos; Ani W Manichaikul; Mark T Gladwin; Lisa Warsinger Martin; Mehdi Nouraie; Olle Melander; Deborah A Meyers; Courtney G Montgomery; Kari E North; Elizabeth C Oelsner; Nicholette D Palmer; Marinelle Payton; Anna L Peljto; Patricia A Peyser; Michael Preuss; Bruce M Psaty; Dandi Qiao; Daniel J Rader; Nicholas Rafaels; Susan Redline; Robert M Reed; Alexander P Reiner; Stephen S Rich; Jerome I Rotter; David A Schwartz; Aladdin H Shadyab; Edwin K Silverman; Nicholas L Smith; J Gustav Smith; Albert V Smith; Jennifer A Smith; Weihong Tang; Kent D Taylor; Marilyn J Telen; Ramachandran S Vasan; Victor R Gordeuk; Zhe Wang; Kerri L Wiggins; Lisa R Yanek; Ivana V Yang; Kendra A Young; Kristin L Young; Yingze Zhang; Dajiang J Liu; Matthew C Keller; Scott Vrieze
Journal:  Nat Hum Behav       Date:  2022-08-04

2.  A semi-parametric Bayesian model for semi-continuous longitudinal data.

Authors:  Junting Ren; Susan Tapert; Chun Chieh Fan; Wesley K Thompson
Journal:  Stat Med       Date:  2022-03-10       Impact factor: 2.497

3.  Genetic association tests in family samples for multi-category phenotypes.

Authors:  Shuai Wang; James B Meigs; Josée Dupuis
Journal:  BMC Genomics       Date:  2021-12-04       Impact factor: 4.547

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.