Andrew R Wood1, Tonu Esko2, Jian Yang3, Sailaja Vedantam4, Tune H Pers5, Stefan Gustafsson6, Audrey Y Chu7, Karol Estrada8, Jian'an Luan9, Zoltán Kutalik10, Najaf Amin11, Martin L Buchkovich12, Damien C Croteau-Chonka13, Felix R Day9, Yanan Duan14, Tove Fall15, Rudolf Fehrmann16, Teresa Ferreira17, Anne U Jackson18, Juha Karjalainen16, Ken Sin Lo19, Adam E Locke18, Reedik Mägi20, Evelin Mihailov21, Eleonora Porcu22, Joshua C Randall23, André Scherag24, Anna A E Vinkhuyzen25, Harm-Jan Westra16, Thomas W Winkler26, Tsegaselassie Workalemahu27, Jing Hua Zhao9, Devin Absher28, Eva Albrecht29, Denise Anderson30, Jeffrey Baron31, Marian Beekman32, Ayse Demirkan33, Georg B Ehret34, Bjarke Feenstra35, Mary F Feitosa36, Krista Fischer37, Ross M Fraser38, Anuj Goel39, Jian Gong40, Anne E Justice41, Stavroula Kanoni42, Marcus E Kleber43, Kati Kristiansson44, Unhee Lim45, Vaneet Lotay46, Julian C Lui31, Massimo Mangino47, Irene Mateo Leach48, Carolina Medina-Gomez49, Michael A Nalls50, Dale R Nyholt51, Cameron D Palmer4, Dorota Pasko1, Sonali Pechlivanis52, Inga Prokopenko53, Janina S Ried29, Stephan Ripke54, Dmitry Shungin55, Alena Stancáková56, Rona J Strawbridge57, Yun Ju Sung58, Toshiko Tanaka59, Alexander Teumer60, Stella Trompet61, Sander W van der Laan62, Jessica van Setten63, Jana V Van Vliet-Ostaptchouk64, Zhaoming Wang65, Loïc Yengo66, Weihua Zhang67, Uzma Afzal67, Johan Arnlöv68, Gillian M Arscott69, Stefania Bandinelli70, Amy Barrett71, Claire Bellis72, Amanda J Bennett71, Christian Berne73, Matthias Blüher74, Jennifer L Bolton38, Yvonne Böttcher75, Heather A Boyd35, Marcel Bruinenberg76, Brendan M Buckley77, Steven Buyske78, Ida H Caspersen79, Peter S Chines80, Robert Clarke81, Simone Claudi-Boehm82, Matthew Cooper30, E Warwick Daw36, Pim A De Jong83, Joris Deelen32, Graciela Delgado84, Josh C Denny85, Rosalie Dhonukshe-Rutten86, Maria Dimitriou87, Alex S F Doney88, Marcus Dörr89, Niina Eklund90, Elodie Eury66, Lasse Folkersen57, Melissa E Garcia91, Frank Geller35, Vilmantas Giedraitis92, Alan S Go93, Harald Grallert94, Tanja B Grammer84, Jürgen Gräßler95, Henrik Grönberg96, Lisette C P G M de Groot86, Christopher J Groves71, Jeffrey Haessler40, Per Hall96, Toomas Haller37, Goran Hallmans97, Anke Hannemann98, Catharina A Hartman99, Maija Hassinen100, Caroline Hayward101, Nancy L Heard-Costa102, Quinta Helmer103, Gibran Hemani3, Anjali K Henders51, Hans L Hillege104, Mark A Hlatky105, Wolfgang Hoffmann106, Per Hoffmann107, Oddgeir Holmen108, Jeanine J Houwing-Duistermaat109, Thomas Illig110, Aaron Isaacs111, Alan L James112, Janina Jeff46, Berit Johansen79, Åsa Johansson113, Jennifer Jolley114, Thorhildur Juliusdottir17, Juhani Junttila115, Abel N Kho116, Leena Kinnunen44, Norman Klopp110, Thomas Kocher117, Wolfgang Kratzer118, Peter Lichtner119, Lars Lind120, Jaana Lindström44, Stéphane Lobbens66, Mattias Lorentzon121, Yingchang Lu122, Valeriya Lyssenko123, Patrik K E Magnusson96, Anubha Mahajan17, Marc Maillard124, Wendy L McArdle125, Colin A McKenzie126, Stela McLachlan38, Paul J McLaren127, Cristina Menni47, Sigrun Merger82, Lili Milani37, Alireza Moayyeri47, Keri L Monda128, Mario A Morken80, Gabriele Müller129, Martina Müller-Nurasyid130, Arthur W Musk131, Narisu Narisu80, Matthias Nauck132, Ilja M Nolte133, Markus M Nöthen134, Laticia Oozageer135, Stefan Pilz136, Nigel W Rayner137, Frida Renstrom138, Neil R Robertson139, Lynda M Rose7, Ronan Roussel140, Serena Sanna22, Hubert Scharnagl141, Salome Scholtens133, Fredrick R Schumacher142, Heribert Schunkert143, Robert A Scott9, Joban Sehmi67, Thomas Seufferlein118, Jianxin Shi144, Karri Silventoinen145, Johannes H Smit146, Albert Vernon Smith147, Joanna Smolonska148, Alice V Stanton149, Kathleen Stirrups150, David J Stott151, Heather M Stringham18, Johan Sundström120, Morris A Swertz16, Ann-Christine Syvänen152, Bamidele O Tayo153, Gudmar Thorleifsson154, Jonathan P Tyrer155, Suzanne van Dijk156, Natasja M van Schoor157, Nathalie van der Velde158, Diana van Heemst159, Floor V A van Oort160, Sita H Vermeulen161, Niek Verweij48, Judith M Vonk133, Lindsay L Waite28, Melanie Waldenberger162, Roman Wennauer163, Lynne R Wilkens45, Christina Willenborg164, Tom Wilsgaard165, Mary K Wojczynski36, Andrew Wong166, Alan F Wright101, Qunyuan Zhang36, Dominique Arveiler167, Stephan J L Bakker168, John Beilby169, Richard N Bergman170, Sven Bergmann171, Reiner Biffar172, John Blangero72, Dorret I Boomsma173, Stefan R Bornstein95, Pascal Bovet174, Paolo Brambilla175, Morris J Brown176, Harry Campbell38, Mark J Caulfield177, Aravinda Chakravarti178, Rory Collins81, Francis S Collins80, Dana C Crawford179, L Adrienne Cupples180, John Danesh181, Ulf de Faire182, Hester M den Ruijter183, Raimund Erbel184, Jeanette Erdmann164, Johan G Eriksson185, Martin Farrall39, Ele Ferrannini186, Jean Ferrières187, Ian Ford188, Nita G Forouhi9, Terrence Forrester126, Ron T Gansevoort168, Pablo V Gejman189, Christian Gieger29, Alain Golay190, Omri Gottesman46, Vilmundur Gudnason147, Ulf Gyllensten113, David W Haas191, Alistair S Hall192, Tamara B Harris91, Andrew T Hattersley193, Andrew C Heath194, Christian Hengstenberg143, Andrew A Hicks195, Lucia A Hindorff196, Aroon D Hingorani197, Albert Hofman198, G Kees Hovingh199, Steve E Humphries200, Steven C Hunt201, Elina Hypponen202, Kevin B Jacobs203, Marjo-Riitta Jarvelin204, Pekka Jousilahti44, Antti M Jula44, Jaakko Kaprio205, John J P Kastelein199, Manfred Kayser206, Frank Kee207, Sirkka M Keinanen-Kiukaanniemi208, Lambertus A Kiemeney209, Jaspal S Kooner210, Charles Kooperberg40, Seppo Koskinen44, Peter Kovacs74, Aldi T Kraja36, Meena Kumari211, Johanna Kuusisto212, Timo A Lakka213, Claudia Langenberg214, Loic Le Marchand45, Terho Lehtimäki215, Sara Lupoli216, Pamela A F Madden194, Satu Männistö44, Paolo Manunta217, André Marette218, Tara C Matise219, Barbara McKnight220, Thomas Meitinger221, Frans L Moll222, Grant W Montgomery51, Andrew D Morris88, Andrew P Morris223, Jeffrey C Murray224, Mari Nelis37, Claes Ohlsson121, Albertine J Oldehinkel99, Ken K Ong225, Willem H Ouwehand114, Gerard Pasterkamp62, Annette Peters226, Peter P Pramstaller227, Jackie F Price38, Lu Qi228, Olli T Raitakari229, Tuomo Rankinen230, D C Rao231, Treva K Rice232, Marylyn Ritchie233, Igor Rudan234, Veikko Salomaa44, Nilesh J Samani235, Jouko Saramies236, Mark A Sarzynski230, Peter E H Schwarz237, Sylvain Sebert238, Peter Sever239, Alan R Shuldiner240, Juha Sinisalo241, Valgerdur Steinthorsdottir154, Ronald P Stolk133, Jean-Claude Tardif242, Anke Tönjes74, Angelo Tremblay243, Elena Tremoli244, Jarmo Virtamo44, Marie-Claude Vohl245, Philippe Amouyel246, Folkert W Asselbergs247, Themistocles L Assimes105, Murielle Bochud174, Bernhard O Boehm248, Eric Boerwinkle249, Erwin P Bottinger46, Claude Bouchard230, Stéphane Cauchi66, John C Chambers250, Stephen J Chanock251, Richard S Cooper153, Paul I W de Bakker252, George Dedoussis87, Luigi Ferrucci59, Paul W Franks253, Philippe Froguel254, Leif C Groop255, Christopher A Haiman142, Anders Hamsten57, M Geoffrey Hayes116, Jennie Hui256, David J Hunter257, Kristian Hveem108, J Wouter Jukema258, Robert C Kaplan259, Mika Kivimaki211, Diana Kuh166, Markku Laakso212, Yongmei Liu260, Nicholas G Martin51, Winfried März261, Mads Melbye262, Susanne Moebus52, Patricia B Munroe177, Inger Njølstad165, Ben A Oostra263, Colin N A Palmer88, Nancy L Pedersen96, Markus Perola264, Louis Pérusse265, Ulrike Peters40, Joseph E Powell3, Chris Power266, Thomas Quertermous105, Rainer Rauramaa267, Eva Reinmaa37, Paul M Ridker268, Fernando Rivadeneira49, Jerome I Rotter269, Timo E Saaristo270, Danish Saleheen271, David Schlessinger272, P Eline Slagboom32, Harold Snieder133, Tim D Spector47, Konstantin Strauch273, Michael Stumvoll74, Jaakko Tuomilehto274, Matti Uusitupa275, Pim van der Harst276, Henry Völzke106, Mark Walker277, Nicholas J Wareham9, Hugh Watkins39, H-Erich Wichmann278, James F Wilson38, Pieter Zanen279, Panos Deloukas280, Iris M Heid281, Cecilia M Lindgren282, Karen L Mohlke12, Elizabeth K Speliotes283, Unnur Thorsteinsdottir284, Inês Barroso285, Caroline S Fox286, Kari E North287, David P Strachan288, Jacques S Beckmann289, Sonja I Berndt251, Michael Boehnke18, Ingrid B Borecki36, Mark I McCarthy290, Andres Metspalu21, Kari Stefansson284, André G Uitterlinden49, Cornelia M van Duijn291, Lude Franke16, Cristen J Willer292, Alkes L Price293, Guillaume Lettre242, Ruth J F Loos294, Michael N Weedon1, Erik Ingelsson295, Jeffrey R O'Connell296, Goncalo R Abecasis18, Daniel I Chasman268, Michael E Goddard297, Peter M Visscher3, Joel N Hirschhorn298, Timothy M Frayling1. 1. Genetics of Complex Traits, University of Exeter Medical School, University of Exeter, Exeter, UK. 2. 1] Estonian Genome Center, University of Tartu, Tartu, Estonia. [2] Division of Endocrinology, Genetics and Basic and Translational Obesity Research, Boston Children's Hospital, Boston, Massachusetts, USA. [3] Broad Institute of MIT and Harvard University, Cambridge, Massachusetts, USA. [4] Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA. 3. 1] Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia. [2] University of Queensland Diamantina Institute, Translation Research Institute, Brisbane, Queensland, Australia. 4. 1] Division of Endocrinology, Genetics and Basic and Translational Obesity Research, Boston Children's Hospital, Boston, Massachusetts, USA. [2] Broad Institute of MIT and Harvard University, Cambridge, Massachusetts, USA. 5. 1] Division of Endocrinology, Genetics and Basic and Translational Obesity Research, Boston Children's Hospital, Boston, Massachusetts, USA. [2] Broad Institute of MIT and Harvard University, Cambridge, Massachusetts, USA. [3] Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA. [4] Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark. 6. 1] Science for Life Laboratory, Uppsala University, Uppsala, Sweden. [2] Department of Medical Sciences, Molecular Epidemiology, Uppsala University, Uppsala, Sweden. 7. Division of Preventive Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA. 8. 1] Broad Institute of MIT and Harvard University, Cambridge, Massachusetts, USA. [2] Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands. [3] Analytic and Translational Genetics Unit, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA. 9. Medical Research Council (MRC) Epidemiology Unit, University of Cambridge, Institute of Metabolic Science, Addenbrooke's Hospital, Cambridge, UK. 10. 1] Institute of Social and Preventive Medicine (IUMSP), Centre Hospitalier Universitaire Vaudois (CHUV), Lausanne, Switzerland. [2] Swiss Institute of Bioinformatics, Lausanne, Switzerland. [3] Department of Medical Genetics, University of Lausanne, Lausanne, Switzerland. 11. Genetic Epidemiology Unit, Department of Epidemiology, Erasmus University Medical Center, Rotterdam, the Netherlands. 12. Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA. 13. 1] Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA. [2] Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA. 14. Department of Genetics, Division of Statistical Genomics, Washington University School of Medicine, St. Louis, Missouri, USA. 15. 1] Science for Life Laboratory, Uppsala University, Uppsala, Sweden. [2] Department of Medical Sciences, Molecular Epidemiology, Uppsala University, Uppsala, Sweden. [3] Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden. 16. Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. 17. Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. 18. Center for Statistical Genetics, Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA. 19. Montreal Heart Institute, Montreal, Quebec, Canada. 20. 1] Estonian Genome Center, University of Tartu, Tartu, Estonia. [2] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. 21. 1] Estonian Genome Center, University of Tartu, Tartu, Estonia. [2] Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia. 22. Istituto di Ricerca Genetica e Biomedica (IRGB), Consiglio Nazionale delle Ricerche, Cagliari, Italy. 23. 1] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. [2] Wellcome Trust Sanger Institute, Hinxton, UK. 24. 1] Institute for Medical Informatics, Biometry and Epidemiology (IMIBE), University Hospital Essen, Essen, Germany. [2] Clinical Epidemiology, Integrated Research and Treatment Center, Center for Sepsis Control and Care (CSCC), Jena University Hospital, Jena, Germany. 25. Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia. 26. Department of Genetic Epidemiology, Institute of Epidemiology and Preventive Medicine, University of Regensburg, Regensburg, Germany. 27. Department of Nutrition, Harvard School of Public Health, Harvard University, Boston, Massachusetts, USA. 28. HudsonAlpha Institute for Biotechnology, Huntsville, Alabama, USA. 29. Institute of Genetic Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. 30. Telethon Institute for Child Health Research, Centre for Child Health Research, University of Western Australia, Perth, Western Australia, Australia. 31. Section on Growth and Development, Program in Developmental Endocrinology and Genetics, Eunice Kennedy Shriver National Institute of Child Health and Human Development, US National Institutes of Health, Bethesda, Maryland, USA. 32. 1] Netherlands Consortium for Healthy Aging (NCHA), Leiden University Medical Center, Leiden, the Netherlands. [2] Department of Molecular Epidemiology, Leiden University Medical Center, Leiden, the Netherlands. 33. 1] Genetic Epidemiology Unit, Department of Epidemiology, Erasmus University Medical Center, Rotterdam, the Netherlands. [2] Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands. 34. 1] Center for Complex Disease Genomics, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA. [2] Department of Specialties of Internal Medicine, Division of Cardiology, Geneva University Hospital, Geneva, Switzerland. 35. Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark. 36. Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA. 37. Estonian Genome Center, University of Tartu, Tartu, Estonia. 38. Centre for Population Health Sciences, University of Edinburgh, Edinburgh, UK. 39. 1] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. [2] Radcliffe Department of Medicine, Division of Cardiovascular Medicine, University of Oxford, Oxford, UK. 40. Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA. 41. Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA. 42. William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK. 43. 1] Vth Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Heidelberg, Germany. [2] Department of Internal Medicine II, Ulm University Medical Centre, Ulm, Germany. 44. National Institute for Health and Welfare, Helsinki, Finland. 45. Epidemiology Program, University of Hawaii Cancer Center, Honolulu, Hawaii, USA. 46. Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA. 47. Department of Twin Research and Genetic Epidemiology, King's College London, London, UK. 48. Department of Cardiology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. 49. 1] Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands. [2] Netherlands Consortium for Healthy Aging (NCHA), Rotterdam, the Netherlands. [3] Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands. 50. Laboratory of Neurogenetics, National Institute on Aging, US National Institutes of Health, Bethesda, Maryland, USA. 51. QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia. 52. Institute for Medical Informatics, Biometry and Epidemiology (IMIBE), University Hospital Essen, Essen, Germany. 53. 1] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. [2] Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, UK. [3] Department of Genomics of Common Disease, School of Public Health, Imperial College London, Hammersmith Hospital, London, UK. 54. 1] Analytic and Translational Genetics Unit, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA. [2] Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard University, Cambridge, Massachusetts, USA. 55. 1] Department of Clinical Sciences, Genetic and Molecular Epidemiology Unit, Lund University Diabetes Center, Skåne University Hospital, Malmö, Sweden. [2] Department of Public Health and Clinical Medicine, Unit of Medicine, Umeå University, Umeå, Sweden. [3] Department of Odontology, Umeå University, Umeå, Sweden. 56. Department of Medicine, University of Eastern Finland, Kuopio, Finland. 57. Atherosclerosis Research Unit, Center for Molecular Medicine, Department of Medicine, Karolinska Institutet, Stockholm, Sweden. 58. Division of Biostatistics, Washington University School of Medicine, St. Louis, Missouri, USA. 59. Translational Gerontology Branch, National Institute on Aging, Baltimore, Maryland, USA. 60. Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany. 61. 1] Department of Cardiology, Leiden University Medical Center, Leiden, the Netherlands. [2] Department of Gerontology and Geriatrics, Leiden University Medical Center, Leiden, the Netherlands. 62. Experimental Cardiology Laboratory, Division of Heart and Lungs, University Medical Center Utrecht, Utrecht, the Netherlands. 63. Department of Medical Genetics, University Medical Center Utrecht, Utrecht, the Netherlands. 64. Department of Endocrinology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands. 65. 1] DZHK (Deutsches Zentrum für Herz-Kreislaufforschung-German Centre for Cardiovascular Research), partner site Greifswald, Greifswald, Germany. [2] Institute of Clinical Chemistry and Laboratory Medicine, University Medicine Greifswald, Greifswald, Germany. [3] Division of Cancer Epidemiology and Genetics, National Cancer Institute, US National Institutes of Health, Bethesda, Maryland, USA. [4] Core Genotyping Facility, SAIC-Frederick, Inc., NCI-Frederick, Frederick, Maryland, USA. 66. 1] CNRS UMR 8199, Lille, France. [2] European Genomic Institute for Diabetes, Lille, France. [3] Université de Lille 2, Lille, France. 67. 1] Ealing Hospital National Health Service (NHS) Trust, Middlesex, UK. [2] Department of Epidemiology and Biostatistics, Imperial College London, London, UK. 68. 1] Science for Life Laboratory, Uppsala University, Uppsala, Sweden. [2] Department of Medical Sciences, Molecular Epidemiology, Uppsala University, Uppsala, Sweden. [3] School of Health and Social Studies, Dalarna University, Falun, Sweden. 69. PathWest Laboratory Medicine of Western Australia, Nedlands, Western Australia, Australia. 70. Geriatric Unit, Azienda Sanitaria Firenze (ASF), Florence, Italy. 71. Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, UK. 72. Department of Genetics, Texas Biomedical Research Institute, San Antonio, Texas, USA. 73. Department of Medical Sciences, Endocrinology, Diabetes and Metabolism, Uppsala University, Uppsala, Sweden. 74. 1] Integrated Research and Treatment Center (IFB) Adiposity Diseases, University of Leipzig, Leipzig, Germany. [2] Department of Medicine, University of Leipzig, Leipzig, Germany. 75. Integrated Research and Treatment Center (IFB) Adiposity Diseases, University of Leipzig, Leipzig, Germany. 76. LifeLines, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. 77. Department of Pharmacology and Therapeutics, University College Cork, Cork, Ireland. 78. 1] Department of Statistics and Biostatistics, Rutgers University, Piscataway, New Jersy, USA. [2] Department of Genetics, Rutgers University, Piscataway, New Jersey, USA. 79. Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway. 80. Genome Technology Branch, National Human Genome Research Institute, US National Institutes of Health, Bethesda, Maryland, USA. 81. Clinical Trial Service Unit, Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK. 82. Division of Endocrinology, Diabetes and Metabolism, Ulm University Medical Centre, Ulm, Germany. 83. Department of Radiology, University Medical Center Utrecht, Utrecht, the Netherlands. 84. Vth Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Heidelberg, Germany. 85. Department of Biomedical Informatics, Vanderbilt University, Nashville, Tennessee, USA. 86. Department of Human Nutrition, Wageningen University, Wageningen, the Netherlands. 87. Department of Dietetics-Nutrition, Harokopio University, Athens, Greece. 88. Medical Research Institute, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK. 89. 1] DZHK (Deutsches Zentrum für Herz-Kreislaufforschung-German Centre for Cardiovascular Research), partner site Greifswald, Greifswald, Germany. [2] Department of Internal Medicine B, University Medicine Greifswald, Greifswald, Germany. 90. 1] National Institute for Health and Welfare, Helsinki, Finland. [2] Institute for Molecular Medicine, University of Helsinki, Helsinki, Finland. 91. Laboratory of Epidemiology and Population Sciences, National Institute on Aging, US National Institutes of Health, Bethesda, Maryland, USA. 92. Department of Public Health and Caring Sciences, Geriatrics, Uppsala University, Uppsala, Sweden. 93. Kaiser Permanente, Division of Research, Oakland, California, USA. 94. 1] Institute of Genetic Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. [2] Research Unit of Molecular Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. [3] German Center for Diabetes Research (DZD), Neuherberg, Germany. 95. Department of Medicine III, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany. 96. Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden. 97. Unit of Nutritional Research, Department of Public Health and Clinical Medicine, Umeå University, Umeå, Sweden. 98. Institute of Clinical Chemistry and Laboratory Medicine, University Medicine Greifswald, Greifswald, Germany. 99. Department of Psychiatry, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands. 100. Kuopio Research Institute of Exercise Medicine, Kuopio, Finland. 101. MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, UK. 102. 1] National Heart, Lung, and Blood Institute, Framingham Heart Study, Framingham, Massachusetts, USA. [2] Department of Neurology, Boston University School of Medicine, Boston, Massachusetts, USA. 103. 1] Netherlands Consortium for Healthy Aging (NCHA), Leiden University Medical Center, Leiden, the Netherlands. [2] Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands. [3] Faculty of Psychology and Education, VU University Amsterdam, Amsterdam, the Netherlands. 104. 1] Department of Cardiology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. [2] Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. 105. Department of Medicine, Stanford University School of Medicine, Stanford, California, USA. 106. 1] DZHK (Deutsches Zentrum für Herz-Kreislaufforschung-German Centre for Cardiovascular Research), partner site Greifswald, Greifswald, Germany. [2] Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany. 107. 1] Department of Biomedicine, Division of Medical Genetics, University of Basel, Basel, Switzerland. [2] Department of Genomics, Life and Brain Center, University of Bonn, Bonn, Germany. [3] Institute of Human Genetics, University of Bonn, Bonn, Germany. 108. Department of Public Health and General Practice, Norwegian University of Science and Technology, Trondheim, Norway. 109. 1] Netherlands Consortium for Healthy Aging (NCHA), Leiden University Medical Center, Leiden, the Netherlands. [2] Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, the Netherlands. 110. 1] Research Unit of Molecular Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. [2] Hannover Unified Biobank, Hannover Medical School, Hannover, Germany. 111. 1] Genetic Epidemiology Unit, Department of Epidemiology, Erasmus University Medical Center, Rotterdam, the Netherlands. [2] Center for Medical Systems Biology, Leiden, the Netherlands. 112. 1] Department of Pulmonary Physiology and Sleep Medicine, Nedlands, Western Australia, Australia. [2] School of Medicine and Pharmacology, University of Western Australia, Crawley, Western Australia, Australia. 113. Department of Immunology, Genetics and Pathology, SciLifeLab, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden. 114. 1] Department of Haematology, University of Cambridge, Cambridge, UK. [2] NHS Blood and Transplant, Cambridge, UK. 115. Department of Medicine, University of Oulu, Oulu, Finland. 116. Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA. 117. Unit of Periodontology, Department of Restorative Dentistry, Periodontology and Endodontology, University Medicine Greifswald, Greifswald, Germany. 118. Department of Internal Medicine I, Ulm University Medical Centre, Ulm, Germany. 119. Institute of Human Genetics, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. 120. Department of Medical Sciences, Cardiovascular Epidemiology, Uppsala University, Uppsala, Sweden. 121. Centre for Bone and Arthritis Research, Department of Internal Medicine and Clinical Nutrition, Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden. 122. 1] Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA. [2] Genetics of Obesity and Related Metabolic Traits Program, Icahn School of Medicine at Mount Sinai, New York, New York, USA. 123. Steno Diabetes Center A/S, Gentofte Denmark. 124. Service of Nephrology, Department of Medicine, Lausanne University Hospital (CHUV), Lausanne, Switzerland. 125. School of Social and Community Medicine, University of Bristol, Bristol, UK. 126. Tropical Metabolism Research Unit, Tropical Medicine Research Institute, University of the West Indies, Mona, Kingston, Jamaica. 127. 1] Global Health Institute, Department of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland. [2] Institute of Microbiology, University Hospital and University of Lausanne, Lausanne, Switzerland. 128. 1] Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA. [2] Center for Observational Research, Amgen, Inc., Thousand Oaks, California, USA. 129. Center for Evidence-Based Healthcare, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany. 130. 1] Institute of Genetic Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. [2] Department of Medicine I, University Hospital Großhadern, Ludwig Maximilians Universität, Munich, Germany. [3] Chair of Genetic Epidemiology, Institute of Medical Informatics, Biometry and Epidemiology, Ludwig Maximilians Universität, Neuherberg, Germany. [4] DZHK (Deutsches Forschungszentrum für Herz-Kreislauferkrankungen-German Research Centre for Cardiovascular Research), Munich Heart Alliance, Munich, Germany. 131. Department of Respiratory Medicine, Sir Charles Gairdner Hospital, Nedlands, Western Australia, Australia. 132. 1] DZHK (Deutsches Zentrum für Herz-Kreislaufforschung-German Centre for Cardiovascular Research), partner site Greifswald, Greifswald, Germany. [2] Institute of Clinical Chemistry and Laboratory Medicine, University Medicine Greifswald, Greifswald, Germany. 133. Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. 134. 1] Department of Genomics, Life and Brain Center, University of Bonn, Bonn, Germany. [2] Institute of Human Genetics, University of Bonn, Bonn, Germany. 135. Ealing Hospital National Health Service (NHS) Trust, Middlesex, UK. 136. 1] Department of Epidemiology and Biostatistics, Institute for Research in Extramural Medicine (EMGO) Institute for Health and Care Research, VU University Medical Center, Amsterdam, the Netherlands. [2] Department of Internal Medicine, Division of Endocrinology and Metabolism, Medical University of Graz, Graz, Austria. 137. 1] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. [2] Wellcome Trust Sanger Institute, Hinxton, UK. [3] Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, UK. 138. Department of Clinical Sciences, Genetic and Molecular Epidemiology Unit, Lund University Diabetes Center, Skåne University Hospital, Malmö, Sweden. 139. 1] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. [2] Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, UK. 140. 1] Diabetology-Endocrinology-Nutrition, Public Hospital System of the City of Paris (AP-HP), Bichat Hospital, Paris, France. [2] INSERM U872, Centre de Recherche des Cordeliers, Paris, France. [3] Paris Diderot University, Paris, France. 141. Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz, Austria. 142. Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA. 143. 1] DZHK (Deutsches Forschungszentrum für Herz-Kreislauferkrankungen-German Research Centre for Cardiovascular Research), Munich Heart Alliance, Munich, Germany. [2] Deutsches Herzzentrum München, Technische Universität München, Munich, Germany. 144. National Cancer Institute, Bethesda, Maryland, USA. 145. Department of Sociology, University of Helsinki, Helsinki, Finland. 146. 1] EMGO Institute for Health and Care Research, VU University, Amsterdam, the Netherlands. [2] Department of Psychiatry, Neuroscience Campus, VU University Amsterdam, Amsterdam, the Netherlands. 147. 1] Icelandic Heart Association, Kopavogur, Iceland. [2] University of Iceland, Reykjavik, Iceland. 148. 1] Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. [2] Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. 149. Molecular and Cellular Therapeutics, Royal College of Surgeons in Ireland, Dublin, Ireland. 150. 1] Wellcome Trust Sanger Institute, Hinxton, UK. [2] William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK. 151. Institute of Cardiovascular and Medical Sciences, Faculty of Medicine, University of Glasgow, Glasgow, UK. 152. 1] Science for Life Laboratory, Uppsala University, Uppsala, Sweden. [2] Department of Medical Sciences, Molecular Medicine, Uppsala University, Uppsala, Sweden. 153. Department of Public Health Sciences, Stritch School of Medicine, Loyola University of Chicago, Maywood, Illinois, USA. 154. deCODE Genetics, Amgen, Inc., Reykjavik, Iceland. 155. Department of Oncology, University of Cambridge, Cambridge, UK. 156. Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands. 157. Department of Epidemiology and Biostatistics, Institute for Research in Extramural Medicine (EMGO) Institute for Health and Care Research, VU University Medical Center, Amsterdam, the Netherlands. 158. 1] Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands. [2] Section of Geriatrics, Department of Internal Medicine, Academic Medical Center, Amsterdam, the Netherlands. 159. 1] Netherlands Consortium for Healthy Aging (NCHA), Leiden University Medical Center, Leiden, the Netherlands. [2] Department of Gerontology and Geriatrics, Leiden University Medical Center, Leiden, the Netherlands. 160. Department of Child and Adolescent Psychiatry, Psychology, Erasmus University Medical Center, Rotterdam, the Netherlands. 161. 1] Department for Health Evidence, Radboud University Medical Centre, Nijmegen, the Netherlands. [2] Department of Genetics, Radboud University Medical Centre, Nijmegen, the Netherlands. 162. Research Unit of Molecular Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. 163. Department of Clinical Chemistry, Ulm University Medical Centre, Ulm, Germany. 164. 1] DZHK (Deutsches Forschungszentrum für Herz-Kreislauferkrankungen-German Research Centre for Cardiovascular Research), partner site Hamburg-Lubeck-Kiel, Lubeck, Germany. [2] Institut für Integrative und Experimentelle Genomik, Universität zu Lübeck, Lübeck, Germany. 165. Department of Community Medicine, Faculty of Health Sciences, UiT The Arctic University of Tromsø, Tromsø, Norway. 166. MRC Unit for Lifelong Health and Ageing at University College London, London, UK. 167. Department of Epidemiology and Public Health, University of Strasbourg, Faculty of Medicine, Strasbourg, France. 168. Department of Internal Medicine, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. 169. 1] PathWest Laboratory Medicine of Western Australia, Nedlands, Western Australia, Australia. [2] Pathology and Laboratory Medicine, University of Western Australia, Perth, Western Australia, Australia. 170. Cedars-Sinai Diabetes and Obesity Research Institute, Los Angeles, California, USA. 171. 1] Swiss Institute of Bioinformatics, Lausanne, Switzerland. [2] Department of Medical Genetics, University of Lausanne, Lausanne, Switzerland. 172. Department of Prosthetic Dentistry, Gerostomatology and Dental Materials, University Medicine Greifswald, Greifswald, Germany. 173. Biological Psychology, VU University Amsterdam, Amsterdam, the Netherlands. 174. 1] Institute of Social and Preventive Medicine (IUMSP), Centre Hospitalier Universitaire Vaudois (CHUV) and University of Lausanne, Lausanne, Switzerland. [2] Ministry of Health, Victoria, Republic of Seychelles. 175. Laboratory Medicine, Hospital of Desio, Department of Health Sciences, University of Milano, Bicocca, Italy. 176. Clinical Pharmacology Unit, University of Cambridge, Addenbrooke's Hospital, Cambridge, UK. 177. 1] Clinical Pharmacology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK. [2] Barts and The London Genome Centre, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK. 178. Center for Complex Disease Genomics, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA. 179. 1] Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, Tennessee, USA. [2] Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, USA. 180. 1] National Heart, Lung, and Blood Institute, Framingham Heart Study, Framingham, Massachusetts, USA. [2] Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts, USA. 181. Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK. 182. Division of Cardiovascular Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden. 183. 1] Experimental Cardiology Laboratory, Division of Heart and Lungs, University Medical Center Utrecht, Utrecht, the Netherlands. [2] Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, the Netherlands. 184. Clinic of Cardiology, West German Heart Centre, University Hospital Essen, Essen, Germany. 185. 1] National Institute for Health and Welfare, Helsinki, Finland. [2] Department of General Practice and Primary Health Care, University of Helsinki, Helsinki, Finland. [3] Unit of General Practice, Helsinki University Central Hospital, Helsinki, Finland. 186. 1] Department of Internal Medicine, University of Pisa, Pisa, Italy. [2] National Research Council (CNR) Institute of Clinical Physiology, University of Pisa, Pisa, Italy. 187. Department of Cardiology, Toulouse University School of Medicine, Rangueil Hospital, Toulouse, France. 188. Robertson Center for Biostatistics, University of Glasgow, Glasgow, UK. 189. NorthShore University HealthSystem, University of Chicago, Evanston, Illinois, USA. 190. Service of Therapeutic Education for Diabetes, Obesity and Chronic Diseases, Geneva University Hospital, Geneva, Switzerland. 191. Department of Medicine, Pharmacology, Pathology, Microbiology and Immunology, Vanderbilt University School of Medicine, Nashville, Tennessee, USA. 192. Leeds MRC Medical Bioinformatics Centre, University of Leeds, Leeds, UK. 193. Institute of Biomedical and Clinical Science, University of Exeter, Exeter, UK. 194. Department of Psychiatry, Washington University School of Medicine, St. Louis, Missouri, USA. 195. Center for Biomedicine, European Academy Bozen, Bolzano (EURAC), Bolzano, Italy (affiliated institute of the University of Lübeck, Lübeck, Germany). 196. Division of Genomic Medicine, National Human Genome Research Institute, US National Institutes of Health, Bethesda, Maryland, USA. 197. Institute of Cardiovascular Science, University College London, London, UK. 198. 1] Netherlands Consortium for Healthy Aging (NCHA), Rotterdam, the Netherlands. [2] Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands. 199. Department of Vascular Medicine, Academic Medical Center, Amsterdam, the Netherlands. 200. Centre for Cardiovascular Genetics, Institute of Cardiovascular Sciences, University College London, London, UK. 201. Cardiovascular Genetics Division, Department of Internal Medicine, University of Utah, Salt Lake City, Utah, USA. 202. 1] School of Population Health, University of South Australia, Adelaide, South Australia, Australia. [2] Sansom Institute for Health Research, University of South Australia, Adelaide, South Australia, Australia. [3] South Australian Health and Medical Research Institute, Adelaide, South Australia, Australia. [4] Centre for Paediatric Epidemiology and Biostatistics, University College London Institute of Child Health, London, UK. 203. 1] Division of Cancer Epidemiology and Genetics, National Cancer Institute, US National Institutes of Health, Bethesda, Maryland, USA. [2] Core Genotyping Facility, SAIC-Frederick, Inc., NCI-Frederick, Frederick, Maryland, USA. 204. 1] Department of Epidemiology and Biostatistics, Imperial College London, London, UK. [2] National Institute for Health and Welfare, Oulu, Finland. [3] MRC Health Protection Agency (HPA) Centre for Environment and Health, School of Public Health, Imperial College London, London, UK. [4] Unit of Primary Care, Oulu University Hospital, Oulu, Finland. [5] Biocenter Oulu, University of Oulu, Oulu, Finland. [6] Institute of Health Sciences, University of Oulu, Oulu, Finland. 205. 1] National Institute for Health and Welfare, Helsinki, Finland. [2] Institute for Molecular Medicine, University of Helsinki, Helsinki, Finland. [3] Hjelt Institute Department of Public Health, University of Helsinki, Helsinki, Finland. 206. 1] Netherlands Consortium for Healthy Aging (NCHA), Rotterdam, the Netherlands. [2] Department of Forensic Molecular Biology, Erasmus Medical Center, Rotterdam, the Netherlands. 207. UK Clinical Research Collaboration Centre of Excellence for Public Health (Northern Ireland), Queens University of Belfast, Belfast, UK. 208. 1] Faculty of Medicine, Institute of Health Sciences, University of Oulu, Oulu, Finland. [2] Unit of General Practice, Oulu University Hospital, Oulu, Finland. 209. 1] Department for Health Evidence, Radboud University Medical Centre, Nijmegen, the Netherlands. [2] Department of Urology, Radboud University Medical Centre, Nijmegen, the Netherlands. 210. 1] Ealing Hospital National Health Service (NHS) Trust, Middlesex, UK. [2] Imperial College Healthcare NHS Trust, London, UK. [3] National Heart and Lung Institute, Imperial College London, London, UK. 211. Department of Epidemiology and Public Health, University College London, London, UK. 212. Department of Medicine, Kuopio University Hospital and University of Eastern Finland, Kuopio, Finland. 213. 1] Kuopio Research Institute of Exercise Medicine, Kuopio, Finland. [2] Department of Physiology, Institute of Biomedicine, University of Eastern Finland, Kuopio Campus, Kuopio, Finland. [3] Department of Clinical Physiology and Nuclear Medicine, Kuopio University Hospital and University of Eastern Finland, Kuopio, Finland. 214. 1] Medical Research Council (MRC) Epidemiology Unit, University of Cambridge, Institute of Metabolic Science, Addenbrooke's Hospital, Cambridge, UK. [2] Department of Epidemiology and Public Health, University College London, London, UK. 215. Department of Clinical Chemistry, Fimlab Laboratories and School of Medicine, University of Tampere, Tampere, Finland. 216. 1] Department of Health Sciences, University of Milano, Milan, Italy. [2] Fondazione Filarete, Milan, Italy. 217. 1] Division of Nephrology and Dialysis, San Raffaele Scientific Institute, Milan, Italy. [2] Università Vita-Salute San Raffaele, Milan, Italy. 218. 1] Institut Universitaire de Cardiologie et de Pneumologie de Québec, Faculty of Medicine, Laval University, Quebec City, Quebec, Canada. [2] Institute of Nutrition and Functional Foods, Laval University, Quebec City, Quebec, Canada. 219. Department of Genetics, Rutgers University, Piscataway, New Jersey, USA. 220. Department of Biostatistics, University of Washington, Seattle, Washington, USA. 221. DZHK (Deutsches Forschungszentrum für Herz-Kreislauferkrankungen-German Research Centre for Cardiovascular Research), Munich Heart Alliance, Munich, Germany. 222. Department of Surgery, University Medical Center Utrecht, Utrecht, the Netherlands. 223. 1] Estonian Genome Center, University of Tartu, Tartu, Estonia. [2] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. [3] Department of Biostatistics, University of Liverpool, Liverpool, UK. 224. Department of Pediatrics, University of Iowa, Iowa City, Iowa, USA. 225. 1] Medical Research Council (MRC) Epidemiology Unit, University of Cambridge, Institute of Metabolic Science, Addenbrooke's Hospital, Cambridge, UK. [2] MRC Unit for Lifelong Health and Ageing at University College London, London, UK. 226. 1] Research Unit of Molecular Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. [2] DZHK (Deutsches Forschungszentrum für Herz-Kreislauferkrankungen-German Research Centre for Cardiovascular Research), Munich Heart Alliance, Munich, Germany. [3] Institute of Epidemiology II, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. 227. 1] Center for Biomedicine, European Academy Bozen, Bolzano (EURAC), Bolzano, Italy (affiliated institute of the University of Lübeck, Lübeck, Germany). [2] Department of Neurology, General Central Hospital, Bolzano, Italy. 228. 1] Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA. [2] Department of Nutrition, Harvard School of Public Health, Harvard University, Boston, Massachusetts, USA. 229. 1] Department of Clinical Physiology and Nuclear Medicine, Turku University Hospital, Turku, Finland. [2] Research Centre of Applied and Preventive Cardiovascular Medicine, University of Turku, Turku, Finland. 230. Human Genomics Laboratory, Pennington Biomedical Research Center, Baton Rouge, Louisiana, USA. 231. 1] Department of Genetics, Washington University School of Medicine, St. Louis, Missouri, USA. [2] Division of Biostatistics, Washington University School of Medicine, St. Louis, Missouri, USA. [3] Department of Psychiatry, Washington University School of Medicine, St. Louis, Missouri, USA. 232. 1] Division of Biostatistics, Washington University School of Medicine, St. Louis, Missouri, USA. [2] Department of Psychiatry, Washington University School of Medicine, St. Louis, Missouri, USA. 233. Center for Systems Genomics, Pennsylvania State University, University Park, Pennsylvania, USA. 234. 1] Centre for Population Health Sciences, University of Edinburgh, Edinburgh, UK. [2] Croatian Centre for Global Health, Faculty of Medicine, University of Split, Split, Croatia. 235. 1] Department of Cardiovascular Sciences, University of Leicester, Glenfield Hospital, Leicester, UK. [2] National Institute for Health Research (NIHR) Leicester Cardiovascular Biomedical Research Unit, Glenfield Hospital, Leicester, UK. 236. South Carelia Central Hospital, Lappeenranta, Finland. 237. 1] Department of Medicine III, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany. [2] Paul Langerhans Institute Dresden, German Center for Diabetes Research (DZD), Dresden, Germany. 238. Institute of Health Sciences, University of Oulu, Oulu, Finland. 239. International Centre for Circulatory Health, Imperial College London, London, UK. 240. 1] Program for Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, Maryland, USA. [2] Division of Endocrinology, Diabetes and Nutrition, University of Maryland School of Medicine, Baltimore, Maryland, USA. [3] Geriatric Research and Education Clinical Center, Vetrans Administration Medical Center, Baltimore, Maryland, USA. 241. Helsinki University Central Hospital Heart and Lung Center, Department of Medicine, Helsinki University Central Hospital, Helsinki, Finland. 242. 1] Montreal Heart Institute, Montreal, Quebec, Canada. [2] Montreal Heart Institute, Université de Montréal, Montreal, Quebec, Canada. 243. Department of Kinesiology, Laval University, Quebec City, Quebec, Canada. 244. Dipartimento di Scienze Farmacologiche e Biomolecolari, Università di Milano and Centro Cardiologico Monzino, Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS), Milan, Italy. 245. 1] Institute of Nutrition and Functional Foods, Laval University, Quebec City, Quebec, Canada. [2] Department of Food Science and Nutrition, Laval University, Quebec City, Quebec, Canada. 246. Institut Pasteur de Lille, INSERM U744, Université de Lille 2, Lille, France. 247. 1] Institute of Cardiovascular Science, University College London, London, UK. [2] Department of Cardiology, Division of Heart and Lungs, University Medical Center Utrecht, Utrecht, the Netherlands. [3] Durrer Center for Cardiogenetic Research, Interuniversity Cardiology Institute Netherlands-Netherlands Heart Institute, Utrecht, the Netherlands. 248. 1] Division of Endocrinology, Diabetes and Metabolism, Ulm University Medical Centre, Ulm, Germany. [2] Lee Kong Chian School of Medicine, Imperial College London and Nanyang Technological University, Singapore. 249. Health Science Center at Houston, University of Texas, Houston, Texas, USA. 250. 1] Ealing Hospital National Health Service (NHS) Trust, Middlesex, UK. [2] Department of Epidemiology and Biostatistics, Imperial College London, London, UK. [3] Imperial College Healthcare NHS Trust, London, UK. 251. Division of Cancer Epidemiology and Genetics, National Cancer Institute, US National Institutes of Health, Bethesda, Maryland, USA. 252. 1] Department of Medical Genetics, University Medical Center Utrecht, Utrecht, the Netherlands. [2] Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA. [3] Department of Epidemiology, University Medical Center Utrecht, Utrecht, the Netherlands. 253. 1] Department of Nutrition, Harvard School of Public Health, Harvard University, Boston, Massachusetts, USA. [2] Department of Clinical Sciences, Genetic and Molecular Epidemiology Unit, Lund University Diabetes Center, Skåne University Hospital, Malmö, Sweden. [3] Department of Public Health and Clinical Medicine, Unit of Medicine, Umeå University, Umeå, Sweden. 254. 1] Department of Genomics of Common Disease, School of Public Health, Imperial College London, Hammersmith Hospital, London, UK. [2] CNRS UMR 8199, Lille, France. [3] European Genomic Institute for Diabetes, Lille, France. [4] Université de Lille 2, Lille, France. 255. 1] Institute for Molecular Medicine, University of Helsinki, Helsinki, Finland. [2] Lund University Diabetes Centre, Lund University, Malmö, Sweden. [3] Diabetes and Endocrinology Unit, Department of Clinical Science, Lund University, Malmö, Sweden. 256. 1] PathWest Laboratory Medicine of Western Australia, Nedlands, Western Australia, Australia. [2] Pathology and Laboratory Medicine, University of Western Australia, Perth, Western Australia, Australia. [3] School of Population Health, University of South Australia, Adelaide, South Australia, Australia. [4] Sansom Institute for Health Research, University of South Australia, Adelaide, South Australia, Australia. 257. 1] Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA. [2] Department of Nutrition, Harvard School of Public Health, Harvard University, Boston, Massachusetts, USA. [3] Department of Epidemiology, Harvard School of Public Health, Harvard University, Boston, Massachusetts, USA. 258. 1] Department of Cardiology, Leiden University Medical Center, Leiden, the Netherlands. [2] Durrer Center for Cardiogenetic Research, Interuniversity Cardiology Institute Netherlands-Netherlands Heart Institute, Utrecht, the Netherlands. [3] Interuniversity Cardiology Institute of the Netherlands (ICIN), Utrecht, the Netherlands. 259. Department of Epidemiology and Population Health, Albert Einstein College of Medicine. Belfer, New York, USA. 260. Center for Human Genetics, Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, North Carolina, USA. 261. 1] Vth Department of Medicine (Nephrology, Hypertensiology, Endocrinology, Diabetology, Rheumatology), Medical Faculty of Mannheim, University of Heidelberg, Heidelberg, Germany. [2] Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz, Austria. [3] Synlab Academy, Synlab Services, Mannheim, Germany. 262. 1] Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark. [2] Department of Medicine, Stanford University School of Medicine, Stanford, California, USA. 263. 1] Genetic Epidemiology Unit, Department of Epidemiology, Erasmus University Medical Center, Rotterdam, the Netherlands. [2] Center for Medical Systems Biology, Leiden, the Netherlands. [3] Department of Clinical Genetics, Erasmus University Medical Center, Rotterdam, the Netherlands. 264. 1] Estonian Genome Center, University of Tartu, Tartu, Estonia. [2] National Institute for Health and Welfare, Helsinki, Finland. [3] Institute for Molecular Medicine, University of Helsinki, Helsinki, Finland. 265. 1] Institute of Nutrition and Functional Foods, Laval University, Quebec City, Quebec, Canada. [2] Department of Kinesiology, Laval University, Quebec City, Quebec, Canada. 266. Centre for Paediatric Epidemiology and Biostatistics, University College London Institute of Child Health, London, UK. 267. 1] Kuopio Research Institute of Exercise Medicine, Kuopio, Finland. [2] Department of Clinical Physiology and Nuclear Medicine, Kuopio University Hospital and University of Eastern Finland, Kuopio, Finland. 268. 1] Division of Preventive Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA. [2] Harvard Medical School, Boston, Massachusetts, USA. 269. Institute for Translational Genomics and Population Sciences, Los Angeles BioMedical Research Institute at Harbor-University of California, Los Angeles Medical Center, Torrance, California, USA. 270. 1] Finnish Diabetes Association, Tampere, Finland. [2] Pirkanmaa Hospital District, Tampere, Finland. 271. 1] Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK. [2] Center for Non-Communicable Diseases, Karatchi, Pakistan. [3] Department of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA. 272. Laboratory of Genetics, National Institute on Aging, Baltimore, Maryland, USA. 273. 1] Institute of Genetic Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. [2] Chair of Genetic Epidemiology, Institute of Medical Informatics, Biometry and Epidemiology, Ludwig Maximilians Universität, Neuherberg, Germany. 274. 1] National Institute for Health and Welfare, Helsinki, Finland. [2] Instituto de Investigacion Sanitaria del Hospital Universario La Paz (IdiPAZ), Madrid, Spain. [3] Diabetes Research Group, King Abdulaziz University, Jeddah, Saudi Arabia. [4] Centre for Vascular Prevention, Danube University Krems, Krems, Austria. 275. 1] Department of Public Health and Clinical Nutrition, University of Eastern Finland, Kuopio, Finland. [2] Research Unit, Kuopio University Hospital, Kuopio, Finland. 276. 1] Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. [2] Department of Cardiology, University Medical Center Groningen, University of Groningen, Groningen, the Netherlands. [3] Durrer Center for Cardiogenetic Research, Interuniversity Cardiology Institute Netherlands-Netherlands Heart Institute, Utrecht, the Netherlands. 277. Institute of Cellular Medicine, Newcastle University, Newcastle, UK. 278. 1] Chair of Epidemiology, Institute of Medical Informatics, Biometry and Epidemiology, Ludwig Maximilians Universität, Munich, Germany. [2] Klinikum Großhadern, Munich, Germany. [3] Institute of Epidemiology I, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. 279. Department of Pulmonology, University Medical Center Utrecht, Utrecht, the Netherlands. 280. 1] Wellcome Trust Sanger Institute, Hinxton, UK. [2] William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK. [3] King Abdulaziz University, Jeddah, Saudi Arabia. 281. 1] Department of Genetic Epidemiology, Institute of Epidemiology and Preventive Medicine, University of Regensburg, Regensburg, Germany. [2] Institute of Genetic Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. 282. 1] Broad Institute of MIT and Harvard University, Cambridge, Massachusetts, USA. [2] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. 283. 1] Department of Internal Medicine, Division of Gastroenterology, University of Michigan, Ann Arbor, Michigan, USA. [2] Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA. 284. 1] deCODE Genetics, Amgen, Inc., Reykjavik, Iceland. [2] Faculty of Medicine, University of Iceland, Reykjavik, Iceland. 285. 1] Wellcome Trust Sanger Institute, Hinxton, UK. [2] University of Cambridge Metabolic Research Laboratories, Institute of Metabolic Science, Addenbrooke's Hospital, Cambridge, UK. [3] NIHR Cambridge Biomedical Research Centre, Institute of Metabolic Science, Addenbrooke's Hospital, Cambridge, UK. 286. National Heart, Lung, and Blood Institute, Framingham Heart Study, Framingham, Massachusetts, USA. 287. 1] Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA. [2] Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA. 288. Division of Population Health Sciences and Education, St George's, University of London, London, UK. 289. 1] Swiss Institute of Bioinformatics, Lausanne, Switzerland. [2] Department of Medical Genetics, University of Lausanne, Lausanne, Switzerland. [3] Service of Medical Genetics, CHUV University Hospital, Lausanne, Switzerland. 290. 1] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. [2] Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, UK. [3] Oxford NIHR Biomedical Research Centre, Oxford University Hospitals NHS Trust, Oxford, UK. 291. 1] Genetic Epidemiology Unit, Department of Epidemiology, Erasmus University Medical Center, Rotterdam, the Netherlands. [2] Netherlands Consortium for Healthy Aging (NCHA), Rotterdam, the Netherlands. [3] Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands. [4] Center for Medical Systems Biology, Leiden, the Netherlands. 292. 1] Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA. [2] Department of Internal Medicine, Division of Cardiovascular Medicine, University of Michigan, Ann Arbor, Michigan, USA. [3] Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, USA. 293. 1] Broad Institute of MIT and Harvard University, Cambridge, Massachusetts, USA. [2] Department of Epidemiology, Harvard School of Public Health, Harvard University, Boston, Massachusetts, USA. [3] Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA. 294. 1] Medical Research Council (MRC) Epidemiology Unit, University of Cambridge, Institute of Metabolic Science, Addenbrooke's Hospital, Cambridge, UK. [2] Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA. [3] Genetics of Obesity and Related Metabolic Traits Program, Icahn School of Medicine at Mount Sinai, New York, New York, USA. [4] Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, New York, USA. 295. 1] Science for Life Laboratory, Uppsala University, Uppsala, Sweden. [2] Department of Medical Sciences, Molecular Epidemiology, Uppsala University, Uppsala, Sweden. [3] Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK. 296. 1] Program for Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, Maryland, USA. [2] Division of Endocrinology, Diabetes and Nutrition, University of Maryland School of Medicine, Baltimore, Maryland, USA. 297. 1] Biosciences Research Division, Department of Primary Industries, Melbourne, Victoria, Australia. [2] Department of Food and Agricultural Systems, University of Melbourne, Melbourne, Victoria, Australia. 298. 1] Division of Endocrinology, Genetics and Basic and Translational Obesity Research, Boston Children's Hospital, Boston, Massachusetts, USA. [2] Broad Institute of MIT and Harvard University, Cambridge, Massachusetts, USA. [3] Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA.
Abstract
Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700 and ∼9,500 SNPs explained ∼21%, ∼24% and ∼29% of phenotypic variance. Furthermore, all common variants together captured 60% of heritability. The 697 variants clustered in 423 loci were enriched for genes, pathways and tissue types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/β-catenin and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants.
Using genome-wide data from 253,288 individuals, we identified 697 variants at genome-wide significance that together explained one-fifth of the heritability for adult height. By testing different numbers of variants in independent studies, we show that the most strongly associated ∼2,000, ∼3,700 and ∼9,500 SNPs explained ∼21%, ∼24% and ∼29% of phenotypic variance. Furthermore, all common variants together captured 60% of heritability. The 697 variants clustered in 423 loci were enriched for genes, pathways and tissue types known to be involved in growth and together implicated genes and pathways not highlighted in earlier efforts, such as signaling by fibroblast growth factors, WNT/β-catenin and chondroitin sulfate-related genes. We identified several genes and pathways not previously connected with human skeletal growth, including mTOR, osteoglycin and binding of hyaluronic acid. Our results indicate a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants.
Height is a classical polygenic trait that has provided general insights into the genetic architecture of common human traits and diseases, and into the prospects and challenges of different methods used to identify genetic risk factors. Studies consistently estimate that the additive genetic contribution to normal variation in adult height (“narrow sense heritability”) is approximately 80%[1-3]. Previous analysis of genome-wide association studies (GWAS) of adult height showed that common variants together account for 50% of this heritable contribution to height variation[4,5]. The most recent GWAS of adult height identified 180 loci, which together highlighted many genes relevant to human skeletal growth that had not been implicated in previous studies[6]. Common variants in these loci, however, only accounted for 10% of the phenotypic variation (~12% of heritability). Here, we report results from a GWAS meta-analysis of adult height in 253,288 individuals of European ancestry. We show that additive contributions of fewer than 10,000 SNPs (at P<5×10−3) can account for 36% of the heritability of adult height. Variants reaching genome-wide significance (P<5×10−8) in this larger study (697 SNPs) clustered in loci, were substantially enriched for regulatory variants, and implicated multiple known and previously unknown genes and pathways relevant to growth. More broadly, our results provide evidence that increasing GWAS sample sizes to the order of 100,000s, now plausible for many common traits, will likely continue to identify the variants and loci that close the “missing heritability” gap, whilst improving knowledge of the biology of those traits.
Results
The overall analysis strategy is illustrated in Supplementary Figure 1. We first performed a GWAS meta-analysis of adult height using summary statistics from 79 studies consisting of 253,288 individuals of European ancestry (Online Methods). We identified 697 SNPs that reached genome-wide significance (P<5×10−8) using an approximate conditional and joint multiple-SNP (COJO) analysis[7] in GCTA[8] (Online Methods) which takes linkage disequilibrium (LD) between SNPs into account (Supplementary Table 1; Supplementary Figs. 2-3). The 697 SNPs clustered in 423 loci, with a locus defined as one or multiple jointly associated SNPs located within ±1Mb of each other. Most of these 697 SNPs are uncorrelated although those in close physical proximity (e.g. < 1Mb) may be in partial LD (see Supplementary Table 1 for LD between adjacent pairs of the 697 SNPs). The clustering of signals was non-random (empirical enrichment of 1.4 fold, P<1×10−4) with 90, 26 and 31 loci containing 2, 3 and ≥ 4 signals respectively, (Supplementary Note and Supplementary Tables 1 and 2). We observed strong evidence of clustering of association signals within loci across a range of locus sizes, from 100kb to 1.25Mb, but the clustering was almost entirely driven by variants within 250kb of index SNPs (Supplementary Note and Supplementary Table 2). As shown in Figure 1 and Supplementary Figure 4, in some loci, multiple signals cluster tightly around a single gene, whereas in other cases, the clustering of associated variants is likely due to multiple different height-related genes being in close proximity.
Figure 1
Regional association plots for loci with multiple association signals
Panels a to d highlight examples of multiple signals after approximate conditional joint multiple-SNP analysis GCTA-COJO analysis. SNPs are shaded and shaped based on the index SNP with which they are in strongest LD (r2>0.4). Panels a-c show the majority of signals clustering in and around a single gene (ACAN, ADAMTS17, PTCH1, respectively) whereas panel d shows the multiple signals clustering through proximity.
Of the 697 SNPs, 403 were represented on the Metabochip array[9]. Using data from 80,067 individuals genotyped on the Metabochip array from 37 independent studies, we observed very strong evidence of concordance of effect sizes between the Metabochip and GWAS samples (P = 1.9×10−160); and >99% of variants were directionally consistent between Metabochip and GWAS (Online Methods, Supplementary Note, and Supplementary Table 3).We observed a large genome-wide ‘inflation’ factor of the test statistic for association even after we corrected each study’s test statistics by its individual inflation factor (single λGC = 1.94). At least two phenomena could have contributed to this observation. First, as described previously[10], highly polygenic models of inheritance are expected to increase the genomic inflation factor to levels comparable to what we observe. Second, height is particularly susceptible to confounding by population ancestry (stratification), which can also lead to inflation of the test statistics. We addressed these possibilities by comparing our results with those obtained using more stringent corrections for stratification (linear mixed models), and with results obtained in subsets of studies in which a purely family-based analysis was feasible, and by performing a within-family prediction analysis which partitioned the variance in the genetic predictor into the contributions of true associations and population stratification.Our linear mixed model (LMM) analyses, performed in a subset of 15 individual studies comprising 59,380 individuals, provided strong evidence that the inflated statistics were driven predominantly by the highly polygenic nature of the trait. This approach utilizes a genomic relationship matrix (GRM) calculated through genome-wide SNP data to correct for distant relatedness between all pairs of individuals within a study. This resulted in a single λGC of 1.20. This value was entirely consistent with the single λGC of 1.20 obtained from the standard GWAS analysis of the same individuals and a single λGC of 1.94 obtained from the full 253,288 individuals (Supplementary Table 4). Because this approach may be overly conservative for a strongly genetic and highly polygenic trait, each study additionally repeated the analyses for each chromosome using a GRM generated from the remaining 21 chromosomes, or in the case of the largest study (WGHS) repeating the analysis for all odd numbered chromosomes using a GRM generated from the even numbered chromosomes and vice versa. The single λGC inflation factor for this analysis, 1.23, was also entirely consistent with the standard GWAS results (Online Methods, Supplementary Note, and Supplementary Table 4).Our family based analyses also provided strong evidence that the inflated statistics are driven predominantly by the highly polygenic nature of height. We assessed whether variants that reached genome-wide significance after single GC correction replicated in family-based analyses of up to 25,849 samples (effective sample size 14,963, using methods that are immune to stratification (Online Methods, Supplementary Note, and Supplementary Tables 5 and 6). We identified genome-wide significant associations from a meta-analysis that excluded the family-based samples, and tested these associations for replication in the family-based samples; a lower rate of replication than expected could be due to inflation of effect sizes in the discovery sample from the “winner’s curse” and/or stratification. Of 416 genome-wide significant SNPs representing multiple signals selected after exclusion of family-based studies, 371 SNPs had a consistent direction of effect (compared with 208 expected by chance, and 400 expected in the absence of any inflation of estimated effect sizes), and 142 replicated with P<0.05 (compared with 21 expected by chance, and 210 expected in the absence of effect size inflation; Supplementary Table 5). These analyses (particularly the directional consistency) shows that most of the loci represent true associations, but also shows that there is a modest inflation in the effect size estimates, due to stratification and/or the winner’s curse. To distinguish between these possibilities, we repeated this analysis, substituting for the family-based samples a random set of studies with similar total effective sample size. The number of replicating loci was only slightly lower in the family-based cohorts than in the random samples (Supplementary Table 5, 12-17 fewer replications attributable to stratification at different P-value thresholds). This indicates that most of the modest inflation in effect estimates is due to the winner’s curse, that a small amount of inflation is due to residual stratification, and that few (upper limit ~15-25; Supplementary Note and Supplementary Table 5) if any of the loci that reach genome-wide significance after single GC correction are likely to be complete false positives due to stratification (that is, no real association whatsoever with height).
Variance explained by SNPs at different significance levels
Having established that single GC correction is sufficient to identify SNPs that are likely to be truly associated with height, we next performed a series of analyses using GWAS data from five independent validation studies to quantify the fraction of phenotypic variance explained by SNPs selected from the GCTA-COJO analyses[7] of the meta-analysis data, which excluded data from the validation studies, at a range of statistical thresholds, and to quantify the accuracy of predicting height using these selected SNPs (Online Methods). We first developed a new method that uses within-family prediction to partition the variance of the SNP-based predictor into components due to real SNP effects, errors in estimating SNP effects, and population stratification (Online Methods), and applied the method to data on full-sib pairs from three of the five validation studies (Online Methods). Consistently across the three studies, all the partitioned variance components increased as a less stringent significance level was used for SNP selection in the discovery sample and the error variance increased more dramatically than the genetic variance when more SNPs selected at a less significance level were included in the predictor (Fig. 2a-c). We demonstrated the partitioning of variance due to population stratification by the within-family prediction analyses with and without adjusting for principal components (PCs) (Supplementary Fig. 5). The results again confirmed that the impact of population stratification on the top associated SNPs was minor and demonstrated that the variation in the predictor due to true SNP effect, estimation error and population stratification was quantifiable. We next inferred, using these partitioned variance components from the within-family prediction analysis, how well different selected sets of SNPs would predict height in independent samples. We showed that the observed prediction accuracy (squared correlation between phenotype and predictor, R2) in five different population-based cohorts was highly consistent with the values inferred from the within-family based analyses, with prediction accuracy peaking at ~17% using the ~1,900 SNPs reaching P<5×10−5 (Fig. 2d). Finally we estimated variance explained by the selected SNPs in population-based studies using the GCTA-GREML method[4,8] (Fig. 2e). The results showed that ~670 SNPs at P<5×10−8 and ~9,500 SNPs at P<5×10−3 captured ~16% and ~29% of phenotypic variance respectively (Table 1), which was also consistent with the estimates inferred from the within-family prediction analysis. As shown in equation [19], prediction R2 is not equal to the variance explained but a function of the variance of true SNP effects and the error variance in estimating SNP effects, in the absence of population structure. This is demonstrated in Figure 2, where at thresholds below genome-wide significance, variance explained is higher than the prediction accuracy, because the latter is deflated both by imprecise estimates of effect sizes (estimation errors) and by inclusion of SNPs that are not associated with height. The estimate of variance explained by all the HapMap3 (ref. 11) SNPs without SNP selection was ~50% (Table 1), consistent with previous estimates[4,5]. Thus, a group of ~9,500 SNPs (representing <1% of common SNPs) selected at P<5×10−3, explained ~29% of phenotypic variance. Since ~50% of phenotypic variance is explained by all common SNPs, the selected set of SNPs, despite being limited to <1% of common SNPs, accounts for the majority of variance attributable to all common SNPs (29/50 ~ 60%). This set of ~9,500 SNPs strongly clustered with the newly established height loci: 1,704 (19%) variants were located within 250kb of one of the 697 genome-wide associated SNPs, suggesting that a substantial fraction of “missing heritability” is within already identified loci. This clustering of additional variants within identified loci was confirmed in a parallel analysis based on two left-out studies where we observed that SNPs in closer physical proximity with the top associated SNPs explained disproportionally more variance (Online Methods and Supplementary Fig. 6).
Figure 2
Quantifying the variance explained by height associated SNPs at different levels of significance
The SNPs were selected from the approximate conditional and joint multiple SNPs association analysis using GCTA-COJO analysis with the target cohort being excluded from the meta-analysis. a, b, c, Partitioning the variance in the SNP-derived genetic predictor using a within-family analysis. The SNP-based predictor was adjusted by the first 20 PCs. The four variance-covariance components Vg, Ve, Cg and Ce are defined in Online Methods. d, Accuracy of predicting phenotype by the genetic predictor in unrelated individuals. The prediction R2 shown on the y-axis is the squared correlation between phenotype and predictor. The SNP-based predictor was adjusted by the first 20 PCs. The solid line is the average prediction R2 weighted by sample size over the five cohorts. The dashed line is the prediction accuracy inferred from the within-family prediction analysis (Equation 19 in Online Methods). e, The variance explained by the SNPs was estimated by the whole-genome estimation method in GCTA. The phenotype was adjusted by the first 20 PCs. Each error bar represents the standard error of the estimate. The estimates from all the five cohorts (B-PROOF, FRAM, QIMR, TwinGene and WTCCC-T2D) were averaged by the inverse-variance approach. The dashed line is the variance explained inferred from the within-family prediction analysis. In panels d and e, the number shown in each column is the number of SNPs used in the analysis.
Table 1
Estimates of variance explained by SNPs selected at different significance levels
The SNPs were selected by an approximate conditional and joint multiple-SNP analysis (GCTA-COJO) of the summary statistics from the meta-analysis. The target cohort for variance estimation was excluded from the meta-analysis.
Threshold
QIMR (n = 3,924)
FRAM (n = 1,145)
TwinGene (n = 5,668)
WTCCC-T2D (n = 1,914)
B-PROOF (n = 2,555)
[a]Weighted average
[b]Pred.
#SNP
h2g
SE
#SNP
h2g
SE
#SNP
h2g
SE
#SNP
h2g
SE
#SNP
h2g
SE
h2g
SE
h2g
5E-8
675
0.164
0.016
656
0.190
0.040
670
0.159
0.013
679
0.143
0.025
691
0.152
0.021
0.159
0.008
0.149
5E-7
887
0.187
0.017
862
0.210
0.045
866
0.170
0.013
890
0.184
0.028
886
0.162
0.022
0.176
0.009
0.166
5E-6
1245
0.196
0.018
1202
0.207
0.050
1186
0.188
0.014
1256
0.201
0.030
1232
0.175
0.024
0.190
0.009
0.186
5E-5
1950
0.212
0.020
1891
0.183
0.060
1918
0.208
0.015
1985
0.208
0.037
1947
0.194
0.029
0.206
0.010
0.218
5E-4
3754
0.248
0.024
3671
0.239
0.080
3689
0.239
0.017
3771
0.201
0.047
3661
0.248
0.037
0.240
0.013
0.259
5E-3
9693
0.297
0.035
9403
0.171
0.126
9548
0.287
0.025
9677
0.267
0.070
9174
0.341
0.055
0.292
0.018
0.339
[c]HM3
1.08M
0.473
0.086
1.06M
0.313
0.291
1.12M
0.522
0.060
0.97M
0.534
0.170
1.09M
0.463
0.126
0.498
0.044
The estimates from all the five cohorts were averaged by the inverse-variance approach i.e. ;
the predicted variance explained by the selected SNPs (Vg) from the within-family prediction analysis;
SNPs from HapMap3 project[11].
Larger GWAS identifies new biologically relevant genes and pathways
Having shown that ~1% of variants can account for the majority of heritability attributable to common variation, we next considered whether the expanded set of height-associated variants could be used to identify the genomic features and biological pathways of most relevance to normal variation in adult height. To test whether our GWAS could implicate new biology, we used established and novel approaches to test whether the height-associated loci were enriched for functionally relevant variants, genes, pathways, and tissues.As with the 180 variants identified in our previous analysis, the 697 variants were non-randomly distributed with respect to functional and putatively functional regions of the genome (Online Methods). We observed that height associated variants were enriched for non-synonymous SNPs (nsSNPs) (empirical enrichment of 1.2 fold, P=0.02), cis-regulatory effects in blood (empirical enrichment of 1.5 fold, P=0.03), a curated list of genes that underlie monogenic syndromes of abnormal skeletal growth[12] (empirical enrichment 1.4 fold, P=0.013), associations with apparently unrelated complex traits in the NHGRI GWAS catalog (empirical enrichment 2.6 fold, P<1×10−4) and functional chromatin annotations in multiple tissues and cell types (empirical enrichment 1.8 fold, P<1×10−3) (Supplementary Note and Supplementary Tables 7-11).The greater resolution of height associated variants provided by increased sample size, combined with improved gene prioritization and gene set enrichment approaches, identified multiple new tissues, gene sets and specific genes that are highly likely to be involved in the biology of skeletal growth. Specifically, using a variety of established and novel pathway methods, we identified ~3 times as many enriched pathways and prioritized ~5 times as many genes (including genes newly prioritized in previously identified loci) compared to results derived from identical pathway methods to the previous GWAS of 133,000 individuals (Table 2).
Table 2
Comparison of prioritized variants, loci, biology and variance explained from GWASs on human stature with 130,000 individuals (previously published in Lango Allen et al., 2010) and with 250,000 individuals (this paper).
Height GWAS with 130,000 samples (Lango Allen et al., Yang et al)*
Height GWAS with 253,288 samples
SNP based comparisons
GWAS significant SNPS
199
697
Genomic loci[#] (+/− 1Mb)
180
423
Loci[#] with multiple signals
19
147
Secondary associations in loci[#]
19
273
Biological annotation (DEPICT at FDR < 0.05)
Prioritized genes
92
649
Loci[&] with prioritized gene
74 (43%)
422 (75%)
Pruned gene sets and protein-protein complexes[%]
813
2,330
Tissues and cell-types
5
43
Variance explained
GWAS significant SNPs
10%
16%
Deep list of SNPs at 1×10−3
13%
29%
All common SNPs
45%**
50%
Heritability explained
GWAS significant SNPs
12.5%
20%
Deep list of SNPs at 1×10−3
16%
36%
All common SNPs
56%**
62.5%
Counts, numbers and estimates for Lango Allen et al. are taken from respective publication.
Genomic loci defined by distance: +/− 1Mb from index height SNP
Genomic loci defined by LD: r2 > 0.5 from index height SNP
After clumping of similar gene sets and pathways
Yang et al. Nat Genet 42, 565-9 (2010).
We first focused on existing pathway and gene prioritization methods: (1) MAGENTA[13], a method designed to identify gene sets enriched in GWAS data, and (2) GRAIL[14], which uses published literature to highlight connections between likely relevant genes within GWAS loci. As expected, the GRAIL and MAGENTA analyses confirmed several previously identified gene sets and pathways clearly relevant to skeletal growth, but in the larger sample they also provided evidence for additional known and novel genes, gene sets and protein complexes not identified in our previous smaller study (for example, FGF signaling, WNT signaling, osteoglycin, and other genes related to bone or cartilage development) (Supplementary Tables 12-13 and Supplementary Fig. 7).To obtain more detailed insight into height biology, we applied DEPICT, a novel data-driven integrative method that uses gene sets reconstituted based on large scale expression data to prioritize genes and gene sets, and also to identify tissues enriched in highly expressed genes from associated loci (Pers et al. in preparation;
Online Methods and Supplementary Note). The DEPICT analysis highlighted 2,330 reconstituted gene sets (after pruning for high levels of redundancy). These gene sets both confirmed and extended the MAGENTA and GRAIL findings, and identified novel pathways not identified in our previous height GWAS (for example regulation of beta-catenin, biology related to glycosaminoglycans such as chondroitin sulfate and hyaluronic acid, and mTOR signaling) (Supplementary Table 14). Gene sets identified based on 327 strictly novel height variants (>1Mb from the 180 known variants loci) highly resembled gene sets highlighted by the already known 180 loci (Spearman’s rank correlation coefficient between gene set enrichment Z-scores r=0.91, P=2×10−16). Thus, the variants discovered through increased sample size continued to highlight specific and relevant growth-associated gene sets, while the combined analysis of both old and new loci provided the additional power needed to identify new gene sets (Table 3 and Supplementary Table 14).
Table 3
Significantly prioritized novel human growth associated genes
The table lists 20 genes prioritized by DEPICT. Genes are ranked by the number of lines of supporting evidence and the DEPICT P-value (Supplementary Table 16). Because 20 of the 30 top-ranked genes were in a curated list of genes known to cause syndromes of skeletal[12], these “OMIM genes” are not shown here. The top fifteen genes with prior literature support (based on GRAIL) are shown, followed by the top five novel genes. Each gene is accompanied by the significantly enriched reconstituted gene sets in which it appears in (DEPICT gene set enrichment analysis). Abbreviations; (GO – Gene Ontology; MP – Mice Phenotypes from Mouse Genome Informatics database; InWeb – protein-protein interaction complexes; KEGG and REACTOME databases).
Locus (height SNP)
Gene symbol
New locus
Prioritization P-value
Levels of biological annotation
Top ranking reconstituted gene sets
Genes with prior literature support (GRAIL)
rs10748128
FRS2
N
1.0×10−16
7
PI 3K cascade (REACTOME, P=6.2×10−13); Chronic Myeloid Leukemia (KEGG, P=1.6×10−12); Response To Fibroblast Growth Factor Stimulus (GO, P=5.4×10−11);
Short Mandible (MP, P=3.3×10−19); Respiratory System Development (GO, P=3.1×10−17); Abnormal Ulna Morphology (MP, P=1.9×10−15)
rs16860216
SOX8
N
0.016
7
Small Thoracic Cage (MP, P=6.9×10−14); Short Ribs (MP, P=2.7×10−8); Short Sternum (MP, P=6.5×10−7)
rs1199734
LATS2
Y
1.0×10−16
6
Partial Lethality Throughout Fetal Growth And Development (MP, P=1.2×10−18); Growth Factor Binding (GO, P=2.6×10−14); TGFB1 protein complex (InWeb, P=6.3×10−12)
rs12323101
PDS5B
N
1.0×10−16
6
Chromatin Binding (GO, P=6.4×10−17); Nuclear Hormone Receptor Binding (GO, P=2.4×10−12); RBBP4 protein complex (InWeb, P=1.3×10−11); WNT16 protein complex (InWeb, P=1.9×10−8)
rs6746356
SP3
Y
1.0×10−16
6
BCOR protein complex (InWeb, P=2.7×10−17); AFF2 protein complex (InWeb, P=4.5×10−7); Intracellular Steroid Hormone Receptor Signaling Pathway (GO, P=9.0×10−6)
Short Ulna (MP, P=4.7×10−13); Abnormal Joint Morphology (MP, P=8.6×10−11); Regulation Of Chondrocyte Differentiation (GO, P=2.9×10−9)
rs3790086
WWP2
Y
1.0×10−16
5
Cartilage Development (GO, P=2.0×10−19); Chondrocyte Differentiation (GO, P=3.0×10−15); Signaling By Platelet-Derived Growth Factor (REACTOME, P=4.8×10−10)
The DEPICT analysis also prioritized tissues and individual genes. We found that genes within associated height loci were enriched for expression in tissues related to chondrocytes (cartilage, joint capsule, synovial membrane, and joints; P<5.5×10−9, FDR<0.001), and other musculoskeletal, cardiovascular, and endocrine tissue-types (FDR<0.05) (Fig. 3; Supplementary Fig. 8; Supplementary Table 15). We also showed that a subset of the 697 height associated SNPs that represented lead cis-eQTLs in blood defined 75 genes that were collectively enriched for expression in cartilage (P=0.008) (Supplementary Note and Supplementary Table 8).
Figure 3
Tissue enrichment combined with pruned gene set network
Genes within genome-wide significant height associated loci enriched for several relevant tissue annotations as well as gene sets. a, Genes in associated loci tended to be highly expressed in tissues related to chondrocytes and osteoblasts (cartilage, joints, and spine), and other musculoskeletal, cardiovascular and endocrine tissue-types. The analysis was conducted based on the DEPICT method and 37,427 human microarray samples. Tissue annotations are sorted by physiological system and significance. Significantly enriched (FDR<0.05) tissues are color-coded in black. b, Significantly enriched reconstituted gene sets (P-value<1×10−11, FDR<1×10−5) identified by DEPICT. Nodes represent reconstituted gene sets and are color-coded by statistical significance. Edge thickness between nodes is proportional to degree of gene overlap as measured by the Jaccard index. Nodes with gene overlap greater than 25% were collapsed into single meta nodes and marked by blue borders. c, reconstituted gene sets comprised by the Chordate Embryonic Development meta node, which represented several gene sets relevant to human height (e.g. ossification, embryonic skeletal system development, and limb development).
We used DEPICT to prioritize 649 genes (at FDR<0.05) within height-associated loci (Table 3 and Supplementary Table 16). Of these 649 genes, 202 genes (31%) were either significant in the GRAIL analysis (Supplementary Tables 13 and 16) and/or overlapped with a list of abnormal skeletal growth syndromes that we assembled from the OMIM database[12] (n=40; Supplementary Tables 9 and 16). Many other newly prioritized genes had additional supporting evidence (Supplementary Table 16), including specific expression in the growth plate[12], and/or connections to relevant pathways (for example: GLI2 and LAMA5 [hedgehog signaling]; FRS2 [FGF signaling]; AXIN2, NFATC1, CTNNB1, FBXW11, WNT4, WNT5A and VANGL2 [WNT/beta-catenin signaling]; SMAD3 and MTOR [TGF-beta and/or mTOR signaling]; WWP2/miR140, IBSP, SHOX2 and SP3 [required in mice for proper bone and cartilage formation]; CHYS1, DSE and PCOLCE2 [glycosaminoglycan/collagen metabolism]; SCARA3, COPZ2, TBX18, CRISPLD1 and SLIT3 [differential expression in growth plate and predicted to be in highly relevant pathways]).DEPICT also prioritizes genes that are new candidates for playing a role in skeletal growth. The genes newly and strongly implicated in this study included not only genes with obvious relationships to skeletal biology, such as SOX5 and collagen genes, but also genes that have no clear published connection to skeletal growth, and likely represent as yet unknown biology (Table 3 and Supplementary Table 16). DEPICT strongly prioritized genes that do not have published annotations related to growth-related pathways but are predicted to be in gene sets that are both enriched in the associated loci and clearly connected to growth. These include genes newly predicted to be in pathways related to cartilage or bone development (FAM101A, CRISPLD1 and the noncoding RNA LINC00476), collagen or extracellular matrix (GLT8D2, CCDC3, and ZCCHC24), histone demethylation (ATAD2B and TSTD2) and other genes predicted to have skeletal phenotypes but not currently annotated as belonging to relevant pathways (ARSJ, PSKH1, COPZ2, ADAMTS17 and the microRNA cluster MIR17HG). Of note, mutations in both ADAMTS17 and MIR17HG have been identified as causes of syndromic short stature in humans[15,16].As suggested by the prioritization of ADAMTS17 and MIR17HG, it is possible that some of the newly highlighted genes may also underlie new syndromes of abnormal skeletal growth. As a further proof of principle, the second entry on our list of prioritized genes (Table 3 and Supplementary Table 16), CHSY1, was not a known monogenic gene in the OMIM database[12] when we assembled our list, but mutations in this gene have since been shown to cause a syndrome including brachydactyly and short stature[17,18]. Thus, the novel DEPICT method, applied to the larger GWAS data set, not only identified similar biology to GRAIL and MAGENTA but also implicated a large number of additional genes, gene sets and pathways that that are likely important in skeletal biology and human growth.
Discussion
By performing a large GWAS study on adult height, a highly heritable polygenic trait, we have provided answers to several current questions of relevance to the genetic study of polygenic diseases and traits. First, we showed that by conducting larger GWAS, we can identify SNPs that explain a substantial proportion of the heritability attributable to common variants. As hypothesized by Yang et al. (2010), the heritability directly accounted for by variants identified by GWAS and inferred by whole-genome estimation approaches are converging with increasing sample size. The variance explained by genome-wide significant SNPs has increased from 3-5% with discovery samples of ~25,000 (ref. 19) to 10% with a discovery sample size of ~130,000 (ref. 6) to 16% with a discovery sample size of 250,000 (this study), and the variance explained from all captured common SNPs is ~50%[4,5]. The variance explained by genome-wide significant SNPs on a chromosome is also proportional to its length, consistent with the conclusion made by Yang et al.[5] using all SNPs (Supplementary Fig. 9). Our new results show that ~21%, ~24% and ~29% of phenotypic variance in independent validation samples is captured by the best ~2,000, ~3,700 and ~9,500 SNPs respectively selected in the discovery samples (Table 1), and that the correlation between actual and predicted height in independent samples from the same population has increased to 0.41 (maximum prediction R2 = 0.412 = 0.17, Fig. 2d). The results are consistent with a genetic architecture for human height that is characterized by a very large but finite number (thousands) of causal variants, located throughout the genome but clustered in both a biological and genomic manner. Such a genetic architecture may be described as pseudo-infinitesimal, and may characterize many other polygenic traits and diseases. There is also strong evidence of multiple alleles at the same locus segregating in the population and for associated loci to overlap with Mendelian forms, suggesting a large but finite genomic mutational target for height, and effect sizes ranging from minute (<1mm; ~0.01 SDs) to gigantic (>300mm; >3 SDs, in the case of monogenic mutations).It has been argued that the biological information emerging from GWA studies will become less relevant as sample sizes increase, because as thousands of associated variants are discovered, the range of implicated genes and pathways will lose specificity and cover essentially the entire genome[20]. If this were the case, then increasing sample sizes would not help to prioritize follow up studies aimed at identifying and understanding new biology, and the associated loci would blanket the entire genome. Our study provides strong evidence to the contrary: the identification of many 100’s and even 1000’s of associated variants can continue to provide biologically relevant information. In other words, the variants identified in larger sample sizes both display a stronger enrichment of pathways clearly relevant to skeletal growth and prioritize many additional new and relevant genes. Furthermore, the associated variants are often non-randomly and tightly clustered (typically separated by <250 kb), resulting in the frequent presence of multiple associated variants in a locus. The observations that genes and especially pathways are now beginning to be implicated by multiple variants suggests that the larger set of results retain biological specificity but that at some point, a new set of associated variants will largely highlight the same genes, pathways and biological mechanisms as have already been seen. This endpoint (which we have not clearly reached for height) could be considered analogous to reaching “saturation” in model organism mutagenesis screens, where new alleles typically map to previously identified genes[21].We have identified a large number of gene sets and pathways that are enriched for associations with height. Although the number of gene sets and pathways is large, many are overlapping and likely represent multiple annotations of a much smaller set of core biological mechanisms. We also highlight individual genes within associated loci as being relevant to skeletal growth, including candidates for contributing to syndromes of abnormal skeletal growth; for example, we strongly implicated CHSY1, recently identified as an underlying cause of a monogenic syndrome with short stature and brachydactyly[17,18]. The lists of prioritized genes and pathways should therefore provide a rich trove of data for future studies of skeletal growth; to facilitate such studies, we have made our results (including genome-wide association results and complete list of highlighted genes and pathways) publicly available. Based on the results of large genetic studies of height, we anticipate that increasing the number of associated loci for other traits and diseases could yield similarly rich lists that would generate new biological hypotheses and motivate future research into the basis of human biology and disease.
ONLINE METHODS
Genome-wide association study meta-analysis
We combined height summary association statistics from 79 genome-wide association (GWA) studies in a meta-analysis of 253,288 individuals using the same methods and studies as previously described[6] and additional studies as described in Supplementary Tables 17-19. A total of 2,550,858 autosomal SNPs were meta-analyzed using inverse-variance fixed effects method using METAL[22].
GCTA-COJO: conditional and joint multiple SNPs analysis
We used GCTA-COJO analysis[7,8] to select the top associated SNPs. This method uses the summary statistics from the meta-analysis and LD correlations between SNPs estimated from a reference sample to perform a conditional association analysis[7]. The method starts with an initial model of the SNP that shows the strongest evidence of association across the whole genome. It then implements the association analysis conditioning on the selected SNP(s) to search for the top SNPs one-by-one iteratively via a stepwise model selection procedure until no SNP has a conditional P-value that passes the significance level. Finally, all the selected SNPs are fitted jointly in the model for effect size estimation. We used 6,654 unrelated individuals from the ARIC cohort as the reference sample for LD estimation. There were ~3.0M SNPs included in the original meta-analysis. We included in this analysis only the SNPs (~2.48M) on HapMap2 and with sample size > 50,000. We used the genome-wide significance level P<5×10−8 (as reported in Supplementary Table 1).
Metabochip replication
We combined height summary association statistics from 37 independent studies genotyped using Illumina’s Metabochip array[9] in a meta-analysis of 80,067 individuals of European ancestry (Supplementary Tables 20-22). Each study tested association between each genotyped SNP and the same QC procedures, height transformations, adjustment, and inheritance model as described for the GWA analysis. Genomic control correction was applied to results for each study prior to meta-analysis, using a set of 4,427 SNPs associated with QT interval to control study-specific inflation factors. We used the inverse-variance fixed effects meta-analysis method.
Validation – linear mixed model (LMM) based association analysis
Each of 15 studies (59,380 individuals) used genome-wide SNP information to calculate a genomic relationship matrix (GRM) for all pairs of individuals and used this to correct association statistics for cryptic relatedness and population stratification. Each study used a linear mixed model as implemented in the software EMMAX[23]. Meta-analysis was performed as described for the standard GWAS and using a single GC correction. Each study additionally repeated the analyses for each chromosome using a GRM generated from the remaining 21 chromosomes, or in the case of the largest study (WGHS) repeating the analysis for all odd numbered chromosomes using a GRM generated from the even numbered chromosomes and vice versa. Each study then combined association results from the 22 or 2 parts of the genome into one set of data and we repeated the single GC meta-analysis.
Validation – within family (transmission) association analyses
A pure transmission based analysis was performed in seven cohorts for SNPs representing 416 signals of association (Supplementary Note), selected after repeating meta-analysis excluding these studies, with single GC correction. Filtering of low imputation quality SNPs in the studies was followed by inverse variance method of meta-analysis of the family based results. Because of the presence of related individuals, family based studies have lower power at a given sample size. For each study, we calculated the effective sample size (the size of a sample of unrelated individuals that would have the equivalent power; see Supplementary Note and Winkler et al. [24]). Estimation of winner’s curse in our data set was performed by repeating the meta-analysis excluding either the family-based studies or excluding random sets of studies from GIANT matched by effective sample size to the family based studies. Independent genome-wide significant loci were selected from each meta-analysis. Power for replication in the excluded samples was estimated at different P-value thresholds and the deficit in replications (number of replications expected minus number observed) was calculated. The contribution of the winner’s curse to the deficit in replications was estimated as the average deficit across the three sets of random non-family-based cohorts. By subtracting this from the deficit observed for the family-based cohorts, we estimated the lack of replication that could be attributed to stratification (either inflation of effect size for true associations, or false positive associations).
Variance and heritability explained
We used GCTA-COJO analysis (Online Methods) to select the top associated SNPs at a range of stringent significance levels (5×10−3, 5×10−4, 5×10−5, …, 5×10−8) for estimation and prediction analyses. We then quantified the variance explained by those selected SNPs using a three-stage analysis, i.e. within-family prediction, GCTA-GREML analysis and population based prediction, in five validation studies (B-PROOF, FRAM, QIMR, TwinGene and WTCCC-T2D). To avoid sample overlap, we repeated the main GWAS meta-analysis and the multiple-SNP analysis five times, each time excluding one of the five validation studies. This approach ensured complete independence between data used to discover SNPs, and data used to estimate how much variance in height these SNPs explained and how well they predicted height. For the within-family prediction analyses, we selected 1,622, 2,758 and 1,597 pairs of full sibs from the QIMR, TwinGene and FRAM cohorts, respectively, with one sib pair per family. For the whole-genome estimation and prediction analyses, we used GCTA-GRM[8] to estimate the genetic relatedness between individuals and selected unrelated individuals with pairwise genetic relatedness <0.025 in each of the five studies, i.e. B-PROOF (n = 2,555), FRAM (n = 1,145), QIMR (n = 3,627), TwinGene (n = 5,668) and WTCCC-T2D (n = 1,914).
Within-family prediction analysis
We used the SNPs selected from GCTA-COJO analysis to create a genetic predictor (also called “genetic profile score”) for each of all the full sibs using PLINK[25]. We then adjusted the genetic predictor by the first 20 principal components (PCs) generated from the principal component analysis (PCA)[26]. By comparing the predictors within and between families, we partitioned the variance in the predictor analysis into components due to real SNP effects (Vg), errors in estimating SNP effects (Ve), and population structure (Cg + Ce), as described in the Online Methods below.We calculated the weighted average of each of the four (co)variance components over the three cohorts by their sample size, i.e. Σ (Vg()
n)/Σ (n) with the subscript i indicating the cohort and n being the sample size. From the results of these partitioning analyses within families we can infer what the prediction R2 (Equation 19 in Online Methods below) and what the proportion of variance explained by SNPs (i.e. Vg/VP with VP being the phenotypic variance) would be in a sample of unrelated individuals when using the same set of SNPs. We then tested these inferred values in unrelated samples.
GCTA-GREML analysis
We performed the GREML analysis[4] in GCTA[8] to estimate the variance explained by the selected SNPs (h2g) in each of the five validation studies. This method fits the effects of a set of SNPs simultaneously in a model as random effects and estimates the genetic variance captured by all the fitted SNPs without testing the significance of association of any single SNPs. We combined the estimates of h2g from the five studies by the inverse-variance approach, i.e. .
Population-based prediction analysis
We created a genetic predictor using the selected SNPs for the unrelated individuals in each of the five validation studies. We then calculated the squared correlation (R2) between phenotype and predictor in each validation study, and calculated the weighted average of the prediction R2 by the sample size across the five studies, i.e. .
Theory and method to partition the variance in a genetic predictor
Under the assumption of an additive genetic model, the phenotype of a quantitative trait can be written as
where y is the trait phenotype, g is the total genetic effect of all SNPs, x is an indicator variable for SNP genotypes, b is the SNP effect, and ε is the residual.From this model, the additive genetic variance is
with the first component being the expected value of additive genetic variance under linkage equilibrium (LE) and second component being the deviation from the expected value could be caused by linkage disequilibrium (LD), population structure or selection[27].Considering a pair of full siblings in a family, the additive genetic covariance between the sibs is
For full sibs,
for SNPs that are in LD, and for SNPs that are not in LD (as shown by both empirical and simulation results).Let |(SNPs are in LD), and |(SNPs are not in LD but correlated due to population structure)Therefore, the genetic variance is
The genetic covariance between a pair of full-sibs isIf we take a set of SNPs with their effects estimated from GCTA-COJO analysis (Online Methods), and create a predictor using these SNPs in an independent validation sample, we can write the predictor as
where is the estimate of b with with e being the error in estimating b.If we assume b and e are independent and denote and , the variance of the predictor is
The covariance between the predictors of a pair of full-sibs is
The covariance between the true phenotype and the predictor of a same individual is
The covariance between the true phenotype of one sib and the predictor of the other sib isIf we define and Δy = y1 − y2,
We therefore can calculate these four parameters as
where Vg can be interpreted as the variance explained by real SNP effects, Cg is the covariance between predictors attributed to the real effects of SNPs that are not in LD but correlated due to population stratification, Ve is the accumulated variance due to the errors in estimating SNP effects, and Ce is the covariance between predictors attributed to errors in estimating the effects of SNPs that are correlated due to population stratification.To assess the prediction accuracy, we usually perform a regression analysis of the real phenotype against the predictor, i.e.
so that the regression slope is actually
with the regression R2 being
In the absence of population structure,
Variance explained by SNPs in proximity to the top associated SNPs
We performed analyses to quantify the variance explained by SNPs in close physical proximity to the top associated SNPs in 9,500 unrelated individuals (pairwise genetic relatedness < 0.025) from a combined dataset of the QIMR and TwinGene cohorts. As in previous analyses, to avoid sample overlap between discovery and validation studies, we repeated the discovery meta-analysis excluding the QIMR and TwinGene cohorts, and identified 643 genome-wide significant SNPs from the GCTA-COJO analysis of the summary statistics using ARIC data for LD estimation. We used GCTA-GREML analysis[4,8] to quantify the phenotypic variance explained by all the common SNPs (MAF > 0.01) within 100Kb, 500Kb or 1Mb of the 643 genome-wide significant SNPs. We show in Supplementary Figure 6a that there are 104K, 423K and 745K SNPs within 100Kb, 500Kb and 1Mb of the top associated SNPs, which explain 20.8% (s.e. = 1.3%), 25.7% (s.e. = 1.8%) and 29.5% (s.e. = 2.2%) of phenotypic variance, respectively. We then applied a regression-based approach[28] to adjust for LD between SNPs. The estimates of variance explained after LD-adjustment were slightly higher than those without adjustment, and the ratio of between the estimates with and without LD-adjustment was consistently ~1.05 regardless of the window size (Supplementary Fig. 6a). However, the difference is small.We then sought to investigate whether or not there is an enrichment of additional association signals at the top associated loci. We varied the window size from 20Kb to 50Kb, 100Kb, 150Kb, 200Kb, 300Kb, 400Kb, 500Kb, 750Kb and 1Mb, and fitted a two-component model in GCTA-GREML analysis, with the first component being the top associated SNPs and the second component being the rest of SNPs within the window. We found that the per-SNP variance explained excluding the top SNPs (variance explained by the second component divided by the number of SNPs included in this component) decreased with the size of window (Supplementary Fig. 6b), implying that SNPs in closer physical proximity to the top associated SNPs tend to explain disproportionally more variance.
Enrichment of associated SNPs in ENCODE regions, loci containing OMIM genes, eQTLs and nsSNPs
To identify putative causal variants among the height-associated markers, we explored whether the height-associated SNPs were in strong LD (r2>0.8) with non-synonymous coding variants in 1000 Genomes Project CEU Phase 1 data, showed an effect on whole blood gene expression levels, were located within ENCODE-annotated regions, were within loci harboring monogenic growth genes, or had previously been associated with other complex traits in NHGRI GWAS catalog (P<5×10−8) (Supplementary Tables 7-11). To estimate the empirical assessment of enrichment for listed features we used 10,000 permutations of random sets of SNPs matched to the pruned (LD r2>0.1) 628 height-associated SNPs by the number of nearby genes (within a distance of LD r2>0.5), physical distance to nearest gene, and minor allele frequency.
Enrichment of genes in associated loci in known and novel pathways
Data-Driven Expression-Prioritized Integration for Complex Traits (DEPICT) analysis
The DEPICT method (T.H.P. et al., unpublished data; see Geller et al.[29] for an earlier application of DEPICT) relies on pre-computed predictions of gene function based on a heterogeneous panel of 77,840 expression arrays (Fehrmann et al., manuscript in review; ref. 30), 5,984 molecular pathways (based on 169,810 high-confidence experimentally derived protein-protein interactions[31]), 2,473 phenotypic gene sets (based on 211,882 gene-phenotype pairs from the Mouse Genetics Initiative (see URLs)), 737 Reactome pathways[32], 5,083 Gene Ontology terms[14], and 184 KEGG pathways[33]. The method leverages these predictions to extend the functional annotations of genes, including genes that previously had only a few or no functional annotations. DEPICT facilitates the analysis of GWAS data by (1) assessing whether genes in associated loci are enriched in tissue-specific expression, (2) identifying reconstituted gene sets that are enriched in genes from associated loci, and (3) systematically identifying the most likely causal gene(s) at a given locus (see Supplementary Note for a more detailed description of DEPICT). In order to run DEPICT, we first clumped the summary statistics from the meta-analysis using 500kb flanking regions, r2>0.1, and excluded SNPs with P≥5×10−8, which resulted in 628 SNPs. We then mapped genes to each of the 628 best-associated SNPs. For a given SNP, this was accomplished by including all genes that resided within LD r2>0.5 boundaries of that SNP, and always including the nearest gene, to its locus gene set. We used a locus definition that was calibrated using the GWAS data for height levels presented in this paper and optimized capture of known monogenic genes for those traits. We merged overlapping loci, and excluded loci that mapped near or within the major histocompatibility complex locus (chromosome 6, location: 20 to 40 Mb), which resulted in a list of 566 non-overlapping loci that were used as input to DEPICT. HapMap Project Phase II CEU genotype data was used for all LD calculations.
GRAIL and MAGENTA analysis
The GRAIL[14] algorithm was run using the LD pruned (r2>0.1) 628 SNPs without correcting for gene size, and using text-mining data up to December 2006 (default setting). MAGENTA[13] was run with the single genomic control adjusted summary statistics as input using default settings and excluding the HLA region.
Authors: Yun Li; Kathrin Laue; Samia Temtamy; Mona Aglan; L Damla Kotan; Gökhan Yigit; Husniye Canan; Barbara Pawlik; Gudrun Nürnberg; Emma L Wakeling; Oliver W Quarrell; Ingelore Baessmann; Matthew B Lanktree; Mustafa Yilmaz; Robert A Hegele; Khalda Amr; Klaus W May; Peter Nürnberg; A Kemal Topaloglu; Matthias Hammerschmidt; Bernd Wollnik Journal: Am J Hum Genet Date: 2010-12-10 Impact factor: 11.025
Authors: Jose Morales; Latifa Al-Sharif; Dania S Khalil; Jameela M A Shinwari; Prashant Bavi; Rahima A Al-Mahrouqi; Ali Al-Rajhi; Fowzan S Alkuraya; Brian F Meyer; Nada Al Tassan Journal: Am J Hum Genet Date: 2009-11 Impact factor: 11.025
Authors: David Croft; Gavin O'Kelly; Guanming Wu; Robin Haw; Marc Gillespie; Lisa Matthews; Michael Caudy; Phani Garapati; Gopal Gopinath; Bijay Jassal; Steven Jupe; Irina Kalatskaya; Shahana Mahajan; Bruce May; Nelson Ndegwa; Esther Schmidt; Veronica Shamovsky; Christina Yung; Ewan Birney; Henning Hermjakob; Peter D'Eustachio; Lincoln Stein Journal: Nucleic Acids Res Date: 2010-11-09 Impact factor: 16.971
Authors: Frank Geller; Bjarke Feenstra; Lisbeth Carstensen; Tune H Pers; Iris A L M van Rooij; Izabella Baranowska Körberg; Shweta Choudhry; Juha M Karjalainen; Tine H Schnack; Mads V Hollegaard; Wout F J Feitz; Nel Roeleveld; David M Hougaard; Joel N Hirschhorn; Lude Franke; Laurence S Baskin; Agneta Nordenskjöld; Loes F M van der Zanden; Mads Melbye Journal: Nat Genet Date: 2014-08-10 Impact factor: 38.330
Authors: Jian Yang; Teresa Ferreira; Andrew P Morris; Sarah E Medland; Pamela A F Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael N Weedon; Ruth J Loos; Timothy M Frayling; Mark I McCarthy; Joel N Hirschhorn; Michael E Goddard; Peter M Visscher Journal: Nat Genet Date: 2012-03-18 Impact factor: 38.330
Authors: Thomas W Winkler; Felix R Day; Damien C Croteau-Chonka; Andrew R Wood; Adam E Locke; Reedik Mägi; Teresa Ferreira; Tove Fall; Mariaelisa Graff; Anne E Justice; Jian'an Luan; Stefan Gustafsson; Joshua C Randall; Sailaja Vedantam; Tsegaselassie Workalemahu; Tuomas O Kilpeläinen; André Scherag; Tonu Esko; Zoltán Kutalik; Iris M Heid; Ruth J F Loos Journal: Nat Protoc Date: 2014-04-24 Impact factor: 13.491
Authors: Alexander Gusev; Gaurav Bhatia; Noah Zaitlen; Bjarni J Vilhjalmsson; Dorothée Diogo; Eli A Stahl; Peter K Gregersen; Jane Worthington; Lars Klareskog; Soumya Raychaudhuri; Robert M Plenge; Bogdan Pasaniuc; Alkes L Price Journal: PLoS Genet Date: 2013-12-26 Impact factor: 5.917
Authors: Ana Cvejic; Lonneke Haer-Wigman; Jonathan C Stephens; Pim van der Harst; C Ellen van der Schoot; Willem H Ouwehand; Cornelis A Albers; Myrto Kostadima; Peter A Smethurst; Mattia Frontini; Emile van den Akker; Paul Bertone; Ewa Bielczyk-Maczyńska; Samantha Farrow; Rudolf Sn Fehrmann; Alan Gray; Masja de Haas; Vincent G Haver; Gregory Jordan; Juha Karjalainen; Hindrik Hd Kerstens; Graham Kiddle; Heather Lloyd-Jones; Malcolm Needs; Joyce Poole; Aicha Ait Soussan; Augusto Rendon; Klaus Rieneck; Jennifer G Sambrook; Hein Schepers; Herman H W Silljé; Botond Sipos; Dorine Swinkels; Asif U Tamuri; Niek Verweij; Nicholas A Watkins; Harm-Jan Westra; Derek Stemple; Lude Franke; Nicole Soranzo; Hendrik G Stunnenberg; Nick Goldman Journal: Nat Genet Date: 2013-04-07 Impact factor: 38.330
Authors: Fan Liu; Merel A Hamer; Stefanie Heilmann; Christine Herold; Susanne Moebus; Albert Hofman; André G Uitterlinden; Markus M Nöthen; Cornelia M van Duijn; Tamar Ec Nijsten; Manfred Kayser Journal: Eur J Hum Genet Date: 2015-10-28 Impact factor: 4.246