Ji Chen1,2, Cassandra N Spracklen3,4, Gaëlle Marenne2,5, Arushi Varshney6, Laura J Corbin7,8, Jian'an Luan9, Sara M Willems9, Ying Wu3, Xiaoshuai Zhang9,10, Momoko Horikoshi11,12,13, Thibaud S Boutin14, Reedik Mägi15, Johannes Waage16, Ruifang Li-Gao17, Kei Hang Katie Chan18,19,20, Jie Yao21, Mila D Anasanti22, Audrey Y Chu23, Annique Claringbould24, Jani Heikkinen22, Jaeyoung Hong25, Jouke-Jan Hottenga26,27, Shaofeng Huo28, Marika A Kaakinen22,29, Tin Louie30, Winfried März31,32,33, Hortensia Moreno-Macias34, Anne Ndungu12, Sarah C Nelson30, Ilja M Nolte35, Kari E North36, Chelsea K Raulerson3, Debashree Ray37, Rebecca Rohde36, Denis Rybin25, Claudia Schurmann38,39, Xueling Sim40,41,42, Lorraine Southam2,43, Isobel D Stewart9, Carol A Wang44, Yujie Wang36, Peitao Wu25, Weihua Zhang45,46, Tarunveer S Ahluwalia16,47,48, Emil V R Appel49, Lawrence F Bielak50, Jennifer A Brody51, Noël P Burtt52, Claudia P Cabrera53,54, Brian E Cade55,56, Jin Fang Chai40, Xiaoran Chai57,58, Li-Ching Chang59, Chien-Hsiun Chen59, Brian H Chen60, Kumaraswamy Naidu Chitrala61, Yen-Feng Chiu62, Hugoline G de Haan17, Graciela E Delgado33, Ayse Demirkan29,63, Qing Duan3,64, Jorgen Engmann65, Segun A Fatumo66,67,68, Javier Gayán69, Franco Giulianini23, Jung Ho Gong18, Stefan Gustafsson70, Yang Hai71, Fernando P Hartwig7,72, Jing He73, Yoriko Heianza74, Tao Huang75, Alicia Huerta-Chagoya76,77, Mi Yeong Hwang78, Richard A Jensen51, Takahisa Kawaguchi79, Katherine A Kentistou80,81, Young Jin Kim78, Marcus E Kleber33, Ishminder K Kooner46, Shuiqing Lai18, Leslie A Lange82, Carl D Langefeld83, Marie Lauzon21, Man Li84, Symen Ligthart63, Jun Liu63,85, Marie Loh45,86, Jirong Long87, Valeriya Lyssenko88,89, Massimo Mangino90,91, Carola Marzi92,93, May E Montasser94, Abhishek Nag12, Masahiro Nakatochi95, Damia Noce96, Raymond Noordam97, Giorgio Pistis98, Michael Preuss38,99, Laura Raffield3, Laura J Rasmussen-Torvik100, Stephen S Rich101,102, Neil R Robertson11,12, Rico Rueedi103,104, Kathleen Ryan94, Serena Sanna24,98, Richa Saxena105,106,107, Katharina E Schraut80,81, Bengt Sennblad108, Kazuya Setoh79, Albert V Smith109,110, Thomas Sparsø49, Rona J Strawbridge111,112, Fumihiko Takeuchi113, Jingyi Tan21, Stella Trompet97,114, Erik van den Akker115,116,117, Peter J van der Most35, Niek Verweij118,119, Mandy Vogel120, Heming Wang55,56, Chaolong Wang121,122, Nan Wang123,124, Helen R Warren53,54, Wanqing Wen87, Tom Wilsgaard125, Andrew Wong126, Andrew R Wood1, Tian Xie35, Mohammad Hadi Zafarmand127,128, Jing-Hua Zhao129, Wei Zhao50, Najaf Amin63,85, Zorayr Arzumanyan21, Arne Astrup130, Stephan J L Bakker131, Damiano Baldassarre132,133, Marian Beekman115, Richard N Bergman134, Alain Bertoni135, Matthias Blüher136, Lori L Bonnycastle137, Stefan R Bornstein138, Donald W Bowden139, Qiuyin Cai73, Archie Campbell140,141, Harry Campbell80, Yi Cheng Chang59,142,143, Eco J C de Geus26,27, Abbas Dehghan63, Shufa Du144, Gudny Eiriksdottir110, Aliki Eleni Farmaki145,146, Mattias Frånberg112, Christian Fuchsberger96, Yutang Gao147, Anette P Gjesing49, Anuj Goel12,148, Sohee Han78, Catharina A Hartman149, Christian Herder150,151,152, Andrew A Hicks96, Chang-Hsun Hsieh153,154, Willa A Hsueh155, Sahoko Ichihara156, Michiya Igase157, M Arfan Ikram63, W Craig Johnson30, Marit E Jørgensen47,158, Peter K Joshi80, Rita R Kalyani159, Fouad R Kandeel160, Tomohiro Katsuya161,162, Chiea Chuen Khor122, Wieland Kiess120, Ivana Kolcic163, Teemu Kuulasmaa164, Johanna Kuusisto165, Kristi Läll15, Kelvin Lam21, Deborah A Lawlor7,8, Nanette R Lee166,167, Rozenn N Lemaitre51, Honglan Li168, Shih-Yi Lin169,170, Jaana Lindström171, Allan Linneberg172,173, Jianjun Liu122,174, Carlos Lorenzo175, Tatsuaki Matsubara176, Fumihiko Matsuda79, Geltrude Mingrone177, Simon Mooijaart97, Sanghoon Moon78, Toru Nabika178, Girish N Nadkarni38, Jerry L Nadler179, Mari Nelis15, Matt J Neville11,180, Jill M Norris181, Yasumasa Ohyagi182, Annette Peters93,183,184, Patricia A Peyser50, Ozren Polasek163,185, Qibin Qi186, Dennis Raven149, Dermot F Reilly187, Alex Reiner188, Fernando Rivideneira189, Kathryn Roll21, Igor Rudan190, Charumathi Sabanayagam57,191, Kevin Sandow21, Naveed Sattar192, Annette Schürmann93,193, Jinxiu Shi194, Heather M Stringham41,42, Kent D Taylor21, Tanya M Teslovich195, Betina Thuesen172, Paul R H J Timmers80,196, Elena Tremoli133, Michael Y Tsai197, Andre Uitterlinden189, Rob M van Dam40,174,198, Diana van Heemst97, Astrid van Hylckama Vlieg17, Jana V van Vliet-Ostaptchouk35, Jagadish Vangipurapu199, Henrik Vestergaard49,200, Tao Wang186, Ko Willems van Dijk201,202,203, Tatijana Zemunik204, Gonçalo R Abecasis42, Linda S Adair144,205, Carlos Alberto Aguilar-Salinas206,207,208, Marta E Alarcón-Riquelme209,210, Ping An211, Larissa Aviles-Santa212, Diane M Becker213, Lawrence J Beilin214, Sven Bergmann103,104,215, Hans Bisgaard16, Corri Black216, Michael Boehnke41,42, Eric Boerwinkle217,218, Bernhard O Böhm219,220, Klaus Bønnelykke16, D I Boomsma26,27, Erwin P Bottinger38,221,222, Thomas A Buchanan124,223,224, Mickaël Canouil225,226, Mark J Caulfield53,54, John C Chambers45,46,86,227,228, Daniel I Chasman23,229, Yii-Der Ida Chen21, Ching-Yu Cheng57,191, Francis S Collins137, Adolfo Correa230, Francesco Cucca98, H Janaka de Silva231, George Dedoussis232, Sölve Elmståhl233, Michele K Evans234, Ele Ferrannini235, Luigi Ferrucci236, Jose C Florez107,237,238, Paul W Franks89,239, Timothy M Frayling1, Philippe Froguel225,226,240, Bruna Gigante241, Mark O Goodarzi242, Penny Gordon-Larsen144,205, Harald Grallert92,93, Niels Grarup49, Sameline Grimsgaard125, Leif Groop243,244, Vilmundur Gudnason110,245, Xiuqing Guo21, Anders Hamsten112, Torben Hansen49, Caroline Hayward196, Susan R Heckbert246, Bernardo L Horta72, Wei Huang194, Erik Ingelsson247, Pankow S James248, Marjo-Ritta Jarvelin249,250,251,252, Jost B Jonas253,254,255, J Wouter Jukema114,256, Pontiano Kaleebu257, Robert Kaplan186,188, Sharon L R Kardia50, Norihiro Kato113, Sirkka M Keinanen-Kiukaanniemi258,259, Bong-Jo Kim78, Mika Kivimaki260, Heikki A Koistinen261,262,263, Jaspal S Kooner46,227,228,264, Antje Körner120, Peter Kovacs136,265, Diana Kuh126, Meena Kumari266, Zoltan Kutalik104,267, Markku Laakso165, Timo A Lakka268,269,270, Lenore J Launer61, Karin Leander271, Huaixing Li28, Xu Lin28, Lars Lind272, Cecilia Lindgren12,273,274, Simin Liu18, Ruth J F Loos38,99, Patrik K E Magnusson275, Anubha Mahajan12,276, Andres Metspalu15, Dennis O Mook-Kanamori17,277, Trevor A Mori214, Patricia B Munroe53,54, Inger Njølstad125, Jeffrey R O'Connell94, Albertine J Oldehinkel149, Ken K Ong9, Sandosh Padmanabhan278, Colin N A Palmer279, Nicholette D Palmer139, Oluf Pedersen49, Craig E Pennell44, David J Porteous140,280, Peter P Pramstaller96, Michael A Province211, Bruce M Psaty51,246,281, Lu Qi282, Leslie J Raffel283, Rainer Rauramaa270, Susan Redline55,56, Paul M Ridker23,284, Frits R Rosendaal17, Timo E Saaristo285,286, Manjinder Sandhu287, Jouko Saramies288, Neil Schneiderman289, Peter Schwarz93,138,290, Laura J Scott41,42, Elizabeth Selvin37, Peter Sever264, Xiao-Ou Shu87, P Eline Slagboom115, Kerrin S Small90, Blair H Smith291, Harold Snieder35, Tamar Sofer238,292, Thorkild I A Sørensen7,8,49,293, Tim D Spector90, Alice Stanton294, Claire J Steves90,295, Michael Stumvoll136, Liang Sun28, Yasuharu Tabara79, E Shyong Tai40,174,296, Nicholas J Timpson7,8, Anke Tönjes136, Jaakko Tuomilehto297,298,299, Teresa Tusie77,300, Matti Uusitupa301, Pim van der Harst24,118, Cornelia van Duijn63,85, Veronique Vitart196, Peter Vollenweider302, Tanja G M Vrijkotte127, Lynne E Wagenknecht303, Mark Walker304, Ya X Wang254, Nick J Wareham9, Richard M Watanabe123,124,224, Hugh Watkins12,148, Wen B Wei305, Ananda R Wickremasinghe306, Gonneke Willemsen26,27, James F Wilson80,196, Tien-Yin Wong57,191, Jer-Yuarn Wu59, Anny H Xiang307, Lisa R Yanek213, Loïc Yengo308, Mitsuhiro Yokota309, Eleftheria Zeggini2,43,310, Wei Zheng87, Alan B Zonderman61, Jerome I Rotter21, Anna L Gloyn11,12,180,311, Mark I McCarthy11,12,180,312,276, Josée Dupuis25, James B Meigs107,238,313, Robert A Scott9, Inga Prokopenko22,29, Aaron Leong229,314,315, Ching-Ti Liu25, Stephen C J Parker6,316, Karen L Mohlke3, Claudia Langenberg9, Eleanor Wheeler2,9, Andrew P Morris12,317,318,319, Inês Barroso320,321,322. 1. Exeter Centre of Excellence for Diabetes Research (EXCEED), Genetics of Complex Traits, University of Exeter Medical School, University of Exeter, Exeter, UK. 2. Department of Human Genetics, Wellcome Sanger Institute, Cambridge, UK. 3. Department of Genetics, University of North Carolina, Chapel Hill, NC, USA. 4. Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA, USA. 5. Inserm, Univ Brest, EFS, UMR 1078, GGB, Brest, France. 6. Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA. 7. MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK. 8. Department of Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK. 9. MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK. 10. Department of Biostatistics, School of Public Health, Shandong University, Jinan, China. 11. Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, UK. 12. Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK. 13. Laboratory for Genomics of Diabetes and Metabolism, RIKEN Centre for Integrative Medical Sciences, Yokohama, Japan. 14. Medical Research Council Human Genetics Unit, Institute for Genetics and Molecular Medicine, Edinburgh, UK. 15. Estonian Genome Center, Institute of Genomics, University of Tartu, Tartu, Estonia. 16. COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark. 17. Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, the Netherlands. 18. Department of Epidemiology, Brown University School of Public Health, Brown University, Providence, RI, USA. 19. Department of Biomedical Sciences, City University of Hong Kong, Hong Kong SAR, China. 20. Department of Electrical Engineering, City University of Hong Kong, Hong Kong SAR, China. 21. The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA. 22. Department of Metabolism, Digestion and Reproduction, Imperial College London, London, UK. 23. Division of Preventive Medicine, Brigham and Women's Hospital, Boston, MA, USA. 24. Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands. 25. Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA. 26. Department of Biological Psychology, Faculty of Behaviour and Movement Sciences, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands. 27. Amsterdam Public Health Research Institute, Amsterdam University Medical Center, Amsterdam, the Netherlands. 28. CAS Key Laboratory of Nutrition, Metabolism and Food Safety, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China. 29. Section of Statistical Multi-omics, Department of Clinical and Experimental Research, University of Surrey, Guildford, UK. 30. Department of Biostatistics, University of Washington, Seattle, WA, USA. 31. SYNLAB Academy, SYNLAB Holding Deutschland GmbH, Mannheim, Germany. 32. Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University Graz, Graz, Austria. 33. Vth Department of Medicine (Nephrology, Hypertensiology, Rheumatology, Endocrinology, Diabetology), Medical Faculty Mannheim, Heidelberg University, Mannheim, Baden-Württemberg, Germany. 34. Department of Economics, Metropolitan Autonomous University, Mexico City, Mexico. 35. Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands. 36. CVD Genetic Epidemiology Computational Laboratory, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA. 37. Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA. 38. The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA. 39. HPI Digital Health Center, Digital Health and Personalized Medicine, Hasso Plattner Institute, Potsdam, Germany. 40. Saw Swee Hock School of Public Health, National Univeristy of Singapore and National University Health System, Singapore, Singapore. 41. Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA. 42. Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, USA. 43. Institute of Translational Genomics, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany. 44. School of Medicine and Public Health, College of Health, Medicine and Wellbeing, The University of Newcastle, Newcastle, New South Wales, Australia. 45. Department of Epidemiology and Biostatistics, Imperial College London, London, UK. 46. Department of Cardiology, Ealing Hospital, London North West Healthcare NHS Trust, London, UK. 47. Steno Diabetes Center Copenhagen, Gentofte, Denmark. 48. The Bioinformatics Centre, Department of Biology, University of Copenhagen, Copenhagen, Denmark. 49. Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark. 50. Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA. 51. Department of Medicine, Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA. 52. Metabolism Program, Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA. 53. Department of Clinical Pharmacology, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK. 54. NIHR Barts Cardiovascular Biomedical Research Centre, Queen Mary University of London, London, UK. 55. Department of Medicine, Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA. 56. Department of Medicine, Sleep Medicine, Harvard Medical School, Boston, MA, USA. 57. Ocular Epidemiology, Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore. 58. Department of Ophthalmology, National University of Singapore and National University Health System, Singapore, Singapore. 59. Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan. 60. Department of Epidemiology, The Herbert Wertheim School of Public Health and Human Longevity Science, University of California San Diego, La Jolla, CA, USA. 61. Laboratory of Epidemiology and Population Sciences, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA. 62. Institute of Population Health Sciences, National Health Research Institutes, Miaoli, Taiwan. 63. Department of Epidemiology, Erasmus Medical Center, Rotterdam, the Netherlands. 64. Department of Statistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA. 65. Institute of Cardiovascular Science, University College London, London, UK. 66. Uganda Medical Informatics Centre (UMIC), MRC/UVRI and London School of Hygiene & Tropical Medicine (Uganda Research Unit), Entebbe, Uganda. 67. London School of Hygiene & Tropical Medicine, London, UK. 68. H3Africa Bioinformatics Network (H3ABioNet) Node, Centre for Genomics Research and Innovation, NABDA/FMST, Abuja, Nigeria. 69. Bioinfosol, Sevilla, Spain. 70. Molecular Epidemiology and Science for Life Laboratory, Department of Medical Sciences, Uppsala University, Uppsala, Sweden. 71. Department of Statistics, The University of Auckland, Science Center, Auckland, New Zealand. 72. Postgraduate Program in Epidemiology, Federal University of Pelotas, Pelotas, Brazil. 73. Department of Medicine, Epidemiology, Vanderbilt University Medical Center, Nashville, TN, USA. 74. Department of Epidemiology, Tulane University Obesity Research Center, Tulane University, New Orleans, LA, USA. 75. Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China. 76. Molecular Biology and Genomic Medicine Unit, National Council for Science and Technology, Mexico City, Mexico. 77. Molecular Biology and Genomic Medicine Unit, National Institute of Medical Sciences and Nutrition, Mexico City, Mexico. 78. Division of Genome Science, Department of Precision Medicine, National Institute of Health, Cheongju, South Korea. 79. Center for Genomic Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan. 80. Centre for Global Health Research, Usher Institute, University of Edinburgh, Edinburgh, UK. 81. Centre for Cardiovascular Sciences, Queen's Medical Research Institute, University of Edinburgh, Edinburgh, UK. 82. Department of Medicine, Divison of Biomedical Informatics and Personalized Medicine, University of Colorado Anschutz Medical Campus, Denver, CO, USA. 83. Department of Biostatistics and Data Science, Wake Forest School of Medicine, Winston-Salem, NC, USA. 84. Department of Medicine, Division of Nephrology and Hypertension, University of Utah, Salt Lake City, UT, USA. 85. Nuffield Department of Population Health, University of Oxford, Oxford, UK. 86. Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore. 87. Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University Medical Center, Nashville, TN, USA. 88. Department of Clinical Science, Center for Diabetes Research, University of Bergen, Bergen, Norway. 89. Department of Clinical Sciences, Lund University Diabetes Centre, Lund University, Malmo, Sweden. 90. Department of Twin Research and Genetic Epidemiology, School of Life Course Sciences, King's College London, London, UK. 91. NIHR Biomedical Research Centre, Guy's and St Thomas' NHS Foundation Trust, London, UK. 92. Institute of Epidemiology, Research Unit of Molecular Epidemiology, Helmholtz Zentrum München Research Center for Environmental Health, Neuherberg, Germany. 93. German Center for Diabetes Research (DZD), Neuherberg, Germany. 94. Department of Medicine, Division of Endocrinology, Diabetes and Nutrition, University of Maryland School of Medicine, Baltimore, MD, USA. 95. Public Health Informatics Unit, Department of Integrated Sciences, Nagoya University Graduate School of Medicine, Nagoya, Japan. 96. Institute for Biomedicine, Eurac Research, Bolzano, Italy. 97. Department of Internal Medicine, Section of Gerontology and Geriatrics, Leiden University Medical Center, Leiden, the Netherlands. 98. Istituto di Ricerca Genetica e Biomedica (IRGB), Consiglio Nazionale delle Ricerche (CNR), Monserrato, Italy. 99. The Mindich Child Health and Development Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA. 100. Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA. 101. Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA. 102. Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA. 103. Department of Computational Biology, University of Lausanne, Lausanne, Switzerland. 104. Swiss Institute of Bioinformatics, Lausanne, Switzerland. 105. Center for Genomic Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA. 106. Department of Anesthesia, Critical Care and Pain Medicine, Massachusetts General Hospital, Boston, MA, USA. 107. Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA. 108. Department of Cell and Molecular Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Uppsala University, Uppsala, Sweden. 109. Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA. 110. Icelandic Heart Association, Kopavogur, Iceland. 111. Institute of Health and Wellbeing, University of Glasgow, Glasgow, UK. 112. Department of Medicine Solna, Cardiovascular Medicine, Karolinska Institutet, Stockholm, Sweden. 113. National Center for Global Health and Medicine, Tokyo, Japan. 114. Department of Cardiology, Leiden University Medical Center, Leiden, the Netherlands. 115. Department of Biomedical Data Sciences, Molecular Epidemiology, Leiden University Medical Center, Leiden, the Netherlands. 116. Department of Pattern Recognition and Bioinformatics, Delft University of Technology, Delft, the Netherlands. 117. Department of Biomedical Data Sciences, Leiden Computational Biology Center, Leiden University Medical Center, Leiden, the Netherlands. 118. Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands. 119. Genomics PLC, Oxford, UK. 120. Center of Pediatric Research, University Children's Hospital Leipzig, University of Leipzig Medical Center, Leipzig, Germany. 121. Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China. 122. Genome Institute of Singapore, Agency for Science, Technology and Research, Singapore, Singapore. 123. Department of Preventive Medicine, Keck School of Medicine of University of Southern California, Los Angeles, CA, USA. 124. University of Southern California Diabetes and Obesity Research Institute, Keck School of Medicine of University of Southern California, Los Angeles, CA, USA. 125. Department of Community Medicine, Faculty of Health Sciences, UIT the Arctic University of Norway, Tromsø, Norway. 126. MRC Unit for Lifelong Health and Ageing at University College London, London, UK. 127. Department of Public Health, Amsterdam Public Health Research Institute, Amsterdam University Medical Center, Amsterdam, the Netherlands. 128. Department of Clinical Epidemiology, Biostatistics, and Bioinformatics, Amsterdam Public Health Research Institute, Amsterdam University Medical Center, Amsterdam, the Netherlands. 129. Department of Public Health and Primary Care, School of Clinical Medicine, University of Cambridge, Cambridge, UK. 130. Department of Nutrition, Exercise, and Sports, Faculty of Science, University of Copenhagen, Copenhagen, Denmark. 131. Department of Internal Medicine, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands. 132. Department of Medical Biotechnology and Translational Medicine, University of Milan, Milan, Italy. 133. Centro Cardiologico Monzino, IRCCS, Milan, Italy. 134. Diabetes and Obesity Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA. 135. Department of Epidemiology and Prevention, Division of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA. 136. Medical Department III-Endocrinology, Nephrology, Rheumatology, University of Leipzig Medical Center, Leipzig, Germany. 137. Medical Genomics and Metabolic Genetics Branch, National Human Genome Research Institute, National Institues of Health, Bethesda, MD, USA. 138. Department for Prevention and Care of Diabetes, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany. 139. Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA. 140. Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, UK. 141. Usher Institute, University of Edinburgh, Edinburgh, UK. 142. Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan. 143. Graduate Institute of Medical Genomics and Proteomics, National Taiwan University, Taipei, Taiwan. 144. Department of Nutrition, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA. 145. Department of Population Science and Experimental Medicine, Institute of Cardiovascular Science, University College London, London, UK. 146. Department of Nutrition and Dietetics, School of Health Science and Education, Harokopio University of Athens, Athens, Greece. 147. Department of Epidemiology, Shanghai Cancer Institute, Shanghai, China. 148. Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK. 149. Department of Psychiatry, Interdisciplinary Center Psychopathy and Emotion Regulation, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands. 150. Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Düsseldorf, Germany. 151. Division of Endocrinology and Diabetology, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany. 152. German Center for Diabetes Research (DZD), Düsseldorf, Germany. 153. Internal Medicine, Endocrine and Metabolism, Tri-Service General Hospital, Taipei, Taiwan. 154. School of Medicine, National Defense Medical Center, Taipei, Taiwan. 155. Internal Medicine, Endocrinology, Diabetes and Metabolism, Diabetes and Metabolism Research Center, The Ohio State University Wexner Medical Center, Columbus, OH, USA. 156. Department of Environmental and Preventive Medicine, Jichi Medical University School of Medicine, Shimotsuke, Japan. 157. Department of Anti-aging Medicine, Ehime University Graduate School of Medicine, Toon, Japan. 158. National Institute of Public Health, University of Southern Denmark, Odense, Denmark. 159. Department of Medicine, Endocrinology, Diabetes and Metabolism, Johns Hopkins University School of Medicine, Baltimore, MD, USA. 160. Clinical Diabetes, Endocrinology and Metabolism, Translational Research and Cellular Therapeutics, Beckman Research Institute of the City of Hope, Duarte, CA, USA. 161. Department of Clinical Gene Therapy, Osaka University Graduate School of Medicine, Suita, Japan. 162. Department of Geriatric and General Medicine, Osaka University Graduate School of Medicine, Suita, Japan. 163. Department of Public Health, University of Split School of Medicine, Split, Croatia. 164. Institute of Biomedicine, Bioinformatics Center, Univeristy of Eastern Finland, Kuopio, Finland. 165. Department of Medicine, University of Eastern Finland and Kuopio University Hospital, Kuopio, Finland. 166. USC-Office of Population Studies Foundation, University of San Carlos, Cebu City, the Philippines. 167. Department of Anthropology, Sociology and History, University of San Carlos, Cebu City, the Philippines. 168. State Key Laboratory of Oncogene and Related Genes and Department of Epidemiology, Shanghai Cancer Institute, Renji Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China. 169. Center for Geriatrics and Gerontology, Taichung Veterans General Hospital, Taichung, Taiwan. 170. National Defense Medical Center, National Yang-Ming University, Taipei, Taiwan. 171. Diabetes Prevention Unit, National Institute for Health and Welfare, Helsinki, Finland. 172. Center for Clinical Research and Prevention, Bispebjerg and Frederiksberg Hospital, Copenhagen, Denmark. 173. Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark. 174. Yong Loo Lin School of Medicine, National University of Singapore and National University Health System, Singapore, Singapore. 175. Department of Medicine, University of Texas Health Sciences Center, San Antonio, TX, USA. 176. Department of Internal Medicine, Aichi Gakuin University School of Dentistry, Nagoya, Japan. 177. Department of Diabetes, Diabetes, and Nutritional Sciences, James Black Centre, King's College London, London, UK. 178. Department of Functional Pathology, Shimane University School of Medicine, Izumo, Japan. 179. Department of Medicine and Pharmacology, New York Medical College School of Medicine, Valhalla, NY, USA. 180. Oxford NIHR Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK. 181. Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. 182. Department of Geriatric Medicine and Neurology, Ehime University Graduate School of Medicine, Toon, Japan. 183. Institute of Epidemiology, Helmholtz Zentrum München Research Center for Environmental Health, Neuherberg, Germany. 184. Institute for Medical Information Processing, Biometry and Epidemiology, Ludwig-Maximilians University Munich, Munich, Germany. 185. Gen-Info, Zagreb, Croatia. 186. Department of Epidemiology and Population Health, Albert Einstein College of Medicine, New York, NY, USA. 187. Genetics and Pharmacogenomics, Merck Sharp & Dohme, Kenilworth, NJ, USA. 188. Department of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA. 189. Department of Internal Medicine, Erasmus Medical Center, Rotterdam, the Netherlands. 190. Centre for Global Health, The Usher Institute, University of Edinburgh, Edinburgh, UK. 191. Ophthalmology & Visual Sciences Academic Clinical Program (Eye ACP), Duke-NUS Medical School, Singapore, Singapore. 192. BHF Glasgow Cardiovascular Research Centre, Institute of Cardiovascular and Medical Sciences, University of Glasgow, Glasgow, UK. 193. Department of Experimental Diabetology, German Institute of Human Nutrition Potsdam-Rehbruecke, Nuthetal, Germany. 194. Department of Genetics, Shanghai-MOST Key Laboratory of Health and Disease Genomics, Chinese National Human Genome Center at Shanghai (CHGC) and Shanghai Academy of Science & Technology (SAST), Shanghai, China. 195. Sarepta Therapeutics, Cambridge, MA, USA. 196. Medical Research Council Human Genetics Unit, Institute for Genetics and Cancer, University of Edinburgh, Edinburgh, UK. 197. Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN, USA. 198. Department of Nutrition, Harvard T. H. Chan School of Public Health, Boston, MA, USA. 199. Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland. 200. Department of Medicine, Bornholms Hospital, Rønne, Denmark. 201. Department of Internal Medicine, Division of Endocrinology, Leiden University Medical Center, Leiden, the Netherlands. 202. Laboratory for Experimental Vascular Medicine, Leiden University Medical Center, Leiden, the Netherlands. 203. Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands. 204. Department of Human Biology, University of Split School of Medicine, Split, Croatia. 205. Carolina Population Center, University of North Carolina, Chapel Hill, NC, USA. 206. Department of Endocrinology and Metabolism, Instituto Nacional de Ciencias Medicas y Nutricion, Mexico City, Mexico. 207. Unidad de Investigación de Enfermedades Metabólicas, Instituto Nacional de Ciencias Médicas y Nutrición and Tec Salud, Mexico City, Mexico. 208. Instituto Tecnológico y de Estudios Superiores de Monterrey Tec Salud, Monterrey, Mexico. 209. Department of Medical Genomics, Pfizer/University of Granada/Andalusian Government Center for Genomics and Oncological Research (GENYO), Granada, Spain. 210. Institute for Environmental Medicine, Chronic Inflammatory Diseases, Karolinska Institutet, Solna, Sweden. 211. Department of Genetics, Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO, USA. 212. Clinical and Health Services Research, National Institute on Minority Health and Health Disparities, Bethesda, MD, USA. 213. Department of Medicine, General Internal Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA. 214. Medical School, Royal Perth Hospital Unit, University of Western Australia, Perth, Western Australia, Australia. 215. Department of Integrative Biomedical Sciences, University of Cape Town, Cape Town, South Africa. 216. Aberdeen Centre for Health Data Science, School of Medicine, Medical Sciences and Nutrition, University of Aberdeen, Aberdeen, UK. 217. Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA. 218. Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA. 219. Division of Endocrinology and Diabetes, Graduate School of Molecular Endocrinology and Diabetes, University of Ulm, Ulm, Germany. 220. LKC School of Medicine, Nanyang Technological University, Singapore and Imperial College London, UK, Singapore, Singapore. 221. Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA. 222. Digital Health Center, Hasso Plattner Institut, University Potsdam, Potsdam, Germany. 223. Department of Medicine, Keck School of Medicine of University of Southern California, Los Angeles, CA, USA. 224. Department of Physiology and Neuroscience, Keck School of Medicine of University of Southern California, Los Angeles, CA, USA. 225. INSERM UMR 1283/CNRS UMR 8199, European Institute for Diabetes (EGID), Université de Lille, Lille, France. 226. INSERM UMR 1283/CNRS UMR 8199, European Institute for Diabetes (EGID), Institut Pasteur de Lille, Lille, France. 227. Imperial College Healthcare NHS Trust, Imperial College London, London, UK. 228. MRC-PHE Centre for Environment and Health, Imperial College London, London, UK. 229. Harvard Medical School, Boston, MA, USA. 230. Department of Medicine, Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS, USA. 231. Department of Medicine, Faculty of Medicine, University of Kelaniya, Ragama, Sri Lanka. 232. Department of Nutrition and Dietetics, School of Health Science and Education, Harokopio University of Athens, Kallithea, Greece. 233. Department of Clinical Sciences, Lund University, Malmö, Sweden. 234. Laboratory of Epidemiology and Population Sciences, National Institute on Aging Intramural Research Program, National Institutes of Health, Baltimore, MD, USA. 235. CNR Institute of Clinical Physiology, Pisa, Italy. 236. Intramural Research Program, National Institute of Aging, Baltimore, MD, USA. 237. Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA. 238. Department of Medicine, Harvard Medical School, Boston, MA, USA. 239. Department of Public Health and Clinical Medicine, Umeå University, Umeå, Sweden. 240. Department of Genomics of Common Disease, Imperial College London, London, UK. 241. Department of Medicine, Cardiovascular Medicine, Karolinska Institutet, Stockholm, Sweden. 242. Department of Medicine, Division of Endocrinology, Diabetes and Metabolism, Cedars-Sinai Medical Center, Los Angeles, CA, USA. 243. Diabetes Centre, Lund University, Lund, Sweden. 244. Finnish Institute of Molecular Medicine, Helsinki University, Helsinki, Finland. 245. Faculty of Medicine, School of Health Sciences, University of Iceland, Reykjavik, Iceland. 246. Department of Epidemiology, Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA. 247. Department of Medicine, Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford University, Stanford, CA, USA. 248. Division of Epidemiology and Community Health, University of Minnesota, Minneapolis, MN, USA. 249. Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK. 250. Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland. 251. Unit of Primary Health Care, Oulu Univerisity Hospital, OYS, Oulu, Finland. 252. Department of Life Sciences, College of Health and Life Sciences, Brunel University London, London, UK. 253. Department of Ophthalmology, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany. 254. Beijing Institute of Ophthalmology, Beijing Ophthalmology and Visual Science Key Lab, Beijing Tongren Eye Center, Beijing Tongren Hospital, Capital Medical University, Beijing, China. 255. Institute of Molecular and Clinical Ophthalmology Basel IOB, Basel, Switzerland. 256. Netherlands Heart Institute, Utrecht, the Netherlands. 257. MRC/UVRI and LSHTM (Uganda Research Unit), Entebbe, Uganda. 258. Faculty of Medicine, Institute of Health Sciences, University of Oulu, Oulu, Finland. 259. Unit of General Practice, Oulu University Hospital, Oulu, Finland. 260. Department of Epidemiology and Public Health, University College London, London, UK. 261. Department of Public Health Solutions, Finnish Institute for Health and Welfare, Helsinki, Finland. 262. Department of Medicine, University of Helsinki and Helsinki University Central Hospital, Helsinki, Finland. 263. Minerva Foundation Institute for Medical Research, Helsinki, Finland. 264. National Heart and Lung Institute, Imperial College London, London, UK. 265. IFB Adiposity Diseases, University of Leipzig Medical Center, Leipzig, Germany. 266. Institute for Social and Economic Research, University of Essex, Colchester, UK. 267. University Institute of Primary Care and Public Health, Division of Biostatistics, University of Lausanne, Lausanne, Switzerland. 268. Institute of Biomedicine, School of Medicine, University of Eastern Finland, Kuopio, Finland. 269. Department of Clinical Physiology and Nuclear Medicine, Kuopio University Hospital, Kuopio, Finland. 270. Foundation for Research in Health Exercise and Nutrition, Kuopio Research Institute of Exercise Medicine, Kuopio, Finland. 271. Institute of Environmental Medicine, Cardiovascular and Nutritional Epidemiology, Karolinska Institutet, Stockholm, Sweden. 272. Department of Medical Sciences, University of Uppsala, Uppsala, Sweden. 273. Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK. 274. Nuffield Department of Women's and Reproductive Health, University of Oxford, Oxford, UK. 275. Department of Medical Epidemiology and Biostatistics and the Swedish Twin Registry, Karolinska Institutet, Stockholm, Sweden. 276. Genentech, South San Francisco, CA, USA. 277. Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, the Netherlands. 278. Institute of Cardiovascular and Medical Sciences, University of Glasgow, Glasgow, UK. 279. Division of Population Health and Genomics, School of Medicine, University of Dundee, Ninewells Hospital and Medical School, Dundee, UK. 280. Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK. 281. Department of Health Services, Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA. 282. Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA. 283. Department of Pediatrics, Genetic and Genomic Medicine, University of California, Irvine, Irvine, CA, USA. 284. Havard Medical School, Boston, MA, USA. 285. Tampere, Finnish Diabetes Association, Tampere, Finland. 286. Pirkanmaa Hospital District, Tampere, Finland. 287. Department of Medicine, University of Cambridge, Cambridge, UK. 288. South Karelia Central Hospital, Lappeenranta, Finland. 289. Department of Psychology, University of Miami, Miami, FL, USA. 290. Paul Langerhans Institute Dresden of the Helmholtz Center Munich, University Hospital and Faculty of Medicine, Dresden, Germany. 291. Division of Population Health and Genomics, Ninewells Hospital and Medical School, University of Dundee, Dundee, UK. 292. Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA. 293. Department of Public Health, Section of Epidemiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark. 294. Department of Molecular and Cellular Therapeutics, Royal College of Surgeons in Ireland, Dublin, Ireland. 295. Department of Ageing and Health, Guy's and St Thomas' NHS Foundation Trust, London, UK. 296. Cardiovascular and Metabolic Disease Signature Research Program, Duke-NUS Medical School, Singapore, Singapore. 297. Department of Public Health Solutions, National Institute for Health and Welfare, Helsinki, Finland. 298. Department of Public Health, University of Helsinki, Helsinki, Finland. 299. Saudi Diabetes Research Group, King Abdulaziz University, Jeddah, Saudi Arabia. 300. Department of Genomic Medicine and Environmental Toxicology, Instituto de Investigaciones Biomedicas, Universidad Nacional Autonoma de Mexico, Mexico City, Mexico. 301. Department of Public Health and Clinical Nutrition, University of Eastern Finland, Kuopio, Finland. 302. Department of Medicine, Internal Medicine, Lausanne University Hospital (CHUV), Lausanne, Switzerland. 303. Department of Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA. 304. Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK. 305. Beijing Tongren Eye Center, Beijing Key Laboratory of Intraocular Tumor Diagnosis and Treatment, Beijing Ophthalmology & Visual Sciences Key Lab, Beijing Tongren Hospital, Capital Medical University, Beijing, China. 306. Department of Public Health, Faculty of Medicine, University of Kelaniya, Ragama, Sri Lanka. 307. Department of Research and Evaluation, Kaiser Permanente of Southern California, Pasadena, CA, USA. 308. Institute for Molecular Bioscience, The University of Queensland, St Lucia, Queensland, Australia. 309. Kurume University School of Medicine, Kurume, Japan. 310. TUM School of Medicine, Technical University of Munich and Klinikum Rechts der Isar, Munich, Germany. 311. Department of Pediatrics, Division of Endocrinology, Stanford School of Medicine, Stanford, CA, USA. 312. Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK. 313. Department of Medicine, Division of General Internal Medicine, Massachusetts General Hospital, Boston, MA, USA. 314. Department of Medicine, General Internal Medicine, Massachusetts General Hospital, Boston, MA, USA. 315. Department of Medicine, Diabetes Unit and Endocrine Unit, Massachusetts General Hospital, Boston, MA, USA. 316. Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA. 317. Centre for Genetics and Genomics Versus Arthritis, Division of Musculoskeletal and Dermatological Sciences, The University of Manchester, Manchester, UK. 318. Centre for Musculoskeletal Research, Division of Musculoskeletal and Dermatological Sciences, The University of Manchester, Manchester, UK. 319. Department of Biostatistics, University of Liverpool, Liverpool, UK. 320. Exeter Centre of Excellence for Diabetes Research (EXCEED), Genetics of Complex Traits, University of Exeter Medical School, University of Exeter, Exeter, UK. ines.barroso@exeter.ac.uk. 321. Department of Human Genetics, Wellcome Sanger Institute, Cambridge, UK. ines.barroso@exeter.ac.uk. 322. MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK. ines.barroso@exeter.ac.uk.
Abstract
Glycemic traits are used to diagnose and monitor type 2 diabetes and cardiometabolic health. To date, most genetic studies of glycemic traits have focused on individuals of European ancestry. Here we aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available. Trans-ancestry and single-ancestry meta-analyses identified 242 loci (99 novel; P < 5 × 10-8), 80% of which had no significant evidence of between-ancestry heterogeneity. Analyses restricted to individuals of European ancestry with equivalent sample size would have led to 24 fewer new loci. Compared with single-ancestry analyses, equivalent-sized trans-ancestry fine-mapping reduced the number of estimated variants in 99% credible sets by a median of 37.5%. Genomic-feature, gene-expression and gene-set analyses revealed distinct biological signatures for each trait, highlighting different underlying biological pathways. Our results increase our understanding of diabetes pathophysiology by using trans-ancestry studies for improved power and resolution.
Glycemic traits are used to diagnose and monitor type 2 diabetes and cardiometabolic health. To date, most genetic studies of glycemic traits have focused on individuals of European ancestry. Here we aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available. Trans-ancestry and single-ancestry meta-analyses identified 242 loci (99 novel; P < 5 × 10-8), 80% of which had no significant evidence of between-ancestry heterogeneity. Analyses restricted to individuals of European ancestry with equivalent sample size would have led to 24 fewer new loci. Compared with single-ancestry analyses, equivalent-sized trans-ancestry fine-mapping reduced the number of estimated variants in 99% credible sets by a median of 37.5%. Genomic-feature, gene-expression and gene-set analyses revealed distinct biological signatures for each trait, highlighting different underlying biological pathways. Our results increase our understanding of diabetes pathophysiology by using trans-ancestry studies for improved power and resolution.
Fasting glucose (FG), 2h-glucose post-challenge (2hGlu), and glycated hemoglobin (HbA1c) are glycemic traits used to diagnose diabetes[1]. In addition, HbA1c is the most commonly used biomarker to monitor glucose control in patients with diabetes. Fasting insulin (FI) reflects a combination of insulin secretion and insulin resistance, both components of type 2 diabetes (T2D), and insulin clearance[2]. Collectively, all four glycemic traits are useful to better understand T2D pathophysiology[3-5] and cardiometabolic outcomes[6].To date, genome-wide association studies (GWAS) and analysis of Metabochip and exome arrays have identified >120 loci associated with glycemic traits in individuals without diabetes[7-15]. However, despite considerable differences in the prevalence of T2D risk factors across ancestries[16-18], most glycemic trait GWAS have insufficient representation of individuals of non-European ancestry. Additionally, they have limited resolution for fine-mapping of causal variants and for effector transcript identification. Here, we present large-scale trans-ancestry meta-analyses of GWAS for four glycemic traits in individuals without diabetes. We aimed to identify additional glycemic trait-associated loci; investigate the portability of loci and genetic scores across ancestries; leverage differences in effect allele frequency (EAF), effect size, and linkage disequilibrium (LD) across diverse populations to conduct fine-mapping and aid causal variant/effector transcript identification; and compare the genetic architecture of glycemic traits to further identify the cell-types and target tissues most influenced by these traits which inform T2D pathophysiology.
Results
Study design and definitions
To identify loci associated with glycemic traits FG, 2hGlu, FI, and HbA1c, we aggregated GWAS in up to 281,416 individuals without diabetes, ~30% of whom were of non-European ancestry [13% East Asian, 7% Hispanic, 6% African-American, 3% South Asian, and 2% sub-Saharan African (Ugandan data only available for HbA1c)]. Each cohort imputed data to the 1000 Genomes Project reference panel[19] (phase 1 v3, March 2012, or later; Methods, Supplementary Table 1, Extended Data Figure 1, Supplementary Note). Up to ~49.3 million variants were directly genotyped or imputed, with between 38.6 million (2hGlu) and 43.5 million variants (HbA1c) available for analysis after exclusions based on minor allele count (MAC < 3) and imputation quality (imputation r2 or INFO score <0.40) in each cohort. FG, 2hGlu and FI analyses were adjusted for BMI[15] but for simplicity they are abbreviated as FG, 2hGlu and FI (Methods).
Extended Data Fig. 1
Flow diagram of this study
The figure shows the data, key methods and main analyses included in this effort.
We first performed trait-specific fixed-effect meta-analyses within each ancestry using METAL[20] (Methods). We defined “single-ancestry lead” variants as the strongest trait-associated variants (P<5x10-8) within a 1Mb region in an ancestry (Table 1). Within each ancestry and each autosome, we used approximate conditional analyses in GCTA[21,22], to identify “single-ancestry index variants” (P<5x10-8) that exert conditionally distinct effects on the trait (Table 1, Methods, Supplementary Note). This approach identified 124 FG, 15 2hGlu, 48 FI and 139 HbA1c variants that were significant in at least one ancestry (Supplementary Table 2).
Table 1
Glossary of terms - This study combined analyses of trait-associations across multiple correlated glycemic traits and across multiple ancestries, which has presented challenges in our ability to apply commonly used terms with clarity. For this reason, we define below terms often used in the field with variable meaning, as well as definitions of new terms used in this study.
Term
Definition
EA (Effect allele)
The effect allele was that defined by METAL based on trans-ancestry FG results and aligned such that the same allele was kept as the effect allele across all ancestries and traits, irrespective of its allele frequency or effect size for that particular ancestry and trait, in this way the effect allele is not necessarily the trait-increasing allele.
Single-ancestry lead variant
Variant with the smallest p-value amongst all with P < 5x10-8, within a 1Mb region, based on analysis of a single trait in a single ancestry.
Single-ancestry index variants
Variants identified by GCTA analysis of each autosome, and that appear to exert conditionally distinct effects on a given trait in a given ancestry (P < 5x10-8). As defined, these include the single-ancestry lead variants.
Trans-ancestry lead variant
Variant identified by trans-ethnic meta-analysis of a given trait that has the strongest association for that trait (log10BF > 6, which is broadly equivalent to P < 5x10-8) within a 1Mb region.
Single-ancestry locus
1Mb region centred on a single-ancestry lead variant which does not contain a lead variant identified in the trans-ancestry meta-analysis (i.e., does not contain a trans-ancestry lead variant).
Signal
Conditionally independent association between a trait and a set of variants in LD with each other and which is noted by the corresponding index variant.
Trans-ancestry locus
A genomic interval that contains trans-ancestry trait-specific lead variants, with/out additional single-ancestry index variants, for one or more traits. This region is defined by starting at the telomere of each chromosome and selecting the first single-ancestry index variant or trans-ancestry lead variant for any trait. If other trans-ancestry lead variants or single-ancestry index variants mapped within 500kb of the first signal, then they were merged into the same locus. This process was repeated until there were no more signals within 500kb of the previous variant. A 500kb interval was added to the beginning of the first signal, and the end of the last signal to establish the final boundary of the trans-ancestry locus (Extended Data Figure 2). As defined, a trans-ancestry locus may not have a single lead trans-ancestry variant, but may instead contain multiple trans-ancestry lead variants, one for each trait.
Next, we conducted trait-specific trans-ancestry meta-analyses using MANTRA (Methods, Supplementary Table 1, Supplementary Note) to identify genome-wide significant “trans-ancestry lead variants”, defined as the most significant trait-associated variant across all ancestries (log10 Bayes Factor [BF] >6, equivalent to P<5x10-8)[23] (Table 1, Methods). Here, we present trans-ancestry results as our primary results (Supplementary Table 2).Causal variants are expected to affect related glycemic traits and may be shared across ancestries. Therefore, we combined all single-ancestry lead variants, single-ancestry index variants, and/or trans-ancestry lead variants (for any trait) mapping within 500Kb of each other, into a single “trans-ancestry locus” bounded by 500Kb flanking sequences (Table 1, Extended Data Figure 2). As defined, a trans-ancestry locus may contain multiple causal variants affecting one or more glycemic traits, exerting their effect in one or more ancestry.
Extended Data Fig. 2
Locus diagram
Trans-ancestry locus A contains a trans-ancestry lead variant for one glycemic trait represented by the blue diamond, and another single-ancestry index variant for another glycemic trait represented by the orange triangle. Single-ancestry locus B contains a single-ancestry lead variant represented by the purple square. The orange, blue and purple bars represent a +/- 500Kb window around the orange, blue, and purple variants, respectively. The black bars indicate the full locus window where trans-ancestry locus A contains trans-ancestry lead and single-ancestry index variants for two traits and single-ancestry locus B has a single-ancestry lead variant for a single trait.
Glycemic trait locus discovery
Trans-ancestry meta-analyses identified 235 trans-ancestry loci, of which 59 contained lead variants for more than one trait. In addition, we identified seven “single-ancestry loci” that did not contain any trans-ancestry lead variants (Table 1, Supplementary Table 2). Of the 242 combined loci, 99 (including 6 of the 7 single-ancestry) had not been previously associated with any of the four glycemic traits or with T2D, at the time of analysis (Figure 1, Supplementary Table 3, Supplementary note). However, based on recent East Asian and trans-ancestry T2D GWAS meta-analyses[23-27], the lead variants at 27/99 novel glycemic trait loci have strong evidence of association with T2D (P<10-4; 13 loci with P<5x10-8), suggesting they are also important in T2D pathophysiology (Supplementary Tables 2 and 4).
Figure 1
Summary of all 242 loci identified in this study.
235 trans-ancestry loci are shown in orange (novel) or black (established) along with seven single-ancestry loci (blue) represented by nearest gene. Each locus is mapped to corresponding chromosome (outer segment). Each set of rows shows the results from the trans-ancestry analysis (orange) and each of the ancestries: European (purple), African American (tan), East Asian (grey), South Asian (green), Hispanic (yellow), sub-Saharan African (Ugandan-pink). Loci with a corresponding type 2 diabetes signal are represented by red circles in the middle of the plot.
Of the six single-ancestry novel loci, three were unique to non-European ancestry individuals (Supplementary Table 3). An African American association for FI (lead variant rs12056334) near LOC100128993 (an uncharacterized RNA gene; Supplementary Note), an African American association for FG (lead variant rs61909476) near ETS1 and a Hispanic association for FG (lead variant rs12315677) within PIK3C2G (Supplementary Table 3). Despite broadly similar EAF across ancestries, rs61909476 was significantly associated with FG only in African American individuals (EAF ~7%, b=0.0812 mmol/l, SE=0.01 mmol/l, P=3.9×10-8 vs EAF 10-17%, b=0-0.002 mmol/l, se=0.003-0.017 mmol/l, P=0.44-0.95 in all other ancestries, Supplementary table 2, Supplementary note). The nearest gene, ETS1, encodes a transcription factor that is expressed in mouse pancreatic β-cells, and its overexpression decreases glucose-stimulated insulin secretion in mouse islets[28]. Located within the PIK3C2G gene, rs12315677 has an 84% EAF in Hispanic (70-94% in other ancestries) and is significantly associated with FG in this ancestry alone (b=0.0387 mmol/l, SE=0.0075 mmol/l, P=4.0×10-8 vs b=-0.0128-0.010 mmol/l, SE=0.003-0.018 mmol/l, P=0.14-0.76 in all other ancestries, Supplementary note). In mice, deletion of Pik3c2g leads to a phenotype characterized by reduced glycogen storage in the liver, hyperlipidemia, adiposity, and insulin resistance with increasing age, or after a high fat diet[29]. Instances of similar EAFs but differing effect sizes between populations, could be due to genotype-by-environment or other epistatic effects. Alternatively, lower imputation accuracy in smaller sample sizes could deflate effect sizes, although imputation quality for these variants was good (average r2=0.81). Finally, the variants detected here may be in LD with ancestry-specific causal variants not interrogated here that differ in frequency across ancestries. However, we could not find evidence of rarer alleles in the cognate populations from the 1000G project (Supplementary Table 5). The final three single-ancestry loci were identified in individuals of European ancestry (Supplementary note).Next, by rescaling the standard errors of allelic effect sizes to artificially boost the sample size of the European meta-analysis to match that of trans-ancestry meta-analysis, we determined that 21 of the novel trans-ancestry loci would not have been discovered with an equivalent sample size comprised exclusively of European ancestry individuals (Supplementary note). Their discovery was due to the higher EAF and/or larger effect size in non-European ancestry populations. In particular, two loci (near LINC00885 and MIR4278) contain East Asian and African American single-ancestry lead variants, respectively, suggesting that these specific ancestries may be driving the trans-ancestry discovery (Supplementary Tables 2-3). Combined with the three single-ancestry non-European loci described above, our results show that 24% (24/99) of novel loci were discovered due to the contribution of non-European ancestry participants, strengthening the argument for expanding genetic studies in diverse populations.
Allelic architecture of glycemic traits
Single-ancestry and trans-ancestry results combined increased the number of established loci for FG to 102 (182 signals, 53 novel loci), FI to 66 (95 signals, 49 novel loci), 2hGlu to 21 (28 signals, 11 novel loci), and HbA1c to 127 (218 signals, 62 novel loci) (Supplementary Table 2), with significant overlap across traits (Extended Data Figure 3). We also detected (P<0.05 or log10BF>0) the vast majority (~90%) of previously established glycemic signals, 70-88% of which attained genome-wide significance (Supplementary Note, Supplementary Table 6). Given that analyses for FG, FI, and 2hGlu were performed adjusted for BMI, we confirmed that collider bias did not influence >98% of signals discovered (Supplementary note)[31]. As expected, given the greater power due to increased sample sizes, new association signals tended to have smaller effect sizes and/or EAFs in European ancestry individuals compared to established signals (Extended Data Figure 4).
Extended Data Fig. 3
Venn diagram
Overlap of TA loci between traits.
Extended Data Fig. 4
Allele frequency versus effect size
Allele frequency versus effect size for all signals detected through the trans-ancestry meta-analyses, for each of the four traits. Frequency and effect size are from the European meta-analyses. The power curves were computed based on the European sample size for each trait, and the mean (m) and standard deviation (sd) computed on the FENLAND study: FG, m=4.83 mmol/l, sd=0.68; FI, m=3.69 mmol/l, sd=0.60; 2hGlu, m=5.30 mmol/l, sd=1.74; HbA1c, m=5.55%, sd=0.48.
Characterization of lead variants across ancestries
To better understand the transferability of trans-ancestry lead variants across ancestries, we investigated the pairwise EAF correlation and the pairwise summarized heterogeneity of effect sizes between ancestries[32] (Methods, Supplementary Note). Consistent with population history and evolution, these results demonstrated considerable EAF correlation (ρ2>0.70) between European and Hispanic, European and South Asian, and Hispanic and South Asian populations, consistent across all four traits, and between African Americans and Ugandans for HbA1c (Extended Data Figure 5). Despite significant EAF correlations, some pairwise comparisons exhibited strong evidence for effect size heterogeneity between ancestries that was less consistent between traits (Extended Data Figure 5). However, sensitivity analyses demonstrated that, across all comparisons, the evidence for heterogeneity is driven by a small number of variants, with between 81.5% (for HbA1c) and 85.7% of trans-ancestry lead variants (for FG) showing no evidence for trans-ancestry heterogeneity (P>0.05) (Supplementary Note).
Extended Data Fig. 5
EAF correlation and heterogeneity test
Pearson correlation of EAF on the lower tri-angle and p-value of one-side heterogeneity test without multiple testing corrections on the upper tri-angle of the trans-ancestry lead variants associated with each trait between ancestries. Correlations > 0.7 are in bold.
Trait variance explained by associated loci
The trait variance explained by genome-wide significant loci was assessed using the single-ancestry variants only or a combination of single-ancestry and trans-ancestry variants (Supplementary Table 7) with betas extracted from the relevant single-ancestry meta-analysis results (Methods). The variance explained was assessed by linear regression in a subset of the contributing cohorts (Methods, Supplementary Tables 8-11). In general, the approach that explained the most variance was to begin with the trans-ancestry lead variants that had P<0.1 in the relevant single-ancestry meta-analysis, then add in all single-ancestry variants that were not in LD with the trans-ancestry variants (LD r2<0.1) (List C, Supplementary Tables 8-11, Figure 2). Using this approach, the mean variance in the trait distribution explained was between 0.7% (2hGlu in EUR) and 6% (HbA1c in AA). The European-based estimates explained more variance relative to previous estimates of 2.8% for FG and 1.7% for HbA1c[33] (Supplementary Note).
Figure 2
Trait variance explained by associated loci.
The boxplots show the maximum, first quartile, median, third quartile and minimum of trait variance explained when using a genetic score with single-ancestry lead and index variants (EUR, AA, EAS, HISP and SAS) or a combination of individual trait trans-ancestry lead variants and single-ancestry lead and index variants (TA+EUR, TA+AA, TA+EAS, TA+HISP and TA+SAS). Variance explained for each trait (FG, FI and HbA1c) in each ancestry is shown on different panels and in different colors. Data points represent the variance explained in individual cohorts used in this analysis. R2 was estimated in 1 to 11 cohorts with sample sizes ranging from 489 to 9,758 (Supplementary Tables 8-11).
Transferability of EUR ancestry-derived polygenic scores
To investigate the transferability of polygenic scores across ancestries we used the PRS-CSauto software[34] to first build polygenic scores for each glycemic trait based on European ancestry data. However, the training set for 2hGlu was too small so this trait was excluded. To build the polygenic scores (PGS), for each trait we first removed five of the largest European cohorts from the European ancestry meta-analysis. These five cohorts were meta-analyzed and used as our European ancestry test dataset, for each trait. The remaining European ancestry cohorts were also meta-analyzed and used as the training dataset, from which we derived a PGS for each trait (Methods). We used PRS-CSauto to revise the effect size estimates for the variants in the score (obtained from the training European datasets) based on the LD of the test population. PRS-CSauto does not have LD reference panels for South Asian or Hispanic ancestry and as such we were unable to test the transferability of the PGS into those populations. The “gtx” package[35] (Methods) was used to obtain the R2 for each test population (Figure 3, Supplementary Table 12). Consistent with other complex traits[36], the European ancestry-derived PGS had greater predictive power into test data of European ancestry than other ancestry groups.
Figure 3
Transferability of PGS across ancestries.
For each trait, the barplots represent trait variance explained when using a European ancestry-derived PGS in European, East Asian and African American test datasets. Variance explained (the height of each bar) for each trait (FG, FI and HbA1c) in each ancestry is shown on different panels and in different colors.
Fine-mapping
We fine-mapped, 231 trans-ancestry and six single-ancestry autosomal loci (Supplementary Table 2, Supplementary note). Using FINEMAP with ancestry-specific LD and an average LD matrix across ancestries, we conducted fine-mapping both within (161 loci with single-ancestry lead variants) and across ancestries (231 loci) for each trait (Methods). Because 59 of the 231 trans-ancestry loci were associated with more than one trait, we conducted trans-ancestry fine-mapping for a total of 305 locus-trait associations. Of these 305 locus-trait combinations, FINEMAP estimated the presence of a single causal variant at 186 loci (61%), while multiple distinct causal variants were implicated at 126 loci (39%), for a total of 464 causal variants (Figure 4A).
Figure 4
Trans-ancestry fine-mapping.
A) Number of plausible causal variants at each locus-trait association derived from FINEMAP. B) Number of variants within each 99% credible set. Twenty-one locus-trait associations at 19 loci were mapped to a single variant in the 99% credible set. C) Fine-mapping resolution. For each of the 98 locus-trait associations with a predicted single causal variant in both trans-ancestry and single-ancestry analyses, the number of variants included in the 99% credible set in the single-ancestry fine-mapping (x axis; logarithmic scale) is plotted against those in the trans-ancestry fine-mapping (y axis; logarithmic scale). Trans-ancestry and single-ancestry fine-mapping were based on the same set of variants. After removing eight locus-trait associations with one variant in the 99% credible sets in both trans-ancestry and single-ancestry analyses, there were 18 locus-trait associations (in grey) where trans-ancestry fine-mapping did not improve the resolution of fine-mapping results (i.e. number of variants in the 99% credible set did not decrease). Of the 72 locus-trait associations with improved trans-ancestry fine-mapping resolution (blue and red) further analyses in European fine-mapping emulating the total sample size in trans-ancestry fine-mapping demonstrated that 34 locus-trait associations (in red) were improved because of both total sample size and differences across ancestries, while 38 locus-trait associations (in blue) were only improved due to increased sample size in the original trans-ancestry fine-mapping analysis.
Credible sets for causal variants
At each locus, we next constructed credible sets (CS) for each causal variant that account for >=99% of the posterior probability of association (PPA). We identified 21 locus-trait associations (at 19 loci) for which the 99% CS included a single variant, and we highlight four examples (Methods, Supplementary Note, Figure 4B, Supplementary Table 13).At MTNR1B and SIX3 we identified, respectively, rs10830963 (PPA>0.999, for both HbA1c and FG) and rs12712928 (PPA=0.997, for FG) as the likely causal variants. At both loci previous studies confirm these variants affect transcriptional activity[37,38,39] (Supplementary note). At a locus near PFKM associated with HbA1c, trans-ancestry fine-mapping identified rs12819124 (PPA>0.999) as the likely causal variant. This variant has been previously associated with mean corpuscular hemoglobin[40], suggesting an effect on HbA1c via the red blood cell (RBC, Supplementary note). At HBB, we identifed rs334 (PPA>0.999; Glu7Val) as the likely causal variant associated with HbA1c. rs334 is a causal variant of sickle cell anemia[41], previously associated with urinary albumin-to-creatinine ratio in Caribbean Hispanic individuals[42], severe malaria in a Tanzanian study population[43], hematocrit and mean corpuscular volume in Hispanic/Latino populations[44], and RBC distribution in Ugandan individuals[45], all pointing to a variant effect on HbA1c via non-glycemic pathways.The remaining locus-trait associations with a single variant in the 99% CS (Supplementary Table 13) point to variants that could be prioritized for functional follow-up to elucidate impact on glycemic trait physiology.At an additional 156 locus-trait associations trans-ancestry fine-mapping identified 99% CS with 50 or fewer variants (Figure 4B, Supplementary Table 13). Consistent with the potential for >1 causal variant in a locus, 74 locus-trait associations contained 88 variants with PPA>0.90 that are strong candidate causal variants (Supplementary Table 14). For example, 10 are coding variants including several missense such as the HBB Glu7Val mentioned above, GCKR Leu446Pro, RREB1 Asp1771Asn, G6PC2 Pro324Ser, GLP1R Ala316Thr, and TMPRSS6 Val736Ala, each of which have been proposed or shown to affect gene function[12,46-50]. We additionally identified AMPD3 Val311Leu (PPA=0.989) and TMC6 Trp125Arg (PPA>0.999) variants associated with HbA1c which were previously detected in an exome array analysis but had not been fine-mapped with certainty due to the absence of backbone GWAS data[30]. Our fine-mapping now suggest these variants are likely causal and identify their cognate genes as effector transcripts.Finally, we evaluated the resolution obtained in the trans-ancestry versus single-ancestry fine-mapping (Methods, Supplementary Note). We compared the number of variants in 99% CS across 98 locus-trait associations which, as suggested by FINEMAP, had a single causal variant in both trans-ancestry and single-ancestry analyses. Fine-mapping within and across ancestries was conducted using the same set of variants. At 8 of 98 locus-trait associations single-ancestry fine-mapping identified a single variant in the CS. In addition, at 72 of the 98 locus-trait associations, the number of variants in the 99% CS was smaller in the trans-ancestry fine-mapping (Figure 4C), which likely reflects the larger sample size and differences in LD structure, EAFs, and effect sizes across diverse populations. To quantify the estimated improvement in fine-mapping resolution attributable to the multi-ancestry GWAS, we then compared 99% CS sizes from the trans-ancestry fine-mapping to single-ancestry-specific data emulating the same total sample size by rescaling the standard errors (Methods). Of the 72 locus-trait associations with estimated improved fine-mapping in trans-ancestry analysis, resolution at 38 (53%) was improved because of the larger sample size in the trans-ancestry fine-mapping analysis (Figure 4C), and this estimated improved resolution would likely have been obtained in a European-only fine-mapping effort with equivalent sample size. However, at 34 (47%) loci, the inclusion of samples from multiple diverse populations yielded the estimated improved resolution. On average, ancestry differences led to a reduction in the median number of variants in the 99% CS from 24 to 15 variants (37.5% median reduction; Figure 4C), demonstrating the value of conducting fine-mapping across ancestries.
HbA1c Signal Classification
HbA1c-associated variants can exert their effects on HbA1c levels through both glycemic and non-glycemic pathways [7,51] and their correct classification can affect T2D diagnostic accuracy[7,52]. Using prior association results for other glycemic, RBC, and iron traits, and a fuzzy clustering approach we classified variants into their most likely mode of action (Methods, Supplementary note). Of the 218 HbA1c-associated variants, 27 (12%) could not be characterized due to missing data and 23 (11%) could not be classified into a “known” class (Supplementary note). The remaining signals were classified as principally: a) glycemic (n=53; 24%), b) affecting iron levels/metabolism (n=12; 6%), or c) RBC traits (n=103; 47%). A genetic risk score (GRS) composed of all HbA1c-associated signals was strongly associated with T2D risk (OR=2.4, 95% CI 2.3-2.5, P=2.7x10-298). However, when using partitioned GRSs composed of these different classes of variants (Methods), we found the T2D association was mainly driven by variants influencing HbA1c through glycemic pathways (OR=2.6, 95% CI 2.5-2.8, P=2.3x10-250), with weaker evidence of association (despite the larger number of variants in the GRS) and a more modest risk (OR=1.4, 95% CI 1.2-1.7, P=4.7x10-4) imparted by signals in the mature RBC cluster that were not glycemic (i.e. where those specific variants had P>0.05 for FI, 2hGlu and FG) (Extended Data Figure 6, Supplementary note). This contrasts our previous finding where we found no significant association between a risk score of non-glycemic variants and T2D[7]. Our current results could be partly driven by T2D cases being diagnosed based on HbA1c levels that may be influenced by the non-glycemic signals, or by glycemic effects not captured by FI, 2hGlu or FG measures.
Extended Data Fig. 6
Forest plot of T2D GRS from HbA1c variants
The p-value on the right side is from the two-side test without multiple testing corrections. Vertical points of each diamond represent the point estimate of the odds ratio. The horizontal points of each diamond represent the 95% confidence interval of the odds ratio. Figure shows the association results between HbA1c-associated variants built into a GRS for T2D by taking each HbA1c-associated variant and using a weight that corresponds to its T2D effect size (logOR) based on analysis by the DIAGRAM consortium. The overall GRS is subsequently partitioned according to the HbA1c signal classification. The overall and partitioned GRS were tested for association with T2D based on data from UK biobank.
Biological signatures of glycemic trait associated loci
To better understand distinct and shared biological signatures underlying variant-trait associations, we conducted genomic feature enrichment, eQTL co-localization, and tissue and gene-set enrichment analyses across all four traits.
Epigenomic landscape of trait-associated variants
We explored the genomic context underlying glycemic trait loci by computing overlap enrichment for annotations such as coding, conserved regions, and super enhancers merged across multiple cell types[53-55] using the GREGOR tool[56]. We observed that FG, FI and HbA1c signals (Supplementary Table 7) were significantly (P<8.4x10-4, Bonferroni threshold for 59 annotations) enriched in evolutionarily conserved regions (Fig 5A, Extended Data Figure 7, Supplementary Table 15).
Figure 5
Epigenomic landscape of trait-associated variants.
A: Enrichment of GWAS variants to overlap genomic regions including ‘Static Annotations’ which are common or ‘static’ across cell types and ‘Stretch Enhancers’ which are identified in each tissue/cell type. The numbers of signals for each trait are indicated in parentheses. Enrichment was calculated using GREGOR [56]. One-sided test for significance (red) is determined after Bonferroni correction to account for 59 total annotations tested for each trait; nominal significance (P<0.05) is indicated in yellow. B: Enrichment for HbA1c GWAS signals partitioned into “hard” Glycemic and Red Blood Cell cluster (signals from “hard” mature Red Blood Cell and reticulocyte clusters together) to overlap annotations including StrEs in Islets and the blood-derived leukemia cell line K562, respectively (additional partitioned results in Supplementary Table 17). C: Individual FI GWAS signals that drive enrichment in Adipose and Skeletal Muscle StrEs. D, E: Genome browser shots of FI GWAS signals – intronic region of the COL4A2 gene (D) and an inter-genic region ~25kb from LINC01214 gene (E) showing GWAS SNPs (lead and LD r2>0.8 proxies), ATAC-seq signal tracks and chromatin state annotations in different tissues/cell types.
Extended Data Fig. 7
Enrichment of glycemic trait associated GWAS variants to overlap genomic annotations using GREGOR
Figure shows enrichment for 59 total static and stretch enhancer annotations considered. One-side test significance (red) is determined after Bonferroni correction to account for 59 total annotations tested for each trait; nominal significance (P<0.05) is indicated in yellow.
We then considered epigenomic landscapes defined in individual cell/tissue types. Previously, stretch enhancers (StrE, enhancer chromatin states ≥3kb in length) in pancreatic islets were shown to be highly cell-specific and strongly enriched with T2D risk signals[57]. Considering StrEs across 31 cell-types[39], FG and 2hGlu signals showed the highest enrichment in islets (FG: fold-enrichment=4.70, P=2.7x10-24; 2hGlu: fold-enrichment=5.51, P=3.6x10-4
Figure 5A, Supplementary Table 16), highlighting the importance of islets for these traits. FI signals were enriched in skeletal muscle (fold-enrichment=3.17, P=7.8x10-6) and adipose StrEs (fold-enrichment=3.27, P=1.8x10-7) consistent with these tissues as targets of insulin action (Figure 5A). StrEs in individual cell types showed higher enrichment than super enhancers merged across cell types, highlighting the importance of cell-specific analyses (Figure 5A). HbA1c signals were enriched in StrEs of multiple cell types and tissues, but have the strongest enrichment in K562 leukemia derived cells (fold-enrichment=3.24, P=1.2x10-7, Figure 5A). Among the “hard” glycemic and red blood cell (mature + reticulocyte) HbA1c signals, glycemic signals were enriched in islet StrEs (fold-enrichment=3.96, P=3.7x10-16) while red blood cell signals were enriched in K562 StrEs (fold-enrichment=7.5, P=2.08x10-14, Figure 5B, Supplementary Table 17). These analyses suggest that these glycemic trait-associated variants influence the function of tissue-specific enhancers.Independent analyses with fGWAS[58] and GARFIELD[59] yielded consistent results (Extended Data Figures 8 and 9, Supplementary Tables 16 and 18). Notably, FI signals at a lenient threshold of P<10-5 were enriched in liver StrEs using GARFIELD (odds ratio=1.92, P=1.7x10-4) (Extended Data Figure 9A). This suggests that liver regulatory annotations are relevant for FI GWAS signals, but that we lack power to detect significant enrichment using the genome-wide significant loci and the current set of reference annotations.
Extended Data Fig. 8
Enrichment of glycemic trait associated GWAS variants to overlap genomic annotations using fGWAS
Figure shows log2(Fold Enrichment) of GWAS variants to overlap 59 static and stretch enhancer annotations calculated. Significant enrichment (red) is considered if the 95% confidence intervals (shown by the error bars) do not overlap 0.
Extended Data Fig. 9
Enrichment of glycemic trait associated GWAS variants to overlap genomic annotations using GARFIELD
Figure shows the beta or effect size (log odds ratio) for GWAS variants to overlap 59 static and stretch enhancer annotations. GWAS variants were included at two significance thresholds, 1e-05 (A) and 1e-08 (B). One-side test significance (red) is determined after Bonferroni correction to account for effective annotations tested for each trait reported by GARFIELD (see supplementary note); nominal significance (P<0.05) is indicated in yellow. The 95% confidence intervals are shown by the error bars.
We next explored the 27 loci driving the FI enrichment in adipose and skeletal muscle, 11 of which overlapped StrEs in both tissues (Figure 5C). At the COL4A2 locus, variants within an intronic region overlap StrEs in adipose tissue, skeletal muscle, and a human skeletal muscle myoblast (HSMM) cell line that are not shared across other cell/tissue types. Among these, rs9555695 (in the 99% CS) also overlaps accessible chromatin regions in adipose (Figure 5D). At a narrow signal with no proxy variants (LD r2>0.7 in Europeans), the lead trans-ancestry variant rs62271373 (PPA = 0.94) located in an intergenic region ~25kb from the LINC01214 gene overlaps StrEs specific to adipose and HSMM and an active enhancer chromatin state in skeletal muscle (Figure 5E). Collectively, the tissue-specific epigenomic signatures at GWAS signals provide an opportunity to nominate tissues where these variants are likely to be active. This map may help future efforts to deconvolute GWAS signals into tissue-specific disease pathology.
Co-localization of GWAS and eQTLs
Among the 99 novel glycemic trait loci, we identified co-localized eQTLs at 34 loci in blood, pancreatic islets, subcutaneous or visceral adipose, skeletal muscle, or liver, providing suggestive evidence of causal genes (Supplementary Table 19). The co-localized eQTLs include several genes previously reported at glycemic trait loci: ADCY5, CAMK1D, IRS1, JAZF1, and KLF14
[60-62]. For some additional loci, the co-localized genes have prior evidence for a role in glycemic regulation. For example, the lead trans-ancestry variant and likely causal variant, rs1799815 (PPA=0.993), associated with FI is the strongest variant associated with expression of INSR, encoding the insulin receptor, in subcutaneous adipose from METSIM (P=2x10-9) and GTEx (P=5x10-6). The A allele at rs1799815 is associated with higher FI and lower expression of INSR, consistent with the relationship between insulin resistance and reduced INSR function[63]. In a second example, rs841572, the trans-ancestry lead variant associated with FG, has the highest PPA (PPA=0.535) among the 20 variants in the 99% CS and is in strong LD (r2=0.87) with the lead eQTL variant (rs841576, also in the 99% CS) associated with SLC2A1 expression in blood (eQTLGen P=1x10-8). SLC2A1, also known as GLUT1, encodes the major glucose transporter in brain, placenta, and erythrocytes, and is responsible for glucose entry into the brain[64]. rs841572-A is associated with lower FG and lower SLC2A1 expression. While rare missense variants in SLC2A1 are an established cause of seizures and epilepsy[65], our data suggest that SLC2A1 variants also affect plasma glucose levels within a population. These co-localized signals provide possible regulatory mechanisms for variant effects on genes to influence glycemic traits.The co-localized eQTLs also provide new insights into the mechanisms at glycemic trait loci. For example, rs9884482 (in the 99% CS) is associated with FI and TET2 expression in subcutaneous adipose (P=2x10-20); rs9884482 is in high LD (r2=0.96 in Europeans) with the lead TET2 eQTL variant (rs974801). TET2 encodes a DNA-demethylase that can affect transcriptional repression [66]. Adipose Tet2 expression is reduced in diet-induced insulin resistance in mice[67], and knockdown of Tet2 blocked adipogenesis[67,68]. Consistently, in human adipose tissue, rs9884482-C was associated with lower TET2 expression and higher FI. In a second example, rs617948 is associated with HbA1c (in the 99% CS) and is the lead variant associated with C2CD2L expression in blood (eQTLGen P=3x10-96). C2CD2L, also known as TMEM24, regulates pulsatile insulin secretion and facilitates release of insulin pool reserves[69,70]. rs617948-G was associated with higher HbA1c and lower C2CD2L, providing evidence for a role of this insulin secretion protein in glucose homeostasis. Our HbA1c “soft” clustering assigned this signal to both the “unknown” (0.51 probability) and “reticulocyte” (0.42 probability) clusters. rs617948 is strongly associated with HbA1c (P<6.8x10-8), but not with FG, FI or 2hGlu (P>0.05, Supplementary Table 20, Supplementary Note). This suggests an effect of this variant on reticulocyte biology, and on insulin secretion, potentially influencing HbA1c levels through different tissues, and providing a plausible explanation for the classification as “unknown”.
Tissue Expression
Consistent with effector transcript expression analysis using GTEx data[30], we found significant differences in tissue expression across the glycemic trait signals. FG signals were enriched for genes expressed in the pancreas (FDR<0.05), while there were an insufficient number of significant associations in 2hGlu to identify enrichment for any tissue or cell type at FDR<0.2 threshold. FI signals were enriched for connective tissue and cells (which includes adipose tissue), endocrine glands, blood cells, and muscles (FDR<0.2) and HbA1c signals were significantly enriched for genes expressed in the pancreas, hemic, and immune system (FDR<0.05) (Figure 6, Supplementary Table 21). Consistent with previous analysis[30], FI-enrichment for connective tissue was driven by adipose tissue (subcutaneous and visceral), while the newly described enrichment with endocrine glands was driven by the adrenal glands and cortex (Supplementary Table 21). Beyond enrichment for genes expressed in glycemic-related tissues, HbA1c signals were enriched with genes expressed in blood, consistent with the role of RBC in this trait and our previous results[30].
Figure 6
Tissues and cell types significantly enriched for genes within glycemic-associated loci.
Top panel FG-associated loci, middle panel FI-associated loci, bottom panel Hba1c-associated loci. FDR thresholds are shown in red (q<0.05), orange (q<0.2), black (q≥0.2).
The association between FI signals and genes expressed in adrenal glands is notable, suggesting a possible direct role for these genes in insulin resistance. These genes might influence cortisol levels, which could contribute to insulin resistance and FI levels through impaired insulin receptor signaling in peripheral tissues, as well as influencing body fat distribution, stimulate lipolysis, and other indirect mechanisms[71,72].
Gene-set Analyses
Next, we performed gene-set analysis using DEPICT (Methods). In keeping with previous results[30], we found distinct gene-sets enriched (FDR<0.05) for each glycemic trait except 2hGlu, which had insufficient associations to have power in this analysis. FG-associated variants highlighted gene-sets involved in metabolism and gene-sets involved in general cellular function such as “cytoplasmic vesicle membrane” and “circadian clock”” (Figure 7A). In contrast, in addition to metabolism-related gene-sets, FI-associated variants highlighted pathways related to growth, cancer and reproduction (Figure 7B). This is consistent with the role of insulin as a mitogenic hormone, and with epidemiological links between insulin and certain types of cancer[73] and reproductive disorders such as polycystic ovary syndrome[74]. HbA1c-associated variants highlighted many gene-sets (Figure 7C), including those linked to metabolism and hematopoiesis, again recapitulating our postulated effects of variants on glucose and RBC biology. Additional pathways from HbA1c-associated variants also highlighted previous “CREBP PPI” and lipid biology related to T2D[75] and HbA1c[76], respectively, and potential new biology through which variants may influence HbA1c.
Figure 7
Gene-set enrichment analyses.
Results from affinity-propagation clustering of significantly enriched gene-sets (FDR<0.05) identified by DEPICT for A) FG, B) FI, and C) HbA1c. Each node is a meta gene-set which is represented by an exemplar gene-set within the meta gene-set. For example, in B. “chronic myeloid leukemia “ is an exemplar gene-set representing a much broader meta gene-set relating to cancer and represented in the zoomed in section on the right. Similarities between the meta gene-sets are represented by Pearson correlation coefficients (r>0.3). The nodes are colored according to the minimum gene-set enrichment p-value for gene-sets in that meta gene-set.. PPI=protein-protein interaction network.
Discussion
Here we describe a large glycemic trait meta-analysis of GWAS for which 30% of the population was composed of East Asian, Hispanic, African-American, South Asian and sub-Saharan African participants. This effort identified 242 loci (235 trans-ancestry and seven single-ancestry), which jointly explain between 0.7% (2hGlu in European ancestry individuals) and 6% (HbA1c in African American ancestry individuals) of the variance in glycemic traits in any given ancestry. While 114/242 loci are associated with T2D (P<10-4; 83 loci with P<5x10-8, Supplementary Table 4), absence of strong evidence of association at the remaining loci (P≥10-4) suggests that for alleles more frequent than 5% we can exclude T2D ORs≥1.07 with 80% power (alpha=5x10-8; and ORs≥1.05 for alpha=10-4) given a current study of 228,499 T2D cases and 1,178,783 controls[27]. We identified 486 signals associated with glycemic traits, of which eight have MAF<1%, and 45 have 1%<=MAF<5% in all ancestries, highlighting that 89% of signals identified are common in at least one ancestry studied.A key aim of our study was to evaluate the added advantage of including population diversity in genetic discovery and fine-mapping efforts. Beyond the larger sample size included in the trans-ancestry meta-analysis, we were able to estimate the contribution of non-European ancestry data in locus discovery and fine-mapping resolution. We found that 24 of the 99 newly discovered loci owe their discovery to the inclusion of East Asian, Hispanic, African-American, South Asian and sub-Saharan African participant data, due to differences in EAF and effect sizes across ancestries.Comparison of 295 trans-ancestry lead variants (315 locus-trait associations) across ancestries demonstrated that between 81.5% (HbA1c) and 85.7% (FG) of the trans-ancestry lead variants had no evidence of trans-ancestry heterogeneity in allelic effects (P>0.05).Given sample size and power limitations, genome-wide significant trait-associated variants in a single-ancestry explain only a modest proportion of trait variance in that ancestry (Figure 2). We demonstrate that trans-ancestry lead variants explain more trait variance than the ancestry-specific variants (Figure 2). This shows that even though some trans-ancestry lead variants are not genome-wide significant in all ancestries, they contribute to the genetic architecture of the trait in most ancestries.We evaluated for the first time the transferability of European ancestry-derived glycemic trait PGS into other ancestries. Consistent with other traits[36,77,78], we confirm that European ancestry-derived PGS perform much worse when the test dataset is from a different ancestry. Each trait-specific PGS improves trait variance explained by between 3.5-fold (HbA1c) and 6-fold (FG) in the European dataset (Figure 3, Supplementary Table 12) compared to a score built only from trans-ancestry lead variants and European index variants (Figure 2, Supplementary tables 9-12).Despite development of approaches to derive polygenic risk scores[79], we note the difficulty in using summary level data to build a PGS in one ancestry and then apply it in test datasets of different ancestry. While PRS-CSauto[34] is able to use summary level data, revision of the effect size estimates to account for LD required reference panels that matched the ancestry of the test dataset. However, the current software lacks appropriate reference panels for many ancestries, precluding its broad application. Future developments of trans-ancestry PGS are required for improved cross-ancestry performance.We show that fine-mapping resolution is improved in trans-ancestry, compared to single-ancestry fine-mapping efforts. In ~50% of our loci, we showed that the improvement was due to differences in EAF, effect size, or LD structure between ancestries, and not just due to the overall increased sample size available for trans-ancestry fine-mapping. By performing trans-ancestry fine-mapping, and co-localizing GWAS signals with eQTL signals and coding variants, we identified new candidate causal genes. Altogether, these results motivate continued expansion of genetic and genomic efforts in diverse populations to improve understanding of these traits in groups disproportionally affected by T2D.Given data on four different glycemic traits and their utility to diagnose and monitor T2D and metabolic health, we also sought to characterize biological features underlying these traits. We show that despite significant sharing of loci across the four traits, each trait is also characterized by unique features based on StrE, gene expression and gene-set signatures. Combining genetic data from these traits with T2D data will further elucidate pathways driving normal physiology and pathophysiology, and help further develop useful predictive scores for disease classification and management[4,5].
Online Methods
Study design and participants
This study included trait data from four glycemic traits: fasting glucose (FG), fasting insulin (FI), 2hr post-challenge glucose (2hGlu), and glycated hemoglobin (HbA1c). The total number of contributing cohorts ranged from 41 (2hGlu) to 131 (FG), and the maximum sample size for each trait ranged from 85,916 (2hGlu) to 281,416 (FG) (Supplementary Table 1). Ancestry was initially defined at the cohort level, but within each cohort ancestry was confirmed with genetic data with ancestry outliers removed (Supplementary Table 1). Overall, European ancestry (EUR) participants dominated the sample size for all traits, representing between 68.0% (HbA1c) to 73.8% (2hGlu) of the overall sample size. African Americans (AA) represented between 1.7% (2hGlu) to 5.9% (FG) of participants; individuals of Hispanic ancestry (HISP) represented between 6.8% (FG) to 14.6% (2hGlu) of participants; individuals of East-Asian ancestry (EAS) represented between 9.9% (2hGlu) to 15.4% (HbA1c) of participants; and South-Asian ancestry (SAS) individuals represented between 0% (no contribution to 2hGlu) to 4.4% (HbA1c) of participants. Data from Ugandan participants were only available for the HbA1c analysis and represented 2% of participants.
Phenotypes
Analyses included data for FG and 2hGlu measured in mmol/l, FI measured in pmol/l, and HbA1c in % [where possible, studies reported HbA1c as a National Glycohemoglobin Standardization Program (NGSP) percent]. Similar to previous MAGIC efforts[7], individuals were excluded if they had type 1 or type 2 diabetes (defined by physician diagnosis); reported use of diabetes-relevant medication(s); or had a FG ≥7 mmol/L, 2hGlu ≥11.1mmol/L, or HbA1c ≥ 6.5%, as detailed in Supplementary Table 1. 2hGlu measures were obtained 120 minutes after a glucose challenge in an oral glucose tolerance test (OGTT). Measures for FG and FI taken from whole blood were corrected to plasma level using the correction factor 1.13[80].
Genotyping, quality control, and imputation
Each participating cohort performed study-level quality control, imputation, and association analyses following a shared analysis plan. Cohorts were genotyped using commercially available genome-wide arrays or the Illumina CardioMetabochip (Metabochip) array (Supplementary Table 1)[81]. Prior to imputation, each cohort performed stringent sample and variant quality control (QC) to ensure only high-quality variants were kept in the genotype scaffold for imputation. Sample quality control checks included removing samples with low call rate < 95%, extreme heterozygosity, sex mismatch with X chromosome variants, duplicates, first- or second-degree relatives (unless by design), or ancestry outliers. Following sample QC, cohorts applied variant QC thresholds for call rate (< 95%), Hardy-Weinberg Equilibrium (HWE) P < 1x10-6, and minor allele frequency (MAF). Full details of QC thresholds and exclusions by participating cohort are available in Supplementary Table 1.Imputation was performed up to the 1000 Genomes Project phase 1 (v3) cosmopolitan reference panel[82], with a small number of cohorts imputing up to the 1000 Genomes phase 3 panel[19] or population-specific reference panels (Supplementary Table 1).
Study level association analyses
Each of the glycemic traits (FG, natural log FI, and 2hGlu) were regressed on BMI (except HbA1c), study-specific covariates, and principal components (unless implementing a linear mixed model). Analyses for FG, FI, and 2hGlu were adjusted for BMI as we had previously shown this did not materially affect results for FG and 2hGlu but improved our ability to detect FI-associated loci[15]. For simplicity, we refer to the traits as FG, FI and 2hGlu. For a discussion on collider bias see Supplementary Note section 2c. Both the raw and rank-based inverse normal transformed residuals from the regression were tested for association with genetic variants using SNPTEST[23] or Mach2Qtl[83,84]. Poorly imputed variants, defined as imputation r2 < 0.4 or INFO score < 0.4, were excluded from downstream analyses (Supplementary Table 1). Following study level QC, approximately 12,229,036 variants (GWAS cohorts) and 1,999,204 variants (Metabochip cohorts) were available for analysis (Supplementary Table 1).
Centralized quality control
Each contributing cohort shared their summary statistic results with the central analysis group who performed additional QC using EasyQC[85]. Allele frequency estimates were compared to estimates from 1000Gp1 reference panel[82], and variants were excluded from downstream analyses if there was a minor allele frequency difference > 0.2 for AA, EUR, HISP, and EAS populations against AFR, EUR, MXL, and ASN populations from 1000 Genomes Phase 1, respectively, or a minor allele frequency difference > 0.4 for SAS against EUR populations. At this stage, additional variants were excluded from each cohort file if they met one of the following criteria: were tri-allelic; had a minor allele count (MAC) < 3; demonstrated a standard error of the effect size ≥ 10; or were missing an effect estimate, standard error, or imputation quality. All data that survived QC (approximately 12,186,053 variants from GWAS cohorts and 1,998,657 variants from Metabochip cohorts) were available for downstream meta-analyses.
Single-ancestry meta-analyses
Single-ancestry meta-analyses were performed within each ancestry group using the fixed-effects inverse variance meta-analysis implemented in METAL[20]. We applied a double-genomic control (GC) correction[15,86] to both the study-specific GWAS results and the single-ancestry meta-analysis results. Study-specific Metabochip results were GC-corrected using 4,973 SNPs included on the Metabochip array for replication of associations with QT-interval, a phenotype not correlated with our glycemic traits[15].
Identification of single-ancestry index variants
To identify distinct association index variants across each chromosome within each ancestry (Table 1), we performed approximate conditional analyses implemented in GCTA[21] using the --cojo-slct option (autosomes) and distance-based clumping (X chromosome). Linkage disequilibrium (LD) correlations for GCTA were estimated from a representative cohort from each ancestry: WGHS (EUR); CHNS (EAS); SINDI (SAS); BioMe (AA); SOL (HISP) and Uganda (for itself). The results from GCTA were comparable when using alternative cohorts for the LD reference. For any index variant with a QC flag which caused reason for concern, we performed manual inspection of forest plots to decide whether the signal was likely to be real (Supplementary note). Among 335 single-ancestry index variants across all traits, this manual inspection was done for 40 signals of which 32 passed and 8 failed after inspection. Thus, a total of 327 single-ancestry index variants passed and 8 failed.
Trans-ancestry meta-analyses
To leverage power across all ancestries, we also conducted trait-specific trans-ancestry meta-analysis by combining the single-ancestry meta-analysis results using MANTRA (Supplementary note)[87].We defined log10Bayes’ Factor (BF) > 6 as genome-wide significant, approximately comparable to P < 5x10-8.
Manual curation of trans-ancestry lead variants
To ensure trans-ancestry lead variants were robust, we performed manual inspection of forest plots by at least two authors, for any variants with flags indicating possible QC issues (Supplementary note). Of 463 trans-ancestry lead variants across all traits, 184 passed without inspection, 131 passed after inspection, and 148 failed after inspection.
Comparison of TA lead variants across ancestries
For each pair of ancestries, we calculated Pearson’s correlation in EAFs for each trans-ancestry lead variant. The pairwise summarized heterogeneity of effect sizes between ancestries was then tested using the joint F-test of heterogeneity[32]. The test statistic is the sum of Cochran Q-statistics for heterogeneity across all trans-ancestry signals. Under the null hypothesis, the statistics follows the χ2 distribution with n degrees of freedom, where n is the number of the trans-ancestry lead variants.
LD-pruned variant lists
Several downstream analyses (for example, genomic feature enrichment, genetic scores, and estimation of variance explained by associated variants) require independent LD-pruned variants (r2<0.1) to avoid double-counting variants which might otherwise be in LD with each other and that do not provide additional “independent” evidence. Therefore, for these analyses we generated different lists of either TA or single-ancestry LD pruned (r2<0.1) variants, keeping in each case the variant with the strongest evidence of association (Supplementary Table 7). Subsequently, we combined TA and single-ancestry variant lists and conducted further LD pruning. For some analyses, we took the TA pruned variant list and added single-ancestry signals if the LD r2<0.1, while for others we started with the single-ancestry pruned lists and supplemented with TA lead variants if the LD r2<0.1. One exception was the list used for eQTL co-localizations, which included all single-ancestry European signals (without LD pruning) and supplemented with any additional TA lead variants (starting from the variants with the most significant P-values) in EUR LD r2<0.1 with any of the variants already in list, and that reached at least P<1x10-5 in the European ancestry meta-analysis.To determine how much of the phenotypic variance of each trait could be explained by the corresponding trait-associated loci, variants were combined in a series of weighted genetic scores (GS). The analysis was performed in a subset of the cohorts included in the discovery GWAS (with representation from each ancestry) and in a smaller number of independent cohorts (European ancestry only). Up to three different GS were derived per trait (and for each ancestry) in order to evaluate the potential for the trans-ancestry meta-GWAS identified loci to provide additional information above and beyond that contributed by the ancestry-specific meta-analysis results. These GS comprised: List A - single-ancestry signals; List B - single-ancestry signals plus trans-ancestry signals; and List C - trans-ancestry signals plus single-ancestry signals (Supplementary Table 7). In the case of the European ancestry cohorts that contributed to the GWAS, we employed the method of Nolte et al.[33] to adjust the effect sizes (betas) from the GWAS for the contribution of that cohort, providing sets of cohort-specific effect sizes that were then used to generate the GS. The association between each GS and its corresponding trait was tested by linear regression and the adjusted R2 from the model extracted as an estimate of the variance explained.
Transferability of polygenic scores (PGS) across ancestries
We used the PRS-CSauto[34] software to first build European ancestry-derived PGS for each glycemic trait (FG, FI, 2hGlu, HbA1c) on the basis of summary statistics. However, PRS-CSauto does not perform well when the training dataset is relatively small and the genetic architecture is sparse[34]. Consequently, 2hGlu was excluded from this analysis. For each trait, to obtain European ancestry training and test datasets, we first removed all cohorts only genotyped on the Metabochip which were not included in this analysis. From the remaining cohorts we then removed five of the largest European cohorts contributing to the respective European ancestry meta-analysis. For each trait, these five cohorts were meta-analyzed and used as the European ancestry test dataset. Subsequently, the remaining European ancestry cohorts were also meta-analyzed and used as the European ancestry training dataset. For each of the other ancestries, cohorts only genotyped on the Metabochip were also removed, and the remaining cohorts were meta-analyzed, and used as the non-European ancestry test datasets. Variants with MAF<0.05 or missing in over half of the individuals in the training dataset were removed[34,88]. The PGS for each trait was built using PRS-CSauto with default settings[34] with the effect size estimates based on the European training dataset being revised based on an LD reference panel matching the test dataset. The proportion of the trait variance explained by the European ancestry-derived PGS (R2) was estimated using the R package “gtx”[89] based on the revised effect sizes and summary statistics from the test dataset for each ancestry.Of the 242 loci identified in this study, 237 were autosomal loci which we took forward for fine-mapping (Supplementary Table 2). We used the Bayesian fine-mapping method FINEMAP[90] (version 1.1) to refine association signals and attempt to identify likely causal variants at each locus. FINEMAP estimates the maximum number of causal variants at each locus, calculates the posterior probability of each variant being causal, and proposes the most likely configuration of causal variants. The posterior probabilities of the configurations in each locus were used to construct 99% credible sets.We performed both single-ancestry and trans-ancestry fine-mapping. In both analyses, only data from cohorts genotyped on GWAS arrays were used, and analyses were limited to trans-ancestry lead variants and other single-ancestry lead variants present in at least 90% of the samples for each trait. For the single-ancestry fine-mapping, FINEMAP estimates the number of causal variants in a region up to a maximum number, which we set to be two plus the number of distinct signals identified from the GCTA signal selection. FINEMAP uses single-ancestry and trait-specific z-scores from the fixed-effect meta-analysis in METAL[20] and an ancestry-specific LD reference, which we created from a subset of cohorts (combined sample size > 30% of the sample size for that ancestry), weighting each cohort by sample size. In the trans-ancestry fine-mapping, FINEMAP was similarly used to estimate the number of causal variants starting with two, and trait-specific z-scores and LD maps were generated from the sample size weighted average of those used in the single-ancestry fine-mapping. The maximum number of causal variants was iteratively increased by one until it was larger than the number of causal variants supported by data (Bayes factor), which was the estimated maximum number of causal variants used in the final run of fine-mapping analysis.To compare fine-mapping results obtained from the single-ancestry and trans-ancestry efforts, analyses were limited to fine-mapping regions with evidence for a single likely causal variant in both, enabling a straightforward comparison of credible sets (Supplementary note). To ensure any difference in the fine-mapping results was not driven by different sets of variants being present in the different analyses, we repeated the single-ancestry fine-mapping limited to the same set of variants used in the trans-ancestry fine-mapping. The fine-mapping resolution was assessed based on comparisons of the 99% credible sets in terms of number of variants included in the set, and length of the region. To assess whether the improvement in the trans-ancestry fine-mapping was due to differences in LD, increased sample size, or both, we repeated the trans-ancestry fine-mapping mimicking the sample size present in the single-ancestry fine-mapping by dividing the standard errors by the square root of the sample size ratio and compared the results with those from the single-ancestry fine-mapping.
Functional Annotation of trait-associated variants
HbA1c signal classification
There were 218 HbA1c-associated signals from either the single-ancestry (i.e. all GCTA-signals from any ancestry) or trans-ancestry meta-analyses. To classify these signals in terms of their likely mode of action (i.e., glycemic, erythrocytic, or other [7]), we examined association summary statistics for the lead variants at the 218 signals in other large European datasets for 19 additional traits: three glycemic traits from this study (FG, 2hGlu and FI); seven mature red blood cell (RBC) traits[91,92] (red blood cell count, mean corpuscular volume, hematocrit, mean corpuscular hemoglobin, mean corpuscular hemoglobin concentration, hemoglobin concentration and red cell distribution width); five reticulocyte traits (reticulocyte count, reticulocyte fraction of red cells, immature fraction of reticulocytes, high light scatter reticulocyte count and high light scatter percentage of red cells)[91,92], and four iron traits (serum iron, transferrin, transferrin saturation and ferritin)[93]. Of the 218 HbA1c signals, data were available for the lead (n=183) or proxy (European LD r2 > 0.8, n = 8) variants at 191 signals.The additional traits were clustered using hierarchical clustering to ensure biologically related traits would cluster together (Supplementary note). We then used a non-negative matrix factorization (NMF)[94] process to cluster the HbA1c signals. Each cluster was labelled as glycemic, reticulocyte, mature RBC, or iron related based on the strength of association of signals in the cluster to the glycemic, reticulocyte, mature RBC and iron traits (Supplementary note). To verify that our cluster naming was correct, we used HbA1c association results conditioned on either FG or iron traits, or type 2 diabetes association results (Supplementary note).
HbA1c genetic risk scores (GRSs) and T2D risk
We constructed GRS for each cluster of HbA1c-associated signals (based on hard clustering) and tested the association of each cluster with T2D risk using samples from the UK Biobank. Pairs of HbA1c signals in LD (EUR r2>0.10) were LD pruned by removing the signal with the less significant P-value of association with HbA1c. The GRS for each cluster was calculated based on the logarithm of odds ratios from the latest T2D study summary statistics[95] and UK Biobank genotypes imputed to the Haplotype Reference Consortium[19]. From 487,409 UK Biobank samples (age between 46 and 82 years, and 55% female), we excluded participants for the following reasons: 373 with mismatched sex; 9 not used in the kinship calculation; 78,365 non-European ancestry individuals; and 138,504 with missing T2D status, age, or sex information. We further removed 26,896 related participants (kinship > 0.088, preferentially removing individuals with the largest number of relatives and controls where a T2D case was related to a control). T2D cases were defined by: (i) a history of diabetes without metformin or insulin treatment, (ii) self-reported diagnosis of T2D, or (iii) diagnosis of T2D in a national registry (N = 17,022, age between 47 and 79 years, and 36% female). Controls were participants without a history of T2D (N = 226,240, age between 46 and 82 years, and 56% female). We tested for association between each GRS and T2D using logistic regression including covariates for age, sex, and the first five principal components. Significance of association was evaluated by a bootstrap approach to incorporate the variance of each HbA1c associated signal in the T2D summary data. To do this, we generated the GRS of each cluster 200 times by resampling the logarithm of odds ratio of each signal with T2D. For each non-glycemic class that had a GRS significantly associated with T2D, we performed sensitivity analyses to evaluate whether the association was driven from variants that also belonged to a glycemic cluster when using a soft clustering approach (the signals were classified as also glycemic in the soft clustering or had an association P ≤ 0.05 with any of the three glycemic traits).
Chromatin states
To identify genetic variants within association signals that overlapped predicted chromatin states, we used a previously published, 13 chromatin state model that included 31 diverse tissues, including pancreatic islets, skeletal muscle, adipose, and liver[39]. Briefly, this model was generated from cell/tissue ChIP-seq data for H3K27ac, H3K27me3, H3K36me3, H3K4me1, and H3K4me3, and input control from a diverse set of publicly available data[53,57,96,97] using the ChromHMM program[98]. As reported previously[39], StrEs were defined as contiguous enhancer chromatin state (Active Enhancer 1 and 2, Genic Enhancer and Weak Enhancer) segments longer than 3kb[57].
Enrichment of genetic variants in genomic features
We used GREGOR (version 1.2.1) to calculate the enrichment of GWAS variants overlapping static and StrEs[56]. For calculating the enrichment of glycemic trait-associated variants in these annotations, we used the filtered list of trait-associated variants as described above (Supplementary Table 7) as input. For calculating the enrichment of sub-classified HbA1c variants, we included the list of loci characterized as Glycemic, another list of loci characterized as Reticulocyte or mature Red Blood Cell, collectively representing the red blood cell fraction, along with lists of iron related or unclassified loci (Supplementary Table 17). We used the following parameters in GREGOR enrichment analyses: European r2 threshold (for inclusion of variants in LD with the lead variant) = 0.8, LD window size = 1 Mb, and minimum neighbour number = 500.We used fGWAS (version 0.3.6)[58] to calculate enrichment of glycemic trait-associated variants in static and StrE annotations using summary level GWAS results. We used the default fGWAS parameters for enrichment analyses for individual annotations for each trait. For each annotation, the model provided the natural log of maximum likelihood estimate of the enrichment parameter. Annotations were considered as significantly enriched if the log2 (parameter estimate) and respective 95% confidence intervals were above zero or significantly depleted if the log2 (parameter estimate) and respective 95% confidence intervals were below zero.We tested enrichment of trait-associated variants in static and StrE annotations with GARFIELD (v2)[59]. We formatted annotation overlap files as required by the tool; prepared input data at two GWAS thresholds - of 1x10-5 and a more stringent 1x10-8 by pruning and clumping with default parameters (garfield-prep-chr script). We calculated enrichment in each individual annotation using garfield-test.R with –c option set to 0. We also calculated the effective number of annotations using the garfield-Meff-Padj.R script. We used the effective number of annotations for each trait to obtain Bonferroni corrected significance thresholds for enrichment for each trait.
eQTL analyses
To aid in the identification of candidate casual genes at the European-only and trans-ancestry association signals, we examined whether any of the lead variants associated with glycemic traits (Supplementary Table 7) were also associated with expression level (FDR < 5%) of nearby transcripts located within 1 Mb in existing eQTL data sets of blood, subcutaneous adipose, visceral adipose, skeletal muscle, and pancreatic islet samples[60,61,99-102]. LD was estimated from the collected cohort pairwise LD information, where available, else from the European samples in 1000G phase 3. GWAS and eQTL signals likely co-localize when the GWAS variant and the variant most strongly associated with the expression level of the corresponding transcript (eSNP) exhibit high pairwise LD (r2 > 0.8; 1000 Genomes Phase 3, EUR). At these signals, we conducted reciprocal conditional analyses to test association between the GWAS variant and transcript level when the eSNP was also included in the model, and vice versa. We report GWAS and eQTL signals as co-localized if the association for the eSNP was not significant (FDR ≥ 5%) when conditioned on the GWAS variant; we also report signals from the eQTLGen whole blood meta-analysis data that meet only the LD threshold because conditional analysis was not possible.
Tissue and gene-set analysis
We performed enrichment analysis using DEPICT (Data-driven Expression-Prioritized Integration for Complex Traits) version 3, specifically developed for 1000 Genomes Project imputed meta-analysis data[103] to identify cell types and tissues in which genes at trait-associated variants were strongly expressed, and to detect enrichment of gene-sets or pathways. DEPICT data included human gene expression data for 19,987 genes in 10,968 reconstituted gene sets, and 209 tissues/cell types. Because gene expression data in DEPICT is based on European samples and LD, we selected trait-associated variants with P<10-5 in the European meta-analysis and tested for enrichment of signals in each reconstituted gene-set, and each tissue or cell type. Enrichment results with a false discovery rate (FDR)<0.05 were considered significant. We ran DEPICT based on association results for all traits among: (i) cohorts with genome-wide data, or (ii) all cohorts (genome-wide and Metabochip cohorts). Because results were broadly consistent between the two approaches, we present results from the analysis that contained all cohorts as it had greater statistical power.
Statistics and reproducibility
Sample size
No statistical method was used to predetermine sample size. We aimed to bring together the largest possible sample size with GWAS data from individuals of diverse ancestries (European, Hispanic, African American, East Asian, South Asian and sub-Saharan African) without diabetes and with data for one or more of the following traits: fasting glucose, fasting insulin, 2hr post-challenge glucose, and glycated hemoglobin. The sample sizes were 281,416 (FG), 213,650 (FI), 215,977 (HbA1c) and 85,916 (2hGlu) (Supplementary Table 1).Our sample size was sufficiently powered to detect common variant associations with each of the glycemic traits and was able to detect associations at 242 loci.
Randomization/Blinding
This is a study of continuous traits therefore there were no experiments to randomize and there was no “outcome” to which investigators needed to be blinded to.
Data exclusions
Prior to conducting this study, we identified reasons for which data should be excluded from the analysis at either the cohort or summary level; these exclusions are as follows. Sample quality control checks included removing samples with low call rate < 95%, extreme heterozygosity, sex mismatch with X chromosome variants, duplicates, first- or second-degree relatives (unless by design), or ancestry outliers. Following sample QC, cohorts applied variant QC thresholds for call rate (< 95%), Hardy-Weinberg Equilibrium (HWE) P < 1x10-6, and minor allele frequency (MAF). Full details of QC thresholds and exclusions by participating cohort are available in Supplementary Table 1. Each contributing cohort shared their summary statistic results with the central analysis group who performed additional QC using EasyQC. Allele frequency estimates were compared to estimates from 1000Gp1 reference panel, and variants were excluded from downstream analyses if there was a minor allele frequency difference > 0.2 for AA, EUR, HISP, and EAS populations against AFR, EUR, MXL, and ASN populations from 1000 Genomes Phase 1, respectively, or a minor allele frequency difference > 0.4 for SAS against EUR populations. At this stage, additional variants were excluded from each cohort file if they met one of the following criteria: were tri-allelic; had a minor allele count (MAC) < 3; demonstrated a standard error of the effect size ≥ 10; imputation r2 < 0.4 or INFO score < 0.4; or were missing an effect estimate, standard error, or imputation quality.
Flow diagram of this study
The figure shows the data, key methods and main analyses included in this effort.
Locus diagram
Trans-ancestry locus A contains a trans-ancestry lead variant for one glycemic trait represented by the blue diamond, and another single-ancestry index variant for another glycemic trait represented by the orange triangle. Single-ancestry locus B contains a single-ancestry lead variant represented by the purple square. The orange, blue and purple bars represent a +/- 500Kb window around the orange, blue, and purple variants, respectively. The black bars indicate the full locus window where trans-ancestry locus A contains trans-ancestry lead and single-ancestry index variants for two traits and single-ancestry locus B has a single-ancestry lead variant for a single trait.
Venn diagram
Overlap of TA loci between traits.
Allele frequency versus effect size
Allele frequency versus effect size for all signals detected through the trans-ancestry meta-analyses, for each of the four traits. Frequency and effect size are from the European meta-analyses. The power curves were computed based on the European sample size for each trait, and the mean (m) and standard deviation (sd) computed on the FENLAND study: FG, m=4.83 mmol/l, sd=0.68; FI, m=3.69 mmol/l, sd=0.60; 2hGlu, m=5.30 mmol/l, sd=1.74; HbA1c, m=5.55%, sd=0.48.
EAF correlation and heterogeneity test
Pearson correlation of EAF on the lower tri-angle and p-value of one-side heterogeneity test without multiple testing corrections on the upper tri-angle of the trans-ancestry lead variants associated with each trait between ancestries. Correlations > 0.7 are in bold.
Forest plot of T2D GRS from HbA1c variants
The p-value on the right side is from the two-side test without multiple testing corrections. Vertical points of each diamond represent the point estimate of the odds ratio. The horizontal points of each diamond represent the 95% confidence interval of the odds ratio. Figure shows the association results between HbA1c-associated variants built into a GRS for T2D by taking each HbA1c-associated variant and using a weight that corresponds to its T2D effect size (logOR) based on analysis by the DIAGRAM consortium. The overall GRS is subsequently partitioned according to the HbA1c signal classification. The overall and partitioned GRS were tested for association with T2D based on data from UK biobank.
Enrichment of glycemic trait associated GWAS variants to overlap genomic annotations using GREGOR
Figure shows enrichment for 59 total static and stretch enhancer annotations considered. One-side test significance (red) is determined after Bonferroni correction to account for 59 total annotations tested for each trait; nominal significance (P<0.05) is indicated in yellow.
Enrichment of glycemic trait associated GWAS variants to overlap genomic annotations using fGWAS
Figure shows log2(Fold Enrichment) of GWAS variants to overlap 59 static and stretch enhancer annotations calculated. Significant enrichment (red) is considered if the 95% confidence intervals (shown by the error bars) do not overlap 0.
Enrichment of glycemic trait associated GWAS variants to overlap genomic annotations using GARFIELD
Figure shows the beta or effect size (log odds ratio) for GWAS variants to overlap 59 static and stretch enhancer annotations. GWAS variants were included at two significance thresholds, 1e-05 (A) and 1e-08 (B). One-side test significance (red) is determined after Bonferroni correction to account for effective annotations tested for each trait reported by GARFIELD (see supplementary note); nominal significance (P<0.05) is indicated in yellow. The 95% confidence intervals are shown by the error bars.
Authors: Mark O Goodarzi; Jinrui Cui; Yii-Der I Chen; Willa A Hsueh; Xiuqing Guo; Jerome I Rotter Journal: Am J Physiol Endocrinol Metab Date: 2011-05-31 Impact factor: 4.310
Authors: Geoffrey A Walford; Stefan Gustafsson; Denis Rybin; Alena Stančáková; Han Chen; Ching-Ti Liu; Jaeyoung Hong; Richard A Jensen; Ken Rice; Andrew P Morris; Reedik Mägi; Anke Tönjes; Inga Prokopenko; Marcus E Kleber; Graciela Delgado; Günther Silbernagel; Anne U Jackson; Emil V Appel; Niels Grarup; Joshua P Lewis; May E Montasser; Claes Landenvall; Harald Staiger; Jian'an Luan; Timothy M Frayling; Michael N Weedon; Weijia Xie; Sonsoles Morcillo; María Teresa Martínez-Larrad; Mary L Biggs; Yii-Der Ida Chen; Arturo Corbaton-Anchuelo; Kristine Færch; Juan Miguel Gómez-Zumaquero; Mark O Goodarzi; Jorge R Kizer; Heikki A Koistinen; Aaron Leong; Lars Lind; Cecilia Lindgren; Fausto Machicao; Alisa K Manning; Gracia María Martín-Núñez; Gemma Rojo-Martínez; Jerome I Rotter; David S Siscovick; Joseph M Zmuda; Zhongyang Zhang; Manuel Serrano-Rios; Ulf Smith; Federico Soriguer; Torben Hansen; Torben J Jørgensen; Allan Linnenberg; Oluf Pedersen; Mark Walker; Claudia Langenberg; Robert A Scott; Nicholas J Wareham; Andreas Fritsche; Hans-Ulrich Häring; Norbert Stefan; Leif Groop; Jeff R O'Connell; Michael Boehnke; Richard N Bergman; Francis S Collins; Karen L Mohlke; Jaakko Tuomilehto; Winfried März; Peter Kovacs; Michael Stumvoll; Bruce M Psaty; Johanna Kuusisto; Markku Laakso; James B Meigs; Josée Dupuis; Erik Ingelsson; Jose C Florez Journal: Diabetes Date: 2016-07-14 Impact factor: 9.461
Authors: Anubha Mahajan; Xueling Sim; Hui Jin Ng; Alisa Manning; Manuel A Rivas; Heather M Highland; Adam E Locke; Niels Grarup; Hae Kyung Im; Pablo Cingolani; Jason Flannick; Pierre Fontanillas; Christian Fuchsberger; Kyle J Gaulton; Tanya M Teslovich; N William Rayner; Neil R Robertson; Nicola L Beer; Jana K Rundle; Jette Bork-Jensen; Claes Ladenvall; Christine Blancher; David Buck; Gemma Buck; Noël P Burtt; Stacey Gabriel; Anette P Gjesing; Christopher J Groves; Mette Hollensted; Jeroen R Huyghe; Anne U Jackson; Goo Jun; Johanne Marie Justesen; Massimo Mangino; Jacquelyn Murphy; Matt Neville; Robert Onofrio; Kerrin S Small; Heather M Stringham; Ann-Christine Syvänen; Joseph Trakalo; Goncalo Abecasis; Graeme I Bell; John Blangero; Nancy J Cox; Ravindranath Duggirala; Craig L Hanis; Mark Seielstad; James G Wilson; Cramer Christensen; Ivan Brandslund; Rainer Rauramaa; Gabriela L Surdulescu; Alex S F Doney; Lars Lannfelt; Allan Linneberg; Bo Isomaa; Tiinamaija Tuomi; Marit E Jørgensen; Torben Jørgensen; Johanna Kuusisto; Matti Uusitupa; Veikko Salomaa; Timothy D Spector; Andrew D Morris; Colin N A Palmer; Francis S Collins; Karen L Mohlke; Richard N Bergman; Erik Ingelsson; Lars Lind; Jaakko Tuomilehto; Torben Hansen; Richard M Watanabe; Inga Prokopenko; Josee Dupuis; Fredrik Karpe; Leif Groop; Markku Laakso; Oluf Pedersen; Jose C Florez; Andrew P Morris; David Altshuler; James B Meigs; Michael Boehnke; Mark I McCarthy; Cecilia M Lindgren; Anna L Gloyn Journal: PLoS Genet Date: 2015-01-27 Impact factor: 5.917
Authors: Momoko Horikoshi; Reedik Mӓgi; Martijn van de Bunt; Ida Surakka; Antti-Pekka Sarin; Anubha Mahajan; Letizia Marullo; Gudmar Thorleifsson; Sara Hӓgg; Jouke-Jan Hottenga; Claes Ladenvall; Janina S Ried; Thomas W Winkler; Sara M Willems; Natalia Pervjakova; Tõnu Esko; Marian Beekman; Christopher P Nelson; Christina Willenborg; Steven Wiltshire; Teresa Ferreira; Juan Fernandez; Kyle J Gaulton; Valgerdur Steinthorsdottir; Anders Hamsten; Patrik K E Magnusson; Gonneke Willemsen; Yuri Milaneschi; Neil R Robertson; Christopher J Groves; Amanda J Bennett; Terho Lehtimӓki; Jorma S Viikari; Johan Rung; Valeriya Lyssenko; Markus Perola; Iris M Heid; Christian Herder; Harald Grallert; Martina Müller-Nurasyid; Michael Roden; Elina Hypponen; Aaron Isaacs; Elisabeth M van Leeuwen; Lennart C Karssen; Evelin Mihailov; Jeanine J Houwing-Duistermaat; Anton J M de Craen; Joris Deelen; Aki S Havulinna; Matthew Blades; Christian Hengstenberg; Jeanette Erdmann; Heribert Schunkert; Jaakko Kaprio; Martin D Tobin; Nilesh J Samani; Lars Lind; Veikko Salomaa; Cecilia M Lindgren; P Eline Slagboom; Andres Metspalu; Cornelia M van Duijn; Johan G Eriksson; Annette Peters; Christian Gieger; Antti Jula; Leif Groop; Olli T Raitakari; Chris Power; Brenda W J H Penninx; Eco de Geus; Johannes H Smit; Dorret I Boomsma; Nancy L Pedersen; Erik Ingelsson; Unnur Thorsteinsdottir; Kari Stefansson; Samuli Ripatti; Inga Prokopenko; Mark I McCarthy; Andrew P Morris Journal: PLoS Genet Date: 2015-07-01 Impact factor: 5.917
Authors: Alisa K Manning; Marie-France Hivert; Robert A Scott; Jonna L Grimsby; Nabila Bouatia-Naji; Han Chen; Denis Rybin; Ching-Ti Liu; Lawrence F Bielak; Inga Prokopenko; Najaf Amin; Daniel Barnes; Gemma Cadby; Jouke-Jan Hottenga; Erik Ingelsson; Anne U Jackson; Toby Johnson; Stavroula Kanoni; Claes Ladenvall; Vasiliki Lagou; Jari Lahti; Cecile Lecoeur; Yongmei Liu; Maria Teresa Martinez-Larrad; May E Montasser; Pau Navarro; John R B Perry; Laura J Rasmussen-Torvik; Perttu Salo; Naveed Sattar; Dmitry Shungin; Rona J Strawbridge; Toshiko Tanaka; Cornelia M van Duijn; Ping An; Mariza de Andrade; Jeanette S Andrews; Thor Aspelund; Mustafa Atalay; Yurii Aulchenko; Beverley Balkau; Stefania Bandinelli; Jacques S Beckmann; John P Beilby; Claire Bellis; Richard N Bergman; John Blangero; Mladen Boban; Michael Boehnke; Eric Boerwinkle; Lori L Bonnycastle; Dorret I Boomsma; Ingrid B Borecki; Yvonne Böttcher; Claude Bouchard; Eric Brunner; Danijela Budimir; Harry Campbell; Olga Carlson; Peter S Chines; Robert Clarke; Francis S Collins; Arturo Corbatón-Anchuelo; David Couper; Ulf de Faire; George V Dedoussis; Panos Deloukas; Maria Dimitriou; Josephine M Egan; Gudny Eiriksdottir; Michael R Erdos; Johan G Eriksson; Elodie Eury; Luigi Ferrucci; Ian Ford; Nita G Forouhi; Caroline S Fox; Maria Grazia Franzosi; Paul W Franks; Timothy M Frayling; Philippe Froguel; Pilar Galan; Eco de Geus; Bruna Gigante; Nicole L Glazer; Anuj Goel; Leif Groop; Vilmundur Gudnason; Göran Hallmans; Anders Hamsten; Ola Hansson; Tamara B Harris; Caroline Hayward; Simon Heath; Serge Hercberg; Andrew A Hicks; Aroon Hingorani; Albert Hofman; Jennie Hui; Joseph Hung; Marjo-Riitta Jarvelin; Min A Jhun; Paul C D Johnson; J Wouter Jukema; Antti Jula; W H Kao; Jaakko Kaprio; Sharon L R Kardia; Sirkka Keinanen-Kiukaanniemi; Mika Kivimaki; Ivana Kolcic; Peter Kovacs; Meena Kumari; Johanna Kuusisto; Kirsten Ohm Kyvik; Markku Laakso; Timo Lakka; Lars Lannfelt; G Mark Lathrop; Lenore J Launer; Karin Leander; Guo Li; Lars Lind; Jaana Lindstrom; Stéphane Lobbens; Ruth J F Loos; Jian'an Luan; Valeriya Lyssenko; Reedik Mägi; Patrik K E Magnusson; Michael Marmot; Pierre Meneton; Karen L Mohlke; Vincent Mooser; Mario A Morken; Iva Miljkovic; Narisu Narisu; Jeff O'Connell; Ken K Ong; Ben A Oostra; Lyle J Palmer; Aarno Palotie; James S Pankow; John F Peden; Nancy L Pedersen; Marina Pehlic; Leena Peltonen; Brenda Penninx; Marijana Pericic; Markus Perola; Louis Perusse; Patricia A Peyser; Ozren Polasek; Peter P Pramstaller; Michael A Province; Katri Räikkönen; Rainer Rauramaa; Emil Rehnberg; Ken Rice; Jerome I Rotter; Igor Rudan; Aimo Ruokonen; Timo Saaristo; Maria Sabater-Lleal; Veikko Salomaa; David B Savage; Richa Saxena; Peter Schwarz; Udo Seedorf; Bengt Sennblad; Manuel Serrano-Rios; Alan R Shuldiner; Eric J G Sijbrands; David S Siscovick; Johannes H Smit; Kerrin S Small; Nicholas L Smith; Albert Vernon Smith; Alena Stančáková; Kathleen Stirrups; Michael Stumvoll; Yan V Sun; Amy J Swift; Anke Tönjes; Jaakko Tuomilehto; Stella Trompet; Andre G Uitterlinden; Matti Uusitupa; Max Vikström; Veronique Vitart; Marie-Claude Vohl; Benjamin F Voight; Peter Vollenweider; Gerard Waeber; Dawn M Waterworth; Hugh Watkins; Eleanor Wheeler; Elisabeth Widen; Sarah H Wild; Sara M Willems; Gonneke Willemsen; James F Wilson; Jacqueline C M Witteman; Alan F Wright; Hanieh Yaghootkar; Diana Zelenika; Tatijana Zemunik; Lina Zgaga; Nicholas J Wareham; Mark I McCarthy; Ines Barroso; Richard M Watanabe; Jose C Florez; Josée Dupuis; James B Meigs; Claudia Langenberg Journal: Nat Genet Date: 2012-05-13 Impact factor: 38.330
Authors: Josée Dupuis; Claudia Langenberg; Inga Prokopenko; Richa Saxena; Nicole Soranzo; Anne U Jackson; Eleanor Wheeler; Nicole L Glazer; Nabila Bouatia-Naji; Anna L Gloyn; Cecilia M Lindgren; Reedik Mägi; Andrew P Morris; Joshua Randall; Toby Johnson; Paul Elliott; Denis Rybin; Gudmar Thorleifsson; Valgerdur Steinthorsdottir; Peter Henneman; Harald Grallert; Abbas Dehghan; Jouke Jan Hottenga; Christopher S Franklin; Pau Navarro; Kijoung Song; Anuj Goel; John R B Perry; Josephine M Egan; Taina Lajunen; Niels Grarup; Thomas Sparsø; Alex Doney; Benjamin F Voight; Heather M Stringham; Man Li; Stavroula Kanoni; Peter Shrader; Christine Cavalcanti-Proença; Meena Kumari; Lu Qi; Nicholas J Timpson; Christian Gieger; Carina Zabena; Ghislain Rocheleau; Erik Ingelsson; Ping An; Jeffrey O'Connell; Jian'an Luan; Amanda Elliott; Steven A McCarroll; Felicity Payne; Rosa Maria Roccasecca; François Pattou; Praveen Sethupathy; Kristin Ardlie; Yavuz Ariyurek; Beverley Balkau; Philip Barter; John P Beilby; Yoav Ben-Shlomo; Rafn Benediktsson; Amanda J Bennett; Sven Bergmann; Murielle Bochud; Eric Boerwinkle; Amélie Bonnefond; Lori L Bonnycastle; Knut Borch-Johnsen; Yvonne Böttcher; Eric Brunner; Suzannah J Bumpstead; Guillaume Charpentier; Yii-Der Ida Chen; Peter Chines; Robert Clarke; Lachlan J M Coin; Matthew N Cooper; Marilyn Cornelis; Gabe Crawford; Laura Crisponi; Ian N M Day; Eco J C de Geus; Jerome Delplanque; Christian Dina; Michael R Erdos; Annette C Fedson; Antje Fischer-Rosinsky; Nita G Forouhi; Caroline S Fox; Rune Frants; Maria Grazia Franzosi; Pilar Galan; Mark O Goodarzi; Jürgen Graessler; Christopher J Groves; Scott Grundy; Rhian Gwilliam; Ulf Gyllensten; Samy Hadjadj; Göran Hallmans; Naomi Hammond; Xijing Han; Anna-Liisa Hartikainen; Neelam Hassanali; Caroline Hayward; Simon C Heath; Serge Hercberg; Christian Herder; Andrew A Hicks; David R Hillman; Aroon D Hingorani; Albert Hofman; Jennie Hui; Joe Hung; Bo Isomaa; Paul R V Johnson; Torben Jørgensen; Antti Jula; Marika Kaakinen; Jaakko Kaprio; Y Antero Kesaniemi; Mika Kivimaki; Beatrice Knight; Seppo Koskinen; Peter Kovacs; Kirsten Ohm Kyvik; G Mark Lathrop; Debbie A Lawlor; Olivier Le Bacquer; Cécile Lecoeur; Yun Li; Valeriya Lyssenko; Robert Mahley; Massimo Mangino; Alisa K Manning; María Teresa Martínez-Larrad; Jarred B McAteer; Laura J McCulloch; Ruth McPherson; Christa Meisinger; David Melzer; David Meyre; Braxton D Mitchell; Mario A Morken; Sutapa Mukherjee; Silvia Naitza; Narisu Narisu; Matthew J Neville; Ben A Oostra; Marco Orrù; Ruth Pakyz; Colin N A Palmer; Giuseppe Paolisso; Cristian Pattaro; Daniel Pearson; John F Peden; Nancy L Pedersen; Markus Perola; Andreas F H Pfeiffer; Irene Pichler; Ozren Polasek; Danielle Posthuma; Simon C Potter; Anneli Pouta; Michael A Province; Bruce M Psaty; Wolfgang Rathmann; Nigel W Rayner; Kenneth Rice; Samuli Ripatti; Fernando Rivadeneira; Michael Roden; Olov Rolandsson; Annelli Sandbaek; Manjinder Sandhu; Serena Sanna; Avan Aihie Sayer; Paul Scheet; Laura J Scott; Udo Seedorf; Stephen J Sharp; Beverley Shields; Gunnar Sigurethsson; Eric J G Sijbrands; Angela Silveira; Laila Simpson; Andrew Singleton; Nicholas L Smith; Ulla Sovio; Amy Swift; Holly Syddall; Ann-Christine Syvänen; Toshiko Tanaka; Barbara Thorand; Jean Tichet; Anke Tönjes; Tiinamaija Tuomi; André G Uitterlinden; Ko Willems van Dijk; Mandy van Hoek; Dhiraj Varma; Sophie Visvikis-Siest; Veronique Vitart; Nicole Vogelzangs; Gérard Waeber; Peter J Wagner; Andrew Walley; G Bragi Walters; Kim L Ward; Hugh Watkins; Michael N Weedon; Sarah H Wild; Gonneke Willemsen; Jaqueline C M Witteman; John W G Yarnell; Eleftheria Zeggini; Diana Zelenika; Björn Zethelius; Guangju Zhai; Jing Hua Zhao; M Carola Zillikens; Ingrid B Borecki; Ruth J F Loos; Pierre Meneton; Patrik K E Magnusson; David M Nathan; Gordon H Williams; Andrew T Hattersley; Kaisa Silander; Veikko Salomaa; George Davey Smith; Stefan R Bornstein; Peter Schwarz; Joachim Spranger; Fredrik Karpe; Alan R Shuldiner; Cyrus Cooper; George V Dedoussis; Manuel Serrano-Ríos; Andrew D Morris; Lars Lind; Lyle J Palmer; Frank B Hu; Paul W Franks; Shah Ebrahim; Michael Marmot; W H Linda Kao; James S Pankow; Michael J Sampson; Johanna Kuusisto; Markku Laakso; Torben Hansen; Oluf Pedersen; Peter Paul Pramstaller; H Erich Wichmann; Thomas Illig; Igor Rudan; Alan F Wright; Michael Stumvoll; Harry Campbell; James F Wilson; Richard N Bergman; Thomas A Buchanan; Francis S Collins; Karen L Mohlke; Jaakko Tuomilehto; Timo T Valle; David Altshuler; Jerome I Rotter; David S Siscovick; Brenda W J H Penninx; Dorret I Boomsma; Panos Deloukas; Timothy D Spector; Timothy M Frayling; Luigi Ferrucci; Augustine Kong; Unnur Thorsteinsdottir; Kari Stefansson; Cornelia M van Duijn; Yurii S Aulchenko; Antonio Cao; Angelo Scuteri; David Schlessinger; Manuela Uda; Aimo Ruokonen; Marjo-Riitta Jarvelin; Dawn M Waterworth; Peter Vollenweider; Leena Peltonen; Vincent Mooser; Goncalo R Abecasis; Nicholas J Wareham; Robert Sladek; Philippe Froguel; Richard M Watanabe; James B Meigs; Leif Groop; Michael Boehnke; Mark I McCarthy; Jose C Florez; Inês Barroso Journal: Nat Genet Date: 2010-01-17 Impact factor: 38.330
Authors: Antigone S Dimas; Vasiliki Lagou; Adam Barker; Joshua W Knowles; Reedik Mägi; Marie-France Hivert; Andrea Benazzo; Denis Rybin; Anne U Jackson; Heather M Stringham; Ci Song; Antje Fischer-Rosinsky; Trine Welløv Boesgaard; Niels Grarup; Fahim A Abbasi; Themistocles L Assimes; Ke Hao; Xia Yang; Cécile Lecoeur; Inês Barroso; Lori L Bonnycastle; Yvonne Böttcher; Suzannah Bumpstead; Peter S Chines; Michael R Erdos; Jurgen Graessler; Peter Kovacs; Mario A Morken; Narisu Narisu; Felicity Payne; Alena Stancakova; Amy J Swift; Anke Tönjes; Stefan R Bornstein; Stéphane Cauchi; Philippe Froguel; David Meyre; Peter E H Schwarz; Hans-Ulrich Häring; Ulf Smith; Michael Boehnke; Richard N Bergman; Francis S Collins; Karen L Mohlke; Jaakko Tuomilehto; Thomas Quertemous; Lars Lind; Torben Hansen; Oluf Pedersen; Mark Walker; Andreas F H Pfeiffer; Joachim Spranger; Michael Stumvoll; James B Meigs; Nicholas J Wareham; Johanna Kuusisto; Markku Laakso; Claudia Langenberg; Josée Dupuis; Richard M Watanabe; Jose C Florez; Erik Ingelsson; Mark I McCarthy; Inga Prokopenko Journal: Diabetes Date: 2013-12-02 Impact factor: 9.461
Authors: Eleanor Wheeler; Aaron Leong; Ching-Ti Liu; Marie-France Hivert; Rona J Strawbridge; Clara Podmore; Man Li; Jie Yao; Xueling Sim; Jaeyoung Hong; Audrey Y Chu; Weihua Zhang; Xu Wang; Peng Chen; Nisa M Maruthur; Bianca C Porneala; Stephen J Sharp; Yucheng Jia; Edmond K Kabagambe; Li-Ching Chang; Wei-Min Chen; Cathy E Elks; Daniel S Evans; Qiao Fan; Franco Giulianini; Min Jin Go; Jouke-Jan Hottenga; Yao Hu; Anne U Jackson; Stavroula Kanoni; Young Jin Kim; Marcus E Kleber; Claes Ladenvall; Cecile Lecoeur; Sing-Hui Lim; Yingchang Lu; Anubha Mahajan; Carola Marzi; Mike A Nalls; Pau Navarro; Ilja M Nolte; Lynda M Rose; Denis V Rybin; Serena Sanna; Yuan Shi; Daniel O Stram; Fumihiko Takeuchi; Shu Pei Tan; Peter J van der Most; Jana V Van Vliet-Ostaptchouk; Andrew Wong; Loic Yengo; Wanting Zhao; Anuj Goel; Maria Teresa Martinez Larrad; Dörte Radke; Perttu Salo; Toshiko Tanaka; Erik P A van Iperen; Goncalo Abecasis; Saima Afaq; Behrooz Z Alizadeh; Alain G Bertoni; Amelie Bonnefond; Yvonne Böttcher; Erwin P Bottinger; Harry Campbell; Olga D Carlson; Chien-Hsiun Chen; Yoon Shin Cho; W Timothy Garvey; Christian Gieger; Mark O Goodarzi; Harald Grallert; Anders Hamsten; Catharina A Hartman; Christian Herder; Chao Agnes Hsiung; Jie Huang; Michiya Igase; Masato Isono; Tomohiro Katsuya; Chiea-Chuen Khor; Wieland Kiess; Katsuhiko Kohara; Peter Kovacs; Juyoung Lee; Wen-Jane Lee; Benjamin Lehne; Huaixing Li; Jianjun Liu; Stephane Lobbens; Jian'an Luan; Valeriya Lyssenko; Thomas Meitinger; Tetsuro Miki; Iva Miljkovic; Sanghoon Moon; Antonella Mulas; Gabriele Müller; Martina Müller-Nurasyid; Ramaiah Nagaraja; Matthias Nauck; James S Pankow; Ozren Polasek; Inga Prokopenko; Paula S Ramos; Laura Rasmussen-Torvik; Wolfgang Rathmann; Stephen S Rich; Neil R Robertson; Michael Roden; Ronan Roussel; Igor Rudan; Robert A Scott; William R Scott; Bengt Sennblad; David S Siscovick; Konstantin Strauch; Liang Sun; Morris Swertz; Salman M Tajuddin; Kent D Taylor; Yik-Ying Teo; Yih Chung Tham; Anke Tönjes; Nicholas J Wareham; Gonneke Willemsen; Tom Wilsgaard; Aroon D Hingorani; Josephine Egan; Luigi Ferrucci; G Kees Hovingh; Antti Jula; Mika Kivimaki; Meena Kumari; Inger Njølstad; Colin N A Palmer; Manuel Serrano Ríos; Michael Stumvoll; Hugh Watkins; Tin Aung; Matthias Blüher; Michael Boehnke; Dorret I Boomsma; Stefan R Bornstein; John C Chambers; Daniel I Chasman; Yii-Der Ida Chen; Yduan-Tsong Chen; Ching-Yu Cheng; Francesco Cucca; Eco J C de Geus; Panos Deloukas; Michele K Evans; Myriam Fornage; Yechiel Friedlander; Philippe Froguel; Leif Groop; Myron D Gross; Tamara B Harris; Caroline Hayward; Chew-Kiat Heng; Erik Ingelsson; Norihiro Kato; Bong-Jo Kim; Woon-Puay Koh; Jaspal S Kooner; Antje Körner; Diana Kuh; Johanna Kuusisto; Markku Laakso; Xu Lin; Yongmei Liu; Ruth J F Loos; Patrik K E Magnusson; Winfried März; Mark I McCarthy; Albertine J Oldehinkel; Ken K Ong; Nancy L Pedersen; Mark A Pereira; Annette Peters; Paul M Ridker; Charumathi Sabanayagam; Michele Sale; Danish Saleheen; Juha Saltevo; Peter Eh Schwarz; Wayne H H Sheu; Harold Snieder; Timothy D Spector; Yasuharu Tabara; Jaakko Tuomilehto; Rob M van Dam; James G Wilson; James F Wilson; Bruce H R Wolffenbuttel; Tien Yin Wong; Jer-Yuarn Wu; Jian-Min Yuan; Alan B Zonderman; Nicole Soranzo; Xiuqing Guo; David J Roberts; Jose C Florez; Robert Sladek; Josée Dupuis; Andrew P Morris; E-Shyong Tai; Elizabeth Selvin; Jerome I Rotter; Claudia Langenberg; Inês Barroso; James B Meigs Journal: PLoS Med Date: 2017-09-12 Impact factor: 11.069
Authors: Miriam S Udler; Jaegil Kim; Marcin von Grotthuss; Sílvia Bonàs-Guarch; Joanne B Cole; Joshua Chiou; Michael Boehnke; Markku Laakso; Gil Atzmon; Benjamin Glaser; Josep M Mercader; Kyle Gaulton; Jason Flannick; Gad Getz; Jose C Florez Journal: PLoS Med Date: 2018-09-21 Impact factor: 11.069
Authors: Elise J Needham; Janne R Hingst; Benjamin L Parker; Kaitlin R Morrison; Guang Yang; Johan Onslev; Jonas M Kristensen; Kurt Højlund; Naomi X Y Ling; Jonathan S Oakhill; Erik A Richter; Bente Kiens; Janni Petersen; Christian Pehmøller; David E James; Jørgen F P Wojtaszewski; Sean J Humphrey Journal: Nat Biotechnol Date: 2021-12-02 Impact factor: 54.908
Authors: Fernando Riveros-Mckay; David Roberts; Emanuele Di Angelantonio; Bing Yu; Nicole Soranzo; John Danesh; Elizabeth Selvin; Adam S Butterworth; Inês Barroso Journal: Diabetes Date: 2022-02-01 Impact factor: 9.461
Authors: Jie Zheng; Min Xu; Venexia Walker; Jinqiu Yuan; Roxanna Korologou-Linden; Jamie Robinson; Peiyuan Huang; Stephen Burgess; Shiu Lun Au Yeung; Shan Luo; Michael V Holmes; George Davey Smith; Guang Ning; Weiqing Wang; Tom R Gaunt; Yufang Bi Journal: Diabetologia Date: 2022-07-29 Impact factor: 10.460
Authors: Edgar G Dorsey-Trevino; Varinderpal Kaur; Josep M Mercader; Jose C Florez; Aaron Leong Journal: J Clin Endocrinol Metab Date: 2022-08-18 Impact factor: 6.134
Authors: José Manuel Sánchez-Maldonado; Ricardo Collado; Antonio José Cabrera-Serrano; Rob Ter Horst; Fernando Gálvez-Montosa; Inmaculada Robles-Fernández; Verónica Arenas-Rodríguez; Blanca Cano-Gutiérrez; Olivier Bakker; María Inmaculada Bravo-Fernández; Francisco José García-Verdejo; José Antonio López López; Jesús Olivares-Ruiz; Miguel Ángel López-Nevot; Laura Fernández-Puerta; José Manuel Cózar-Olmo; Yang Li; Mihai G Netea; Manuel Jurado; Jose Antonio Lorente; Pedro Sánchez-Rovira; María Jesús Álvarez-Cubero; Juan Sainz Journal: Cancers (Basel) Date: 2022-05-12 Impact factor: 6.575
Authors: Neil Murphy; Mingyang Song; Nikos Papadimitriou; Robert Carreras-Torres; Claudia Langenberg; Richard M Martin; Konstantinos K Tsilidis; Inês Barroso; Ji Chen; Timothy M Frayling; Caroline J Bull; Emma E Vincent; Michelle Cotterchio; Stephen B Gruber; Rish K Pai; Polly A Newcomb; Aurora Perez-Cornago; Franzel J B van Duijnhoven; Bethany Van Guelpen; Pavel Vodicka; Alicja Wolk; Anna H Wu; Ulrike Peters; Andrew T Chan; Marc J Gunter Journal: J Natl Cancer Inst Date: 2022-05-09 Impact factor: 11.816