Literature DB >> 35604078

Using the UK Biobank as a global reference of worldwide populations: application to measuring ancestry diversity from GWAS summary statistics.

Abstract

MOTIVATION: Measuring genetic diversity is an important problem because increasing genetic diversity is key to making new genetic discoveries, while also being a major source of confounding to be aware of in genetics studies.
RESULTS: Using the UK Biobank data, a prospective cohort study with deep genetic and phenotypic data collected on almost 500,000 individuals from across the United Kingdom, we carefully define 21 distinct ancestry groups from all four corners of the world. These ancestry groups can serve as a global reference of worldwide populations, with a handful of applications. Here we develop a method that uses allele frequencies and principal components derived from these ancestry groups to effectively measure ancestry proportions from allele frequencies of any genetic dataset. AVAILABILITY: This method is implemented in function snp_ancestry_summary of R package bigsnpr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Chemical

Year: 2022 PMID： 35604078 PMCID： PMC9237724 DOI： 10.1093/bioinformatics/btac348

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.931

1 Introduction

Several projects have focused on providing genetic data from diverse populations, such as the HapMap project, the 1000 genomes project (1KG), the Simons genome diversity project and the human genome diversity project (1000 Genomes Project Consortium ; Bergström ; International HapMap 3 Consortium ; Mallick ). However, these datasets do not contain many individuals per population and therefore are not large enough for some purposes, such as accurately estimating allele frequencies for diverse worldwide populations. The UK Biobank (UKBB) project is a prospective cohort study with deep genetic and phenotypic data collected on almost 500 000 individuals from across the UK. Despite being a cohort from the UK, this dataset is so large that it includes individuals that were born in all four corners of the world. Therefore, the UKBB can serve as a global reference of worldwide populations when used in its entirety, i.e. without discarding valuable multiancestry genetic data.

2 Implementation

Here, we carefully use information on self-reported ancestry, country of birth and genetic similarity to define 21 distinct ancestry groups from the UKBB to be used as global reference populations, which is the first innovation of this paper. These include nine groups with genetic ancestries from Europe, four from Africa, three from South Asia, three from East Asia, one from the Middle East and one from South America (which are later merged into 18 groups in Table 1). The detailed procedure used to construct these reference ancestry groups is presented in the Supplementary Materials. As a direct application of these groups, we propose a new method to estimate global ancestry proportions from a cohort based on its allele frequencies only (i.e. summary statistics). Arriaga-MacKenzie previously proposed method Summix, which finds the convex combination of ancestry proportions α (positive and sum to 1) which minimizes the following problem: where M is the number of variants, K the number of reference populations, is the frequency of variant j in population k and is the frequency of variant j in the cohort of interest. Arriaga-MacKenzie used the five continental 1KG populations as reference.

Table 1.

Reference populations with their size (N), and corresponding ancestry proportions (in %) inferred from the proposed snp_ancestry_summary method, for several GWAS summary statistics

Ancestry group	N	BBJ	FinnGen	Perú	Qatar	Africa	GERA	PAGE	BrCa	PrCa	CAD	Body fat	COVID	Eczema	Epilepsy	Urate
Africa (West)	735					30	1.9	27.7	0.3	0.3	2.2	0.7	4	0.2	0.7	2.2
Africa (South)	449					70	0.9	5.9	0.2		1.2	0.3	1.2	0.5	0.3	3.5
Africa (East)	276				13						0.1		0.3			1.9
Africa (North)	268				22			0.5					0.1

Middle East	523				64.6							0.2				1.3

Ashkenazi	1975						4.4	0.5	0.2	1.8	0.4	0.8	0.4	0.6	1.8
Italy	345						4.6		3.1	1.2	9.7	5.5			3.4	0.8
Europe (East)	667						10.5		6.9	11.3	10.5	11.4	13.2	11.7	13.9	10.8
Finland	143 (+ 99)		100				2.4	0.7	9.7	13	5.9	8.8	14.8	12.8	6.5	2
Europe (North West)	4416						59.9	5.6	68.5	64.5	51.8	59.8	61.4	70.9	68	46
Europe (South West)	603						3.5	15.8	4.7	4.5				2.1	2.1

South America	473 (+ 84)			100			4.6	25.4	1.5	0.8	1.6	0.4	1.8	0.5

Sri Lanka	372				0.4		0.4		3.4	1.7	4.7	4.2	1.8			1.9
Pakistan	400							1.1			7	4
Bangladesh	223 (+ 86)							1.6

Asia (East)	961						3.5	1.2	1.2	0.7	2.5	1.2	0.1	0.1	3.1
Japan	240 (+ 104)	100					2.2	9.4	0.4		2.4	2.8	0.7	0.7	0.3	29.8
Philippines	295						1.5	4.6					0.2

Note: Note that, because they are very close ancestry groups, we merge a posteriori the ancestry coefficients α from ‘Ireland’, ‘United Kingdom’ and ‘Scandinavia’ into a single ‘Europe (North West)’ group, and similarly for ‘Europe (North East)’ and ‘Europe (South East)’ into a single ‘Europe (East)’ group. Citations for the allele frequencies used: the BBJ (Sakaue ), FinnGen (Kurki ), GWAS in Peruvians (Asgari ), GWAS in Qataris (Thareja ), GWAS in Sub-Saharan Africans (Africa; Chen ), GERA (Hoffmann ), PAGE (Wojcik ), breast cancer (BrCa; Michailidou ), prostate cancer (PrCa; Schumacher ), coronary artery disease (CAD; Nikpay ), body fat percentage (Lu ), COVID-19 (The COVID-19 Host Genetics Initiative, 2021), eczema (Paternoster ), epilepsy (The International League Against Epilepsy Consortium on Complex Epilepsies, 2018) and serum urate (Tin ). Several of these GWAS summary statistics have been downloaded through the NHGRI-EBI GWAS Catalog (MacArthur ).

Reference populations with their size (N), and corresponding ancestry proportions (in %) inferred from the proposed snp_ancestry_summary method, for several GWAS summary statistics Note: Note that, because they are very close ancestry groups, we merge a posteriori the ancestry coefficients α from ‘Ireland’, ‘United Kingdom’ and ‘Scandinavia’ into a single ‘Europe (North West)’ group, and similarly for ‘Europe (North East)’ and ‘Europe (South East)’ into a single ‘Europe (East)’ group. Citations for the allele frequencies used: the BBJ (Sakaue ), FinnGen (Kurki ), GWAS in Peruvians (Asgari ), GWAS in Qataris (Thareja ), GWAS in Sub-Saharan Africans (Africa; Chen ), GERA (Hoffmann ), PAGE (Wojcik ), breast cancer (BrCa; Michailidou ), prostate cancer (PrCa; Schumacher ), coronary artery disease (CAD; Nikpay ), body fat percentage (Lu ), COVID-19 (The COVID-19 Host Genetics Initiative, 2021), eczema (Paternoster ), epilepsy (The International League Against Epilepsy Consortium on Complex Epilepsies, 2018) and serum urate (Tin ). Several of these GWAS summary statistics have been downloaded through the NHGRI-EBI GWAS Catalog (MacArthur ). Here, we provide reference allele frequencies for 5 816 590 genetic variants across 21 diverse ancestry groups (which are later merged into 18 groups in Table 1). Moreover, we rely on the projection of our reference allele frequencies onto the PCA (principal component analysis) space computed from the corresponding UKBB (and 1KG) individuals, and also make these principal component (PC) loadings available for download. Instead, we then minimize with the same convex constraints on ancestry proportions α, and where L is the number of PCs, is the projection of allele frequencies from population k onto PC l and is the (corrected) projection of allele frequencies from the cohort of interest onto PC l. Note that we need to correct for the shrinkage when projecting a new dataset (here the allele frequencies from the GWAS summary statistics) onto the PC space (Privé ). Finding the ancestry proportions in the PCA space (rather than using the allele frequencies directly) provides more power to distinguish between close populations, which is the second innovation of this paper. This enables us to use more reference populations in order to get a more fine-grained measure of genetic diversity. The steps required by the proposed method are then 1/read all summary statistics datasets into R, i.e. the reference allele frequencies and corresponding PC loadings we provide for download as well as the GWAS summary statistics containing the allele frequencies of interest; 2/match variants and alleles between summary statistics and the reference allele frequencies we provide; 3/project allele frequencies onto the PCA space (matrix multiplication); 4/solve the final (small) quadratic programming problem, by relying on R package quadprog (Turlach ). Steps 3 and 4 are now implemented in function snp_ancestry_summary in our R package bigsnpr (Privé ). Step 2 can be performed using existing function snp_match. A tutorial is provided at https://privefl.github.io/bigsnpr/articles/ancestry.html. All these steps are very fast and overall require a few minutes only for GWAS summary statistics with millions of variants.

3 Results

We download several genome-wide association study (GWAS) summary statistics for which allele frequencies are reported and apply this new method to them. We first apply function snp_ancestry_summary to more homogeneous samples as an empirical validation; when applying it to the Biobank Japan (BBJ; Japanese cohort), FinnGen (Finnish), a Peruvian cohort, a Qatari cohort and Sub-Saharan African cohort, the ancestry proportions obtained match expectations (Table 1). When comparing our estimates with reported ancestries for more diverse cohorts, for example PAGE is composed of 44.6% Hispanic-Latinos, 34.7% African-Americans, 9.4% Asians, 7.9% Native Hawaiians and 3.4% of some other ancestries (self-reported), whereas our estimates are of 25.4% South American, 22.6% European (including 15.8% from South-West Europe), 34.1% African, 2.7% South Asian, 10.6% East Asian and 4.6% Filipino. GWAS summary statistics from either European ancestries or more diverse ancestries all have a substantial proportion estimated from European ancestry groups, while ancestries from other continents are still largely underrepresented (Table 1). We then perform three secondary analyses. First, we compare the results obtained previously in Table 1 with the results we would get without using the PCA projection of allele frequencies (i.e. equivalent to the Summix method). The resulting ancestry proportions are presented in Supplementary Table S1 and are clearly less precise for BBJ and FinnGen. Second, we compare previous results with the ones obtained using a smaller number of variants, by randomly sampling 100 000 variants to run the proposed method. The resulting ancestry proportions are presented in Supplementary Table S2 and are highly consistent with the ones from Table 1, showing that 100 000 overlapping variants are enough to run the proposed method. Third, we also infer ancestry proportions for all 345 individuals of the Simons genome diversity project (Mallick ) using the reference allele frequencies we provide and two methods. We use either our proposed method with the genotypes of an individual divided by 2 in place of allele frequencies, or by using the projection analysis of ADMIXTURE (-P, Shringarpure ). Results are very consistent between the two methods, and are overall as expected, further validating the proposed ancestry groups and the proposed method to infer ancestry proportions, which seems very precise even at the individual level.

4 Discussion

Here, we have identified an unprecedentedly large and diverse set of ancestry groups within a single cohort, the UKBB. Using allele frequencies and PCs derived from these ancestry groups, we show how to effectively measure diversity from GWAS summary statistics reporting allele frequencies. Measuring genetic diversity is an important problem because increasing genetic diversity is key to making new genetic discoveries, while also being a major source of confounding to be aware of in genetics studies. Our work has limitations though. First, it is unknown whether we can effectively capture any existing ancestry as a combination of the 21 reference populations we defined. For example, it seems that Native Hawaiians in the PAGE study are partly captured by the “Philippines” ancestry group we define. Second, with the 21 ancestry groups we define, we probably capture a large proportion of the genetic diversity in Europe, but more fine-grained diversity in other continents may still be lacking. Third, when using the allele frequencies reported in the GWAS summary statistics, it is not clear whether they were computed from all individuals (i.e. before performing any quality control and filtering), and, for meta-analyses of binary traits, whether they were computed as a weighted average of total or effective sample sizes. Despite these limitations, we envision that the ancestry groups we define here will have many useful applications. The presented method that uses these groups could e.g. be used to automatically report ancestry proportions in the GWAS Catalog (MacArthur ). These ancestry groups could also be used for assigning ancestry in other cohorts using the PC projection from this study (Privé ), assessing the portability of polygenic scores (Privé ) or deriving linkage disequilibrium references matching GWAS summary statistics from diverse ancestries.

Software and code availability

The newest version of R package bigsnpr can be installed from GitHub (see https://github.com/privefl/bigsnpr) and a recent enough version can be installed from CRAN. A tutorial on ancestry proportions and ancestry grouping is available at https://privefl.github.io/bigsnpr/articles/ancestry.html. The set of reference allele frequencies for 5 816 590 genetic variants across 21 diverse ancestry groups defined here can be downloaded at https://figshare.com/ndownloader/files/31620968 and PC loadings for all variants across 16 PCs at https://figshare.com/ndownloader/files/31620953. All codes used for this paper are available at https://github.com/privefl/freq-ancestry/tree/main/code. We have extensively used R packages bigstatsr and bigsnpr (Privé ) for analyzing large genetic data, packages from the future framework (Bengtsson, 2021) for easy scheduling and parallelization of analyses on the high-performance computing cluster and packages from the tidyverse suite (Wickham ) for shaping and visualizing results. Click here for additional data file.

24 in total

1. A positively selected FBN1 missense variant reduces height in Peruvian individuals.

Authors: Samira Asgari; Yang Luo; Ali Akbari; Gillian M Belbin; Xinyi Li; Daniel N Harris; Martin Selig; Eric Bartell; Roger Calderon; Kamil Slowikowski; Carmen Contreras; Rosa Yataco; Jerome T Galea; Judith Jimenez; Julia M Coit; Chandel Farroñay; Rosalynn M Nazarian; Timothy D O'Connor; Harry C Dietz; Joel N Hirschhorn; Heinner Guio; Leonid Lecca; Eimear E Kenny; Esther E Freeman; Megan B Murray; Soumya Raychaudhuri
Journal: Nature Date: 2020-05-13 Impact factor: 49.962

2. A cross-population atlas of genetic associations for 220 human phenotypes.

Authors: Saori Sakaue; Masahiro Kanai; Yosuke Tanigawa; Juha Karjalainen; Mitja Kurki; Seizo Koshiba; Akira Narita; Takahiro Konuma; Kenichi Yamamoto; Masato Akiyama; Kazuyoshi Ishigaki; Akari Suzuki; Ken Suzuki; Wataru Obara; Ken Yamaji; Kazuhisa Takahashi; Satoshi Asai; Yasuo Takahashi; Takao Suzuki; Nobuaki Shinozaki; Hiroki Yamaguchi; Shiro Minami; Shigeo Murayama; Kozo Yoshimori; Satoshi Nagayama; Daisuke Obata; Masahiko Higashiyama; Akihide Masumoto; Yukihiro Koretsune; Kaoru Ito; Chikashi Terao; Toshimasa Yamauchi; Issei Komuro; Takashi Kadowaki; Gen Tamiya; Masayuki Yamamoto; Yusuke Nakamura; Michiaki Kubo; Yoshinori Murakami; Kazuhiko Yamamoto; Yoichiro Kamatani; Aarno Palotie; Manuel A Rivas; Mark J Daly; Koichi Matsuda; Yukinori Okada
Journal: Nat Genet Date: 2021-09-30 Impact factor: 38.330

3. Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort.

Authors: Florian Privé; Hugues Aschard; Shai Carmi; Lasse Folkersen; Clive Hoggart; Paul F O'Reilly; Bjarni J Vilhjálmsson
Journal: Am J Hum Genet Date: 2022-01-06 Impact factor: 11.043

4. Summix: A method for detecting and adjusting for population structure in genetic summary data.

Authors: Ian S Arriaga-MacKenzie; Gregory Matesi; Samuel Chen; Alexandria Ronco; Katie M Marker; Jordan R Hall; Ryan Scherenberg; Mobin Khajeh-Sharafabadi; Yinfei Wu; Christopher R Gignoux; Megan Null; Audrey E Hendricks
Journal: Am J Hum Genet Date: 2021-06-21 Impact factor: 11.025

5. Multi-ancestry genome-wide association study of 21,000 cases and 95,000 controls identifies new risk loci for atopic dermatitis.

Authors: Lavinia Paternoster; Marie Standl; Johannes Waage; Hansjörg Baurecht; Melanie Hotze; David P Strachan; John A Curtin; Klaus Bønnelykke; Chao Tian; Atsushi Takahashi; Jorge Esparza-Gordillo; Alexessander Couto Alves; Jacob P Thyssen; Herman T den Dekker; Manuel A Ferreira; Elisabeth Altmaier; Patrick Ma Sleiman; Feng Li Xiao; Juan R Gonzalez; Ingo Marenholz; Birgit Kalb; Maria Pino Yanes; Cheng-Jian Xu; Lisbeth Carstensen; Maria M Groen-Blokhuis; Cristina Venturini; Craig E Pennell; Sheila J Barton; Albert M Levin; Ivan Curjuric; Mariona Bustamante; Eskil Kreiner-Møller; Gabrielle A Lockett; Jonas Bacelis; Supinda Bunyavanich; Rachel A Myers; Anja Matanovic; Ashish Kumar; Joyce Y Tung; Tomomitsu Hirota; Michiaki Kubo; Wendy L McArdle; A J Henderson; John P Kemp; Jie Zheng; George Davey Smith; Franz Rüschendorf; Anja Bauerfeind; Min Ae Lee-Kirsch; Andreas Arnold; Georg Homuth; Carsten O Schmidt; Elisabeth Mangold; Sven Cichon; Thomas Keil; Elke Rodríguez; Annette Peters; Andre Franke; Wolfgang Lieb; Natalija Novak; Regina Fölster-Holst; Momoko Horikoshi; Juha Pekkanen; Sylvain Sebert; Lise L Husemoen; Niels Grarup; Johan C de Jongste; Fernando Rivadeneira; Albert Hofman; Vincent Wv Jaddoe; Suzanne Gma Pasmans; Niels J Elbert; André G Uitterlinden; Guy B Marks; Philip J Thompson; Melanie C Matheson; Colin F Robertson; Janina S Ried; Jin Li; Xian Bo Zuo; Xiao Dong Zheng; Xian Yong Yin; Liang Dan Sun; Maeve A McAleer; Grainne M O'Regan; Caoimhe Mr Fahy; Linda E Campbell; Milan Macek; Michael Kurek; Donglei Hu; Celeste Eng; Dirkje S Postma; Bjarke Feenstra; Frank Geller; Jouke Jan Hottenga; Christel M Middeldorp; Pirro Hysi; Veronique Bataille; Tim Spector; Carla Mt Tiesler; Elisabeth Thiering; Badri Pahukasahasram; James J Yang; Medea Imboden; Scott Huntsman; Natàlia Vilor-Tejedor; Caroline L Relton; Ronny Myhre; Wenche Nystad; Adnan Custovic; Scott T Weiss; Deborah A Meyers; Cilla Söderhäll; Erik Melén; Carole Ober; Benjamin A Raby; Angela Simpson; Bo Jacobsson; John W Holloway; Hans Bisgaard; Jordi Sunyer; Nicole M Probst Hensch; L Keoki Williams; Keith M Godfrey; Carol A Wang; Dorret I Boomsma; Mads Melbye; Gerard H Koppelman; Deborah Jarvis; Wh Irwin McLean; Alan D Irvine; Xue Jun Zhang; Hakon Hakonarson; Christian Gieger; Esteban G Burchard; Nicholas G Martin; Liesbeth Duijts; Allan Linneberg; Marjo-Riitta Jarvelin; Markus M Noethen; Susanne Lau; Norbert Hübner; Young-Ae Lee; Mayumi Tamari; David A Hinds; Daniel Glass; Sara J Brown; Joachim Heinrich; David M Evans; Stephan Weidinger
Journal: Nat Genet Date: 2015-10-19 Impact factor: 38.330

6. A large electronic-health-record-based genome-wide study of serum lipids.

Authors: Thomas J Hoffmann; Elizabeth Theusch; Tanushree Haldar; Dilrini K Ranatunga; Eric Jorgenson; Marisa W Medina; Mark N Kvale; Pui-Yan Kwok; Catherine Schaefer; Ronald M Krauss; Carlos Iribarren; Neil Risch
Journal: Nat Genet Date: 2018-03-05 Impact factor: 38.330

7. Insights into human genetic variation and population history from 929 diverse genomes.

Authors: Shane A McCarthy; Ruoyun Hui; Mohamed A Almarri; Yali Xue; Richard Durbin; Chris Tyler-Smith; Anders Bergström; Qasim Ayub; Petr Danecek; Yuan Chen; Sabine Felkel; Pille Hallast; Jack Kamm; Hélène Blanché; Jean-François Deleuze; Howard Cann; Swapan Mallick; David Reich; Manjinder S Sandhu; Pontus Skoglund; Aylwyn Scally
Journal: Science Date: 2020-03-20 Impact factor: 47.728

8. Efficient analysis of large datasets and sex bias with ADMIXTURE.

Authors: Suyash S Shringarpure; Carlos D Bustamante; Kenneth Lange; David H Alexander
Journal: BMC Bioinformatics Date: 2016-05-23 Impact factor: 3.169

9. New loci for body fat percentage reveal link between adiposity and cardiometabolic disease risk.

Authors: Yingchang Lu; Felix R Day; Stefan Gustafsson; Martin L Buchkovich; Jianbo Na; Veronique Bataille; Diana L Cousminer; Zari Dastani; Alexander W Drong; Tõnu Esko; David M Evans; Mario Falchi; Mary F Feitosa; Teresa Ferreira; Åsa K Hedman; Robin Haring; Pirro G Hysi; Mark M Iles; Anne E Justice; Stavroula Kanoni; Vasiliki Lagou; Rui Li; Xin Li; Adam Locke; Chen Lu; Reedik Mägi; John R B Perry; Tune H Pers; Qibin Qi; Marianna Sanna; Ellen M Schmidt; William R Scott; Dmitry Shungin; Alexander Teumer; Anna A E Vinkhuyzen; Ryan W Walker; Harm-Jan Westra; Mingfeng Zhang; Weihua Zhang; Jing Hua Zhao; Zhihong Zhu; Uzma Afzal; Tarunveer Singh Ahluwalia; Stephan J L Bakker; Claire Bellis; Amélie Bonnefond; Katja Borodulin; Aron S Buchman; Tommy Cederholm; Audrey C Choh; Hyung Jin Choi; Joanne E Curran; Lisette C P G M de Groot; Philip L De Jager; Rosalie A M Dhonukshe-Rutten; Anke W Enneman; Elodie Eury; Daniel S Evans; Tom Forsen; Nele Friedrich; Frédéric Fumeron; Melissa E Garcia; Simone Gärtner; Bok-Ghee Han; Aki S Havulinna; Caroline Hayward; Dena Hernandez; Hans Hillege; Till Ittermann; Jack W Kent; Ivana Kolcic; Tiina Laatikainen; Jari Lahti; Irene Mateo Leach; Christine G Lee; Jong-Young Lee; Tian Liu; Youfang Liu; Stéphane Lobbens; Marie Loh; Leo-Pekka Lyytikäinen; Carolina Medina-Gomez; Karl Michaëlsson; Mike A Nalls; Carrie M Nielson; Laticia Oozageer; Laura Pascoe; Lavinia Paternoster; Ozren Polašek; Samuli Ripatti; Mark A Sarzynski; Chan Soo Shin; Nina Smolej Narančić; Dominik Spira; Priya Srikanth; Elisabeth Steinhagen-Thiessen; Yun Ju Sung; Karin M A Swart; Leena Taittonen; Toshiko Tanaka; Emmi Tikkanen; Nathalie van der Velde; Natasja M van Schoor; Niek Verweij; Alan F Wright; Lei Yu; Joseph M Zmuda; Niina Eklund; Terrence Forrester; Niels Grarup; Anne U Jackson; Kati Kristiansson; Teemu Kuulasmaa; Johanna Kuusisto; Peter Lichtner; Jian'an Luan; Anubha Mahajan; Satu Männistö; Cameron D Palmer; Janina S Ried; Robert A Scott; Alena Stancáková; Peter J Wagner; Ayse Demirkan; Angela Döring; Vilmundur Gudnason; Douglas P Kiel; Brigitte Kühnel; Massimo Mangino; Barbara Mcknight; Cristina Menni; Jeffrey R O'Connell; Ben A Oostra; Alan R Shuldiner; Kijoung Song; Liesbeth Vandenput; Cornelia M van Duijn; Peter Vollenweider; Charles C White; Michael Boehnke; Yvonne Boettcher; Richard S Cooper; Nita G Forouhi; Christian Gieger; Harald Grallert; Aroon Hingorani; Torben Jørgensen; Pekka Jousilahti; Mika Kivimaki; Meena Kumari; Markku Laakso; Claudia Langenberg; Allan Linneberg; Amy Luke; Colin A Mckenzie; Aarno Palotie; Oluf Pedersen; Annette Peters; Konstantin Strauch; Bamidele O Tayo; Nicholas J Wareham; David A Bennett; Lars Bertram; John Blangero; Matthias Blüher; Claude Bouchard; Harry Campbell; Nam H Cho; Steven R Cummings; Stefan A Czerwinski; Ilja Demuth; Rahel Eckardt; Johan G Eriksson; Luigi Ferrucci; Oscar H Franco; Philippe Froguel; Ron T Gansevoort; Torben Hansen; Tamara B Harris; Nicholas Hastie; Markku Heliövaara; Albert Hofman; Joanne M Jordan; Antti Jula; Mika Kähönen; Eero Kajantie; Paul B Knekt; Seppo Koskinen; Peter Kovacs; Terho Lehtimäki; Lars Lind; Yongmei Liu; Eric S Orwoll; Clive Osmond; Markus Perola; Louis Pérusse; Olli T Raitakari; Tuomo Rankinen; D C Rao; Treva K Rice; Fernando Rivadeneira; Igor Rudan; Veikko Salomaa; Thorkild I A Sørensen; Michael Stumvoll; Anke Tönjes; Bradford Towne; Gregory J Tranah; Angelo Tremblay; André G Uitterlinden; Pim van der Harst; Erkki Vartiainen; Jorma S Viikari; Veronique Vitart; Marie-Claude Vohl; Henry Völzke; Mark Walker; Henri Wallaschofski; Sarah Wild; James F Wilson; Loïc Yengo; D Timothy Bishop; Ingrid B Borecki; John C Chambers; L Adrienne Cupples; Abbas Dehghan; Panos Deloukas; Ghazaleh Fatemifar; Caroline Fox; Terrence S Furey; Lude Franke; Jiali Han; David J Hunter; Juha Karjalainen; Fredrik Karpe; Robert C Kaplan; Jaspal S Kooner; Mark I McCarthy; Joanne M Murabito; Andrew P Morris; Julia A N Bishop; Kari E North; Claes Ohlsson; Ken K Ong; Inga Prokopenko; J Brent Richards; Eric E Schadt; Tim D Spector; Elisabeth Widén; Cristen J Willer; Jian Yang; Erik Ingelsson; Karen L Mohlke; Joel N Hirschhorn; John Andrew Pospisilik; M Carola Zillikens; Cecilia Lindgren; Tuomas Oskari Kilpeläinen; Ruth J F Loos
Journal: Nat Commun Date: 2016-02-01 Impact factor: 14.919

10. The Simons Genome Diversity Project: 300 genomes from 142 diverse populations.

Authors: Swapan Mallick; Heng Li; Mark Lipson; Iain Mathieson; Melissa Gymrek; Fernando Racimo; Mengyao Zhao; Niru Chennagiri; Susanne Nordenfelt; Arti Tandon; Pontus Skoglund; Iosif Lazaridis; Sriram Sankararaman; Qiaomei Fu; Nadin Rohland; Gabriel Renaud; Yaniv Erlich; Thomas Willems; Carla Gallo; Jeffrey P Spence; Yun S Song; Giovanni Poletti; Francois Balloux; George van Driem; Peter de Knijff; Irene Gallego Romero; Aashish R Jha; Doron M Behar; Claudio M Bravi; Cristian Capelli; Tor Hervig; Andres Moreno-Estrada; Olga L Posukh; Elena Balanovska; Oleg Balanovsky; Sena Karachanak-Yankova; Hovhannes Sahakyan; Draga Toncheva; Levon Yepiskoposyan; Chris Tyler-Smith; Yali Xue; M Syafiq Abdullah; Andres Ruiz-Linares; Cynthia M Beall; Anna Di Rienzo; Choongwon Jeong; Elena B Starikovskaya; Ene Metspalu; Jüri Parik; Richard Villems; Brenna M Henn; Ugur Hodoglugil; Robert Mahley; Antti Sajantila; George Stamatoyannopoulos; Joseph T S Wee; Rita Khusainova; Elza Khusnutdinova; Sergey Litvinov; George Ayodo; David Comas; Michael F Hammer; Toomas Kivisild; William Klitz; Cheryl A Winkler; Damian Labuda; Michael Bamshad; Lynn B Jorde; Sarah A Tishkoff; W Scott Watkins; Mait Metspalu; Stanislav Dryomov; Rem Sukernik; Lalji Singh; Kumarasamy Thangaraj; Svante Pääbo; Janet Kelso; Nick Patterson; David Reich
Journal: Nature Date: 2016-09-21 Impact factor: 49.962

1 in total

1. Identifying and correcting for misspecifications in GWAS summary statistics and polygenic scores.

Authors: Florian Privé; Julyan Arbel; Hugues Aschard; Bjarni J Vilhjálmsson
Journal: HGG Adv Date: 2022-08-18

1 in total