Literature DB >> 22057162

A robust clustering algorithm for identifying problematic samples in genome-wide association studies.

Céline Bellenguez¹, Amy Strange, Colin Freeman, Peter Donnelly, Chris C A Spencer.

Abstract

SUMMARY: High-throughput genotyping arrays provide an efficient way to survey single nucleotide polymorphisms (SNPs) across the genome in large numbers of individuals. Downstream analysis of the data, for example in genome-wide association studies (GWAS), often involves statistical models of genotype frequencies across individuals. The complexities of the sample collection process and the potential for errors in the experimental assay can lead to biases and artefacts in an individual's inferred genotypes. Rather than attempting to model these complications, it has become a standard practice to remove individuals whose genome-wide data differ from the sample at large. Here we describe a simple, but robust, statistical algorithm to identify samples with atypical summaries of genome-wide variation. Its use as a semi-automated quality control tool is demonstrated using several summary statistics, selected to identify different potential problems, and it is applied to two different genotyping platforms and sample collections. AVAILABILITY: The algorithm is written in R and is freely available at www.well.ox.ac.uk/chris-spencer CONTACT: chris.spencer@well.ox.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Disease Gene Species

Mesh：

Year: 2011 PMID： 22057162 PMCID： PMC3244763 DOI： 10.1093/bioinformatics/btr599

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

1 INTRODUCTION

The advent of new technologies, which can simultaneously genotype hundreds of thousands of single nucleotide polymorphisms (SNPs) across the genome, has permitted large-scale studies of human genetic variation. A major application of these technologies is to undertake genome-wide association studies (GWAS) to identify SNPs that correlate with phenotypes such as disease. An important step in providing convincing evidence of association is to argue that the observed correlation is not an artefact of either the sampling strategy (for example, hidden population structure) or systematic biases in inferring genotypes (for example, differences in call rates). In doing so, it has become standard practice to calculate summaries of genome-wide variation that are not expected to vary systematically between study individuals, and then to identify and remove outlying individuals. Under the correct statistical model, losing data (that is collected at some expense) nearly always results in reduced statistical power to detect real effects. However, when the model fails to capture the data generating process, inclusion of outlying individuals often leads to an increase in false positives. Exclusion of individuals prior to analysis is a trade-off between loss of power due to reduced sample size, and the benefit of controlling the number of false positives. The typical approach to identify potentially problematic samples is to calculate summary statistics of genome-wide data and then, by visualizing their distributions across individuals, to manually choose a threshold based on their values for the majority of the data. To automate this process requires an algorithm to infer the distribution of ‘normal’ study individuals, therefore allowing inference of outliers. For the approach to be applicable in many settings (different summary statistics, genotyping platforms and sample collections) requires a robust model for the outlying individuals.

2 METHODS

Inference of outliers

We implemented a simple mixture model to identify individuals with atypical genome-wide patterns of diversity as measured by m summary statistics of their genotypes or SNP assay intensities data X; S1(X),…, S(X)(i=1,…, n with n the number of individuals). To do so, we assume each individual is either ‘normal’ or an ‘outlier’, which we index by Z∈{0, 1}, and use a Bayesian approach to infer the posterior probability of each individual's membership to the two classes. As summary statistics are averages of many (typically over 500,000) SNPs or assays, the central limit theorem should apply to these statistics across individuals. We consider the distribution of the m summary statistics to be sufficiently well described by independent Gaussian distributions in both the normal and outlier class so that Having observed the summary statistics, our knowledge of which individuals are outliers is given by the posterior distribution where P(Z, μ, σ2) is the prior distribution. Integrals of this form arise commonly in Bayesian statistics, and it is often not possible to compute them directly. However, there are efficient Monte Carlo methods to sample from the distribution of the unobserved labels Z and the model parameters, μ and σ2, conditional on the observed data S(X): We used Gibbs sampling to obtain T samples from this joint posterior distribution. The posterior probability of the i-th individual being an outlier is then estimated as where Z( is the class membership of the i-th individual for the sample t and I is the indicator function. An individual is then considered as an outlier if its estimated posterior probability of being an outlier is >50%. This approach easily generalizes to correlated summary statistics. Here we consider only two summary statistics jointly, but the model could be extended to more. Information on either the distribution of summary statistics for normal individuals, perhaps from previous analysis, or the fraction of individuals which are outlying can both be specified through prior distributions. See Supplementary Material for details.

Approximation for robust detection of outliers

To facilitate the use of Gibbs sampling, we used conjugate priors for the model parameters, except for the variance of the outlier class. To ensure identifiability, we assume that the SD of the outliers (for which Z=1) is factor λ larger than the SD of the normal individuals (for which Z=0) so that: The parameter λ is fixed a priori and controls the stringency of the outlier classification. Using this hard prior assumption, the variance of outlier class is completely determined by the variance of the normal class. We made the additional assumption that the percentage of outlier samples is small, so that all the information about the variance of the normal class is assumed to come from the normal individuals: This assumption adds robustness to the model: the distribution of the outliers will have little impact on the model fit which, because of the light tails of the Guassian distribution, can be heavily influenced by outlying observations. The approximation is similar in motivation to the concept of trimmed-likelihoods, where the likelihood is computed after trimming the least likely observations (Hadi and Luceno, 1997) or perhaps also to contamination models, where the influence of the outliers goes to zero.

3 APPLICATION

We applied the clustering approach independently to four different control datasets genotyped as part of the Wellcome Trust Case Control Consortium 2 (WTCCC2). These comprised 2918 samples from the 1958 Birth Cohort (58C) and 2530 National Blood Service controls (UKBS) genotyped on the Affymetrix Genome-Wide Human SNP 6.0 and the Illumina custom Human 1.2M-Duo chips. We considered four different quality control criteria, based on summaries of each individual's genotypes or probe intensities: Results are shown in Figure 1 and Supplementary Figures S1–S3 for, 58C and UKBS samples genotyped on Affymetrix and Illumina platforms, respectively. As well as being statistically principled, in practice, it is helpful that, once the prior distributions have been specified, identification of outliers is automatic. Empirically, it appears to make sensible inference for a range of normal and outlier distributions, suggesting it is useful for quality control in GWAS [successfully applied in, for example Genetic Analysis of Psoriasis Consortium & the WTCCC2 (2010); The International Multiple Sclerosis Genetics Consortium & the WTCCC2 (2011); The UK IBD Genetics Consortium & the WTCCC2 (2009)] and perhaps in other settings.

Fig. 1.

Outlier identification for 2918 58C samples genotyped on Affymetrix Genome-Wide Human SNP 6.0. ‘Normal’ individuals are coloured from black to grey, with darker colours denoting higher density of individuals. Outliers are coloured from orange to red, with redder colours denoting higher posterior probability of being an outlier. The 99% confidence ellipse of the inferred distribution of ‘normal’ individuals is shown as a dashed grey line.

Genotyping bias: genome-wide heterozygosity (the fraction of heterozygote calls) and call rate (the proportion of missing genotypes). Indicative of assay failure, contamination or inbreeeding. Ancestry: projection of individual's genotypes onto two axes of variation which differentiate individuals with European, Asian and African ancestry. Indicative of individuals with atypical ancestry with respect to the majority of the sample. Intensity: genome-wide average of the probe intensities which target the two alleles at each autosomal SNP. Indicative of partial assay failure or insufficient normalization. Gender: for females and males separately, the mean probe intensities across SNPs on chromosome X. Indicative of incorrect gender assignment. Outlier identification for 2918 58C samples genotyped on Affymetrix Genome-Wide Human SNP 6.0. ‘Normal’ individuals are coloured from black to grey, with darker colours denoting higher density of individuals. Outliers are coloured from orange to red, with redder colours denoting higher posterior probability of being an outlier. The 99% confidence ellipse of the inferred distribution of ‘normal’ individuals is shown as a dashed grey line. Funding: This work was supported by Wellcome Trust awards 090532/Z/09/Z, 075491/Z/04/B and 084575/Z/08/Z. PD was supported in part by a Royal Society Wolfson Merit Award and CCAS by a Nuffield Department of Medicine Scientific Leadership Fellowship. Conflict of Interest: none declared.

3 in total

1. A genome-wide association study identifies new psoriasis susceptibility loci and an interaction between HLA-C and ERAP1.

Authors: Amy Strange; Francesca Capon; Chris C A Spencer; Jo Knight; Michael E Weale; Michael H Allen; Anne Barton; Gavin Band; Céline Bellenguez; Judith G M Bergboer; Jenefer M Blackwell; Elvira Bramon; Suzannah J Bumpstead; Juan P Casas; Michael J Cork; Aiden Corvin; Panos Deloukas; Alexander Dilthey; Audrey Duncanson; Sarah Edkins; Xavier Estivill; Oliver Fitzgerald; Colin Freeman; Emiliano Giardina; Emma Gray; Angelika Hofer; Ulrike Hüffmeier; Sarah E Hunt; Alan D Irvine; Janusz Jankowski; Brian Kirby; Cordelia Langford; Jesús Lascorz; Joyce Leman; Stephen Leslie; Lotus Mallbris; Hugh S Markus; Christopher G Mathew; W H Irwin McLean; Ross McManus; Rotraut Mössner; Loukas Moutsianas; Asa T Naluai; Frank O Nestle; Giuseppe Novelli; Alexandros Onoufriadis; Colin N A Palmer; Carlo Perricone; Matti Pirinen; Robert Plomin; Simon C Potter; Ramon M Pujol; Anna Rautanen; Eva Riveira-Munoz; Anthony W Ryan; Wolfgang Salmhofer; Lena Samuelsson; Stephen J Sawcer; Joost Schalkwijk; Catherine H Smith; Mona Ståhle; Zhan Su; Rachid Tazi-Ahnini; Heiko Traupe; Ananth C Viswanathan; Richard B Warren; Wolfgang Weger; Katarina Wolk; Nicholas Wood; Jane Worthington; Helen S Young; Patrick L J M Zeeuwen; Adrian Hayday; A David Burden; Christopher E M Griffiths; Juha Kere; André Reis; Gilean McVean; David M Evans; Matthew A Brown; Jonathan N Barker; Leena Peltonen; Peter Donnelly; Richard C Trembath
Journal: Nat Genet Date: 2010-10-17 Impact factor: 38.330

2. Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis.

Authors: Stephen Sawcer; Garrett Hellenthal; Matti Pirinen; Chris C A Spencer; Nikolaos A Patsopoulos; Loukas Moutsianas; Alexander Dilthey; Zhan Su; Colin Freeman; Sarah E Hunt; Sarah Edkins; Emma Gray; David R Booth; Simon C Potter; An Goris; Gavin Band; Annette Bang Oturai; Amy Strange; Janna Saarela; Céline Bellenguez; Bertrand Fontaine; Matthew Gillman; Bernhard Hemmer; Rhian Gwilliam; Frauke Zipp; Alagurevathi Jayakumar; Roland Martin; Stephen Leslie; Stanley Hawkins; Eleni Giannoulatou; Sandra D'alfonso; Hannah Blackburn; Filippo Martinelli Boneschi; Jennifer Liddle; Hanne F Harbo; Marc L Perez; Anne Spurkland; Matthew J Waller; Marcin P Mycko; Michelle Ricketts; Manuel Comabella; Naomi Hammond; Ingrid Kockum; Owen T McCann; Maria Ban; Pamela Whittaker; Anu Kemppinen; Paul Weston; Clive Hawkins; Sara Widaa; John Zajicek; Serge Dronov; Neil Robertson; Suzannah J Bumpstead; Lisa F Barcellos; Rathi Ravindrarajah; Roby Abraham; Lars Alfredsson; Kristin Ardlie; Cristin Aubin; Amie Baker; Katharine Baker; Sergio E Baranzini; Laura Bergamaschi; Roberto Bergamaschi; Allan Bernstein; Achim Berthele; Mike Boggild; Jonathan P Bradfield; David Brassat; Simon A Broadley; Dorothea Buck; Helmut Butzkueven; Ruggero Capra; William M Carroll; Paola Cavalla; Elisabeth G Celius; Sabine Cepok; Rosetta Chiavacci; Françoise Clerget-Darpoux; Katleen Clysters; Giancarlo Comi; Mark Cossburn; Isabelle Cournu-Rebeix; Mathew B Cox; Wendy Cozen; Bruce A C Cree; Anne H Cross; Daniele Cusi; Mark J Daly; Emma Davis; Paul I W de Bakker; Marc Debouverie; Marie Beatrice D'hooghe; Katherine Dixon; Rita Dobosi; Bénédicte Dubois; David Ellinghaus; Irina Elovaara; Federica Esposito; Claire Fontenille; Simon Foote; Andre Franke; Daniela Galimberti; Angelo Ghezzi; Joseph Glessner; Refujia Gomez; Olivier Gout; Colin Graham; Struan F A Grant; Franca Rosa Guerini; Hakon Hakonarson; Per Hall; Anders Hamsten; Hans-Peter Hartung; Rob N Heard; Simon Heath; Jeremy Hobart; Muna Hoshi; Carmen Infante-Duarte; Gillian Ingram; Wendy Ingram; Talat Islam; Maja Jagodic; Michael Kabesch; Allan G Kermode; Trevor J Kilpatrick; Cecilia Kim; Norman Klopp; Keijo Koivisto; Malin Larsson; Mark Lathrop; Jeannette S Lechner-Scott; Maurizio A Leone; Virpi Leppä; Ulrika Liljedahl; Izaura Lima Bomfim; Robin R Lincoln; Jenny Link; Jianjun Liu; Aslaug R Lorentzen; Sara Lupoli; Fabio Macciardi; Thomas Mack; Mark Marriott; Vittorio Martinelli; Deborah Mason; Jacob L McCauley; Frank Mentch; Inger-Lise Mero; Tania Mihalova; Xavier Montalban; John Mottershead; Kjell-Morten Myhr; Paola Naldi; William Ollier; Alison Page; Aarno Palotie; Jean Pelletier; Laura Piccio; Trevor Pickersgill; Fredrik Piehl; Susan Pobywajlo; Hong L Quach; Patricia P Ramsay; Mauri Reunanen; Richard Reynolds; John D Rioux; Mariaemma Rodegher; Sabine Roesner; Justin P Rubio; Ina-Maria Rückert; Marco Salvetti; Erika Salvi; Adam Santaniello; Catherine A Schaefer; Stefan Schreiber; Christian Schulze; Rodney J Scott; Finn Sellebjerg; Krzysztof W Selmaj; David Sexton; Ling Shen; Brigid Simms-Acuna; Sheila Skidmore; Patrick M A Sleiman; Cathrine Smestad; Per Soelberg Sørensen; Helle Bach Søndergaard; Jim Stankovich; Richard C Strange; Anna-Maija Sulonen; Emilie Sundqvist; Ann-Christine Syvänen; Francesca Taddeo; Bruce Taylor; Jenefer M Blackwell; Pentti Tienari; Elvira Bramon; Ayman Tourbah; Matthew A Brown; Ewa Tronczynska; Juan P Casas; Niall Tubridy; Aiden Corvin; Jane Vickery; Janusz Jankowski; Pablo Villoslada; Hugh S Markus; Kai Wang; Christopher G Mathew; James Wason; Colin N A Palmer; H-Erich Wichmann; Robert Plomin; Ernest Willoughby; Anna Rautanen; Juliane Winkelmann; Michael Wittig; Richard C Trembath; Jacqueline Yaouanq; Ananth C Viswanathan; Haitao Zhang; Nicholas W Wood; Rebecca Zuvich; Panos Deloukas; Cordelia Langford; Audrey Duncanson; Jorge R Oksenberg; Margaret A Pericak-Vance; Jonathan L Haines; Tomas Olsson; Jan Hillert; Adrian J Ivinson; Philip L De Jager; Leena Peltonen; Graeme J Stewart; David A Hafler; Stephen L Hauser; Gil McVean; Peter Donnelly; Alastair Compston
Journal: Nature Date: 2011-08-10 Impact factor: 49.962

3. Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region.

Authors: Jeffrey C Barrett; James C Lee; Charles W Lees; Natalie J Prescott; Carl A Anderson; Anne Phillips; Emma Wesley; Kirstie Parnell; Hu Zhang; Hazel Drummond; Elaine R Nimmo; Dunecan Massey; Kasia Blaszczyk; Timothy Elliott; Lynn Cotterill; Helen Dallal; Alan J Lobo; Craig Mowat; Jeremy D Sanderson; Derek P Jewell; William G Newman; Cathryn Edwards; Tariq Ahmad; John C Mansfield; Jack Satsangi; Miles Parkes; Christopher G Mathew; Peter Donnelly; Leena Peltonen; Jenefer M Blackwell; Elvira Bramon; Matthew A Brown; Juan P Casas; Aiden Corvin; Nicholas Craddock; Panos Deloukas; Audrey Duncanson; Janusz Jankowski; Hugh S Markus; Christopher G Mathew; Mark I McCarthy; Colin N A Palmer; Robert Plomin; Anna Rautanen; Stephen J Sawcer; Nilesh Samani; Richard C Trembath; Anath C Viswanathan; Nicholas Wood; Chris C A Spencer; Jeffrey C Barrett; Céline Bellenguez; Daniel Davison; Colin Freeman; Amy Strange; Peter Donnelly; Cordelia Langford; Sarah E Hunt; Sarah Edkins; Rhian Gwilliam; Hannah Blackburn; Suzannah J Bumpstead; Serge Dronov; Matthew Gillman; Emma Gray; Naomi Hammond; Alagurevathi Jayakumar; Owen T McCann; Jennifer Liddle; Marc L Perez; Simon C Potter; Radhi Ravindrarajah; Michelle Ricketts; Matthew Waller; Paul Weston; Sara Widaa; Pamela Whittaker; Panos Deloukas; Leena Peltonen; Christopher G Mathew; Jenefer M Blackwell; Matthew A Brown; Aiden Corvin; Mark I McCarthy; Chris C A Spencer; Antony P Attwood; Jonathan Stephens; Jennifer Sambrook; Willem H Ouwehand; Wendy L McArdle; Susan M Ring; David P Strachan
Journal: Nat Genet Date: 2009-11-15 Impact factor: 38.330

3 in total

38 in total

1. Biomarkers of kidney function and cognitive ability: A Mendelian randomization study.

Authors: Erin L Richard; Linda K McEvoy; Steven Y Cao; Eyal Oren; John E Alcaraz; Andrea Z LaCroix; Rany M Salem
Journal: J Neurol Sci Date: 2021-09-14 Impact factor: 4.553

2. Integrative analysis of metabolite GWAS illuminates the molecular basis of pleiotropy and genetic correlation.

Authors: Courtney J Smith; Nasa Sinnott-Armstrong; Anna Cichońska; Heli Julkunen; Eric B Fauman; Peter Würtz; Jonathan K Pritchard
Journal: Elife Date: 2022-09-08 Impact factor: 8.713

3. Genome-wide association study implicates HLA-C*01:02 as a risk factor at the major histocompatibility complex locus in schizophrenia.

Authors:
Journal: Biol Psychiatry Date: 2012-08-09 Impact factor: 13.382

4. Heritability of Atrial Fibrillation.

Authors: Lu-Chen Weng; Seung Hoan Choi; Derek Klarin; J Gustav Smith; Po-Ru Loh; Mark Chaffin; Carolina Roselli; Olivia L Hulme; Kathryn L Lunetta; Josée Dupuis; Emelia J Benjamin; Christopher Newton-Cheh; Sekar Kathiresan; Patrick T Ellinor; Steven A Lubitz
Journal: Circ Cardiovasc Genet Date: 2017-12

5. Common variants in the HLA-DRB1-HLA-DQA1 HLA class II region are associated with susceptibility to visceral leishmaniasis.

Authors: Michaela Fakiola; Amy Strange; Heather J Cordell; E Nancy Miller; Matti Pirinen; Zhan Su; Anshuman Mishra; Sanjana Mehrotra; Gloria R Monteiro; Gavin Band; Céline Bellenguez; Serge Dronov; Sarah Edkins; Colin Freeman; Eleni Giannoulatou; Emma Gray; Sarah E Hunt; Henio G Lacerda; Cordelia Langford; Richard Pearson; Núbia N Pontes; Madhukar Rai; Shri P Singh; Linda Smith; Olivia Sousa; Damjan Vukcevic; Elvira Bramon; Matthew A Brown; Juan P Casas; Aiden Corvin; Audrey Duncanson; Janusz Jankowski; Hugh S Markus; Christopher G Mathew; Colin N A Palmer; Robert Plomin; Anna Rautanen; Stephen J Sawcer; Richard C Trembath; Ananth C Viswanathan; Nicholas W Wood; Mary E Wilson; Panos Deloukas; Leena Peltonen; Frank Christiansen; Campbell Witt; Selma M B Jeronimo; Shyam Sundar; Chris C A Spencer; Jenefer M Blackwell; Peter Donnelly
Journal: Nat Genet Date: 2013-01-06 Impact factor: 38.330

6. Genome-wide association study identifies a variant in HDAC9 associated with large vessel ischemic stroke.

Authors: Céline Bellenguez; Steve Bevan; Andreas Gschwendtner; Chris C A Spencer; Annette I Burgess; Matti Pirinen; Caroline A Jackson; Matthew Traylor; Amy Strange; Zhan Su; Gavin Band; Paul D Syme; Rainer Malik; Joanna Pera; Bo Norrving; Robin Lemmens; Colin Freeman; Renata Schanz; Tom James; Deborah Poole; Lee Murphy; Helen Segal; Lynelle Cortellini; Yu-Ching Cheng; Daniel Woo; Michael A Nalls; Bertram Müller-Myhsok; Christa Meisinger; Udo Seedorf; Helen Ross-Adams; Steven Boonen; Dorota Wloch-Kopec; Valerie Valant; Julia Slark; Karen Furie; Hossein Delavaran; Cordelia Langford; Panos Deloukas; Sarah Edkins; Sarah Hunt; Emma Gray; Serge Dronov; Leena Peltonen; Solveig Gretarsdottir; Gudmar Thorleifsson; Unnur Thorsteinsdottir; Kari Stefansson; Giorgio B Boncoraglio; Eugenio A Parati; John Attia; Elizabeth Holliday; Chris Levi; Maria-Grazia Franzosi; Anuj Goel; Anna Helgadottir; Jenefer M Blackwell; Elvira Bramon; Matthew A Brown; Juan P Casas; Aiden Corvin; Audrey Duncanson; Janusz Jankowski; Christopher G Mathew; Colin N A Palmer; Robert Plomin; Anna Rautanen; Stephen J Sawcer; Richard C Trembath; Ananth C Viswanathan; Nicholas W Wood; Bradford B Worrall; Steven J Kittner; Braxton D Mitchell; Brett Kissela; James F Meschia; Vincent Thijs; Arne Lindgren; Mary Joan Macleod; Agnieszka Slowik; Matthew Walters; Jonathan Rosand; Pankaj Sharma; Martin Farrall; Cathie L M Sudlow; Peter M Rothwell; Martin Dichgans; Peter Donnelly; Hugh S Markus
Journal: Nat Genet Date: 2012-02-05 Impact factor: 38.330

7. Multi-ethnic genome-wide association study for atrial fibrillation.

Authors: Carolina Roselli; Mark D Chaffin; Lu-Chen Weng; Stefanie Aeschbacher; Gustav Ahlberg; Christine M Albert; Peter Almgren; Alvaro Alonso; Christopher D Anderson; Krishna G Aragam; Dan E Arking; John Barnard; Traci M Bartz; Emelia J Benjamin; Nathan A Bihlmeyer; Joshua C Bis; Heather L Bloom; Eric Boerwinkle; Erwin B Bottinger; Jennifer A Brody; Hugh Calkins; Archie Campbell; Thomas P Cappola; John Carlquist; Daniel I Chasman; Lin Y Chen; Yii-Der Ida Chen; Eue-Keun Choi; Seung Hoan Choi; Ingrid E Christophersen; Mina K Chung; John W Cole; David Conen; James Cook; Harry J Crijns; Michael J Cutler; Scott M Damrauer; Brian R Daniels; Dawood Darbar; Graciela Delgado; Joshua C Denny; Martin Dichgans; Marcus Dörr; Elton A Dudink; Samuel C Dudley; Nada Esa; Tonu Esko; Markku Eskola; Diane Fatkin; Stephan B Felix; Ian Ford; Oscar H Franco; Bastiaan Geelhoed; Raji P Grewal; Vilmundur Gudnason; Xiuqing Guo; Namrata Gupta; Stefan Gustafsson; Rebecca Gutmann; Anders Hamsten; Tamara B Harris; Caroline Hayward; Susan R Heckbert; Jussi Hernesniemi; Lynne J Hocking; Albert Hofman; Andrea R V R Horimoto; Jie Huang; Paul L Huang; Jennifer Huffman; Erik Ingelsson; Esra Gucuk Ipek; Kaoru Ito; Jordi Jimenez-Conde; Renee Johnson; J Wouter Jukema; Stefan Kääb; Mika Kähönen; Yoichiro Kamatani; John P Kane; Adnan Kastrati; Sekar Kathiresan; Petra Katschnig-Winter; Maryam Kavousi; Thorsten Kessler; Bas L Kietselaer; Paulus Kirchhof; Marcus E Kleber; Stacey Knight; Jose E Krieger; Michiaki Kubo; Lenore J Launer; Jari Laurikka; Terho Lehtimäki; Kirsten Leineweber; Rozenn N Lemaitre; Man Li; Hong Euy Lim; Henry J Lin; Honghuang Lin; Lars Lind; Cecilia M Lindgren; Marja-Liisa Lokki; Barry London; Ruth J F Loos; Siew-Kee Low; Yingchang Lu; Leo-Pekka Lyytikäinen; Peter W Macfarlane; Patrik K Magnusson; Anubha Mahajan; Rainer Malik; Alfredo J Mansur; Gregory M Marcus; Lauren Margolin; Kenneth B Margulies; Winfried März; David D McManus; Olle Melander; Sanghamitra Mohanty; Jay A Montgomery; Michael P Morley; Andrew P Morris; Martina Müller-Nurasyid; Andrea Natale; Saman Nazarian; Benjamin Neumann; Christopher Newton-Cheh; Maartje N Niemeijer; Kjell Nikus; Peter Nilsson; Raymond Noordam; Heidi Oellers; Morten S Olesen; Marju Orho-Melander; Sandosh Padmanabhan; Hui-Nam Pak; Guillaume Paré; Nancy L Pedersen; Joanna Pera; Alexandre Pereira; David Porteous; Bruce M Psaty; Sara L Pulit; Clive R Pullinger; Daniel J Rader; Lena Refsgaard; Marta Ribasés; Paul M Ridker; Michiel Rienstra; Lorenz Risch; Dan M Roden; Jonathan Rosand; Michael A Rosenberg; Natalia Rost; Jerome I Rotter; Samir Saba; Roopinder K Sandhu; Renate B Schnabel; Katharina Schramm; Heribert Schunkert; Claudia Schurman; Stuart A Scott; Ilkka Seppälä; Christian Shaffer; Svati Shah; Alaa A Shalaby; Jaemin Shim; M Benjamin Shoemaker; Joylene E Siland; Juha Sinisalo; Moritz F Sinner; Agnieszka Slowik; Albert V Smith; Blair H Smith; J Gustav Smith; Jonathan D Smith; Nicholas L Smith; Elsayed Z Soliman; Nona Sotoodehnia; Bruno H Stricker; Albert Sun; Han Sun; Jesper H Svendsen; Toshihiro Tanaka; Kahraman Tanriverdi; Kent D Taylor; Maris Teder-Laving; Alexander Teumer; Sébastien Thériault; Stella Trompet; Nathan R Tucker; Arnljot Tveit; Andre G Uitterlinden; Pim Van Der Harst; Isabelle C Van Gelder; David R Van Wagoner; Niek Verweij; Efthymia Vlachopoulou; Uwe Völker; Biqi Wang; Peter E Weeke; Bob Weijs; Raul Weiss; Stefan Weiss; Quinn S Wells; Kerri L Wiggins; Jorge A Wong; Daniel Woo; Bradford B Worrall; Pil-Sung Yang; Jie Yao; Zachary T Yoneda; Tanja Zeller; Lingyao Zeng; Steven A Lubitz; Kathryn L Lunetta; Patrick T Ellinor
Journal: Nat Genet Date: 2018-06-11 Impact factor: 38.330

8. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease.

Authors: J C Lambert; C A Ibrahim-Verbaas; D Harold; A C Naj; R Sims; C Bellenguez; A L DeStafano; J C Bis; G W Beecham; B Grenier-Boley; G Russo; T A Thorton-Wells; N Jones; A V Smith; V Chouraki; C Thomas; M A Ikram; D Zelenika; B N Vardarajan; Y Kamatani; C F Lin; A Gerrish; H Schmidt; B Kunkle; M L Dunstan; A Ruiz; M T Bihoreau; S H Choi; C Reitz; F Pasquier; C Cruchaga; D Craig; N Amin; C Berr; O L Lopez; P L De Jager; V Deramecourt; J A Johnston; D Evans; S Lovestone; L Letenneur; F J Morón; D C Rubinsztein; G Eiriksdottir; K Sleegers; A M Goate; N Fiévet; M W Huentelman; M Gill; K Brown; M I Kamboh; L Keller; P Barberger-Gateau; B McGuiness; E B Larson; R Green; A J Myers; C Dufouil; S Todd; D Wallon; S Love; E Rogaeva; J Gallacher; P St George-Hyslop; J Clarimon; A Lleo; A Bayer; D W Tsuang; L Yu; M Tsolaki; P Bossù; G Spalletta; P Proitsi; J Collinge; S Sorbi; F Sanchez-Garcia; N C Fox; J Hardy; M C Deniz Naranjo; P Bosco; R Clarke; C Brayne; D Galimberti; M Mancuso; F Matthews; S Moebus; P Mecocci; M Del Zompo; W Maier; H Hampel; A Pilotto; M Bullido; F Panza; P Caffarra; B Nacmias; J R Gilbert; M Mayhaus; L Lannefelt; H Hakonarson; S Pichler; M M Carrasquillo; M Ingelsson; D Beekly; V Alvarez; F Zou; O Valladares; S G Younkin; E Coto; K L Hamilton-Nelson; W Gu; C Razquin; P Pastor; I Mateo; M J Owen; K M Faber; P V Jonsson; O Combarros; M C O'Donovan; L B Cantwell; H Soininen; D Blacker; S Mead; T H Mosley; D A Bennett; T B Harris; L Fratiglioni; C Holmes; R F de Bruijn; P Passmore; T J Montine; K Bettens; J I Rotter; A Brice; K Morgan; T M Foroud; W A Kukull; D Hannequin; J F Powell; M A Nalls; K Ritchie; K L Lunetta; J S Kauwe; E Boerwinkle; M Riemenschneider; M Boada; M Hiltuenen; E R Martin; R Schmidt; D Rujescu; L S Wang; J F Dartigues; R Mayeux; C Tzourio; A Hofman; M M Nöthen; C Graff; B M Psaty; L Jones; J L Haines; P A Holmans; M Lathrop; M A Pericak-Vance; L J Launer; L A Farrer; C M van Duijn; C Van Broeckhoven; V Moskvina; S Seshadri; J Williams; G D Schellenberg; P Amouyel
Journal: Nat Genet Date: 2013-10-27 Impact factor: 38.330

9. Determinants of penetrance and variable expressivity in monogenic metabolic conditions across 77,184 exomes.

Authors: Julia K Goodrich; Moriel Singer-Berk; Rachel Son; Abigail Sveden; Jordan Wood; Eleina England; Joanne B Cole; Ben Weisburd; Nick Watts; Lizz Caulkins; Peter Dornbos; Ryan Koesterer; Zachary Zappala; Haichen Zhang; Kristin A Maloney; Andy Dahl; Carlos A Aguilar-Salinas; Gil Atzmon; Francisco Barajas-Olmos; Nir Barzilai; John Blangero; Eric Boerwinkle; Lori L Bonnycastle; Erwin Bottinger; Donald W Bowden; Federico Centeno-Cruz; John C Chambers; Nathalie Chami; Edmund Chan; Juliana Chan; Ching-Yu Cheng; Yoon Shin Cho; Cecilia Contreras-Cubas; Emilio Córdova; Adolfo Correa; Ralph A DeFronzo; Ravindranath Duggirala; Josée Dupuis; Ma Eugenia Garay-Sevilla; Humberto García-Ortiz; Christian Gieger; Benjamin Glaser; Clicerio González-Villalpando; Ma Elena Gonzalez; Niels Grarup; Leif Groop; Myron Gross; Christopher Haiman; Sohee Han; Craig L Hanis; Torben Hansen; Nancy L Heard-Costa; Brian E Henderson; Juan Manuel Malacara Hernandez; Mi Yeong Hwang; Sergio Islas-Andrade; Marit E Jørgensen; Hyun Min Kang; Bong-Jo Kim; Young Jin Kim; Heikki A Koistinen; Jaspal Singh Kooner; Johanna Kuusisto; Soo-Heon Kwak; Markku Laakso; Leslie Lange; Jong-Young Lee; Juyoung Lee; Donna M Lehman; Allan Linneberg; Jianjun Liu; Ruth J F Loos; Valeriya Lyssenko; Ronald C W Ma; Angélica Martínez-Hernández; James B Meigs; Thomas Meitinger; Elvia Mendoza-Caamal; Karen L Mohlke; Andrew D Morris; Alanna C Morrison; Maggie C Y Ng; Peter M Nilsson; Christopher J O'Donnell; Lorena Orozco; Colin N A Palmer; Kyong Soo Park; Wendy S Post; Oluf Pedersen; Michael Preuss; Bruce M Psaty; Alexander P Reiner; Cristina Revilla-Monsalve; Stephen S Rich; Jerome I Rotter; Danish Saleheen; Claudia Schurmann; Xueling Sim; Rob Sladek; Kerrin S Small; Wing Yee So; Timothy D Spector; Konstantin Strauch; Tim M Strom; E Shyong Tai; Claudia H T Tam; Yik Ying Teo; Farook Thameem; Brian Tomlinson; Russell P Tracy; Tiinamaija Tuomi; Jaakko Tuomilehto; Teresa Tusié-Luna; Rob M van Dam; Ramachandran S Vasan; James G Wilson; Daniel R Witte; Tien-Yin Wong; Noël P Burtt; Noah Zaitlen; Mark I McCarthy; Michael Boehnke; Toni I Pollin; Jason Flannick; Josep M Mercader; Anne O'Donnell-Luria; Samantha Baxter; Jose C Florez; Daniel G MacArthur; Miriam S Udler
Journal: Nat Commun Date: 2021-06-09 Impact factor: 17.694

10. Dense genotyping of immune-related disease regions identifies nine new risk loci for primary sclerosing cholangitis.

Authors: Jimmy Z Liu; Johannes Roksund Hov; Trine Folseraas; Eva Ellinghaus; Simon M Rushbrook; Nadezhda T Doncheva; Ole A Andreassen; Rinse K Weersma; Tobias J Weismüller; Bertus Eksteen; Pietro Invernizzi; Gideon M Hirschfield; Daniel Nils Gotthardt; Albert Pares; David Ellinghaus; Tejas Shah; Brian D Juran; Piotr Milkiewicz; Christian Rust; Christoph Schramm; Tobias Müller; Brijesh Srivastava; Georgios Dalekos; Markus M Nöthen; Stefan Herms; Juliane Winkelmann; Mitja Mitrovic; Felix Braun; Cyriel Y Ponsioen; Peter J P Croucher; Martina Sterneck; Andreas Teufel; Andrew L Mason; Janna Saarela; Virpi Leppa; Ruslan Dorfman; Domenico Alvaro; Annarosa Floreani; Suna Onengut-Gumuscu; Stephen S Rich; Wesley K Thompson; Andrew J Schork; Sigrid Næss; Ingo Thomsen; Gabriele Mayr; Inke R König; Kristian Hveem; Isabelle Cleynen; Javier Gutierrez-Achury; Isis Ricaño-Ponce; David van Heel; Einar Björnsson; Richard N Sandford; Peter R Durie; Espen Melum; Morten H Vatn; Mark S Silverberg; Richard H Duerr; Leonid Padyukov; Stephan Brand; Miquel Sans; Vito Annese; Jean-Paul Achkar; Kirsten Muri Boberg; Hanns-Ulrich Marschall; Olivier Chazouillères; Christopher L Bowlus; Cisca Wijmenga; Erik Schrumpf; Severine Vermeire; Mario Albrecht; John D Rioux; Graeme Alexander; Annika Bergquist; Judy Cho; Stefan Schreiber; Michael P Manns; Martti Färkkilä; Anders M Dale; Roger W Chapman; Konstantinos N Lazaridis; Andre Franke; Carl A Anderson; Tom H Karlsen
Journal: Nat Genet Date: 2013-04-21 Impact factor: 38.330