Literature DB >> 24008273

RegScan: a GWAS tool for quick estimation of allele effects on continuous traits and their combinations.

Toomas Haller, Mart Kals, Tõnu Esko, Reedik Mägi, Krista Fischer.

Abstract

UNLABELLED: Genome-wide association studies are becoming computationally more demanding with the growing amounts of data. Combinatorial traits can increase the data dimensions beyond the computational capabilities of the current tools. We addressed this issue by creating an application for quick association analysis that is ten to hundreds of times faster than the leading fast methods. Our tool (RegScan) is designed for performing basic linear regression analysis with continuous traits maximally fast on large data sets. RegScan specifically targets association analysis of combinatorial traits in metabolomics. It can both generate and analyze the combinatorial traits efficiently. RegScan is capable of analyzing any number of traits together without the need to specify each trait individually. The main goal of the article is to show that RegScan can be the preferred analytical tool when large amounts of data need to be analyzed quickly using the allele frequency test. AVAILABILITY: Precompiled RegScan (all major platforms), source code, user guide and examples are freely available at www.biobank.ee/regscan. REQUIREMENTS: Qt 4.4.3 or newer for dynamic compilations.

Entities: Chemical Species

Keywords: GWAS; combinatorial traits; continuous traits; genome-wide analysis; linear regression; metabolomics

Mesh：

Year: 2013 PMID： 24008273 PMCID： PMC4293375 DOI： 10.1093/bib/bbt066

Source DB: PubMed Journal: Brief Bioinform ISSN： 1467-5463 Impact factor: 11.622

INTRODUCTION

Genome-Wide Association Studies (GWAS) have successfully identified common variants of the human genome associated with the common diseases and traits. International research consortia have uncovered the effects of thousands of genetic makers in complex traits and diseases [1]. Traditionally, linear regression is used to study the association of marker frequencies and continuous traits and the P-value of association is the main metric for initial filtering [2]. It is becoming commonplace to study tens of thousands of individuals, tens of millions of markers and the combinations of various continuous traits (so-called combinatorial traits), mostly ratios, leading to a large number of combinations to be tested. The number of traits is especially large in the metabolomics studies where the trait ratios are considered interesting owing to their potential to shed light on metabolic pathways [3-6]. These analyses are often limited by the available computational resources. Yet, the GWAS consortia, among the others, are looking for ways to efficiently study combinatorial traits. The field is in need of a tool with a strong emphasis on speed. We created a tool (RegScan) that considerably accelerates association studies. RegScan performs linear regression analysis (allele frequency test) and identifies the statistically significant associations between markers and traits maximally fast. Its computational speed benefit is best used in studying large numbers of combinatorial traits.

IMPLEMENTATION

RegScan is a command line application written primarily in C++/Qt [7]. It uses the least squares approach to fit the linear regression model without introducing new modifications to the standard technique. The fast execution speed as compared with the other tools is achieved entirely by using special efficient computational techniques. In some functions, this means resorting to C-style programming instead of C++. Computational overhead is reduced as much as possible by avoiding the slower C++/Qt functions where execution speed is rate limiting and relying on Qt framework only for higher-level functions. Every function is created with the goal to minimize the number of elementary operations and choosing methods that require less time. The order of conducting individual steps is carefully considered and the fastest combinations are favored. The standard techniques of efficient programming, such as reducing time complexity, are honored wherever possible. The main speed gain is derived from an original data file reading mechanism making efficient use of C language function fread(). Reading large data from file and writing it into the file is generally the most time-consuming step. RegScan reduces the number of individual read/write events by reading data in large chunks in the binary mode and then efficiently parsing the data in the memory to retrieve the information. The parsing step is carried out by an original method that sequentially compares each byte in the stream with all integers to block-wise reconstruct the original values. Computational speed gain becomes more pronounced with the increasing number of traits because analyzing additional traits involves no additional significant read events. Care is taken to ensure that the files are read no more than once and the information that is often referred to is stored in the memory early on during runtime. The operations are always carried out with the minimal number of significant digits to save time. Approximations are used if they do not lead to sacrificing the analytical quality. Look-up tables are preferred to runtime calculations. If the value that is looked up is not present in the table, fast and accurate interpolation techniques are used to fill in the missing value. This technique is used in the P-value calculation. Additional methods of accelerating the analysis include setting various restrictive filters before the analysis to bypass the calculations that are not of interest to the user. These optional filters are fully controlled by the user and are explained in the RegScan User Manual. RegScan uses allele dosage to perform linear regression analysis. It works with the Oxford GEN/SAMPLE file format [8] and can also read gzip-compressed GEN files. RegScan can automatically generate all possible combinatorial traits and channel them into fast linear regression analysis. It is capable of handling missing phenotypes and has adequate error-catching mechanisms. While designed for the analysis of combinatorial traits, this is not a requirement. The general workflow of Regscan is presented in Figure 1. The analytical pipeline also uses R (script provided with RegScan) for adjustments and transformations. Effect size, P-value, standard error (SE) of slope and minor allele count can be used to filter the results during runtime if needed. The output can be additionally filtered and analyzed with RegScan to detect true positives or create subsets. RegScan is a collection of analytical scripts that provide functions additional to the association analysis by linear regression: (i) counting and extracting markers associated with combinatorial traits or traits associated with markers, (ii) applying unlimited number of filters to identify associations of interest or extract any subset from the results, (iii) evaluating the statistical significance of the combinatorial trait–marker associations (elaborated below). If needed, RegScan can be efficiently used to prepare condensed data sets for other tools for the types of analyses not supported by RegScan, as it is capable of quickly assessing the associations.

Figure 1:

General RegScan workflow. Creating combinatorial traits and adjustments/transformations are optional (dashed boxes).

SPEED TESTING

SNPTEST 2.4.1 [9] and QuickTest 0.97 [10] were used as references in the tests, as these two widely used computational tools perform linear regression analysis and output all of the three commonly used statistical parameters: P-value, effect size (β) and SE. We conducted tests to estimate (i) the analysis speed gain of RegScan over the reference tools, (ii) computational speed as a function of data size and various settings. The results are presented in Supplementary Data. Briefly, compared with the other software packages, RegScan was always the fastest, and QuickTest the second fastest. QuickTest was therefore chosen for computational speed comparison tests. A typical RegScan analysis was 10× faster than QuickTest on a 2.3 GHz (Scientific Linux 6.3) with one trait and various numbers of individuals (Figure 2). RegScan analyzed our test data set (TDS, 38.02 million markers, 3315 individuals) and one trait in 3.4 h (0.34 ms/marker) as opposed to the QuickTest time of 36.2 h. The analysis speed per trait increased significantly when multiple traits were analyzed in one go. Experiments with 6216 traits led to computational times <10 min/trait with the TDS. The linear regression analysis proceeded at 0.011 ms/trait/marker (30× faster than with a single trait).

Figure 2:

Analysis time (RegScan versus QuickTest) with 1 million markers, one trait and variable number of individuals (750–3315). (A) Relative speed gain of RegScan over QuickTest; slope = 10.14, (B) Computational speed of RegScan and QuickTest as a function of the number of individuals. A colour version of this figure is available at BIB online: http://bib.oxfordjournals.org. The computational times can be additionally shortened by allowing RegScan to allocate more Random-access memory (RAM). Further significant speed gain results from setting various restrictive filters such as a higher minor allele count level, lower SE level, etc. Combining all methods of computational speed reduction can result in analysis speed that is several orders of magnitude faster than with the other common GWAS tools. Computational speed can be of greatest importance in certain situations such as creating large databases of associations by brute force or studying combinatorial traits. For example, a data set with only 112 metabolite concentrations will create 6160 pair-wise combinations. When analyzed individually with other tools and the TDS (see above) the time spent will be >26 processor years. Analyzing these traits individually with RegScan would take about 2.6 processor years. However, when fed into the RegScan analysis all at once, these traits will require only about one processor month, or even less when applying additional optional RegScan filters.

QUANTITATIVE COMPARISON OF RESULTS

An allele frequency test was performed with RegScan and the reference tools on a data set of 40 765 randomly chosen markers from the 1000 Genomes reference panel and 873 individuals to quantitatively compare the results [11]. The results for P-value, effect size (β) and SE agreed well between all tools (Table 1). The somewhat larger, albeit minor, P-value differences between RegScan and the other tools originated from a different computational method used—interpolation of precomputed values—and from the rounding effects (Table 2). The tests, however, confirmed that (i) the RegScan results do not differ significantly from the other commonly used tools for the allele frequency test, and (ii) SNPTEST and QuickTest results differed from each other to an extent similar to their differences from the RegScan results.

Table 1:

Pearson correlation coefficients between P-, β and SE values computed by SNPTEST (ST), QuickTest (QT) and RegScan (RS) based on 40 765 random markers

Parameter	RS versus QT	RS versus ST	QT versus ST
P-values	0.999998	0.999951	0.99995
β	1	0.999999	0.999999
SE	1	1	1

Table 2:

Deviation (%) between P-, β and SE values computed by SNPTEST (ST), QuickTest (QT) and RegScan (RS) based on 40 765 random markers

Parameter	RS versus QT	RS versus ST	QT versus ST
Mean deviation of P-values (%)	0.119	0.114	0.006
P-values with >5% deviation (%)	0.000	0.010	0.010
P-values with >1% deviation (%)	2.956	2.951	0.010
Mean deviation of β (%)	0.018	0.017	0.036
β values with >5% deviation (%)	0.373	0.383	0.010
β values with >1% deviation (%)	1.820	1.828	0.010
Mean deviation of SE (%)	0.004	0.004	0.0002
SE values with >5% deviation (%)	0.000	0.007	0.007
SE values with >1% deviation (%)	0.000	0.010	0.010

The deviation (%) is calculated as the mean of the deviations of all markers (each calculated as the larger value divided by the smaller value times 100).

Pearson correlation coefficients between P-, β and SE values computed by SNPTEST (ST), QuickTest (QT) and RegScan (RS) based on 40 765 random markers Deviation (%) between P-, β and SE values computed by SNPTEST (ST), QuickTest (QT) and RegScan (RS) based on 40 765 random markers The deviation (%) is calculated as the mean of the deviations of all markers (each calculated as the larger value divided by the smaller value times 100).

EXAMPLES AND METHOD VALIDATION

We conducted proof-of-principle tests with 1000 Genomes-imputed 38.02 million markers [11], 873 individuals and 44 clinical traits of blood and all of their ratios to validate RegScan. The details are in Supplementary Data. In the first experiment, we tested whether RegScan was capable of detecting the associations well established by other tools in the other studies. For bilirubin, we identified the top three published markers with RegScan P-values <10−50 [1]. This shows that RegScan can function as a general GWAS tool. In the second test, we studied combinatorial traits involving plasma iron levels. For these combinatorial traits, we detected 20 markers that associated with trait ratios involving iron concentration at a genome-wide significance level. The candidates were ranked by RegScan based on a score that takes into account the P-values of the corresponding single traits. This score allows filtering of the hits based on statistical significance and is called here the Reliability Score, RS (Supplementary Data). The RS indicates how much stronger the association between the combinatorial trait and a given marker is compared with the corresponding single traits. The RS is calculated as RS = Psmaller_single/Pcombinatorial, where Psmaller_single is the P-value of association for the single trait that yielded the lower P-value of the two single traits; Pcombinatorial is the P-value of association for the combinatorial trait. As an example, if P(A) = 10−6, P(B) = 10−2, P(A/B) = 10−10, then RS(A/B) = 10−6/10−10 = 104. We suggest that this simple score is effective in identifying the associations that are likely to be biologically more relevant. RegScan has functions to compute the RS and filter the results based on its value. To validate the use of the RS in studying the combinatorial traits we conducted theoretical simulations that represent the theoretical ‘real-life situations’. We generated data for the scenario where the genetic marker affects a phenotype ratio as well as the scenario where the marker has linear effects on one or both phenotypes, but not on their ratio. The study indicated that the RS is able to identify the correct model in >95% of cases (details in Supplementary Data). A sample GWAS with the above data set was performed for blood serum urate concentration and all trait ratios that contained urate concentration. The relatively small number of individuals used in this example was sufficient to identify a known region in chromosome 4 [12]. However, using all trait ratios exposed at least four additional genomic regions (data not shown) that could be involved in urate metabolism in combination with some of the other 43 traits tested (Figure 3). This example highlights the power of using combinatorial traits by RegScan in detecting new candidate genomic regions for trait associations [5, 13].

Figure 3:

Manhattan plot showing the chromosome regions associated with blood plasma urate concentration (A), and with combinatorial traits involving urate concentration (B) as determined by RegScan.

QUALITATIVE COMPARISON WITH THE OTHER TOOLS

The main advantages of RegScan include the following: Speed. Carrying out simple linear regression analysis maximally fast is the primary goal of RegScan, as it opens doors to studying large data sets. Unlike the reference tools tested, RegScan allows an automatic analysis of any number of traits at the same time. The user does not have to specify individual phenotypes to be analyzed. All phenotypes present in the input files are automatically analyzed against all markers present. This avoids the need to treat each trait separately and leads to major computational speedup. Easy creation of combinatorial traits. RegScan can conveniently convert phenotype files into combinatorial phenotype files. Two types of output files. In addition to the standard output listing statistical parameters for each marker, a summary information file can be created that finds the strongest-associating trait from among all traits tested for each marker. This is done based on (a) the statistical parameter selected by the user, or (b) the maximal effect size. Post-run data analysis for combinatorial traits. Several functions allow studying the association analysis results. We introduced a simple, yet useful, method (RS) for identifying associations with combinatorial traits based on the statistical parameters of the corresponding single traits. The main disadvantage of RegScan is the absence of higher-level analytical functions in addition to the allele frequency association analysis. RegScan also relies on external R scripts for data adjustments.

CONCLUSIONS

RegScan’s main focus is to find marker–trait associations in metabolomics in the context of combinatorial traits. Another predicted use is marker associations with gene expression. RegScan addresses the main obstacle in these studies—the heavy computational burden to find the main associations. RegScan is currently lacking several common analytical options. Our intent is to develop RegScan into a full-capability GWAS tool based on the user feedback.

SUPPLEMENTARY DATA

Supplementary data are available online at http://bib.oxfordjournals.org/. Depending on the data size, RegScan performs association analysis between markers and continuous traits ten to several hundred times faster than the other GWAS tools. Analyses that used to take weeks or months now take days. RegScan can automatically generate and analyze combinatorial traits; it can analyze any number of traits in one go. RegScan provides functions for filtering and additional analysis of the association analysis results; it introduces the concept of RS to study the combinatorial traits. RegScan is designed for metabolomics GWAS but is not limited to that.

9 in total

1. Genome-wide association studies and systems biology: together at last.

Authors: Mika Ala-Korpela; Antti J Kangas; Michael Inouye
Journal: Trends Genet Date: 2011-10-20 Impact factor: 11.639

2. Methods for testing association between uncertain genotypes and quantitative traits.

Authors: Zoltán Kutalik; Toby Johnson; Murielle Bochud; Vincent Mooser; Peter Vollenweider; Gérard Waeber; Dawn Waterworth; Jacques S Beckmann; Sven Bergmann
Journal: Biostatistics Date: 2010-06-11 Impact factor: 5.899

3. Genome-wide association study identifies multiple loci influencing human serum metabolite levels.

Authors: Johannes Kettunen; Taru Tukiainen; Antti-Pekka Sarin; Alfredo Ortega-Alonso; Emmi Tikkanen; Leo-Pekka Lyytikäinen; Antti J Kangas; Pasi Soininen; Peter Würtz; Kaisa Silander; Danielle M Dick; Richard J Rose; Markku J Savolainen; Jorma Viikari; Mika Kähönen; Terho Lehtimäki; Kirsi H Pietiläinen; Michael Inouye; Mark I McCarthy; Antti Jula; Johan Eriksson; Olli T Raitakari; Veikko Salomaa; Jaakko Kaprio; Marjo-Riitta Järvelin; Leena Peltonen; Markus Perola; Nelson B Freimer; Mika Ala-Korpela; Aarno Palotie; Samuli Ripatti
Journal: Nat Genet Date: 2012-01-29 Impact factor: 38.330

4. A new multipoint method for genome-wide association studies by imputation of genotypes.

Authors: Jonathan Marchini; Bryan Howie; Simon Myers; Gil McVean; Peter Donnelly
Journal: Nat Genet Date: 2007-06-17 Impact factor: 38.330

5. Human metabolic individuality in biomedical and pharmaceutical research.

Authors: So-Youn Shin; Ann-Kristin Petersen; Nicole Soranzo; Christian Gieger; Karsten Suhre; Robert P Mohney; David Meredith; Brigitte Wägele; Elisabeth Altmaier; Panos Deloukas; Jeanette Erdmann; Elin Grundberg; Christopher J Hammond; Martin Hrabé de Angelis; Gabi Kastenmüller; Anna Köttgen; Florian Kronenberg; Massimo Mangino; Christa Meisinger; Thomas Meitinger; Hans-Werner Mewes; Michael V Milburn; Cornelia Prehn; Johannes Raffler; Janina S Ried; Werner Römisch-Margl; Nilesh J Samani; Kerrin S Small; H-Erich Wichmann; Guangju Zhai; Thomas Illig; Tim D Spector; Jerzy Adamski
Journal: Nature Date: 2011-08-31 Impact factor: 49.962

6. On the allelic spectrum of human disease.

Authors: D E Reich; E S Lander
Journal: Trends Genet Date: 2001-09 Impact factor: 11.639

7. A genome-wide perspective of genetic variation in human metabolism.

Authors: Thomas Illig; Christian Gieger; Guangju Zhai; Werner Römisch-Margl; Rui Wang-Sattler; Cornelia Prehn; Elisabeth Altmaier; Gabi Kastenmüller; Bernet S Kato; Hans-Werner Mewes; Thomas Meitinger; Martin Hrabé de Angelis; Florian Kronenberg; Nicole Soranzo; H-Erich Wichmann; Tim D Spector; Jerzy Adamski; Karsten Suhre
Journal: Nat Genet Date: 2009-12-27 Impact factor: 38.330

8. Genetics meets metabolomics: a genome-wide association study of metabolite profiles in human serum.

Authors: Christian Gieger; Ludwig Geistlinger; Elisabeth Altmaier; Martin Hrabé de Angelis; Florian Kronenberg; Thomas Meitinger; Hans-Werner Mewes; H-Erich Wichmann; Klaus M Weinberger; Jerzy Adamski; Thomas Illig; Karsten Suhre
Journal: PLoS Genet Date: 2008-11-28 Impact factor: 5.917

9. Genome-wide association analyses identify 18 new loci associated with serum urate concentrations.

Authors: Anna Köttgen; Eva Albrecht; Alexander Teumer; Veronique Vitart; Jan Krumsiek; Claudia Hundertmark; Giorgio Pistis; Daniela Ruggiero; Conall M O'Seaghdha; Toomas Haller; Qiong Yang; Toshiko Tanaka; Andrew D Johnson; Zoltán Kutalik; Albert V Smith; Julia Shi; Maksim Struchalin; Rita P S Middelberg; Morris J Brown; Angelo L Gaffo; Nicola Pirastu; Guo Li; Caroline Hayward; Tatijana Zemunik; Jennifer Huffman; Loic Yengo; Jing Hua Zhao; Ayse Demirkan; Mary F Feitosa; Xuan Liu; Giovanni Malerba; Lorna M Lopez; Pim van der Harst; Xinzhong Li; Marcus E Kleber; Andrew A Hicks; Ilja M Nolte; Asa Johansson; Federico Murgia; Sarah H Wild; Stephan J L Bakker; John F Peden; Abbas Dehghan; Maristella Steri; Albert Tenesa; Vasiliki Lagou; Perttu Salo; Massimo Mangino; Lynda M Rose; Terho Lehtimäki; Owen M Woodward; Yukinori Okada; Adrienne Tin; Christian Müller; Christopher Oldmeadow; Margus Putku; Darina Czamara; Peter Kraft; Laura Frogheri; Gian Andri Thun; Anne Grotevendt; Gauti Kjartan Gislason; Tamara B Harris; Lenore J Launer; Patrick McArdle; Alan R Shuldiner; Eric Boerwinkle; Josef Coresh; Helena Schmidt; Michael Schallert; Nicholas G Martin; Grant W Montgomery; Michiaki Kubo; Yusuke Nakamura; Toshihiro Tanaka; Patricia B Munroe; Nilesh J Samani; David R Jacobs; Kiang Liu; Pio D'Adamo; Sheila Ulivi; Jerome I Rotter; Bruce M Psaty; Peter Vollenweider; Gerard Waeber; Susan Campbell; Olivier Devuyst; Pau Navarro; Ivana Kolcic; Nicholas Hastie; Beverley Balkau; Philippe Froguel; Tõnu Esko; Andres Salumets; Kay Tee Khaw; Claudia Langenberg; Nicholas J Wareham; Aaron Isaacs; Aldi Kraja; Qunyuan Zhang; Philipp S Wild; Rodney J Scott; Elizabeth G Holliday; Elin Org; Margus Viigimaa; Stefania Bandinelli; Jeffrey E Metter; Antonio Lupo; Elisabetta Trabetti; Rossella Sorice; Angela Döring; Eva Lattka; Konstantin Strauch; Fabian Theis; Melanie Waldenberger; H-Erich Wichmann; Gail Davies; Alan J Gow; Marcel Bruinenberg; Ronald P Stolk; Jaspal S Kooner; Weihua Zhang; Bernhard R Winkelmann; Bernhard O Boehm; Susanne Lucae; Brenda W Penninx; Johannes H Smit; Gary Curhan; Poorva Mudgal; Robert M Plenge; Laura Portas; Ivana Persico; Mirna Kirin; James F Wilson; Irene Mateo Leach; Wiek H van Gilst; Anuj Goel; Halit Ongen; Albert Hofman; Fernando Rivadeneira; Andre G Uitterlinden; Medea Imboden; Arnold von Eckardstein; Francesco Cucca; Ramaiah Nagaraja; Maria Grazia Piras; Matthias Nauck; Claudia Schurmann; Kathrin Budde; Florian Ernst; Susan M Farrington; Evropi Theodoratou; Inga Prokopenko; Michael Stumvoll; Antti Jula; Markus Perola; Veikko Salomaa; So-Youn Shin; Tim D Spector; Cinzia Sala; Paul M Ridker; Mika Kähönen; Jorma Viikari; Christian Hengstenberg; Christopher P Nelson; James F Meschia; Michael A Nalls; Pankaj Sharma; Andrew B Singleton; Naoyuki Kamatani; Tanja Zeller; Michel Burnier; John Attia; Maris Laan; Norman Klopp; Hans L Hillege; Stefan Kloiber; Hyon Choi; Mario Pirastu; Silvia Tore; Nicole M Probst-Hensch; Henry Völzke; Vilmundur Gudnason; Afshin Parsa; Reinhold Schmidt; John B Whitfield; Myriam Fornage; Paolo Gasparini; David S Siscovick; Ozren Polašek; Harry Campbell; Igor Rudan; Nabila Bouatia-Naji; Andres Metspalu; Ruth J F Loos; Cornelia M van Duijn; Ingrid B Borecki; Luigi Ferrucci; Giovanni Gambaro; Ian J Deary; Bruce H R Wolffenbuttel; John C Chambers; Winfried März; Peter P Pramstaller; Harold Snieder; Ulf Gyllensten; Alan F Wright; Gerjan Navis; Hugh Watkins; Jacqueline C M Witteman; Serena Sanna; Sabine Schipf; Malcolm G Dunlop; Anke Tönjes; Samuli Ripatti; Nicole Soranzo; Daniela Toniolo; Daniel I Chasman; Olli Raitakari; W H Linda Kao; Marina Ciullo; Caroline S Fox; Mark Caulfield; Murielle Bochud; Christian Gieger
Journal: Nat Genet Date: 2012-12-23 Impact factor: 38.330

9 in total

14 in total

1. Meta-GWAS Reveals Novel Genetic Variants Associated with Urinary Excretion of Uromodulin.

Authors: Christina B Joseph; Marta Mariniello; Ayumi Yoshifuji; Guglielmo Schiano; Jennifer Lake; Jonathan Marten; Anne Richmond; Jennifer E Huffman; Archie Campbell; Sarah E Harris; Stephan Troyanov; Massimiliano Cocca; Antonietta Robino; Sébastien Thériault; Kai-Uwe Eckardt; Matthias Wuttke; Yurong Cheng; Tanguy Corre; Ivana Kolcic; Corrinda Black; Vanessa Bruat; Maria Pina Concas; Cinzia Sala; Stefanie Aeschbacher; Franz Schaefer; Sven Bergmann; Harry Campbell; Matthias Olden; Ozren Polasek; David J Porteous; Ian J Deary; Francois Madore; Philip Awadalla; Giorgia Girotto; Sheila Ulivi; David Conen; Elke Wuehl; Eric Olinger; James F Wilson; Murielle Bochud; Anna Köttgen; Caroline Hayward; Olivier Devuyst
Journal: J Am Soc Nephrol Date: 2022-03 Impact factor: 10.121

2. HASE: Framework for efficient high-dimensional association analyses.

Authors: G V Roshchupkin; H H H Adams; M W Vernooij; A Hofman; C M Van Duijn; M A Ikram; W J Niessen
Journal: Sci Rep Date: 2016-10-26 Impact factor: 4.379

3. Genome-wide meta-analysis associates HLA-DQA1/DRB1 and LPA and lifestyle factors with human longevity.

Authors: Peter K Joshi; Nicola Pirastu; Katherine A Kentistou; Krista Fischer; Edith Hofer; Katharina E Schraut; David W Clark; Teresa Nutile; Catriona L K Barnes; Paul R H J Timmers; Xia Shen; Ilaria Gandin; Aaron F McDaid; Thomas Folkmann Hansen; Scott D Gordon; Franco Giulianini; Thibaud S Boutin; Abdel Abdellaoui; Wei Zhao; Carolina Medina-Gomez; Traci M Bartz; Stella Trompet; Leslie A Lange; Laura Raffield; Ashley van der Spek; Tessel E Galesloot; Petroula Proitsi; Lisa R Yanek; Lawrence F Bielak; Antony Payton; Federico Murgia; Maria Pina Concas; Ginevra Biino; Salman M Tajuddin; Ilkka Seppälä; Najaf Amin; Eric Boerwinkle; Anders D Børglum; Archie Campbell; Ellen W Demerath; Ilja Demuth; Jessica D Faul; Ian Ford; Alessandro Gialluisi; Martin Gögele; MariaElisa Graff; Aroon Hingorani; Jouke-Jan Hottenga; David M Hougaard; Mikko A Hurme; M Arfan Ikram; Marja Jylhä; Diana Kuh; Lannie Ligthart; Christina M Lill; Ulman Lindenberger; Thomas Lumley; Reedik Mägi; Pedro Marques-Vidal; Sarah E Medland; Lili Milani; Reka Nagy; William E R Ollier; Patricia A Peyser; Peter P Pramstaller; Paul M Ridker; Fernando Rivadeneira; Daniela Ruggiero; Yasaman Saba; Reinhold Schmidt; Helena Schmidt; P Eline Slagboom; Blair H Smith; Jennifer A Smith; Nona Sotoodehnia; Elisabeth Steinhagen-Thiessen; Frank J A van Rooij; André L Verbeek; Sita H Vermeulen; Peter Vollenweider; Yunpeng Wang; Thomas Werge; John B Whitfield; Alan B Zonderman; Terho Lehtimäki; Michele K Evans; Mario Pirastu; Christian Fuchsberger; Lars Bertram; Neil Pendleton; Sharon L R Kardia; Marina Ciullo; Diane M Becker; Andrew Wong; Bruce M Psaty; Cornelia M van Duijn; James G Wilson; J Wouter Jukema; Lambertus Kiemeney; André G Uitterlinden; Nora Franceschini; Kari E North; David R Weir; Andres Metspalu; Dorret I Boomsma; Caroline Hayward; Daniel Chasman; Nicholas G Martin; Naveed Sattar; Harry Campbell; Tōnu Esko; Zoltán Kutalik; James F Wilson
Journal: Nat Commun Date: 2017-10-13 Impact factor: 14.919

4. GWAS for male-pattern baldness identifies 71 susceptibility loci explaining 38% of the risk.

Authors: Nicola Pirastu; Peter K Joshi; Paul S de Vries; Marilyn C Cornelis; Paul M McKeigue; NaNa Keum; Nora Franceschini; Marco Colombo; Edward L Giovannucci; Athina Spiliopoulou; Lude Franke; Kari E North; Peter Kraft; Alanna C Morrison; Tõnu Esko; James F Wilson
Journal: Nat Commun Date: 2017-11-17 Impact factor: 14.919

5. Exploration of haplotype research consortium imputation for genome-wide association studies in 20,032 Generation Scotland participants.

Authors: Reka Nagy; Thibaud S Boutin; Jonathan Marten; Jennifer E Huffman; Shona M Kerr; Archie Campbell; Louise Evenden; Jude Gibson; Carmen Amador; David M Howard; Pau Navarro; Andrew Morris; Ian J Deary; Lynne J Hocking; Sandosh Padmanabhan; Blair H Smith; Peter Joshi; James F Wilson; Nicholas D Hastie; Alan F Wright; Andrew M McIntosh; David J Porteous; Chris S Haley; Veronique Vitart; Caroline Hayward
Journal: Genome Med Date: 2017-03-07 Impact factor: 11.117

6. Cardiac Troponin T and Troponin I in the General Population.

Authors: Paul Welsh; David Preiss; Caroline Hayward; Anoop S V Shah; David McAllister; Andrew Briggs; Charles Boachie; Alex McConnachie; Sandosh Padmanabhan; Claire Welsh; Mark Woodward; Archie Campbell; David Porteous; Nicholas L Mills; Naveed Sattar
Journal: Circulation Date: 2019-04-24 Impact factor: 29.690

7. Identification of 12 genetic loci associated with human healthspan.

Authors: Peter O Fedichev; Yurii Aulchenko; Aleksandr Zenin; Yakov Tsepilov; Sodbo Sharapov; Evgeny Getmantsev; L I Menshikov
Journal: Commun Biol Date: 2019-01-30

8. Target genes, variants, tissues and transcriptional pathways influencing human serum urate levels.

Authors: Adrienne Tin; Jonathan Marten; Victoria L Halperin Kuhns; Yong Li; Matthias Wuttke; Holger Kirsten; Karsten B Sieber; Chengxiang Qiu; Mathias Gorski; Zhi Yu; Ayush Giri; Gardar Sveinbjornsson; Man Li; Audrey Y Chu; Anselm Hoppmann; Luke J O'Connor; Bram Prins; Teresa Nutile; Damia Noce; Masato Akiyama; Massimiliano Cocca; Sahar Ghasemi; Peter J van der Most; Katrin Horn; Yizhe Xu; Christian Fuchsberger; Sanaz Sedaghat; Saima Afaq; Najaf Amin; Johan Ärnlöv; Stephan J L Bakker; Nisha Bansal; Daniela Baptista; Sven Bergmann; Mary L Biggs; Ginevra Biino; Eric Boerwinkle; Erwin P Bottinger; Thibaud S Boutin; Marco Brumat; Ralph Burkhardt; Eric Campana; Archie Campbell; Harry Campbell; Robert J Carroll; Eulalia Catamo; John C Chambers; Marina Ciullo; Maria Pina Concas; Josef Coresh; Tanguy Corre; Daniele Cusi; Sala Cinzia Felicita; Martin H de Borst; Alessandro De Grandi; Renée de Mutsert; Aiko P J de Vries; Graciela Delgado; Ayşe Demirkan; Olivier Devuyst; Katalin Dittrich; Kai-Uwe Eckardt; Georg Ehret; Karlhans Endlich; Michele K Evans; Ron T Gansevoort; Paolo Gasparini; Vilmantas Giedraitis; Christian Gieger; Giorgia Girotto; Martin Gögele; Scott D Gordon; Daniel F Gudbjartsson; Vilmundur Gudnason; Toomas Haller; Pavel Hamet; Tamara B Harris; Caroline Hayward; Andrew A Hicks; Edith Hofer; Hilma Holm; Wei Huang; Nina Hutri-Kähönen; Shih-Jen Hwang; M Arfan Ikram; Raychel M Lewis; Erik Ingelsson; Johanna Jakobsdottir; Ingileif Jonsdottir; Helgi Jonsson; Peter K Joshi; Navya Shilpa Josyula; Bettina Jung; Mika Kähönen; Yoichiro Kamatani; Masahiro Kanai; Shona M Kerr; Wieland Kiess; Marcus E Kleber; Wolfgang Koenig; Jaspal S Kooner; Antje Körner; Peter Kovacs; Bernhard K Krämer; Florian Kronenberg; Michiaki Kubo; Brigitte Kühnel; Martina La Bianca; Leslie A Lange; Benjamin Lehne; Terho Lehtimäki; Jun Liu; Markus Loeffler; Ruth J F Loos; Leo-Pekka Lyytikäinen; Reedik Magi; Anubha Mahajan; Nicholas G Martin; Winfried März; Deborah Mascalzoni; Koichi Matsuda; Christa Meisinger; Thomas Meitinger; Andres Metspalu; Yuri Milaneschi; Christopher J O'Donnell; Otis D Wilson; J Michael Gaziano; Pashupati P Mishra; Karen L Mohlke; Nina Mononen; Grant W Montgomery; Dennis O Mook-Kanamori; Martina Müller-Nurasyid; Girish N Nadkarni; Mike A Nalls; Matthias Nauck; Kjell Nikus; Boting Ning; Ilja M Nolte; Raymond Noordam; Jeffrey R O'Connell; Isleifur Olafsson; Sandosh Padmanabhan; Brenda W J H Penninx; Thomas Perls; Annette Peters; Mario Pirastu; Nicola Pirastu; Giorgio Pistis; Ozren Polasek; Belen Ponte; David J Porteous; Tanja Poulain; Michael H Preuss; Ton J Rabelink; Laura M Raffield; Olli T Raitakari; Rainer Rettig; Myriam Rheinberger; Kenneth M Rice; Federica Rizzi; Antonietta Robino; Igor Rudan; Alena Krajcoviechova; Renata Cifkova; Rico Rueedi; Daniela Ruggiero; Kathleen A Ryan; Yasaman Saba; Erika Salvi; Helena Schmidt; Reinhold Schmidt; Christian M Shaffer; Albert V Smith; Blair H Smith; Cassandra N Spracklen; Konstantin Strauch; Michael Stumvoll; Patrick Sulem; Salman M Tajuddin; Andrej Teren; Joachim Thiery; Chris H L Thio; Unnur Thorsteinsdottir; Daniela Toniolo; Anke Tönjes; Johanne Tremblay; André G Uitterlinden; Simona Vaccargiu; Pim van der Harst; Cornelia M van Duijn; Niek Verweij; Uwe Völker; Peter Vollenweider; Gerard Waeber; Melanie Waldenberger; John B Whitfield; Sarah H Wild; James F Wilson; Qiong Yang; Weihua Zhang; Alan B Zonderman; Murielle Bochud; James G Wilson; Sarah A Pendergrass; Kevin Ho; Afshin Parsa; Peter P Pramstaller; Bruce M Psaty; Carsten A Böger; Harold Snieder; Adam S Butterworth; Yukinori Okada; Todd L Edwards; Kari Stefansson; Katalin Susztak; Markus Scholz; Iris M Heid; Adriana M Hung; Alexander Teumer; Cristian Pattaro; Owen M Woodward; Veronique Vitart; Anna Köttgen
Journal: Nat Genet Date: 2019-10-02 Impact factor: 38.330

9. Multivariate genomic scan implicates novel loci and haem metabolism in human ageing.

Authors: Peter K Joshi; Joris Deelen; Paul R H J Timmers; James F Wilson
Journal: Nat Commun Date: 2020-07-16 Impact factor: 14.919

10. Linking protein to phenotype with Mendelian Randomization detects 38 proteins with causal roles in human diseases and traits.

Authors: Andrew D Bretherick; Oriol Canela-Xandri; Peter K Joshi; David W Clark; Konrad Rawlik; Thibaud S Boutin; Yanni Zeng; Carmen Amador; Pau Navarro; Igor Rudan; Alan F Wright; Harry Campbell; Veronique Vitart; Caroline Hayward; James F Wilson; Albert Tenesa; Chris P Ponting; J Kenneth Baillie; Chris Haley
Journal: PLoS Genet Date: 2020-07-06 Impact factor: 5.917