Literature DB >> 29059333

The AraGWAS Catalog: a curated and standardized Arabidopsis thaliana GWAS catalog.

Matteo Togninalli^1,2, Ümit Seren³, Dazhe Meng^3,4, Joffrey Fitz⁵, Magnus Nordborg³, Detlef Weigel⁵, Karsten Borgwardt^1,2, Arthur Korte⁶, Dominik G Grimm^1,2.

Abstract

The abundance of high-quality genotype and phenotype data for the model organism Arabidopsis thaliana enables scientists to study the genetic architecture of many complex traits at an unprecedented level of detail using genome-wide association studies (GWAS). GWAS have been a great success in A. thaliana and many SNP-trait associations have been published. With the AraGWAS Catalog (https://aragwas.1001genomes.org) we provide a publicly available, manually curated and standardized GWAS catalog for all publicly available phenotypes from the central A. thaliana phenotype repository, AraPheno. All GWAS have been recomputed on the latest imputed genotype release of the 1001 Genomes Consortium using a standardized GWAS pipeline to ensure comparability between results. The catalog includes currently 167 phenotypes and more than 222 000 SNP-trait associations with P < 10-4, of which 3887 are significantly associated using permutation-based thresholds. The AraGWAS Catalog can be accessed via a modern web-interface and provides various features to easily access, download and visualize the results and summary statistics across GWAS.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2018 PMID： 29059333 PMCID： PMC5753280 DOI： 10.1093/nar/gkx954

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Genome-wide association studies (GWAS) have become an indispensable tool for elucidating genotype–phenotype relationships. GWAS correlates genomic markers with phenotypic differences in a population and reports the likelihood of the association (for a more detailed description see e.g. (1)). A plethora of significant associations have been reported for many traits in different organisms, including rice (2), tomatoes (3), fruit flies (4), mice (5), humans (6,7) and Arabidopsis (8). The latter, Arabidopsis thaliana, is the prime model system in plant biology (9) and an excellent species for conducting genome-wide association studies: many different phenotypes ranging from flowering time to ion concentrations or disease resistance have been collected (8,10–13). Arabidopsis thaliana is a natural inbred, which means that all lines have nearly complete homozygous genomes. This enables the collection of many different phenotypes of genetically identical plants, as well as a re-analysis of the phenotypes with updated genotypic data. To structure this wealth of phenotypic data, we recently published AraPheno, a database for phenotypes collected in A. thaliana (14). Here, we take the next important step by creating the AraGWAS catalog, a central resource for all genetic associations found through GWAS in A. thaliana. Its purpose is to report the respective GWAS results in a comprehensible way and make them accessible by the community, similar to the NHGRI-EBI GWAS Catalog in humans (15,16). GWAS in A. thaliana have been routinely performed using 214 000 markers generated with hybridization technology (17). Nowadays, full genome information for over 1000 different natural inbred lines is available (13). To enable a comparative analysis between GWAS results, we re-calculated all GWAS for the available phenotypes from AraPheno using a best-practice pipeline and the latest available version of the genotype data. To declare a threshold for significant association, we used a permutation-based threshold that takes the phenotypic distribution into account and differs for different phenotypes. The outcome is a catalog of standardized GWAS results for all A. thaliana phenotypes. To summarize, A. thaliana provides one of the best and most extensive collections of population scale phenotype and genotype data, which facilitates the use of GWAS. To make the rapidly growing amount of GWAS results available for the community and to prevent the GWAS results from being fragmented across different websites, we created the AraGWAS Catalog. This resource will not only be an assortment of different GWAS results, but ensures that all results have been calculated with a standardized pipeline using the identical release of genomic data. This framework will easily promote comparative analyses across different phenotypes.

DATABASE CONTENT AND USAGE

The AraGWAS Catalog is a publicly available and manually curated database for standardized GWAS results for the model organism A. thaliana. The primary purpose of the AraGWAS Catalog is to provide a comprehensive collection and overview of all SNP-trait associations in A. thaliana. GWAS stored in the AraGWAS Catalog have been recomputed using a standardized GWAS pipeline (see section ‘Standardized GWAS Pipeline’) using fully imputed genotype data for 2029 A. thaliana lines from the 1001 Genomes Consortium (13) and all publicly available phenotypes from the central A. thaliana phenotype repository, AraPheno (14). The AraGWAS Catalog will be updated regularly, either when new phenotypes are submitted to the AraPheno database or when new improved genotypes are released (13). The catalog will only contain recomputed associations based on our standardized GWAS pipeline to ensure maximum comparability between different traits and experiments. Table 1 provides a detailed overview of the data stored in the catalog. To account for different phenotypic distributions, permutation-based significance thresholds are computed and reported for every phenotype in the catalog.

Table 1.

AraGWAS Catalog data content and summary statistics as of 15 September 2017

Data content		Data statistics
General statistics
Studies		167
Sig. SNP-Trait Associations at P < 10⁻⁴		222 983 (1 197 588)
Sig. SNP-Trait Associations at Bonferroni threshold		9527 (133 017)
Sig. SNP-Trait Associations at permutation-based threshold		3887 (28 003)
Top 10 genes with most sig. associations
Gene	Short description	N hits
AT4G02850	Phenazine biosynthesis PhzC/PhzF family protein	176
AT4G02930	GTP binding Elongation factor Tu family protein	172
AT4G02830	hypothetical protein	97
AT4G02790	GTP-binding family protein	87
AT1G12230	Aldolase superfamily protein	84
AT4G02800	GRIP/coiled-coil protein	73
AT4G02920	hypothetical protein	70
AT1G12210	RPS5-like 1	69
AT5G63200	tetratricopeptide repeat (TPR)-containing protein	62
AT4G02860	Phenazine biosynthesis PhzC/PhzF protein	57

Numbers of associated hits are filtered by minor allele count (MAC) > 5. Numbers in parenthesis include all markers without filtering for allele count. The number of associated loci per gene are extracted from the ‘Top Genes’ table and are based on permutation-based thresholds and MAC > 5 (https://aragwas.1001genomes.org/#/top-genes). Various data-centric views are implemented so that users can obtain details about the data stored in the catalog. The ‘GWAS Hitmap’ provides a high level overview of the highest associated hits of different regions along one of the five chromosomes (Figure 1). The columns of the interactive hitmap illustrate the five chromosome of A. thaliana, while each of the rows is an individual study/phenotype. Each single dot of the 25 dots per chromosome illustrates the top associated hit within the focal region of the chromosome. Regions are created using a sliding window with a 250 kbp size. The color of the dot highlights the strength of the association, where red is more strongly associated than yellow. The histogram at the top of each chromosome column shows the genome position based distribution of the top associated hits per region for the focal chromosome. This high level view should help users to quickly gain an overview about the associated hits and their distribution within and between chromosomes. We plan to enhance this view with more advanced interactions, so that users can order phenotypes by ontologies or correlations. This might help to detect interesting patterns across studies and chromosomes and thus help identify pleiotropic effects.

Figure 1.

The HitMap is a detailed GWAS heatmap illustrating the 25 top associated hits per chromosomal region and phenotype. Each column represents a chromosome and each row a phenotype/study. Regions in each chromosome are created using a sliding window with a 250 kbp size. Each dot illustrates the top associated hit within the focal region. The color (yellow to red) indicates of the strength of the association. In the ‘Top Associations’ view users can obtain a list of all associated hits (P < 10−4) across all GWA studies and traits stored in the catalog. Associations in that table are linked to additional information, such as variant annotations generated using SnpEff (18). Different filters are provided to filter the list by different minor allele frequencies (MAF), minor allele counts (MAC), chromosomes or variant annotations (e.g. non-synonymous variants). Each entry in the ‘Top Association’ table contains links to detailed views about the study, phenotype or the gene the associated variant was found in. The ‘Top Genes’ view summaries all associated hits detected in genes (or in close proximity to genes) and groups the results by gene name. Additional information about the genes can be obtained by clicking on the gene name, which redirects the user to a gene centric view. Dynamic visualizations, such as filtered Manhattan plots and gene location plots (Figure 2) help users to explore the region around the focal gene and provide detailed information about annotations from SnpEff (18) or gene descriptions (extracted from the TAIR10 GFF3 file from the TAIR resource http://www.arabidopsis.org/download). Further, the gene centric view provides a set of different filters to narrow down the information about variants found within or in close proximity to the focal gene.

Figure 2.

Screenshot of the gene centric view. The Manhattan plot shows all associated variants for the focal region and all available traits. Further, gene locations are displayed at the bottom with interactive elements. Detailed gene descriptions are available when hovering with the cursor over a certain gene (https://aragwas.1001genomes.org/#/gene/AT2G22540). The ‘GWA Studies’ view provides a table of all available GWA studies, including phenotype descriptions and phenotype ontologies (https://bioportal.bioontology.org/ontologies/PTO?p=summary), sorted according to the number of significantly associated hits above the permutation-based threshold. Phenotype ontologies provide a controlled vocabulary to describe phenotypic traits in plants and enables functional grouping of different phenotypes. Phenotypic and trait ontology information is automatically fetched from the AraPheno database using the REST API of AraPheno (14). The GWA study table is sortable and provides a link to the detailed ‘Study’ view. More detailed information about a specific GWA study can be obtained by clicking on the study name. The detailed study view summaries general information about the performed GWAS (Figure 3A), such as the phenotype used for the study (including a link to the original publication and to AraPheno, where the phenotype data and metadata can be found), the genotype version, number of samples, as well as summary statistics (Figure 3B) and the associated hits (Figure 3C). Again, different filtering options are implement to filter the list of associated hits (Figure 3D). Interactive Manhattan plots are rendered in a separate tab of the GWAS view. Users who click on a specific point in the Manhattan plot will be redirected to the ‘Detailed Gene’ view described above.

Figure 3.

Screenshot of detailed study view. (A) Brief description about study related information with links to the phenotype and publication. (B) Various summary statistics about SNP type, impact, annotation and MAF. (C) Sorted list of associated markers and (D) various filters to narrow down the list of associated hits. The AraGWAS Catalog provides access to a detailed FAQ, tutorials and guided tours that should help novice users to familiarize themselves. AraGWAS is hosted under the 1001 Genomes Organization (http://1001genomes.org), and its framework is available as open source (see ‘Implementation’ section).

STANDARDIZED GWAS PIPELINE

GWAS is being performed on all phenotypes in the AraPheno database separately. For the analysis, the untransformed mean values for each genotype have been used. For the genotype data, the latest version from the 1001 genomes project have been used. Combining these data with existing SNP chip data (17) we obtained a SNP-Matrix for 2029 accessions on 10 709 466 segregating markers. Missing data have been imputed using BEAGLE v.3.0 (19) with standard parameters. GWAS were performed using a mixed model correcting for population structure in a two-step procedure: first all markers were analyzed using a fast approximation of the mixed model (EMMAX, 20). Second, the top 100 markers where re-analyzed using the full model (EMMA, 21). The kinship matrix has been pre-calculated using all accessions, removing alleles with a minor allele frequency below 5%. The R scripts used to perform GWAS are available at https://github.com/arthurkorte/GWAS. The genotype data are available at www.1001genomes.org. To obtain the permutation-based threshold for individual phenotypes, the phenotypic data have been permuted to keep the data structure, but loose the genotype–phenotype connection. We reported the 5% permutation-based threshold per trait. This procedure provides a more realistic significance threshold which depends on the phenotypic distribution. It is noteworthy, that this procedure will lead to different significant thresholds for the different traits, where the permutation-based threshold is sometimes more and sometimes less stringent than the classical Bonferroni threshold. An example of the first, where the permutation-based threshold leads to fewer associations, is the phenotype ‘YEL’ (https://aragwas.1001genomes.org/#/study/28, published in (8)). Here, the yellowing of the leaves—as an indicator of chlorosis—has been measured. The phenotypic values are binary and zero-inflated (https://arapheno.1001genomes.org/phenotype/28/), which is a scenario that is prone to inflated P-values in the analysis. The very low value of the permutation-based threshold reflects this and, despite the inflated P-values, makes it possible to identify significant associations. An example for the latter case, where the permutation-based threshold reports more significant associations than the classical Bonferroni threshold is the analysis of the calcium concentration in the leaves (https://aragwas.1001genomes.org/#/study/51, published in (8)). Here, the phenotypic values are nicely distributed (https://arapheno.1001genomes.org/phenotype/51), and the Bonferroni threshold reports an overly stringent threshold. It is important to take the different thresholds into account if one is interested in a gene-centric analysis across studies (compare the score and significance of markers in Figure 2). In sum, over all studies, the number of significant association under a permutation-based threshold is markedly reduced compared to the number of significant association using Bonferroni threshold (see Table 1). However, the use of our standardized pipeline might lead to false negative associations as well, especially if the original analyses have been performed with different genotype data or different GWAS approaches (for example, transformation of the phenotype or multivariate models). Still, the use of our standardized pipeline enables comparative analyses of the results, which would be problematic otherwise.

IMPLEMENTATION

The frontend and backend of the AraGWAS Catalog are based on modern web-based technologies. The web-application frontend is based on HTML5 and Javascript and is a fully functional single-page application (SPA) relying on the open-source Vue.js framework (https://vuejs.org/) and enriched with data visualization libraries such as the open-source D3.js (https://d3js.org) and google charts (https://developers.google.com/chart/), developed by Google. To enable a more user-friendly interface and a smoother user experience, the Material Design system (https://material.io/), originally developed by Google, was used via a semantic component open-source framework (https://vuetifyjs.com/). The backend consists of two databases linked by Django (https://www.djangoproject.com/), a popular open-source framework based on Python. The two databases in the backend of the AraGWAS Catalog can be accessed through a RESTful API, as described in the REST documentation of the web-application (https://aragwas.1001genomes.org/docs/). The first database stores phenotypes and studies information in a Relational Database Management System (RDBMS). The second database indexes genes and associations and is based on the elasticsearch engine (https://www.elastic.co/), an extremely fast open-source search engine that allows for quick retrieval of a large number of associations. The REST endpoints are accessible thanks to Django REST framework (http://www.django-rest-framework.org/), an open-source toolkit based on Django and to elasticsearch-dsl (https://elasticsearch-dsl.readthedocs.io/), an open-source library for high-level elasticsearch queries from Python. The AraGWAS Catalog is automatically deployed using docker (https://www.docker.com), an open-source and popular software containerization platform coupled with Jenkins (https://jenkins.io/), an open-source automation server that deploys the latest version of the website directly from GitHub. The code for AraGWAS Catalog is open-source and freely available on GitHub (https://github.com/1001genomes/AraGWAS).

ACCESSIBILITY AND CITABILITY OF DATA IN THE ARAGWAS CATALOG

The AraGWAS Catalog provides a Representational State Transfer (REST) web service that allows users to programmatically query and access data from the AraGWAS Catalog. The REST API endpoints are language-independent and can be accessed via URL extensions. In addition, we also support Core API, a robust and intuitive way to interact with the REST endpoints using machine readable schemas (http://www.django-rest-framework.org/tutorial/7-schemas-and-client-libraries/). Using the REST endpoints, users can directly obtain data for their custom offline analyses, such as associated hits for a specific gene or details of a GWA study in JSON format. A rich set of various filters allows users to directly access certain information without the need to download entire datasets. For any users who are interested in accessing the AraGWAS REST endpoints from their custom web-service, we can grant access to our REST API for their domains upon request. A thorough documentation on how to use these endpoints is provided in the online FAQ (https://aragwas.1001genomes.org/#/faq) and the REST documentation (https://aragwas.1001genomes.org/docs/). Some examples of how to use the REST interface are provided below. Example 1: Users can use the following URL to obtain a list of all available studies in JSON format using their custom browser: https://aragwas.1001genomes.org/api/studies/ The same result can also be obtained using the following command line command: $:> curl https://aragwas.1001genomes.org/api/studies/ In addition, the programming language Python can be used: Example 2: To search the AraGWAS Catalog for flowering related studies or phenotypes, users can visit the following URL: https://aragwas.1001genomes.org/api/search/search_results/flowering/ The final parameter can be replaced with any query term, such as the name of gene: https://aragwas.1001genomes.org/api/search/search_results/AT4G02760/ or the PubMed ID of a publication: https://aragwas.1001genomes.org/api/search/search_results/PMC3023908/ Example 3: Several filters are available that can be added to URLs to narrow down the search space. For example, if a user who would like to obtain a list of all significantly non-synonymous associated SNPs for chromosome 1 and 2 can use the following URL: https://aragwas.1001genomes.org/api/associations/?chr=1&chr=2&significant=p&annotation=ns To add filters, users must extend the URL with a ‘?’ followed by the individual filters, which are separated using the ‘&’ sign. In our example we added ‘chr = 1&chr = 2’ to restrict the search space to chromosome 1 and 2. The parameter ‘significant = p’ indicates that we are only interested in significantly associated hits after multiple hypothesis correction using permutation-based thresholds. The final parameter, ‘annotation = ns’, indicates that only non-synonymous SNPs should be returned. All filters and its options can be found in the online FAQ (https://aragwas.1001genomes.org/#/faq). In addition to the REST endpoints, data from the AraGWAS Catalog are also downloadable through the web-interface. Entire study results (i.e. summary statistics) are available in HDF5 format. Filtered lists with associated positions per study or gene can be download in CSV format. Phenotypes used for GWAS in the AraGWAS Catalog are linked to the AraPheno database (14), as well as to the original publication in which the phenotype was published for the first time. This way users can easily find detailed information about the linked phenotype, the original publication, as well as options to download the phenotypic data in various formats directly from AraPheno (see (14)). To enable the citability of individual GWAS results from the AraGWAS Catalog, we provide DOIs for all recomputed GWA studies, similar to what is done in AraPheno (14).

CONCLUSIONS AND FUTURE DIRECTIONS

The AraGWAS Catalog is the first comprehensive, manually curated and standardized database to collect the results of GWAS. This database is not only an important and useful resource for the Arabidopsis community, but sets a standard for other species. The public availability of high-quality genotypes and phenotypes in A. thaliana offers—due to the inbred nature of the plant—a unique opportunity to systematically re-compute and analyze the GWAS results using a best-practice pipeline. It enables researchers to analyze and compare standardized results of GWAS on different or related traits. These so-called horizontal GWAS analyses affords unique opportunities to detect seemingly unrelated functions of a gene due to pleiotropic effects or discover traits with a shared genetic basis. For this purpose, the AraGWAS Catalog provides a selection of different methods to facilitate the exploration of these high dimensional data. The catalog offers a sophisticated and fast search API to query the database and to extract information for specific associations, genes or traits. Interactive visualizations empower the user to easily maneuver the data and uncover interesting patterns. Persistent DOIs support unique referencing of GWAS results from the AraGWAS Catalog. However, one must note that SNP-trait associations in the AraGWAS Catalog might differ from the original publication in which the results have been reported. This has several reasons: SNP-trait associations in the AraGWAS Catalog are based (i) on a standardized ‘best practice’ GWAS pipeline and (ii) on a newly imputed genotype release, combining the latest version from the 1001 Genomes Project (13) and the SNP chip data from (17), leading to a SNP-Matrix for 2029 accessions on 10 709 466 segregating markers. It is important to stretch that this procedure enables more reliable results and comparable downstream analyses and should not be considered as disrespect of previously published works. So far, more than 167 traits and 222 000 SNP-trait associations have been integrated into the AraGWAS Catalog, and it will be updated regularly, either when new phenotypes are submitted to AraPheno (14) or when new genotypes are released by the 1001 Genomes Consortium. To keep track of changes of SNP-trait associations between genotype versions, we plan to integrate a versioning system, by means of which scientists can clearly distinguish between different versions and releases of the AraGWAS Catalog. In the future we plan to extend the catalog with novel methods to facilitate the comparison of GWAS results stored in AraGWAS. Although, this is already possible to some extent in the multi-species cloud platform easyGWAS (20), in the AraGWAS Catalog it will be tailored toward A. thaliana to provide a maximum degree of comparability and to leverage specific A. thaliana resources and information, which might not be available for other species. For AraGWAS, a permutation-based standardized GWAS pipeline has been used to compute univariate associations using a linear mixed model (21,22) on dichotomous and continuous traits. However, recent advances in machine learning make it possible for the first time to also use generalized linear mixed models for dichotomous traits (23), which we plan to include into future releases of AraGWAS. In addition to the results of univariate GWAS, we also plan to include SNP-trait associations from multi-locus GWAS (24–26) and multi-trait GWAS (27,28). Despite the recent advances in these fields, generating these results still is a computationally intensive task, especially if re-computations have to be done on a regular basis (e.g. when new genotypes are released). Furthermore, we would like to extend the AraGWAS catalog in the future to also include GWAS results based on hybrid A. thaliana lines and phenotypes (29), as well as results using other types of structural variants or methylated sites (30).

29 in total

1. Genetic architecture of nonadditive inheritance in Arabidopsis thaliana hybrids.

Authors: Danelle K Seymour; Eunyoung Chae; Dominik G Grimm; Carmen Martín Pizarro; Anette Habring-Müller; François Vasseur; Barbara Rakitsch; Karsten M Borgwardt; Daniel Koenig; Detlef Weigel
Journal: Proc Natl Acad Sci U S A Date: 2016-11-01 Impact factor: 11.205

2. Genome-wide association analysis identifies susceptibility loci for migraine without aura.

Authors: Tobias Freilinger; Verneri Anttila; Boukje de Vries; Rainer Malik; Mikko Kallela; Gisela M Terwindt; Patricia Pozo-Rosich; Bendik Winsvold; Dale R Nyholt; Willebrordus P J van Oosterhout; Ville Artto; Unda Todt; Eija Hämäläinen; Jèssica Fernández-Morales; Mark A Louter; Mari A Kaunisto; Jean Schoenen; Olli Raitakari; Terho Lehtimäki; Marta Vila-Pueyo; Hartmut Göbel; Erich Wichmann; Cèlia Sintas; Andre G Uitterlinden; Albert Hofman; Fernando Rivadeneira; Axel Heinze; Erling Tronvik; Cornelia M van Duijn; Jaakko Kaprio; Bru Cormand; Maija Wessman; Rune R Frants; Thomas Meitinger; Bertram Müller-Myhsok; John-Anker Zwart; Markus Färkkilä; Alfons Macaya; Michel D Ferrari; Christian Kubisch; Aarno Palotie; Martin Dichgans; Arn M J M van den Maagdenberg
Journal: Nat Genet Date: 2012-06-10 Impact factor: 38.330

3. The Drosophila melanogaster Genetic Reference Panel.

Authors: Trudy F C Mackay; Stephen Richards; Eric A Stone; Antonio Barbadilla; Julien F Ayroles; Dianhui Zhu; Sònia Casillas; Yi Han; Michael M Magwire; Julie M Cridland; Mark F Richardson; Robert R H Anholt; Maite Barrón; Crystal Bess; Kerstin Petra Blankenburg; Mary Anna Carbone; David Castellano; Lesley Chaboub; Laura Duncan; Zeke Harris; Mehwish Javaid; Joy Christina Jayaseelan; Shalini N Jhangiani; Katherine W Jordan; Fremiet Lara; Faye Lawrence; Sandra L Lee; Pablo Librado; Raquel S Linheiro; Richard F Lyman; Aaron J Mackey; Mala Munidasa; Donna Marie Muzny; Lynne Nazareth; Irene Newsham; Lora Perales; Ling-Ling Pu; Carson Qu; Miquel Ràmia; Jeffrey G Reid; Stephanie M Rollmann; Julio Rozas; Nehad Saada; Lavanya Turlapati; Kim C Worley; Yuan-Qing Wu; Akihiko Yamamoto; Yiming Zhu; Casey M Bergman; Kevin R Thornton; David Mittelman; Richard A Gibbs
Journal: Nature Date: 2012-02-08 Impact factor: 49.962

4. An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations.

Authors: Vincent Segura; Bjarni J Vilhjálmsson; Alexander Platt; Arthur Korte; Ümit Seren; Quan Long; Magnus Nordborg
Journal: Nat Genet Date: 2012-06-17 Impact factor: 38.330

5. Epigenomic Diversity in a Global Collection of Arabidopsis thaliana Accessions.

Authors: Taiji Kawakatsu; Shao-Shan Carol Huang; Florian Jupe; Eriko Sasaki; Robert J Schmitz; Mark A Urich; Rosa Castanon; Joseph R Nery; Cesar Barragan; Yupeng He; Huaming Chen; Manu Dubin; Cheng-Ruei Lee; Congmao Wang; Felix Bemm; Claude Becker; Ryan O'Neil; Ronan C O'Malley; Danjuma X Quarless; Nicholas J Schork; Detlef Weigel; Magnus Nordborg; Joseph R Ecker
Journal: Cell Date: 2016-07-14 Impact factor: 66.850

6. Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines.

Authors: Susanna Atwell; Yu S Huang; Bjarni J Vilhjálmsson; Glenda Willems; Matthew Horton; Yan Li; Dazhe Meng; Alexander Platt; Aaron M Tarone; Tina T Hu; Rong Jiang; N Wayan Muliyati; Xu Zhang; Muhammad Ali Amer; Ivan Baxter; Benjamin Brachi; Joanne Chory; Caroline Dean; Marilyne Debieu; Juliette de Meaux; Joseph R Ecker; Nathalie Faure; Joel M Kniskern; Jonathan D G Jones; Todd Michael; Adnane Nemri; Fabrice Roux; David E Salt; Chunlao Tang; Marco Todesco; M Brian Traw; Detlef Weigel; Paul Marjoram; Justin O Borevitz; Joy Bergelson; Magnus Nordborg
Journal: Nature Date: 2010-03-24 Impact factor: 49.962

7. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations.

Authors: Danielle Welter; Jacqueline MacArthur; Joannella Morales; Tony Burdett; Peggy Hall; Heather Junkins; Alan Klemm; Paul Flicek; Teri Manolio; Lucia Hindorff; Helen Parkinson
Journal: Nucleic Acids Res Date: 2013-12-06 Impact factor: 16.971

8. AraPheno: a public database for Arabidopsis thaliana phenotypes.

Authors: Ümit Seren; Dominik Grimm; Joffrey Fitz; Detlef Weigel; Magnus Nordborg; Karsten Borgwardt; Arthur Korte
Journal: Nucleic Acids Res Date: 2016-10-24 Impact factor: 16.971

9. Efficient network-guided multi-locus association mapping with graph cuts.

Authors: Chloé-Agathe Azencott; Dominik Grimm; Mahito Sugiyama; Yoshinobu Kawahara; Karsten M Borgwardt
Journal: Bioinformatics Date: 2013-07-01 Impact factor: 6.937

Review 10. The advantages and limitations of trait analysis with GWAS: a review.

Authors: Arthur Korte; Ashley Farlow
Journal: Plant Methods Date: 2013-07-22 Impact factor: 4.993

26 in total

1. Arabidopsis natural variation in insect egg-induced cell death reveals a role for LECTIN RECEPTOR KINASE-I.1.

Authors: Raphaël Groux; Elia Stahl; Caroline Gouhier-Darimont; Envel Kerdaffrec; Pedro Jimenez-Sandoval; Julia Santiago; Philippe Reymond
Journal: Plant Physiol Date: 2021-02-25 Impact factor: 8.340

2. Neighbor GWAS: incorporating neighbor genotypic identity into genome-wide association studies of field herbivory.

Authors: Yasuhiro Sato; Eiji Yamamoto; Kentaro K Shimizu; Atsushi J Nagano
Journal: Heredity (Edinb) Date: 2021-01-29 Impact factor: 3.821

3. Using precision phenotyping to inform de novo domestication.

Authors: Alisdair R Fernie; Saleh Alseekh; Jie Liu; Jianbing Yan
Journal: Plant Physiol Date: 2021-07-06 Impact factor: 8.340

4. The 2018 Nucleic Acids Research database issue and the online molecular biology database collection.

Authors: Daniel J Rigden; Xosé M Fernández
Journal: Nucleic Acids Res Date: 2018-01-04 Impact factor: 16.971

5. TASUKE+: a web-based platform for exploring genome-wide association studies results and large-scale resequencing data.

Authors: Masahiko Kumagai; Daiki Nishikawa; Yoshihiro Kawahara; Hironobu Wakimoto; Ryutaro Itoh; Norio Tabei; Tsuyoshi Tanaka; Takeshi Itoh
Journal: DNA Res Date: 2019-12-01 Impact factor: 4.458

6. Accurate and adaptive imputation of summary statistics in mixed-ethnicity cohorts.

Authors: Matteo Togninalli; Damian Roqueiro; Karsten M Borgwardt
Journal: Bioinformatics Date: 2018-09-01 Impact factor: 6.937

7. Fluctuating selection on migrant adaptive sodium transporter alleles in coastal Arabidopsis thaliana.

Authors: Silvia Busoms; Pirita Paajanen; Sarah Marburger; Sian Bray; Xin-Yuan Huang; Charlotte Poschenrieder; Levi Yant; David E Salt
Journal: Proc Natl Acad Sci U S A Date: 2018-12-07 Impact factor: 11.205

8. Genome-Wide Identification of Splicing Quantitative Trait Loci (sQTLs) in Diverse Ecotypes of Arabidopsis thaliana.

Authors: Waqas Khokhar; Musa A Hassan; Anireddy S N Reddy; Saurabh Chaudhary; Ibtissam Jabre; Lee J Byrne; Naeem H Syed
Journal: Front Plant Sci Date: 2019-10-03 Impact factor: 5.753

9. AraPheno and the AraGWAS Catalog 2020: a major database update including RNA-Seq and knockout mutation data for Arabidopsis thaliana.

Authors: Matteo Togninalli; Ümit Seren; Jan A Freudenthal; J Grey Monroe; Dazhe Meng; Magnus Nordborg; Detlef Weigel; Karsten Borgwardt; Arthur Korte; Dominik G Grimm
Journal: Nucleic Acids Res Date: 2020-01-08 Impact factor: 16.971

10. Functional variants of DOG1 control seed chilling responses and variation in seasonal life-history strategies in Arabidopsis thaliana.

Authors: Alejandra Martínez-Berdeja; Michelle C Stitzer; Mark A Taylor; Miki Okada; Exequiel Ezcurra; Daniel E Runcie; Johanna Schmitt
Journal: Proc Natl Acad Sci U S A Date: 2020-01-21 Impact factor: 11.205