| Literature DB >> 29059333 |
Matteo Togninalli1,2, Ümit Seren3, Dazhe Meng3,4, Joffrey Fitz5, Magnus Nordborg3, Detlef Weigel5, Karsten Borgwardt1,2, Arthur Korte6, Dominik G Grimm1,2.
Abstract
The abundance of high-quality genotype and phenotype data for the model organism Arabidopsis thaliana enables scientists to study the genetic architecture of many complex traits at an unprecedented level of detail using genome-wide association studies (GWAS). GWAS have been a great success in A. thaliana and many SNP-trait associations have been published. With the AraGWAS Catalog (https://aragwas.1001genomes.org) we provide a publicly available, manually curated and standardized GWAS catalog for all publicly available phenotypes from the central A. thaliana phenotype repository, AraPheno. All GWAS have been recomputed on the latest imputed genotype release of the 1001 Genomes Consortium using a standardized GWAS pipeline to ensure comparability between results. The catalog includes currently 167 phenotypes and more than 222 000 SNP-trait associations with P < 10-4, of which 3887 are significantly associated using permutation-based thresholds. The AraGWAS Catalog can be accessed via a modern web-interface and provides various features to easily access, download and visualize the results and summary statistics across GWAS.Entities:
Mesh:
Year: 2018 PMID: 29059333 PMCID: PMC5753280 DOI: 10.1093/nar/gkx954
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
AraGWAS Catalog data content and summary statistics as of 15 September 2017
| Data content | Data statistics | |
|---|---|---|
| General statistics | ||
| Studies | 167 | |
| Sig. SNP-Trait Associations at | 222 983 (1 197 588) | |
| Sig. SNP-Trait Associations at Bonferroni threshold | 9527 (133 017) | |
| Sig. SNP-Trait Associations at permutation-based threshold | 3887 (28 003) | |
| Top 10 genes with most sig. associations | ||
|
|
|
|
| AT4G02850 | Phenazine biosynthesis PhzC/PhzF family protein | 176 |
| AT4G02930 | GTP binding Elongation factor Tu family protein | 172 |
| AT4G02830 | hypothetical protein | 97 |
| AT4G02790 | GTP-binding family protein | 87 |
| AT1G12230 | Aldolase superfamily protein | 84 |
| AT4G02800 | GRIP/coiled-coil protein | 73 |
| AT4G02920 | hypothetical protein | 70 |
| AT1G12210 | RPS5-like 1 | 69 |
| AT5G63200 | tetratricopeptide repeat (TPR)-containing protein | 62 |
| AT4G02860 | Phenazine biosynthesis PhzC/PhzF protein | 57 |
Numbers of associated hits are filtered by minor allele count (MAC) > 5. Numbers in parenthesis include all markers without filtering for allele count. The number of associated loci per gene are extracted from the ‘Top Genes’ table and are based on permutation-based thresholds and MAC > 5 (https://aragwas.1001genomes.org/#/top-genes).
Figure 1.The HitMap is a detailed GWAS heatmap illustrating the 25 top associated hits per chromosomal region and phenotype. Each column represents a chromosome and each row a phenotype/study. Regions in each chromosome are created using a sliding window with a 250 kbp size. Each dot illustrates the top associated hit within the focal region. The color (yellow to red) indicates of the strength of the association.
Figure 2.Screenshot of the gene centric view. The Manhattan plot shows all associated variants for the focal region and all available traits. Further, gene locations are displayed at the bottom with interactive elements. Detailed gene descriptions are available when hovering with the cursor over a certain gene (https://aragwas.1001genomes.org/#/gene/AT2G22540).
Figure 3.Screenshot of detailed study view. (A) Brief description about study related information with links to the phenotype and publication. (B) Various summary statistics about SNP type, impact, annotation and MAF. (C) Sorted list of associated markers and (D) various filters to narrow down the list of associated hits.