| Literature DB >> 20336072 |
Susanna Atwell1, Yu S Huang, Bjarni J Vilhjálmsson, Glenda Willems, Matthew Horton, Yan Li, Dazhe Meng, Alexander Platt, Aaron M Tarone, Tina T Hu, Rong Jiang, N Wayan Muliyati, Xu Zhang, Muhammad Ali Amer, Ivan Baxter, Benjamin Brachi, Joanne Chory, Caroline Dean, Marilyne Debieu, Juliette de Meaux, Joseph R Ecker, Nathalie Faure, Joel M Kniskern, Jonathan D G Jones, Todd Michael, Adnane Nemri, Fabrice Roux, David E Salt, Chunlao Tang, Marco Todesco, M Brian Traw, Detlef Weigel, Paul Marjoram, Justin O Borevitz, Joy Bergelson, Magnus Nordborg.
Abstract
Although pioneered by human geneticists as a potential solution to the challenging problem of finding the genetic basis of common human diseases, genome-wide association (GWA) studies have, owing to advances in genotyping and sequencing technology, become an obvious general approach for studying the genetics of natural variation and traits of agricultural importance. They are particularly useful when inbred lines are available, because once these lines have been genotyped they can be phenotyped multiple times, making it possible (as well as extremely cost effective) to study many different traits in many different environments, while replicating the phenotypic measurements to reduce environmental noise. Here we demonstrate the power of this approach by carrying out a GWA study of 107 phenotypes in Arabidopsis thaliana, a widely distributed, predominantly self-fertilizing model plant known to harbour considerable genetic variation for many adaptively important traits. Our results are dramatically different from those of human GWA studies, in that we identify many common alleles of major effect, but they are also, in many cases, harder to interpret because confounding by complex genetics and population structure make it difficult to distinguish true associations from false. However, a-priori candidates are significantly over-represented among these associations as well, making many of them excellent candidates for follow-up experiments. Our study demonstrates the feasibility of GWA studies in A. thaliana and suggests that the approach will be appropriate for many other organisms.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20336072 PMCID: PMC3023908 DOI: 10.1038/nature08800
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962
Figure 1The number of associations identified using different p-value thresholds for each phenotype
For each phenotype, the numbers of distinct peaks of association significant at nominal p-value thresholds of 10−4, 10−5,…, 10−9 are shown. The number of SNPs (out of 250,000) that would be expected to exceed each threshold is shown for comparison. a, No correction for population structure (non-parametric Wilcoxon Test). b, Correction for population structure (parametric mixed model [EMMA]).
Figure 2GWA analysis of hypersensitive response to the bacterial elicitor AvrRpm1
a, Genome-wide p-values from Fisher’s Exact Test. The horizontal dashed line corresponds to a nominal 5% significance-threshold with Bonferroni-correction for 250,000 tests. b, Magnification of the genomic region surrounding RPM1, the position (and extent) of which is indicated by the vertical blue line.
Figure 3Candidate SNPs are over-represented among strong associations
GWA analysis of the FT10 phenotype: negative log p-values from the Wilcoxon test are plotted against those from EMMA. Points corresponding to SNPs within 20 kb of a candidate gene are shown in red; the rest are shown in blue. The enrichment of the former over the latter in different parts of the distribution is shown.
Figure 4Association with FLC expression at the top of chromosome 4 near FRI
The p-values are from EMMA; the position of FRI is indicated by a vertical yellow line. a, Single-SNP tests. b, Col-allele of FRI (blue dot) is added as co-factor in the model. c, Ler-allele of FRI (red dot) is added as co-factor in the model. d, Both alleles added as co-factors in the model.