| Literature DB >> 22701078 |
Xianran Li1, Chengsong Zhu, Cheng-Ting Yeh, Wei Wu, Elizabeth M Takacs, Katherine A Petsch, Feng Tian, Guihua Bai, Edward S Buckler, Gary J Muehlbauer, Marja C P Timmermans, Michael J Scanlon, Patrick S Schnable, Jianming Yu.
Abstract
The complex genomes of many economically important crops present tremendous challenges to understand the genetic control of many quantitative traits with great importance in crop production, adaptation, and evolution. Advances in genomic technology need to be integrated with strategic genetic design and novel perspectives to break new ground. Complementary to individual-gene-targeted research, which remains challenging, a global assessment of the genomic distribution of trait-associated SNPs (TASs) discovered from genome scans of quantitative traits can provide insights into the genetic architecture and contribute to the design of future studies. Here we report the first systematic tabulation of the relative contribution of different genomic regions to quantitative trait variation in maize. We found that TASs were enriched in the nongenic regions, particularly within a 5-kb window upstream of genes, which highlights the importance of polymorphisms regulating gene expression in shaping the natural variation. Consistent with these findings, TASs collectively explained 44%-59% of the total phenotypic variation across maize quantitative traits, and on average, 79% of the explained variation could be attributed to TASs located in genes or within 5 kb upstream of genes, which together comprise only 13% of the genome. Our findings suggest that efficient, cost-effective genome-wide association studies (GWAS) in species with complex genomes can focus on genic and promoter regions.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22701078 PMCID: PMC3514673 DOI: 10.1101/gr.140277.112
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Distribution of maize SNPs across different genomic annotation sets
Figure 1.Genic or nongenic TASs for each dissected QTL region and the proportion of genic SNPs among all tested in the region. Genic region is defined as from the transcription start site to the end of 3′ UTR. With the merged data set, the probability of a TAS being genic or nongenic is significantly independent of whether more genic or nongenic SNPs in the target region are tested (P < 0.05) for all five traits. Each row represents a single QTL region for the indicated trait. Genic and nongenic TASs are equally likely to be identified for QTLs with large or small effects, which are sorted in descending order within each trait. (DTS) Days to silking.
Figure 2.Linkage disequilibrium (LD) among the top five associated SNPs within each target region and the LD among all tested SNPs. A strong LD among top five associated SNPs and a plateau beyond 500–1,000 bp indicate that genomic regions signaled by TASs are well supported.
Figure 3.Distribution of TASs and tested SNPs for five quantitative traits across genomic annotation sets. (A) Nongenic versus genic (genic region is defined as from the transcription start site to the end of 3′ UTR); (B) different annotation sets; and (C) synonymous versus nonsynonymous. Genic plus 5-kb upstream regions comprise only 13% of the maize genome but account for 71% of TASs. Nongenic and promoter 5kb regions are overrepresented among TASs. The nonsynonymous set is underrepresented among TASs. Note that the nongenic region includes promoter 5kb, which in turn includes promoter 1kb, and the genic region includes the untranslated region (UTR), coding region (CDS), and intron. Stars denote that the proportion of TASs from the annotation set significantly differs from that of tested SNPs. (Arrow) Transcription start site.
Figure 4.Phenotypic variation explained by genic and nongenic TASs. (A) Genic TASs versus all other TASs; (B) genic region plus promoter 1kb versus all other TASs; and (C) genic region plus promoter 5kb versus all other TASs. Phenotypic variation explained by all QTLs is shown for comparison. TASs located within genic region plus upstream 5 kb comprise only 13% of the maize genome but explain a large proportion (67%–91%) of the phenotypic variation captured by all TASs.
Figure 5.Contribution from an individual TAS to the phenotypic variation is determined by allele frequency and genetic effect size. (A) Leaf length in millimeters; (B) leaf width in millimeters; (C) leaf angle in degrees; (D) days to anthesis (DTA); (E) days to silking (DTS); and (F) combined results for all traits with standardized genetic effects.