| Literature DB >> 28099470 |
Cheng He1,2, Junjie Fu2, Jie Zhang1, Yongxiang Li2, Jun Zheng2, Hongwei Zhang2, Xiaohong Yang1, Jianhua Wang1, Guoying Wang2.
Abstract
The extensive genetic variation present in maize (Zea mays) germplasm makes it possible to detect signatures of positive artificial selection that occurred during temperate and tropical maize improvement. Here we report an analysis of 532,815 polymorphisms from a maize association panel consisting of 368 diverse temperate and tropical inbred lines. We developed a gene-oriented approach adapting exonic polymorphisms to identify recently selected alleles by comparing haplotypes across the maize genome. This analysis revealed evidence of selection for more than 1100 genomic regions during recent improvement, and included regulatory genes and key genes with visible mutant phenotypes. We find that selected candidate target genes in temperate maize are enriched in biosynthetic processes, and further examination of these candidates highlights two cases, sucrose flux and oil storage, in which multiple genes in a common pathway can be cooperatively selected. Finally, based on available parallel gene expression data, we hypothesize that some genes were selected for regulatory variations, resulting in altered gene expression.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28099470 PMCID: PMC5242465 DOI: 10.1371/journal.pone.0169806
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Genome-wide analysis of selected genomic regions in maize.
Genome-wide XP-EHH scores for temperate (A) and tropical (B) selected regions. The cutoff lines represent an empirical cutoff for the top 1%. Distributions of region size (C) and gene counts (D) within selected regions in tropical and temperate maize lines.
Genomic regions selected in tropical and temperate maize lines.
| Region number | Mean gene number | Mean region size | Coverage of genome | |
|---|---|---|---|---|
| 730 | 5.29 | 95.2kb | 3.38% | |
| 421 | 3.19 | 71.1kb | 1.45% |
a Number of selected regions.
b Average number of genes in the selected regions.
c Average physical size of the selected regions.
d The percentage of all selected regions in the maize genome.
The top 1% of genes in the selected genomic regions in temperate and tropical maize.
| Super classes of genes | Background | Tropical candidate genes | Temperate candidate genes | |||||
|---|---|---|---|---|---|---|---|---|
| number | % | number | % | number | % | |||
| PK | 1228 | 3.11% | 21 | 4.76% | 1.58E-02 | 40 | 4.48% | 5.31E-03 |
| TF | 2836 | 7.19% | 42 | 9.52% | 1.24E-02 | 67 | 7.51% | 4.79E-02 |
| UPS | 1205 | 3.06% | 16 | 3.63% | 8.06E-02 | 34 | 3.81% | 3.12E-02 |
a Protein Kinases.
b Transcription Factors.
c Ubiquitin-Proteasome System.
d P-values of the hypergeometric-test.
Genes showing differences in expression in the maize genome and selected genomic regions.
| Filtered gene set | Tropical candidate genes | Temperate candidate genes | ||||||
|---|---|---|---|---|---|---|---|---|
| number | % | number | % | number | % | |||
| 32681 | 416 | 727 | ||||||
| 13529 | 41.4% | 202 | 48.6% | 3.01E-03 | 363 | 49.9% | 3.01E-06 | |
| 9473 | 29.0% | 146 | 35.1% | 6.14E-03 | 262 | 36.0% | 2.89E-05 | |
a Genes in which the expression differences between tropical and temperate populations are significant at P-values <0.05.
b Genes in which the expression difference between tropical and temperate maize is significant at P-values <0.01.
c P-values of the chi-square test.
Selected key genes in genomic regions selected during improvement in temperate maize.
| Gene | Chromosome | Position | Annotation | Selected region | Expression difference | |||
|---|---|---|---|---|---|---|---|---|
| Direction | XP-EHH score | TMP expression | TPC expression | |||||
| GRMZM2G442658 | 1 | 273983252–2739871476 | alcohol dehydrogenase1, adh1 | TPC | 1.40767 | 333.57 | 294.86 | 0.05 |
| GRMZM2G087095 | 1 | 277310667–277348765 | Zea mays MADS24, zmm24 | TMP | -1.65527 | 1.40 | 2.23 | 6.06E-6 |
| GRMZM2G104546 | 2 | 172843794–172863857 | aspartate kinase homoserine dehydrogenase2, akh2 | TMP | -1.79858 | 11.84 | 11.77 | 0.56 |
| GRMZM2G174784 | 2 | 5514479–5518846 | AP2-EREBP-transcription factor 197, ereb197 | TPC | 1.53975 | 1.19 | 1.29 | 0.44 |
| GRMZM2G026643 | 3 | 27551288–27558978 | outer cell layer1, ocl1 | TPC | 1.66092 | 4.01 | 3.47 | 4.29E-3 |
| GRMZM2G109383 | 5 | 10855539–10861668 | phosphoglucomutase2, pgm2 | TMP | -1.29784 | 130.18 | 136.06 | 0.08 |
| GRMZM5G883855 | 5 | 16092084–16099014 | ameiotic1, am1 | TMP | -1.33957 | 1.67 | 0.98 | 0.80 |
| GRMZM2G087233 | 5 | 19437605–19439848 | QM1 homolog1, qm1 | TMP | -1.35767 | 126.61 | 139.05 | 0.16 |
| GRMZM2G139300 | 5 | 169454598–169459090 | cell wall invertase1, incw1 | TMP | -1.68728 | 8.53 | 6.46 | 2.14E-3 |
| GRMZM2G100018 | 6 | 121390345–121391179 | delta zein structural18, dzs18 | TMP | -1.4945 | 4813.49 | 548.13 | 1.04E-7 |
| GRMZM2G079440 | 6 | 137142553–137144384 | dehydrin1, dhn1 | TPC | 1.57388 | 10.79 | 8.74 | 0.07 |
| GRMZM2G034647 | 8 | 120060780–120063628 | cyclin1, cyc1 | TMP | -1.49137 | 1.64 | 1.63 | 0.48 |
| GRMZM2G152908 | 9 | 122220190–122226863 | sucrose synthase1, sus1 | TMP | -1.44781 | 267.39 | 251.33 | 8.69E-3 |
| GRMZM2G067546 | 9 | 131161707–131169929 | vacuolar sorting receptor homolog1, vacs1 | TMP | -2.21364 | 31.73 | 28.33 | 2.78E-3 |
| GRMZM2G305046 | 9 | 139187765–139188857 | histone2A1, his2a1 | TMP | -1.86109 | 145.66 | 117.16 | 0.04 |
| GRMZM2G102161 | 9 | 146679028–146687322 | Zea mays MADS8, zmm8 | TPC | 1.74751 | 41.95 | 46.38 | 0.13 |
| GRMZM2G171365 | 9 | 153812731–153832523 | MADS1, mads1 | TPC | 2.26930 | 16.48 | 26.62 | 2.86E-8 |
| GRMZM2G147266 | 10 | 1279130–1283746 | ferritin homolog2, fer2 | TMP | -1.42697 | 1.75 | 1.20 | 0.03 |
| GRMZM5G803874 | 10 | 138485066–138487763 | inhibitor of striate1, isr1 | TMP | -1.64537 | 1.08 | 1.09 | 0.25 |
a Each candidate gene is annotated according to MaizeGDB (http://www.maizegdb.org).
b Directions of selected regions. TMP means that this gene is in temperate selected region and TPC means in tropical selected region.
c XP-EHH score is the most significant score of selected region.
d The median gene expression in temperate population.
e The median gene expression in tropical population.
f The P-value of the t-test.
* selected key genes (top 1%).
** strongly selected key genes (top 0.5%).
Fig 2Selected genes in the sucrose catabolism pathway.
(A) Diagram showing the sucrose catabolism pathway with six genes selected during maize improvement. (B) Expression levels of six selected sucrose catabolism genes in tropical and temperate maize lines. The double stars represent highly significant expression differences between tropical and temperate lines. (C) XP-EHH scores and haplotypes of SNPs in one selected gene (GRMZM152908; sus1).
Fig 3Selected genes in the triacylglycerol (TAG) biosynthesis pathway.
Three genes (GRMZM2G083195, GRMZM2G079109, and GRMZM2G061885) that encode enzymes in the TAG biosynthesis pathway (GPAT, LPAAT, and PDAT) were strongly selected in temperate maize. The asterisks indicate candidate genes identified in a previous GWAS study of maize oil [27].
Candidate selected genes involved in oil biosynthesis in selected genomic regions in temperate maize.
| Gene | Chromosome | Position | Annotation | Selected region | |
|---|---|---|---|---|---|
| Level | XP-EHH score | ||||
| GRMZM2G003720 | 2 | 48997817–49000991 | acyl-CoA thioesterase, | 0.01 | -1.41008 |
| GRMZM2G061885 | 2 | 184692047–184698536 | phospholipid:diacylglycerol acyltransferase, | 0.005 | -1.46748 |
| GRMZM2G079109 | 2 | 194036492–194041722 | 1-acylglycerol-3-phosphate O-acyltransferase, | 0.005 | -1.68848 |
| GRMZM2G083195 | 3 | 178181704–178185282 | glycerol-3-phosphate O-acyltransferase, | 0.005 | -1.63522 |
| GRMZM2G003501 | 4 | 166828995–166830916 | very-long-chain 3-ketoacyl-CoA synthase | 0.005 | -1.51649 |
| GRMZM2G167576 | 9 | 36849526–36850583 | 0.005 | -1.56933 | |
| GRMZM2G152105 | 9 | 122514778–122520703 | β-ketoacyl-acyl-carrier-protein synthase I | 0.01 | -1.40285 |
a Candidate genes were annotated according to MaizeGDB (http://www.maizegdb.org)
b Significance levels for genes in all selected candidate regions
c XP-EHH score is the most significant score of the selected region
d This gene is a candidate gene identified in a previous GWAS study [27]