| Literature DB >> 30548709 |
Junling Pang1, Junjie Fu2, Na Zong1, Jing Wang1, Dandan Song1, Xia Zhang1, Cheng He2, Ting Fang2, Hongwei Zhang2, Yunliu Fan1, Guoying Wang2, Jun Zhao1.
Abstract
In maize, kernel traits strongly impact overall grain yields, and it is known that sophisticated spatiotemporal programs of gene expression coordinate kernel development, so advancing our knowledge of kernel development can help efforts to improve grain yields. Here, using phenotype, genotype and transcriptomics data of maize kernels at 5 and 15 days after pollination (DAP) for a large association mapping panel, we employed multiple quantitative genetics approaches-genome-wide association studies (GWAS) as well as expression quantitative trait loci (eQTL) and quantitative trait transcript (QTT) analyses-to gain insights about molecular genetic basis of kernel development in maize. This resulted in the identification of 137 putative kernel length-related genes at 5 DAP, of which 43 are located in previously reported QTL regions. Strikingly, we identified an eQTL that overlaps the locus encoding a maize homolog of the recently described m6 A methylation reader protein ECT2 from Arabidopsis; this putative epi eQTL is associated with 53 genes and may represent a master epi-transcriptomic regulator of kernel development. Notably, among the genes associated with this epi eQTL, 10 are for the main storage proteins in the maize endosperm (zeins) and two are known regulators of zein expression or endosperm development (Opaque2 and ZmICE1). Collectively, beyond cataloging and characterizing genomic attributes of a large number of eQTL associated with kernel development in maize, our study highlights how an eQTL approach can bolster the impact of both GWAS and QTT studies and can drive insights about the basic biology of plants.Entities:
Keywords: Zea mays L; complex trait; expression quantitative trait loci; kernel development; m6A RNA modification
Mesh:
Year: 2019 PMID: 30548709 PMCID: PMC6850110 DOI: 10.1111/tpj.14193
Source DB: PubMed Journal: Plant J ISSN: 0960-7412 Impact factor: 6.417
Figure 1Overview of single nucleotide polymorphisms (SNPs) and expression quantitative trait loci (eQTL) results for kernels at 5 days after pollination (DAP). (a) The number of SNPs called from 5 DAP and 15 DAP (Fu et al., 2013), and their overlap. (b) Concordance rate of the SNPs shared by the two developmental stages. (c) The start position of the 18 377 eQTL‐associated genes (y‐axis) against the position of the most significant SNPs in their eQTL regions (x‐axis). (d) Display of the eQTL results for the gene as an example. The upper panel shows the position of the eQTL and the association signals of all the SNPs located in this eQTL region, while the linkage disequilibrium (LD) plot for all the SNPs included is shown in the lower panel. Information for the most significant SNP is highlighted in the blue box. (e, f) The expression divergence for different haplotypes based on the most significant SNP in 2. (e) Three random samples were chosen for each haplotype, and the abundance of mapped reads is shown for the same coverage range. (f) The overall expression divergence of the two haplotypes of displayed as a box plot.
Figure 2Identification of kernel length‐related genes from integrated analysis of expression quantitative trait loci (eQTL), quantitative trait transcript (QTT) and genome‐wide association studies (GWAS) analyses. (a) GWAS signals of kernel length using different association models: mixed linear model (MLM; top) and general linear model (GLM; bottom). The dashed horizontal lines represent the significant P‐value cut‐off (1.29E‐6). (b) Illustration of the integrative approach used to identify genes related to kernel length at 5 days after pollination (DAP). The combination of eQTL and QTT reveals kernel length‐related genes at the expressional level, and then these genes were filtered by considering the GWAS signals. MAF, minor allele frequency. (c) Correlation of the expression level of GRMZM2G099074 with kernel length. (d) Comparison of kernel length between the two haplotypes based on the most significantly associated SNP of GRMZM2G099074. The kernel length data were used from previous studies (Liu et al., 2016).
List of selected kernel length‐related genes at 5 DAP
| Gene | Functional annotation | eQTL | QTT | GWAS | QTL ref. | ||||
|---|---|---|---|---|---|---|---|---|---|
| SNP (A1/A2) | Mean exp (A1/A2) |
| Cor |
| Mean KL (A1/A2) |
| |||
| GRMZM2G143782 | mab23; Speckle‐type POZ | S6_93485430 (G/A) | 13.2/32.5 | < 2.2E‐16 | 0.33 | 2.1E‐04 | 8.7/9.2 | 1.0E‐07 | (Li |
| GRMZM2G446426 | mads52; MADS‐box transcription factor | S3_38744838 (C/T) | 18.2/8.0 | 1.6E‐12 | −0.33 | 2.2E‐04 | 8.7/9.2 | 1.7E‐08 | (Wang |
| GRMZM2G161545 | Ubiquitin‐conjugating enzyme | S4_146090569 (G/C) | 8.5/5.1 | 2.2E‐16 | −0.30 | 1.0E‐03 | 8.8/9.4 | 5.5E‐07 | (Li |
| GRMZM2G164835 | NA | S6_93657368 (C/G) | 8.7/10.6 | 2.3E‐06 | 0.23 | 1.1E‐03 | 8.7/9.5 | 6.9E‐11 | (Li |
| GRMZM2G122965 | Putative kinesin member | S1_47911257 (G/C) | 7.3/8.3 | 2.6E‐02 | 0.27 | 1.7E‐03 | 8.6/9.1 | 9.5E‐08 | (Li |
| GRMZM2G165746 | Putative mitochondrial substrate/solute carrier | S3_60257567 (A/C) | 21.0/11.2 | 1.6E‐12 | 0.28 | 2.3E‐03 | 9.0/8.6 | 5.8E‐07 | (Li |
| GRMZM2G144615 | F‐box containing protein | S3_9041378 (C/T) | 21.6/17.5 | 6.3E‐15 | −0.26 | 2.7E‐03 | 8.7/9.3 | 9.9E‐09 | (Wang |
| GRMZM2G300924 | ereb170; AP2/EREBP transcription factor | S5_189817208 (C/A) | 2.8/3.9 | 2.0E‐02 | 0.25 | 3.1E‐03 | 8.7/9.4 | 1.2E‐08 | (Li |
| GRMZM2G123355 | Glycosyltransferase 6 | S4_191222267 (G/T) | 13.7/21.7 | 4.3E‐07 | 0.23 | 4.6E‐03 | 8.8/9.5 | 4.5E‐09 | (Wang |
| GRMZM2G037177 | NA | S7_169965607 (G/T) | 101.6/86.1 | 6.2E‐09 | −0.25 | 4.8E‐03 | 8.8/9.3 | 1.2E‐06 | (Veldboom |
| GRMZM2G093035 | NA | S6_13921812 (G/A) | 3.8/7.0 | 7.9E‐08 | 0.25 | 4.9E‐03 | 8.8/9.3 | 5.6E‐07 | (Li |
| GRMZM2G067555 | NA | S4_198308777 (A/C) | 4.3/2.2 | 1.6E‐12 | −0.23 | 5.3E‐03 | 8.7/9.3 | 1.7E‐07 | (Wang |
| GRMZM2G148867 | Putative glutaredoxin | S1_201782255 (A/G) | 10.3/8.4 | 1.6E‐12 | −0.23 | 5.6E‐03 | 8.7/9.1 | 1.1E‐06 | (Li |
| GRMZM2G154029 | HLH DNA‐binding protein | S9_121002306 (C/T) | 13.0/34.6 | 1.2E‐12 | 0.25 | 6.5E‐03 | 8.7/9.4 | 5.2E‐08 | (Li |
| GRMZM2G176133 | 50S ribosomal protein L33 | S6_158610330 (T/C) | 39.3/34.1 | 6.0E‐05 | −0.23 | 6.5E‐03 | 8.6/9.2 | 4.4E‐10 | (Wang |
| GRMZM2G001648 | NA | S4_23004680 (G/A) | 3.4/34.4 | 7.4E‐11 | 0.24 | 7.3E‐03 | 8.8/9.6 | 3.9E‐07 | (Li |
| GRMZM2G092925 | Protein HVA22 | S1_198967213 (C/T) | 4.1/16.8 | 1.7E‐12 | 0.23 | 7.8E‐03 | 8.7/9.5 | 1.4E‐09 | (Li |
| GRMZM2G043943 | Pectinesterase | S6_160526879 (A/G) | 9.3/24.8 | 2.2E‐07 | 0.21 | 8.0E‐03 | 8.8/9.4 | 5.1E‐07 | (Wang |
| GRMZM2G058900 | NA | S6_93993703 (A/G) | 11.6/8.6 | < 2.2E‐16 | −0.22 | 8.1E‐03 | 8.8/9.3 | 3.8E‐07 | (Li |
| GRMZM2G082855 | pzb01110; Leucine‐rich repeat receptor‐like protein | S9_24080734 (T/G) | 7.1/4.7 | 1.1E‐11 | 0.20 | 8.7E‐03 | 9.0/8.5 | 7.9E‐08 | (Liu |
| GRMZM2G126808 | hb45; Homeobox protein | S1_29544058 (C/A) | 12.2/24.8 | 4.9E‐14 | 0.22 | 8.9E‐03 | 8.7/9.4 | 1.3E‐09 | (Wang |
| GRMZM2G012126 | NA | S1_29263923 (G/C) | 13.5/5.7 | < 2.2E‐16 | 0.19 | 1.1E‐02 | 9.1/8.6 | 4.7E‐09 | (Wang |
| GRMZM2G180622 | Sarcoplasmic reticulum histidine‐rich calcium‐binding | S3_28621143 (G/A) | 13.9/5.1 | 1.6E‐12 | 0.20 | 1.2E‐02 | 9.0/8.5 | 1.4E‐07 | (Wang |
| GRMZM2G147966 | NA | S7_117728836 (C/T) | 22.1/28.1 | 1.9E‐02 | 0.21 | 1.2E‐02 | 8.7/9.4 | 1.3E‐07 | (Li |
| GRMZM5G874568 | NA | S6_91797273 (C/G) | 2.9/34.6 | 1.6E‐12 | 0.21 | 1.3E‐02 | 8.7/9.3 | 2.1E‐09 | (Li |
| GRMZM2G139650 | Phytoene dehydrogenase‐like | S5_101336590 (A/G) | 12.0/18.5 | 1.6E‐12 | −0.20 | 1.4E‐02 | 9.0/8.5 | 5.8E‐07 | (Li |
| GRMZM2G353785 | Ribosomal subunit | S7_147108286 (A/G) | 9.2/5.2 | 1.6E‐12 | 0.20 | 1.5E‐02 | 9.0/8.5 | 2.9E‐07 | (Liu |
| GRMZM2G154394 | NA | S4_5399845 (T/C) | 7.1/9.1 | 3.7E‐07 | −0.22 | 1.7E‐02 | 8.9/8.4 | 1.3E‐06 | (Li |
| GRMZM2G333875 | Putative kinase | S6_88865767 (T/C) | 8.6/14.1 | 7.7E‐08 | 0.17 | 2.1E‐02 | 8.7/9.3 | 1.0E‐08 | (Li |
| GRMZM2G000481 | Putative tRNA synthetase | S4_34371927 (G/A) | 3.3/7.2 | 1.6E‐12 | 0.21 | 2.2E‐02 | 8.7/9.3 | 2.7E‐10 | (Li |
| GRMZM2G069146 | ereb115; AP2/EREBP transcription factor | S7_141174723 (G/C) | 37.8/17.1 | 2.1E‐12 | 0.17 | 2.3E‐02 | 9.1/8.6 | 1.3E‐06 | (Liu |
| GRMZM2G551402 | Putative ubiquitin protease | S4_201441462 (T/C) | 14.0/16.1 | 1.2E‐04 | 0.16 | 2.3E‐02 | 8.6/9.1 | 8.9E‐08 | (Wang |
| GRMZM2G053083 | NA | S7_147098815 (G/C) | 37.3/57.3 | 5.2E‐14 | −0.15 | 2.5E‐02 | 9.0/8.5 | 9.1E‐07 | (Liu |
| GRMZM2G104449 | NA | S4_200716340 (C/T) | 8.5/9.6 | 3.9E‐05 | −0.19 | 2.8E‐02 | 8.7/9.1 | 9.4E‐07 | (Wang |
| GRMZM2G166873 | bZIP transcription factor | S8_162663083 (T/G) | 14.7/15.6 | 4.2E‐02 | 0.19 | 2.9E‐02 | 8.6/9.1 | 1.2E‐06 | (Veldboom |
| GRMZM2G034598 | Beta‐hexosaminidase | S6_34455108 (C/G) | 16.6/21.9 | 1.2E‐10 | −0.19 | 3.1E‐02 | 9.0/8.5 | 4.2E‐08 | (Li |
| GRMZM2G352607 | Lipid‐binding protein | S9_44283747 (C/T) | 7.7/5.7 | 3.3E‐11 | −0.19 | 3.2E‐02 | 8.8/9.5 | 3.2E‐08 | (Li |
| GRMZM2G070716 | NADH‐ubiquinone oxidoreductase B16.6 subunit | S1_25369971 (G/A) | 15.5/27.5 | 1.6E‐12 | 0.16 | 3.3E‐02 | 8.7/9.2 | 1.4E‐07 | (Wang |
| GRMZM2G157536 | NA | S6_155435074 (C/G) | 13.1/21.8 | 2.3E‐06 | 0.17 | 3.8E‐02 | 8.7/9.2 | 1.1E‐06 | (Veldboom |
| GRMZM2G102657 | Glycosyltransferase‐related family protein | S9_132170051 (C/T) | 1.4/8.4 | 1.6E‐12 | 0.19 | 4.1E‐02 | 8.7/9.3 | 1.6E‐07 | (Chen |
| GRMZM5G899300 | bub3; Mitotic checkpoint | S9_20608662 (T/C) | 42.4/50.2 | 5.0E‐08 | 0.15 | 4.4E‐02 | 8.7/9.1 | 9.8E‐07 | (Liu |
| GRMZM2G009045 | NA | S7_125138530(C/A) | 5.0/9.1 | 4.2E‐04 | 0.14 | 4.8E‐02 | 8.7/9.2 | 3.3E‐07 | (Li |
| GRMZM2G172322 | gsr1; Glutathione reductase | S1_12985707 (C/G) | 10.3/13.8 | 1.8E‐12 | −0.15 | 4.9E‐02 | 9.0/8.5 | 1.1E‐06 | (Wang |
All of the SNP names begin with ‘S’ and then followed by the chromosome number and the genomic position (AGPv3) separated with ‘_’; A1 and A2 represent the two alleles of SNP.Cor, correlation; eQTL, expression quantitative trait loci; Exp, expression; GWAS, genome‐wide association studies; KL, kernel length; QTT, quantitative trait transcript; Ref., reference; SNP, single nucleotide polymorphism.
Figure 3Integration and comparison of the expression quantitative trait loci (eQTLs) identified from 5 days after pollination (DAP) and 15 DAP maize kernels. (a) Schematic diagram showing the integration of the eQTL regions by retaining the maximum region for any intersecting eQTLs of the two stages. (b) The genomic distribution of the merged eQTL dataset. (c) Venn diagram to show the number of stage‐shared and stage‐specific eQTLs. (d) Positional distribution of the stage‐shared and stage‐specific eQTLs relative to the transcriptional start site (TSS) of their associated genes. The x‐axis shows the distance of eQTLs to the annotated TSS of their associated genes. The y‐axis represents the eQTL density per 5‐kb bin. Red dots and lines highlight the distribution pattern of stage‐specific eQTLs, and blue ones represent stage‐shared eQTLs. (e) The allelic effect of stage‐shared eQTLs. Each point represents an eQTL‐gene association that was significant in both stages. For each eQTL, the lead SNP with the most significant association was chosen as a representative example. The axes show the expression divergence (fold change) between the major and minor alleles of the associated genes at both stages. The eQTLs with inverse regulatory effects between the two stages are shown in red. (f) One eQTL example with allelic effect to its associated gene. The expression levels of the two alleles of GRMZM2G059580 are shown for both 5 DAP and 15 DAP, and inverse regulatory trends are presented with box plots.
Figure 4A proposed epieQTL associated with multiple genes at 15 days after pollination (DAP). (a) The genomic position of one 15‐DAP‐specific expression quantitative trait loci (eQTL; chr7: 8310530…8315224) associated with 53 genes. Information for the two haplotypes based on the most significantly associated single nucleotide polymorphism (SNP; S7_8311142) is highlighted. (b) The phenotypic divergence (t‐test) of the two haplotypes for kernel length, kernel thickness and 100 grain weight. The kernel phenotypic data were used from previous studies (Liu et al., 2016). (c) The genomic distribution of all the 53 target genes associated with this eQTL. Genes that are annotated as functional or putative zein genes are highlighted in blue. (d) The phylogenetic tree of GRMZM2G144726 with other YTH domain‐containing genes in both human and Arabidopsis. GRMZM2G144726 was grouped with ECT1‐4 as the ‘ECT1‐4‐subclade’. (e) Multiple sequence alignment of the YTH domain for the ECT1‐4‐subclade genes with the YTH‐domain‐containing genes in human. The boxes highlight the conserved regions among them; the amino acids with red background are highly conserved sites.