| Literature DB >> 32424146 |
Zhenhai Cui1,2, Haixiao Dong2,3, Ao Zhang1, Yanye Ruan1, Siqi Jiang1, Yan He4, Zhiwu Zhang5.
Abstract
The husk-the leaf-like outer covering of maize ear-has multiple functions, including protecting the ear from diseases infection and dehydration. In previous studies, we genotyped an association panel of 508 inbred lines genotyped with a total of ~550,000 SNPs (Illumina 50 K SNP Chip and RNA-seq). Genome-Wide Association Studies (GWAS) were conducted on four husk traits: husk length (HL), husk layer number (HN), husk thickness (HT), and husk width (HW). Minimal associations were identified and none of them passed the P-value threshold after a Bonferroni multiple-test correction using a single locus test in framework of mixed linear model. In this study, we doubled the number of SNPs (~1,250,000 in total) by adding GBS and 600 K SNP Chip. GWAS, performed with the recently developed multiple loci model (BLINK), revealed six genetic loci associated with HN and HT above the Bonferroni multiple-test threshold. Five candidate genes were identified based on the linkage disequilibrium with these loci, including GRMZM2G381691 and GRMZM2G012416. These two genes were up-regulation and down-regulation in all husk related tissues, respectively. GRMZM2G381691 associated with HT encoded a CCT domain protein, which expressed higher in tropical than temperate maize. GRMZM2G012416 associated with HN encoded an Armadillo (ARM) repeat protein, which regulated GA signal pathway. These associated SNPs and candidate genes paved a path to understand the genetic architecture of husk in maize.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32424146 PMCID: PMC7235265 DOI: 10.1038/s41598-020-65164-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Manhattan plots with dense and sparse markers by BLINK and MLM of husk layer number and husk thickness in maize. The Manhattan plots on the left panel display the P-values of the SNPs, categorized by chromosome and position along the chromosome, associated with Husk layer Number (HN) and Husk Thickness (HT). The P-values were calculated using BLINK and MLM software. The physical positions of all significant SNPs are displayed as the vertical dashed lines and the candidate genes for the husk traits are listed at the top of each line. The horizontal green line represents the Bonferroni multiple test threshold corresponding to a type I error of 1% with dense markers and sparse markers (p < 7.98 × 10−9 and p < 2.11 × 10−8). Six SNPs (purple circles) were identified as significantly associated with husk thickness and were located in or near the five candidate genes. (a) Mahattan plot of HN with dense markers by BLINK. (b) Mahattan plot of HN with dense markers by MLM. (c) Mahattan plot of HN with sparse markers by BLINK. (d) Mahattan plot of HN with sparse markers by MLM. (e) Mahattan plot of HT with dense markers by BLINK. (f) Mahattan plot of HT with dense markers by MLM. (g) Mahattan plot of HT with sparse markers by BLINK. (h) Mahattan plot of HT with sparse markers by MLM.
Attributes of the five SNPs associated with husk layer number and husk thickness.
| SNP | Traita | Chr | Position (bp) | Alleleb | PVEc | MAFd | Sequencing platforme | |
|---|---|---|---|---|---|---|---|---|
| SNP1 | HN | 1 | 48923085 | C/ | 6.14 | 0.06 | 3.63E-9 | 600 K |
| SNP2 | HN | 4 | 204094290 | A/ | 7.34 | 0.26 | 4.93E-10 | 600 K |
| Totalf | 20.85 | |||||||
| SNP3 | HT | 2 | 21007733 | T/ | 8.79 | 0.11 | 6.15E-11 | GBS, RNA-seq |
| SNP4 | HT | 2 | 221361730 | T/ | 10.39 | 0.06 | 1.09E-11 | GBS |
| SNP5 | HT | 10 | 94253633 | C/ | 13.17 | 0.18 | 1.19E-09 | 600 K |
| Totalf | 57.33 |
aHusk layer Number (HN) and Husk Thickness (HT).
bMajor/minor allele, underlined bases are the favorable alleles.
cPercentage of phenotypic variation explained by the additive effect of the single significant SNP (PVE).
dMinor Allele Frequency (MAF).
eGBS = genotyping-by-sequencing; 600 K = SNP array with ~600 K markers; RNA-seq = RNA sequencing.
fTotal percentage of phenotypic variation explained by all significant SNPs.
Figure 2Violin plots of allelic effects of five SNPs associated with husk traits. In a violin plot, the inner red box represents the interquartile range. The central white dot represents the median value. The outer gray shape on each side represents all measured data points and the thickness represents the probability density of the data. The P-values of the two allelic effects of the four husk traits [Husk Length (HL), Husk layer Number (HN), Husk Thickness (HT), Husk Width (HW)] are exhibited above each small plot. *Significant at P ≤ 0.05; **Significant at P ≤ 0.01; ***Significant at P ≤ 0.001.
Figure 3Linkage disequilibrium (LD) decay within one million bp surrounding the five candidate QTNs. The physical position of five significant SNPs (purple circles) associated with husk layer number and husk thickness were defined as zero in x axis. The physical position of five candidate genes were displayed with green arrows.
Annotation of the candidate genes related to the five SNPs associated with husk traits.
| Candidate genes | Traita | Chr | Gene interval(bp) | Distance from the related SNP to the edge of the gene (bp)b | Annotation |
|---|---|---|---|---|---|
| GRMZM2G003984 | HN | 1 | 48953687–48999879 | +30602 | Lon protease |
| GRMZM2G012416 | HN | 4 | 204054112–204064316 | −29974 | ARM repeat superfamily protein |
| AC212835.3_FG007 | HT | 2 | 21007540–21010895 | Located | Poly(U)-specific endoribonuclease-B |
| GRMZM2G057159 | HT | 2 | 221366850–221371447 | +5120 | Subtilase family protein |
| GRMZM2G381691 | HT | 10 | 94248710–94251264 | −52369 | CCT domain protein |
aHusk layer Number (HN) and Husk Thickness (HT).
bThe positive (+) and negative (−) values represent related SNPs location in the 5′ and 3′ direction, respectively, of their candidate gene.
Figure 4Variances of SNPs for P-value associated with husk traits and MAF in five candidate genes. On the half above, the P-values were calculated using BLINK software. The horizontal black dashed line represents the Bonferroni multiple test threshold corresponding to a type I error of 1% (p < 7.98 × 10−9). On the half blow, the y axis represents the Minor Allele Frequency (MAF). The x axis represents the physical position of each candidate gene.
Figure 5Heat-map of tissue-specific expression patterns of the five candidate genes. The in-silico gene expression of the six candidate genes were extracted for husk and other 13 tissues. The gene expressions were represented by the normalized Reads Per Kilobase per Million (RPKM). The log2 transformation of the ratio of gene expression in husk against the gene expression in other 13 tissues were hierarchically clustered in two dimensions (gene and tissue) and displayed using heat map. The expression of GRMZM2G012416 in husk were lower than all other tissues.