| Literature DB >> 26334534 |
Juliet D Tang1, Andy Perkins2, W Paul Williams3, Marilyn L Warburton4.
Abstract
BACKGROUND: Aflatoxin is a potent carcinogen that can contaminate grain infected with the fungus Aspergillus flavus. However, resistance to aflatoxin accumulation in maize is a complex trait with low heritability. Here, two complementary analyses were performed to better understand the mechanisms involved. The first coupled results of a genome-wide association study (GWAS) that accounted for linkage disequilibrium among single nucleotide polymorphisms (SNPs) with gene-set enrichment for a pathway-based approach. The rationale was that the cumulative effects of genes in a pathway would give insight into genetic differences that distinguish resistant from susceptible lines of maize. The second involved finding non-pathway genes close to the most significant SNP-trait associations with the greatest effect on reducing aflatoxin in multiple environments. Unlike conventional GWAS, the latter analysis emphasized multiple aspects of SNP-trait associations rather than just significance and was performed because of the high genotype x environment variability exhibited by this trait.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26334534 PMCID: PMC4558830 DOI: 10.1186/s12864-015-1874-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Decision tree to find the tagSNP and gene for the GWAS results based on linkage disequilibrium values. The tagSNP is at the terminal branch of the tree, SNP1 is the reference SNP, SNP2 Block is a block of SNPs linked to SNP1 (R2 > 0.8), and SNP2 is a found SNP within the SNP2 Block based on the decisions made by the algorithm. The values of the association effect and significance (p) were obtained from the GWAS. Once a tagSNP was identified, it was assumed that the gene(s) causing the association was within 1 Kb
Fig. 2Analysis pipeline that coupled the GWAS, linkage disequilibrium, and pathway analyses. The outcome or size of the data set following each step is indicated. Assoc, association; LD, linkage disequilibrium; Q + K, genetic marker-based kinship matrix plus population structure files for the mixed linear model analysis implemented by TASSEL; PW, pathways
Summary of the gene-set enrichment analysis for pathways with FDR < 0.2
| MaizeCyc ID | PW Name | NES |
|
| Genesa |
|---|---|---|---|---|---|
| PWY-735 | jasmonic acid biosynthesis | 4.55 | 2.63E-6 | 0.001 | 28 |
| PWY-6124 | inosine-5′-phosphate biosynthesis II | 3.18 | 0.0007 | 0.109 | 8 |
| PWY-5136 | fatty acid β-oxidation II (core pathway) | 2.85 | 0.0022 | 0.163 | 25 |
| ARGASEDEG-PWY | arginine degradation I (arginase pathway) | 2.79 | 0.0026 | 0.163 | 14 |
| FOLSYN-PWY-1 | superpathway of tetrahydrofolate biosynthesis | 2.68 | 0.0037 | 0.163 | 15 |
| PWY-3001 | isoleucine biosynthesis I | 2.64 | 0.0041 | 0.163 | 65 |
| SULFATE-CYS-PWY | superpathway of sulfate assimilation and cysteine biosynthesis | 2.61 | 0.0046 | 0.163 | 24 |
| BRANCHED-CHAIN-AA-SYN-PWY | superpathway of leucine, valine, and isoleucine biosynthesis | 2.55 | 0.0054 | 0.163 | 38 |
| PWY-5409 | divinyl ether biosynthesis II | 2.53 | 0.0056 | 0.163 | 7 |
| PWY-6435 | 4-hydroxybenzoate biosynthesis V | 2.53 | 0.0057 | 0.163 | 20 |
| PWY-3461 | tyrosine biosynthesis II | 2.50 | 0.0063 | 0.163 | 7 |
| PWY-561 | superpathway of glyoxylate cycle | 2.46 | 0.0069 | 0.163 | 59 |
| PWY-6040 | chlorogenic acid biosynthesis II | 2.43 | 0.0076 | 0.163 | 5 |
| PWY-4221-1 | superpathway of pantothenate and coenzyme A biosynthesis II | 2.43 | 0.0076 | 0.163 | 43 |
| THRESYN-PWY | threonine biosynthesis | 2.40 | 0.0083 | 0.165 | 41 |
| LEUSYN-PWY | leucine biosynthesis | 2.37 | 0.0090 | 0.167 | 21 |
| PWY-5481 | pyruvate fermentation to lactate | 2.33 | 0.0098 | 0.172 | 8 |
ID identifier, PW pathway, NES normalized enrichment score
aThe number of genes that were mapped to a pathway and contributed to the enrichment score calculation
Fig. 3Graphs of the running enrichment score calculation for (a) PWY-735 (JA biosynthesis pathway) and (b) ETHYL-PWY (ethylene biosynthesis pathway). Genes were ranked in ascending order by their effect scores. Hash marks at the top of the graph denote the ranks of genes in the pathway. The pathway enrichment score coincided with the maximum running enrichment score and is marked by the dashed vertical line
Fig. 4Relative positions of the genes in PWY-735 on the maize chromosomes. Thick vertical lines depict chromosomes 1–10 (left to right). Genes with notations were associated with a tagSNP and contributed to the enrichment score calculation. Notations: *gene had a negative effect value, **gene had p < 0.1 and effect < −0.1, † gene had a positive effect value. Numbers to the left of the chromosome refer to the pathway step catalyzed. Genes that lacked notations did not contribute to the enrichment score because they lacked polymorphisms among the lines in the association panel or lacked sequenced reads from the region. See text for abbreviations of gene function and EC number of the reaction catalyzed
Annotation and QTL evidence for genes from the JA biosynthesis pathway with the most significant associations and greatest effectsa on lowering aflatoxin accumulation
| tagSNP |
| Rb | Effect | Gene Identifier | MaizeCyc Annotation | Stepc | Bin | QTLd |
|---|---|---|---|---|---|---|---|---|
| S1_188114572 | 0.059 | 0.01 | –0.23 | GRMZM5G822593 | lipoxygenase13 | 1 | 1.06 | mqtl1_5 mqtl1_6 |
| S2_45193435 | 0.047 | 0.01 | –0.24 | GRMZM2G104843 | lipoxygenase8 | 1 | 2.04 | mqtl2_2 |
| S3_168834972 | 0.076 | 0.03 | –0.36 | GRMZM2G156861 | lipoxygenase1 | 1 | 3.05 | mqtl3_3 |
| S4_230398767 | 0.007 | 0.02 | –0.37 | GRMZM2G168404 | hydroperoxide dehydratase | 2 | 4.09 | mqtl4_8 |
| S5_31133128 | 0.002 | 0.04 | –0.60 | GRMZM2G110201 | acetyl-CoA C-acyltransferase | 9 | 5.03 | Mp715 |
| S6_124147817 | 0.065 | 0.01 | –0.15 | GRMZM2G002959 | acyl-CoA dehydrogenase | 6 | 6.05 | None |
| S6_138501372 | 0.062 | 0.02 | –0.20 | GRMZM2G117357 | enoyl-CoA hydratase | 7 | 6.05 | None |
| S8_171705472 | 0.047 | 0.03 | –0.27 | GRMZM2G106250 | 3-hydroxyacyl-CoA dehydrogenase | 8 | 8.08 | None |
aSubset had association p < 0.1 and effect < −0.1
bProportion of the phenotypic variation accounted for by the tagSNP
c Pathway step described in the text
dData for QTL (resistant parent listed) and meta-QTL (mqtl) for resistance to Aspergillus species were compiled from Mideros et al. Supplementary Table 8 [5] and Warburton and Williams [31]. TagSNP was in or within 2 Mb of QTL
Annotation and QTL evidence for individual genes with the most consistent associations for lowering aflatoxin resistancea
| Marker |
| Rb | Effect | Geneid | TAIR Annotationc | Bin | QTLd |
|---|---|---|---|---|---|---|---|
| S1_45772203 | 0.0051 | 0.07 | –0.69 | GRMZM2G135359 | LRRPK family protein | 1.03 | mqtl1_3 |
| S2_23556305 | 0.0009 | 0.10 | –0.77 | GRMZM2G078279 | expansin B3 | 2.03 | mqtl2_1 |
| S2_37133737 | 0.0001 | 0.07 | –0.57 | GRMZM2G057637 | None | 2.04 | None |
| S3_23493748 | 0.0017 | 0.06 | –0.83 | GRMZM2G041352 | endonuclease 2 | 3.04 | mqtl3_2 |
| S4_238436842 | 0.0009 | 0.07 | –0.61 | GRMZM2G174481 | NAD(P)-OR superfamily protein | 4.10 | mqtl4_9 |
| S5_193538196 | 0.0004 | 0.09 | –0.76 | GRMZM2G165601 | None | 5.05 | None |
| S5_213916936 | 0.0061 | 0.07 | –0.75 | GRMZM2G469142 | AlgP | 5.08 | mqtl5_6 |
| S6_5158838 | 0.0023 | 0.08 | –0.54 | GRMZM2G125081 | LRRPK family protein | 6.00 | None |
| S6_5158838 | 0.0023 | 0.08 | –0.54 | GRMZM2G125138 | glycine-rich protein | 6.00 | None |
| S6_161709273 | 0.0040 | 0.07 | –0.82 | GRMZM2G121208 | RTE1 | 6.06 | mqtl6_8 |
| S9_154403351 | 0.0002 | 0.08 | –0.62 | GRMZM2G092741 | AP-2, α subunit | 9.07 | None |
| S9_154245175 | 0.0042 | 0.06 | –0.68 | GRMZM2G303312 | ECT4 | 9.07 | None |
| S10_2433643 | 0.0003 | 0.08 | –0.61 | GRMZM2G151903 | MATE efflux family protein | 10.00 | None |
aTwo sequential filters applied. Filter 1 = p < 0.01, effect < −0.2, and R2 > 0.04. Filter 2 = R2 > 0.06 and effect < −0.5
bProportion of the phenotypic variation accounted for by the tagSNP
cAbbreviations: TAIR, The Arabidopsis Information Resource; LRRPK, leucine-rich repeat protein kinase; NAD(P)-OR, NAD(P)-linked oxidoreductase; AlgP, alginate regulatory protein; RTE1, reversion-to-ethylene sensitivity1; AP, adaptor protein complex; ECT4, evolutionarily conserved C-terminal region 4; MATE, multidrug and toxic compound extrusion. The AlgP annotation was from rice
dData for QTL (resistant parent listed) and meta-QTL (mqtl) number for resistance to Aspergillus species were compiled from Mideros et al. Supplementary Table 8 [5]and Warburton and Williams [31]. TagSNP was either in or within 2 Mb of QTL