| Literature DB >> 25475173 |
Yang Bian, Qin Yang, Peter J Balint-Kurti, Randall J Wisser, James B Holland1.
Abstract
BACKGROUND: A previous study reported a comprehensive quantitative trait locus (QTL) and genome wide association study (GWAS) of southern leaf blight (SLB) resistance in the maize Nested Association Mapping (NAM) panel. Since that time, the genomic resources available for such analyses have improved substantially. An updated NAM genetic linkage map has a nearly six-fold greater marker density than the previous map and the combined SNPs and read-depth variants (RDVs) from maize HapMaps 1 and 2 provided 28.5 M genomic variants for association analysis, 17 fold more than HapMap 1. In addition, phenotypic values of the NAM RILs were re-estimated to account for environment-specific flowering time covariates and a small proportion of lines were dropped due to genotypic data quality problems. Comparisons of original and updated QTL and GWAS results confound the effects of linkage map density, GWAS marker density, population sample size, and phenotype estimates. Therefore, we evaluated the effects of changing each of these parameters individually and in combination to determine their relative impact on marker-trait associations in original and updated analyses.Entities:
Mesh:
Year: 2014 PMID: 25475173 PMCID: PMC4300987 DOI: 10.1186/1471-2164-15-1068
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Inputs for joint linkage mapping QTL analysis and GWAS models
| Joint linkage mapping QTL models | |||
|---|---|---|---|
| QTL Model | Phenotype | No. markers in linkage map | No. NAM RILs |
| 1a | originalb | 1106 | 4694 |
| 2 | original | 1106 | 4354 |
| 3 | updated | 1106 | 4694 |
| 4 | updated | 1106 | 4354 |
| 5 | original | 7386 | 4354 |
| 6 | updated | 7386 | 4354 |
| 7 | updated | 7386 | 4413 |
|
| |||
|
|
|
|
|
| Aa | 1 | 1.6 M HM1 SNPs | 1E-4 |
| B | 1 | 1.6 M HM1 SNPs | 1E-7 |
| C | 1 | 28.2 M HM1/2 SNPs + 0.2 M RDVs | 1E-7 |
| D | 7 | 1.6 M HM1 SNPs | 1E-7 |
| E | 7 | 28.2 M HM1/2 SNPs + 0.2 M RDVs | 1E-7 |
aQTL model 1 and GWAS model A results were previously published [8].
bOriginal phenotype inputs were RIL BLUPs across 3 environments, based on a model with a common flowering time covariate; updated phenotype inputs were RIL BLUPs across 3 environments based on a model with environment-specific flowering time covariates.
Physical and genetic positions for QTL peaks and support intervals (SI) mapped using updated phenotypes and linkage map (model 7) and comparison to QTL previously reported by Kump et al. [8] (model 1)
| Chr | Peak position (AGP v2 bp) | Peak position (cM) | SI Map Position (cM) | Step included | SI overlapped | Distance between QTL peaks (cM) |
|---|---|---|---|---|---|---|
| 1 | 90,443,174 | 85.2 | 84.2 - 86.2 | 3 | y | 0.8 |
| 1 | 218,082,692 | 127.6 | 125.6 - 129.6 | 8 | y | 1.9 |
| 1 | 251,723,948 | 146.8 | 144.8 - 147.8 | 18 | y | 1.0 |
| 1 | 283,549,061 | 177 | 175 - 179 | 22 | n | 29.2 |
| 2 | 7,180,393 | 21.4 | 19.4 - 23.4 | 21 | n | 20.2 |
| 2 | 36,838,070 | 65.4 | 64.4 - 66.4 | 5 | y | 2.3 |
| 2 | 206,275,294 | 111.2 | 110.2 - 113.2 | 23 | n | 30.8 |
| 3 | 5,348,237 | 19.6 | 18.6 - 22.6 | 13 | y | 5.9 |
| 3 | 16,246,035 | 43.4 | 41.4 - 44.4 | 1 | n | 6.6 |
| 3 | 31,533,927 | 54.4 | 53.4 - 55.4 | 10 | c | 4.4 |
| 3 | 170,374,260 | 79.4 | 78.4 - 84.4 | 14 | n | 12.9 |
| 3 | 214,568,867 | 124.4 | 123.4 - 126.4 | 26 | c | 6.8 |
| 3 | 219,498,075 | 134 | 132 - 137 | 6 | y | 2.8 |
| 4 | 1,892,489 | 2.4 | -3.6 - 7.4 | 29 | y | 4.6 |
| 4 | 141,422,921 | 59.8 | 54.8 - 60.8 | 20 | c | 7.9 |
| 4 | 181,935,681 | 94 | 91 - 102 | 17 | n | 24.2 |
| 5 | 15,138,119 | 45.2 | 44.2 - 49.2 | 33 | y | 8.2 |
| 5 | 36,905,989 | 58 | 55 - 59 | 11 | y | 4.6 |
| 5 | 158,046,075 | 74.2 | 72.2 - 77.2 | 19 | y | 3.0 |
| 5 | 200,161,433 | 106.6 | 104.6 - 109.6 | 27 | n | 18.1 |
| 5 | 214,137,041 | 144.4 | 140.4 - 154.4 | 31 | n | 19.7 |
| 6 | 6,929,855 | -0.6 | -1.4 - 0.4 | 9 | y | 0.4 |
| 6 | 144,806,691 | 56.6 | 53.6 - 58.6 | 28 | n | 56.8 |
| 7 | 8,270,900 | 31.2 | 28.2 - 34.2 | 30 | y | 3.0 |
| 7 | 142,429,440 | 74.8 | 73.8 - 76.8 | 25 | c | 5.2 |
| 8 | 36,978,572 | 51.2 | 50.2 - 53.2 | 4 | y | 1.4 |
| 8 | 118,435,453 | 63.2 | 61.2 - 67.2 | 24 | n | 10.6 |
| 8 | 166,705,312 | 102.2 | 101.2 - 104.2 | 15 | y | 1.3 |
| 9 | 16,361,287 | 28.4 | 27.4 - 29.4 | 7 | y | 0.1 |
| 9 | 109,899,486 | 52.6 | 51.6 - 54.6 | 2 | n | 5.6 |
| 10 | 2,040,278 | -1.6 | -2.6 - 3.4 | 32 | n | 38.3 |
| 10 | 75,958,884 | 37.8 | 34.8 - 38.8 | 12 | y | 1.1 |
| 10 | 135,354,773 | 59.8 | 58.8 - 60.8 | 16 | n | 23.1 |
‘Step included’ denotes the regression model building step in which each QTL was selected for inclusion in model 7. ‘SI overlapped’ indicates if models 1 and 7 QTL SI overlapped: y for overlapped, n for not overlapped and c for very close (within 1.5 cM) but not overlapped. ‘Distance between QTL’ indicates the cM distance between the peaks of nearest QTL from models 1 and 7.
Figure 1Heat map of additive effect estimates of 25 founder parent alleles for QTLs of SLB resistance relative to B73. QTL are identified by their genetic locations in the consensus genetic map (7386-marker map); effect estimates for each parental allele are indicated by color blocks. Negative cM values for markers indicate that they are distal to the first marker from the original NAM linkage map on that chromosome.
Figure 2QTL positions on the ten maize chromosomes from seven joint linkage mapping models. Positions in cM are based on 7386-marker map. QTL bar heights are proportional to their partial R , blue-colored margins denote the QTL support intervals, and black spikes denote the QTL peak positions. For the definitions of model inputs, see Table 1.
Input changes in joint linkage mapping QTL modeling affect the values of the chromosome-specific residuals more profoundly than the corresponding predicted phenotypic values
| Model | 1 | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|---|
| 1 | 4694 RILs, original BLUPs, 1106 map | 0.88 | 0.91 | 0.81 | 0.81 | 0.77 |
| 2 | SZ | 4354 RILs, original BLUPs, 1106 map | 0.85 | 0.87 | 0.82 | 0.79 |
| 3 | PH | PH, SZ | 4694 RILs, updated BLUPs, 1106 map | 0.82 | 0.79 | 0.79 |
| 4 | PH, SZ | PH | SZ | 4354 RILs, updated BLUPs, 1106 map | 0.80 | 0.77 |
| 5 | GN, SZ | GN | GN, PH, SZ | GN, PH | 4354 RILs, original BLUPs, 7386 map | 0.80 |
| 6 | GN, PH, SZ | GN, PH | GN, SZ | GN | PH | 4354 RILs, original BLUPs, 7386 map |
Upper diagonal shows average correlation of residual outcome for each chromosome for 15 pairwise model comparisons. Lower diagonal shows the input(s) that differed in each pair of model comparison. “GN”, “PH”, and “SZ” denote the different genotype inputs: GN, linkage map (1106- vs. 7386-marker map); PH, phenotype inputs (original vs. updated BLUPs); and SZ, sample size (4354 vs. 4694 RILs), respectively. Diagonal shows the three inputs for each model.
Figure 3Manhattan plots from genome-wide association analysis for SLB across the ten chromosome pairs of maize. The dashed horizontal line in red depicts the resample model inclusion probability (RMIP) thresholds of 0.05 and 0.25. Eighteen candidate genes underlying the most robust GWAS hits (RMIP ≥0.25) located in QTL support intervals are indicated (Table 4).
Highly significant GWAS variants (RMIP ≥ 0.25) and their adjacent candidate genes
| Chromosome | Physical position (AGPv2) | Allele a | Effect b |
| RMIP | Genic position | Inside QTL c | Nearest gene ID | Position of nearest gene (AGPv2) | Annotation | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Start | End | ||||||||||
| 1 | 59772038 | T/C | 0.09 | 9.4E-10 | 0.35 | Intergenicd | N | GRMZM2G134671 | 59764838 | 59769739 | CCT/B-box zinc finger protein |
| 1 | 90059294 | C/T | 0.08 | 8.5E-14 | 0.35 | Intergenic | Y | GRMZM5G806839 | 90108398 | 90107598 | AP2 domain containing protein |
| 2 | 34246559 | A/C | 0.11 | 1.8E-16 | 0.39 | Intergenic | N | GRMZM5G876621 | 34361966 | 34343251 | IBR domain-containing protein |
| 2 | 36938074 | G/A | 0.10 | 1.4E-22 | 0.53 | 3’UTR | Y | GRMZM2G022627 | 36936010 | 36938218 | Transducin/WD40 repeat-like superfamily protein |
| 2 | 204757213 | G/A | 0.11 | 4.5E-09 | 0.41 | Intergenic | N | GRMZM2G142932 | 204771189 | 204767728 | Basic helix-loop-helix (bHLH) DNA-binding protein |
| 2 | 207520001 | CNV- | 0.11 | 4.4E-09 | 0.28 | Intergenic | Y | GRMZM2G062156 | 207635000 | 207637178 | Transporter family protein |
| 3 | 16574125 | T/C | 0.12 | 1.8E-26 | 0.43 | Non-synonymous_codinge | Y | GRMZM2G463580 | 16575869 | 16572429 | Leucine-rich repeat transmembrane protein kinase |
| 3 | 16575842 | C/G | 0.15 | 9.7E-14 | 0.46 | 5’UTR | Y | ||||
| 3 | 32885733 | C/T | 0.19 | 9.1E-16 | 0.9 | Intergenic | Y | GRMZM2G132936 | 32957454 | 32949608 | RNA recognition motif (RRM)-containing protein |
| 3 | 219885527 | T/C | -0.11 | 3.1E-12 | 0.28 | Intronic | Y | GRMZM2G074572 | 219900229 | 219884844 | Stomatal cytokinesis defective/SCD1 protein |
| 3 | 220658469 | A/C | -0.08 | 6.0E-12 | 0.25 | Intronic | Y | GRMZM2G130375 | 220651805 | 220659525 | Beta galactosidase 1 |
| 4 | 2046350 | CGG/--- | -0.08 | 3.8E-09 | 0.26 | Non-synonymous_coding | Y | GRMZM2G131442 | 2044016 | 2047386 | MYB domain protein 112 |
| 5 | 164110001 | CNV- | 0.10 | 1.5E-11 | 0.42 | Intergenic | Y | GRMZM2G111872 | 164120892 | 164124546 | Heavy metal-associated domain containing protein |
| 6 | 7001531 | A/C | -0.08 | 1.1E-15 | 0.4 | Intergenic | Y | GRMZM2G127342 | 7036171 | 7039824 | Lysine histidine transporter 1 (LHT1) |
| 7 | 7958250 | +/- | -0.08 | 1.9E-08 | 0.3 | Intergenic | Y | GRMZM2G021149 | 7954180 | 7955128 | LSM domain containing protein |
| 7 | 142997161 | G/A | 0.09 | 7.3E-09 | 0.28 | Intergenic | Y | GRMZM2G091919 | 143003673 | 143001781 | protein kinase superfamily protein |
| 8 | 22229116 | G/A | 0.13 | 1.3E-08 | 0.35 | Intergenic | N | GRMZM2G325612 | 22249167 | 22245141 | Cytokinin oxidase 5 |
| 8 | 62217745 | T/A | 0.10 | 6.3E-14 | 0.34 | Intergenic | Y | GRMZM2G130101 | 62267220 | 62273955 | Unknown |
| 8 | 171334544 | A/G | -0.06 | 1.2E-09 | 0.26 | Intronic | N | GRMZM2G476009 | 171332654 | 171335508 | DNA-directed RNA polymerases subunit RPABC1 |
| 9 | 16263773 | A/G | 0.17 | 9.9E-09 | 0.33 | Intergenic | Y | GRMZM2G301860 | 16253405 | 16252231 | AP2 domain containing protein |
| 9 | 16317865 | G/A | 0.10 | 1.4E-15 | 0.59 | Down_stream | Y | GRMZM2G099363 | 16320573 | 16318197 | Caffeoyl-CoA O-methyltransferase |
| 9 | 106051436 | G/A | 0.08 | 1.7E-05 | 0.45 | Intergenic | Y | GRMZM2G107886 | 106201175 | 106203143 | CCT/B-box zinc finger protein |
| 9 | 139547728 | C/G | -0.11 | 3.4E-08 | 0.32 | Intergenic | N | GRMZM2G402015 | 139591514 | 139589475 | Plant U-box 13, spotted leaf 11 |
| 10 | 3090835 | G/A | -0.13 | 6.2E-10 | 0.47 | Intergenic | Y | GRMZM2G068465 | 3066458 | 3062645 | Cytochrome P450 |
| 10 | 64647379 | G/A | 0.06 | 5.7E-09 | 0.53 | Intergenic | Y | NA | |||
| 10 | 134501142 | C/T | 0.06 | 4.4E-08 | 0.77 | Intronic | Y | GRMZM2G063972 | 134495450 | 134504022 | Heat shock protein DnaJ |
aAlleles reported as: “B73 allele/alternate allele”. “CNV-” represents the read depth of a line is significantly lower than B73.
bThe mean effect of each significant SNP across data subsamples.
cSNPs located within model 7 QTL SI are indicated as “Y” and “N” otherwise.
dVariants more than 500 bp away from an annotated gene.
eVariation lies in the coding region and results in an amino acid change.
Concordance of variants associated with SLB resistance at resample model inclusion probabilities (RMIPs) of 0.05 and 0.25 from five GWAS analyses
| Comparisons | GWAS models a | RMIP ≥ 0.05 | RMIP ≥ 0.25 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| No. of all significant SNPs b | Proportion of overlapped SNPs (%) c | No. of all significant SNPs | Proportion of overlapped SNPs (%) c | ||||||
| 10-kb window | 100-kb window | 200-kb window | 10-kb window | 100-kb window | 200-kb window | ||||
| GWAS marker density | B vs. C | 326 (151,175) | 26 | 36 | 38 | 45 (25,20) | 13 | 13 | 13 |
| D vs. E | 393 (201,192) | 22 | 34 | 39 | 54 (28,26) | 11 | 11 | 15 | |
| Genetic map density | C vs. E | 367 (175,192) | 14 | 21 | 23 | 46 (20,26) | 9 | 9 | 9 |
| B vs. D | 352 (151,201) | 25 | 29 | 35 | 53 (25,28) | 26 | 28 | 28 | |
| GWAS and genetic marker density | C vs. D | 376 (175,201) | 13 | 21 | 22 | 48 (20,28) | 4 | 8 | 8 |
| B vs. E | 343 (151,192) | 7 | 13 | 15 | 51 (25,26) | 4 | 4 | 4 | |
| GWAS and genetic marker density and | A vs. E | 437 (245,192) | 6 | 12 | 15 | 52 (26,26) | 4 | 4 | 4 |
aSee Table 1 for details of each GWAS model.
bTotal number of all the significant SNPs from a pair of analyses is shown outside of parentheses and numbers of significant SNPs from each analysis considered separately are inside the parenthesis.
cProportion of overlapped SNPs was estimated as the total number of overlapped SNPs from a pair of analyses/total number of all the significant SNPs from the two analyses.
Comparisons show the input(s) that differed in each pair of GWAS comparisons.