Literature DB >> 29398937

Genome-wide association mapping for flowering and maturity in tropical soybean: implications for breeding strategies.

Rodrigo Iván Contreras-Soto1,2,3, Freddy Mora4, Fabiane Lazzari5, Marco Antônio Rott de Oliveira6, Carlos Alberto Scapim1, Ivan Schuster5.   

Abstract

Knowledge of the genetic architecture of flowering and maturity is needed to develop effective breeding strategies in tropical soybean. The aim of this study was to identify haplotypes across multiple environments that contribute to flowering time and maturity, with the purpose of selecting desired alleles, but maintaining a minimal impact on yield-related traits. For this purpose, a genome-wide association study (GWAS) was undertaken to identify genomic regions that control days to flowering (DTF) and maturity (DTM) using a soybean association mapping panel genotyped for single nucleotide polymorphism (SNP) markers. Complementarily, yield-related traits were also assessed to discuss the implications for breeding strategies. To detect either stable or specific associations, the soybean cultivars (N = 141) were field-evaluated across eight tropical environments of Brazil. Seventy-two and forty associations were significant at the genome-wide level relating respectively to DTM and DTF, in two or more environments. Haplotype-based GWAS identified three haplotypes (Gm12_Hap12; Gm19_Hap42 and Gm20_Hap32) significantly co-associated with DTF, DTM and yield-related traits in single and multiple environments. These results indicate that these genomic regions may contain genes that have pleiotropic effects on time to flowering, maturity and yield-related traits, which are tightly linked with multiple other genes with high rates of linkage disequilibrium.

Entities:  

Keywords:  linkage disequilibrium; pleiotropy; quantitative trait loci; tropical soybean

Year:  2017        PMID: 29398937      PMCID: PMC5790042          DOI: 10.1270/jsbbs.17024

Source DB:  PubMed          Journal:  Breed Sci        ISSN: 1344-7610            Impact factor:   2.086


Introduction

Flowering, maturity and plant height are key complex traits determining soybean productivity and adaptability (Cober and Morrison 2010, Zhang ). Most of these traits have been studied through correlation with yield to improve the understanding of their relationship to yield components (Fox , Li , Mansur ). Moreover, to improve relevant agronomic traits in breeding programs, where large populations are evaluated every year, genotyping with a small number of markers would be more feasible (Schuster 2011). Consequently, it is desirable to identify molecular markers in genetically superior progenies or exotic plant introduction with favorable alleles, which should be successfully introgressed using marker assisted selection (MAS) (Fox ). Yield-quantitative trait loci (QTL) are often detected within the context of specific soybean breeding populations and environments, since some conditions in any given environment, geographic region or year can change the grain yield (Guzman , Orf ). According to Palomeque , studies have identified QTLs associated with traits of interest that appear to be independent of the environment but dependent on the genetic background in which they found. The difficulty of identifying yield-QTL effective for MAS across a wide range of genetic and/or environmental contexts might be addressable by using preliminary yield trials to model target haplotypes within each context and then immediately selecting inbred lines that target genotypes in real time (Sebastian ). Sebastian demonstrated that using MAS with haplotypes to improve grain yield is possible if focused within a specific genetic and environmental context. In addition, the context-specific approach has already been adopted as a major component of MAS strategies known commercially as Accelerated Yield Technology (AYT) at Pioneer Hi-Bred International. Genome-wide association studies (GWAS) using individual Single Nucleotide Polymorphism (SNPs) and haplotype information have been used to improve agronomical traits in soybean (Contreras-Soto , Hao , Zhang ). A haplotype block is a genomic region in which two or more polymorphic loci (i.e., SNP) in close proximity tend to be inherited together with high probability (Abdel-Shafy ). These blocks are believed to be caused by recombination hotspots with extremely rare recombination within stretches of DNA, where the enclosed SNPs consequently segregate together from one generation to the next, acting as combined multi-site alleles (Greenspan and Geiger 2004). The combination of SNP alleles in a haplotype block on one chromosome covers the observed variation and can have higher linkage disequilibrium (LD) with the allele of a QTL than individual SNP alleles that are used to construct the haplotype (Abdel-Shafy ). Furthermore, haplotype association is likely to be more powerful in the presence of LD (Garner and Slatkin 2003). Lorenz used simulated phenotype data to show that the use of SNP-based haplotypes can increase power over the use of single-SNP markers in GWAS. Using haplotypes for QTL mapping could compensate for the bi-allelic limitation of SNPs, and substantially improve the efficiency of QTL mapping (Yang ). According to Song , highly selfing species, such as soybean, are in many ways uniquely suitable for haplotype block mapping. Therefore, the aim of this study was to identify haplotypes across multiple environments that contribute to time to flowering and maturity in tropical soybean, with the view to improve the selection of desired alleles for these traits, but with minimal impact on yield.

Material and Methods

Plant material and field evaluation

The association panel of this study consisted of 141 cultivars of tropical soybean (Supplemental Table 1), which were field evaluated in five locations that represent eight environments of Brazil: Cascavel (24°52′54.9″S 53°32′30.4″W) in the growing seasons 2012/2013, 2013/2014 and 2014/2015 (Cas12/13, Cas13/14 and Cas14/15, respectively); Palotina (24°21′06.5″S 53°45′24.9″W) in the growing season 2014/2015 (Pal14/15); Primavera do Leste (15°34′37.6″S 54°20′41.8″W) in the growing season 2012/2013 (Pri12/13), Rio Verde (17°45′49.0″S 51°01′49.3″W) in the growing season 2013/2014 and 2014/2015 (Rio13/14 and Rio14/15), and Sorriso (12°32′43.6″S 55°42′41.8″W) in the growing season 2014/2015 (Sorr14/15). These locations were chosen on the basis of their diversity of latitude and altitude. Field trials were arranged in a complete block design with two replications. Fertilizer and field management practices recommended for optimum soybean production were used according to Embrapa (2011).

Phenotypic data analysis

Seed yield (SY), 100-seed weight (SW), plant height (PH), number of days to flowering (DTF) and maturity (DTM) were measured in the 141 soybean cultivars across the eight environments. Flowering dates were recorded when 50% of plants in a plot had open flowers. DTF was measured by counting days from emergence to flowering, when approximately 50% of plants per plot had at least one open flower (R1), and DTM was measured by counting the days from planting to the date when plants had 95% of their pods dry (R8 on the scale of Fehr and Caviness 1977). Field data were analyzed on the basis of the following mixed linear model: where μ is the total mean, gi is the genetic effect of the ith genotype, lj is the effect of the jth environment, (gl)ij is the interaction effect between the ith genotype and the jth environment (G × E), bk(j) is the random block effect within the jth environment, and eijk is a random error following . Adjusted entry means (AEM) were calculated for each of the 141 entries (ith genotype: gi) with the LSMEANS option of MIXED procedure, and these were used as a dependent variable in the posterior association analysis. AEM (denoted as M) was where μ̂ and ĝ are the generalized least-squares estimates of μ and g, respectively. To estimate AEM for all cultivars at each of the eight environments, g was regarded as fixed and b as random, as proposed by Stich . The Restricted Likelihood Ratio Test (RLRT) was calculated to confirm the heterogeneity of residual variance (across environments) using the GLIMMIX procedure in SAS, according to the following: where MHV and MCV are the models with heterogeneous and common (homogenous) variances, respectively. The asymptotic distribution of the RLRT statistic is Chi-square with p degrees of freedom (), where p is the difference in the number of parameters included in the MHV and MCV models (in this case P = 7). Consequently, error variances were assumed to be heterogeneous among locations, and these were computed using the COVTEST homogeneity option, with RANDOM _residual_ statement and GROUP option in the GLIMMIX procedure (Mora ). Analysis of Deviance (ANODEV) was conducted to evaluate the significance of the effects of the five traits across environments by using the MIXED procedure in SAS (Nelder and Wedderburn 1972). The PROC CORR procedure was used to analyze Pearson correlations among variables by environment since G × E interactions were significant. Broad-sense heritability (h2) for the five traits at each environment was estimated as the proportion of genetic variance () over the total variance (), according to the formula:

Association panel, SNP genotyping and population structure

Cultivars were genotyped for 6,000 single nucleotide polymorphisms (SNPs) using the Illumina BARCSoySNP6K BeadChip, corresponding to a subset of SNPs from the SoySNP50K BeadChip (Song ). Genotyping was conducted by Deoxi Biotechnology Ltda. ® in Aracatuba, Sao Paulo, Brazil. A total of 3,780 SNP markers, including polymorphic and non-redundant SNPs, SNP markers with greater than 10% minor allele frequency (MAF) and missing data values lower than 25% were used for subsequent analysis, with heterozygous markers treated as missing data. Haplotype blocks were constructed using the Solid Spine method implemented in the software Haploview (Barrett ), and have been previously reported by Contreras-Soto (Supplemental Table 2). This method considers that the first and last markers in a block are in strong LD with all intermediate markers, thereby providing more robust block boundaries. A cutoff of 1% was used, meaning that if addition of a SNP to a block resulted in a recombinant allele at a frequency exceeding 1%, then that SNP was not included in the block. Then, these LD blocks were used to conduct the haplotype-based GWAS. A Bayesian model-based method was used to infer population structure using 3,780 SNPs, implemented in the program InStruct (Gao ). Posterior probabilities were estimated using five independent runs of the Markov Chain Monte Carlo (MCMC) sampling algorithm for the numbers of genetically differentiated groups (k) varying from 2 to 10, without prior population information. The MCMC chains were run for a burn-in of 5,000, followed by 50,000 iterations. The convergence of the log likelihood was determined by the value of the Gelman-Rubin statistic. The best estimate of k was determined according to the lowest value of the average log(Likelihood) and Deviance Information Criterion (DIC) values among the simulated groups (Gao ), as defined by Spiegelhalter . where D̄ is a Bayesian measure of model fit that is defined as the posterior expectation of the deviance (D̄ = E/y [−2· ln f (y/θ)]) ; pD is the effective number of parameters, which measures the complexity of the model.

SNP-based GWAS

AEM of each cultivar were used to perform SNP-based and haplotype-based GWAS for SY, PH, SW, DTF and DTM. To consider the effects of population structure and genetic relatedness among the cultivars, the following unified mixed-model (Cappa , Yu ) of association was employed (in matrix form): where y is a vector of adjusted phenotypic observations; α is a vector of SNP effects (fixed); v is a vector of population structure effects (fixed); u is a vector of polygene background effects (random); and ɛ is a vector of residual effects. S, Q and Z are incidence matrices for a, v, and u, respectively. According to Yu , the variances of u and ɛ are and , respectively. K and R are the kinship and residual variance matrices, respectively. This is a structured association model (Q model), which considers the genetic structure of the association panel included in the association mixed model. The kinship coefficient matrix (K) that explains the most likely identity by state of each allele between cultivars was estimated using the program TASSEL (Bradbury , Endelman and Jannink 2012). Mixed linear models with Q and K by themselves and MLM considering Q + K models were also run in TASSEL (Bradbury , Yu ). The Bayesian information criterion (BIC) (Schwarz 1978) was used for model selection, which is defined as: where L is the restricted maximum likelihood for a determined model, p the number of parameters to be estimated in the model, and n the sample size. BIC values were computed using the TASSEL program following Yu .

Haplotype-based GWAS

Haplotype-based GWAS was performed on the basis of LD information. Haplotype-based association mapping was performed by using adjusted phenotypes (y) as the dependent traits and the information of haplotype blocks in the model, as follows: Where 1 is a vector of n ones, with n representing the number of soybean cultivars, H is the incidence matrix of haplotype genotypes for the individuals at the i-th haplotype locus; The element of H(Hij) is equal to the number of the i-th copies of haplotypes-blocks carried by the j-th cultivar. For this analysis u represents the polygenic gene effect or kinship matrix (K) with variance and the residual effects e with variance . A limit of detection (LOD) value higher than 3 was used as the threshold P-value for haplotype-trait associations according to Hwang . Then, only the significant haplotypes were used to estimate the phenotypic variance explained by haplotypes. The percentage of variation explained by the haplotype-based method was calculated using a simple regression performed in TASSEL as follows: Where LR is the Likelihood Ratio; n represents the number of observations (i.e., number of soybean cultivars); logLM and logL0 are the likelihood functions of the reduced and the intercept-only models, respectively (Sun ). The Chi-square test was performed to check phenotypic differences among haplotype blocks using the CONTRAST option of the GENMOD procedure in SAS (SAS Institute, Inc., Cary, NC). Additionally, the genomic regions or SNPs in haplotypes blocks identified in this study were compared to the genomic locations of QTLs previously reported for the traits under study. Genes, QTLs and markers annotated in Glyma1.01 and NCBI RefSeq gene models in SoyBase (www.soybase.org) were used as references.

Results

Phenotypic analysis, heritability and correlation between traits

Analysis of deviance indicated that the effects of genotype (G), environment (E) and their interaction (G × E) were statistically significant (χ2 > 0.01) for all traits under study (Supplemental Table 3). Highly significant differences were observed among traits and environments (Supplemental Figs. 1–5). On average, PH ranged from 38.27 cm (Rio13/14) to 103.45 cm (Cas12/13). SY and SW data ranged from 670.23 kg ha−1 (Rio13/14) to 3319.00 kg ha−1 (Cas14/15) and 11.96 g (Rio13/14) to 15.50 g (Rio14/15), respectively. As expected, DTF and DTM varied widely, ranging from 30 (Pri12/13) to 47 (Cas13/14) days, and 88 (Pri12/13) to 133 (Cas14/15) days, respectively (Table 1). The high phenotypic variability was confirmed by analysis of deviance, which revealed that all traits were severely influenced by environmental factors, showing significant G × E interaction (Supplemental Table 3). Over the eight environments, SY was moderately heritable with a value of 56%, whereas SW, DTM, PH and DTF showed high heritabilities: 81.7%, 91.7%, 93.4% and 94.6%, respectively.
Table 1

Descriptive statistics of phenotypic variation, heritability (h2) across environments and variance components (G and G × E) of seed yield (SY), seed weight (SW), plant height (PH), days to maturity (DTM) and flowering (DTF) of 141 cultivars of soybean evaluated in eight environments

TraitEnvironmentMeanSDMinMaxGG × Eh2 (%)
SY (kg ha−1)Cas12/132457.59820.92806.006563.007506835105556.7
Pri12/131910.82767.17233.004372.00
Cas13/141863.71623.59125.005127.00
Rio13/14670.23305.47128.001780.00
Cas14/153319.001297.25176.007149.00
Pal14/151442.93667.79299.003669.00
Rio14/151559.18814.61136.004284.00
Sorr14/151775.69800.04152.004916.00
Mean

SW (100seed gr)Cas12/1312.082.317.9025.501.502.0581.7
Pri12/1312.591.999.0025.80
Cas13/1413.492.267.9025.00
Rio13/1411.961.698.2023.90
Cas14/1512.333.156.3019.40
Pal14/1512.301.927.6018.40
Rio14/1515.501.9310.3021.00
Sorr14/1514.781.9610.1021.80
Mean

PH (cm)Cas12/13103.4519.8955.00220.00209.30101.8393.4
Pri12/1348.3611.9820.0090.00
Cas13/1497.5919.5445.00205.00
Rio13/1438.2711.5920.0075.00
Cas14/1590.3424.4530.00180.00
Pal14/1574.2523.0430.00130.00
Rio14/1546.1713.7220.0095.00
Sorr14/1555.5218.5023.00100.00
Mean

DTF (days)Cas12/1346.1610.3828.0080.0044.6318.4294.6
Pri12/1330.295.9024.0052.00
Cas13/1447.759.4129.0082.00
Rio13/1440.497.3128.0077.00
Cas14/1546.5810.8526.0076.00
Pal14/1546.766.8932.0070.00
Rio14/1537.397.2624.0054.00
Sorr14/1531.426.0925.0046.00
Mean
Cas12/13126.3315.39104.00256.0082.0053.5791.7
Pri12/1388.839.8440.00172.00
Cas13/14124.8915.6697.00248.00

DTM (days)Rio13/1499.0213.7182.00182.00
Cas14/15133.8910.59106.00164.00
Pal14/15119.596.48106.00138.00
Rio14/15104.989.3880.00123.00
Sorr14/1598.275.5675.00123.00
Mean

G × E = Genotype × Environment interaction.

G = Genotype.

Analysis of phenotypic correlation was conducted by environment since residual heterogeneity was observed among the environments and the G × E interaction was significant for all traits. In most of the environments, significant and positive phenotypic correlations were observed between SY and SW, with correlation coefficients ranging from 0.15 (Pri12/13; P-value < 10−2) to 0.58 (Cas14/15; P-value < 10−4), and with no correlation between SY and SW in Pal14/15 and Sorr14/15. SY and SW showed different patterns of phenotypic correlation with DTF and DTM across environments. The same was observed among SY and SW with PH. In most of the environments, PH and SW showed negative correlations, although non-significant at the 0.05 level. However, PH, DTF and DTM were low to highly positively correlated traits, and statistically different of zero (P-value < 10−4), with correlation coefficients ranging from 0.13 for PH and DTF at Rio14/15 (P-value < 10−2) to 0.84 for DTM and DTF at Rio13/14 (P-value < 10−4) (Supplemental Table 4).

Genome-wide association across environments and traits

According to the deviance information criterion (from the posterior Bayesian clustering analysis), the most probable number of subpopulations was nine (Supplemental Fig. 6). The results based on Bayesian information criterion (BIC) consistently showed a better fit for the Q + K model over either Q or K alone (Supplemental Table 5). In total, 33, 29, 57, 72 and 40 linkage disequilibrium blocks were significantly associated with SY, SW, PH, DTM and DTF, respectively (Tables 2–6, Supplemental Tables 6–10). The haplotypes blocks explained considerable phenotypic variation: 17.6% to 96.8%, 13.6% to 33.2%, 45.2% to 99.4%, 12.7% to 59.9% and 12.9% to 42.7% for SY, SW, PH, DTM and DTF, respectively (Tables 2–6).
Table 2

Haplotype block associated with seed yield in 141 cultivars of tropical soybean

EnvPosition (bp)SNHap_IDHapAHFSYaR2 (%)Nearby genes or QTLs

ChrStartEnd
Cas13/14938523430389066603Gm9_Hap22aCCC292126.2a21.3DNA-binding proteinRHL1-like
Gm9_Hap22bCTC31861.8ab
Gm9_Hap22cTTC611761.7b
Gm9_Hap22dTTT201717.9b

Cas14/1512562221060522894Gm12_Hap12aTAAC553509.0a41.4uncharacterizedLOC102667945
Gm12_Hap12bTAAT373354.1a
Gm12_Hap12cCGGT282323.0b

Pal14/151944965128453705946Gm19_Hap42aAATxAA341815.1a96.8Beta-fructofuranosidase insoluble isoenzyme 1-like
Gm19_Hap42bGCCGGG881219.3a
Gm19_Hap42cACCGGG2374.2b
Gm19_Hap42dAATGAA

Pal14/1510396267343601826Gm10_Hap8aTATxTA161999.6a17.6uncharacterizedLOC100499780
Gm10_Hap8bCCGCTA81522.3b
Gm10_Hap8cCCGCCG301516.2bc
Gm10_Hap8dTCTxTA341227.9bcd
Gm10_Hap8eCCGCCA221162.2bcd
Gm10_Hap8fTCTCTA
Gm10_Hap8gTATCTA
Gm10_Hap8hTCGCTA3

Pal14/151945478438456430733Gm19_Hap43aATA311848.2a50.6Intergenic
Gm19_Hap43bACG21113.0b
Gm19_Hap43cGCG891112.9b
Gm19_Hap43dGTA2

Pal14/1511446264548061735Gm11_Hap11aCCxAA311699.9a45.6Probable 125 kDa kinesin-related protein-like
Gm11_Hap11bTATCA61548.8ab
Gm11_Hap11cCCTAC211144.4bc
Gm11_Hap11dTATAA101060.8bc

Sorr14/155562171457944603Gm5_Hap7aCAC182027.8a18.7uncharacterizedLOC100818074
Gm5_Hap7bCGT211956.5a
Gm5_Hap7cTAT781743.5a

Sorr14/1515562171457944603Gm15_Hap11aTCC72024.1a30.1uncharacterizedLOC100785341
Gm15_Hap11bCCC921898.9ab
Gm15_Hap11cTTA271510.8b

Env: Environment; Chr: Chromosome; SN: Number of SNPs by haplotype; Hap_ID: Haplotype ID; HapA: Allelic haplotypes; HF: Haplotype frequency; SY: mean for seed yield (kg*ha−1) of haplotypes at each environment.

= Different letter means statistical differences.

Table 3

Haplotype block associated with 100-seed weight in 141 cultivars of tropical soybean

EnvPosition (pb)SNHap_IDHapAHFSWaR2 (%)Nearby genes or QTLs

ChrStartEnd
Pri12/1311487588049714522Gm11_Hap12aCT1914.0a27.6Probable Xaa-Pro aminopeptidase P-like
Gm11_Hap12bTC6112.5b
Gm11_Hap12cCC1411.9b
Gm11_Hap12dTT2211.4b

Pri12/1311507472052482572Gm11_Hap13aAA6511.9a13.6Syntaxin-112 like
Gm11_Hap13bGA1812.1a

Cas13/1411507472052482572Gm11_Hap12aGA1813.4a15.7Syntaxin-112 like
Gm11_Hap12bAA6512.9a

Pal14/151332225680323476963Gm13_Hap41aGAC5113.0a33.2Glyma13g205300
Gm13_Hap41bGGT3112.5bGlyma13g207600
Gm13_Hap41cAAT2811.1bGlyma13g207900

Pal14/151331956416321544614Gm13_Hap42aGTAG714.0a22.3Glyma13g209500
Gm13_Hap42bGCGG2312.7a
Gm13_Hap42cGTGG4412.7a
Gm13_Hap42dGTAA612.6a
Gm13_Hap42eATAA2510.9ab
Gm13_Hap42fACAG110.3ab

Rio14/15942458021427907384Gm9_Hap27aTTTA1816.1a20.2Auxin-responsive protein IAA8-like
Gm9_Hap27bGCTA7615.8a
Gm9_Hap27cGCCG3114.3b

Sorr14/152854438088194944Gm2_Hap22aAATG416.3a17.9Auxilin-like protein 1-like
Gm2_Hap22bACCA2415.9ab
Gm2_Hap22cAATA1514.7bc
Gm2_Hap22dGCTG7414.5bc

Sorr14/1511530340158002174Gm11_Hap14aCATC2116.4a23.1Intergenic
Gm11_Hap14bTCTC4514.5b
Gm11_Hap14cTATT2414.5b
Gm11_Hap14dTCCC1214.2b
Gm11_Hap14eTCTT714.1b

Env: Environment; Chr: Chromosome; SN: Number of SNPs by haplotype; Hap_ID: Haplotype ID; HapA: Allelic haplotypes; HF: Haplotype frequency; SW: mean for 100-seed weight (g/100seed) of haplotypes at each environment.

= Different letter means statistical differences.

Table 4

Haplotype block associated with plant height in 141 cultivars of tropical soybean

EnvPosition (bp)NSHap_IDHapAHFPHaR2 (%)Nearby genes or QTLs

ChrStartEnd
Cas12/131945478438456430733Gm19_Hap43dGTA2121.3a52.0Intergenic
Gm19_Hap43aATA31115.4a
Gm19_Hap43cGCG8998.9a
Gm19_Hap43bACG290.0a

Cas12/131944965128453705946Gm19_Hap42aAATxAA31114.7a99.1Sd yld 11-6 Pl ht 4-2Pl ht 13-8Dt1 gene (GmTFL1)Sat_286*
Gm19_Hap42bGCCGGG8899.2b
Gm19_Hap42cACCGGG283.8b

Cas12/131336964799370507363Gm13_Hap53aGGA8120.6a23.9uncharacterizedLOC102670348
Gm13_Hap53bGGG45102.4ab
Gm13_Hap53cAAA53100.9b
Gm13_Hap53dGAA491.3bc

Pri12/131945478438456430733Gm19_Hap43dGTA255.0a63.4Intergenic
Gm19_Hap43aATA3153.9a
Gm19_Hap43cGCG8946.7a
Gm19_Hap43bACG243.8a

Pri12/1314802776185276216Gm14_Hap21aCGGGTA363.8a46.1Intergenic
Gm14_Hap21bCGGGGA2854.3a
Gm14_Hap21cTTTAGA1649.5ab
Gm14_Hap21dCGTATA748.2ab
Gm14_Hap21eTTTATA3947.9b
Gm14_Hap21fTTTAGG1344.6b
Gm14_Hap21gCGTAGA1

Cas13/141945478438456430733Gm19_Hap43aATA31109.3a51.6Intergenic
Gm19_Hap43dGTA2106.3a
Gm19_Hap43cGCG8993.5b
Gm19_Hap43bACG283.8b

Cas13/141944965128453705946Gm19_Hap42aAATxAA31108.8a99.1*
Gm19_Hap42bGCCGGG8893.7b
Gm19_Hap42cACCGGG275.0b

Cas14/151944965128453705946Gm19_Hap42aAATxAA31103.8a99.1*
Gm19_Hap42bGCCGGG8884.6b
Gm19_Hap42cACCGGG263.8b

Cas14/151945478438456430733Gm19_Hap43dGTA2111.3a55.5Intergenic
Gm19_Hap43aATA31105.3a
Gm19_Hap43cGCG8984.4b
Gm19_Hap43bACG282.5b

Pal14/151944965128453705946Gm19_Hap42aAATxAA3194.8a99.4*
Gm19_Hap42bGCCGGG8863.2b
Gm19_Hap42cACCGGG232.6b

Pal14/151945478438456430733Gm19_Hap43aATA3194.2a79.0Intergenic
Gm19_Hap43dACG269.0ab
Gm19_Hap43cGCG8963.2b
Gm19_Hap43dGTA2

Rio14/151945478438456430733Gm19_Hap43dGTA266.3a64.7Intergenic
Gm19_Hap43aATA3155.8b
Gm19_Hap43cGCG8941.4b
Gm19_Hap43bACG233.8bc

Rio14/151944965128453705946Gm19_Hap42aAATxAA3154.7a98.2*
Gm19_Hap42cACCGGG243.8ab
Gm19_Hap42bGCCGGG8841.2b

Sorr14/151945478438456430733Gm19_Hap43dGTA268.9a45.2Intergenic
Gm19_Hap43aATA3168.1a
Gm19_Hap43cGCG8950.5ab
Gm19_Hap43bACG244.5ab

Sorr14/151944965128453705946Gm19_Hap42aAATxAA3167.1a69.1*
Gm19_Hap42cACCGGG257.5ab
Gm19_Hap42bGCCGGG8850.4b

Env: Environment; Chr: Chromosome; SN: Number of SNPs by haplotype; Hap_ID: Haplotype ID; HapA: Allelic haplotypes; HF: Haplotype frequency; PH: mean for plant height (cm) of haplotypes at each environment.

= Different letter means statistical differences.

Table 5

Haplotype block associated with days to maturity in 141 cultivars of tropical soybean

EnvPosition (bp)NSHap_IDHapAHFDTMaR2 (%)Nearby genes or QTLs

ChrStartEnd
Cas12/132045458003458577616Gm20_Hap32aGGGGGC1150.5a59.9LOC100789709 splicing factor U2AF-associated protein 2-like
Gm20_Hap32bGGGGAA1140.5b
Gm20_Hap32cGAGGGA8132.9b
Gm20_Hap32dGGGGGA22126.8b
Gm20_Hap32eGAGGGC67126.2b
Gm20_Hap32fAAAAAA19116.3bc

Pri12/135562171457944603Gm5_Hap7aCGT2198.7a17.4Intergenic
Gm5_Hap7bTAT7887.4b
Gm5_Hap7cCAC1887.1b

Pri12/139602776360790422Gm9_Hap13aCA1494.7a15.5Intergenic
Gm9_Hap13bTA5387.6b
Gm9_Hap13cCC5087.0b

Pri12/13943818290441048103Gm9_Hap30aCCA1089.3a18.2Transcription initiation factor TFIID subunit 1-like
Gm9_Hap30bCTA5987.5a
Gm9_Hap30cCCG1687.4a
Gm9_Hap30dTTA3387.2a

Pri12/13149910518502063475Gm1_Hap17aTxATA1595.5a53.4Intergenic
Gm1_Hap17bGGGGC888.9ab
Gm1_Hap17cTGGGC8387.4ab

Pri12/13445298627454352982Gm4_Hap25aCC4789.2a21.3Intergenic
Gm4_Hap25bCT1689.1a
Gm4_Hap25cTT4784.8b

Pri12/1311736858074057142Gm11_Hap18aTA3789.3a14.1Intergenic
Gm11_Hap18bCx9487.2a

Pri12/135244098429114456Gm5_Hap2aGGAAAA391.3a19.8LOC100813996 transportin-3-like
Gm5_Hap2bGGGGAC4189.6a
Gm5_Hap2cGGGAGC1187.8a
Gm5_Hap2dTAAAAA5787.3a
Gm5_Hap2eGAGAGC1

Pri12/13283179510336384Gm2_Hap3aTAAT1795.8a19.2ATG8i protein
Gm2_Hap3bCGAC1389.2a
Gm2_Hap3cTGAT3188.2a
Gm2_Hap3dCGCT1987.9a
Gm2_Hap3eCGCC1986.9a
Gm2_Hap3fCGAT785.4a

Pri12/13943003730433153383Gm9_Hap28aTTA1491.9a18.2uncharacterized LOC100793859
Gm9_Hap28bCCG9087.8ab
Gm9_Hap28cCTA387.6ab
Gm9_Hap28dTTG687.6ab
Gm9_Hap28eTCG1184.6b

Cas13/14716625092169795865Gm7_Hap33aATGTT22127.7a20.1Intergenic
Gm7_Hap33bGCACT45123.7a
Gm7_Hap33cGCACC54122.5a

Cas13/14941903227423700937Gm9_Hap26aGxTTCTA55126.2a31.1Intergenic
Gm9_Hap26bAATTTTA29124.9ab
Gm9_Hap26cGAxGCCC11120.3ab
Gm9_Hap26dGAxGCTC15113.8b

Cas14/15240565506408134664Gm2_Hap48aCAAT14141.1a21.8uncharacterized LOC100819417
Gm2_Hap48bCGGC88136.1b
Gm2_Hap48cCGAT5127.9bc
Gm2_Hap48dAAAT13121.8bc

Pal14/15213674975141615583Gm2_Hap33aAAC5123.8a21.8Intergenic
Gm2_Hap33bACC10120.6a
Gm2_Hap33cACA12119.9a
Gm2_Hap33dGCA55119.9a
Gm2_Hap33eGAC1119.0a
Gm2_Hap33fAAA22116.3a

Rio14/151630267608305194265Gm16_Hap26aGGGCG111106.6a34.1Intergenic
Gm16_Hap26bAATAA1897.7b

Gm9_Hap19aAC32111.9a
Rio14/15932388671326952422Gm9_Hap19bGC47107.5b12.7Intergenic
Gm9_Hap19cAT3599.1c

Rio14/1519732245473585322Gm19_Hap10aGG61109.8a23.9Intergenic
Gm19_Hap10bAG11108.0a
Gm19_Hap10cAA4898.9b

Rio14/1519811519884365293Gm19_Hap11aGCT10109.8a27.5Intergenic
Gm19_Hap11bGCC61109.7a
Gm19_Hap11cTTC18102.0b
Gm19_Hap11dTTT2597.3b

Rio14/15447740685482223932Gm4_Hap31aCA2109.5a13.8Intergenic
Gm4_Hap31bCG109107.6a
Gm4_Hap31cTA1698.4b

Env: Environment; Chr: Chromosome; SN: Number of SNPs by haplotype; Hap_ID: Haplotype ID; HapA: Allelic haplotypes; HF: Haplotype frequency; DTM: mean for days to maturity (days) of haplotypes at each environment.

= Different letter means statistical differences.

Table 6

Haplotype block associated with days to flowering in 141 cultivars of tropical soybean

EnvPosition (bp)NSHap_IDHapAHFDTFaR2 (%)Nearby genes or QTLs

ChrStartEnd
Cas12/1312562221060522894Gm12_Hap12cCGGT2853.9a34.6uncharacterizedLOC102667945*
Gm12_Hap12aTAAC5545.6b
Gm12_Hap12bTAAT3743.7b

Cas13/1412562221060522894Gm12_Hap12cCGGT2854.9a41.9*
Gm12_Hap12aTAAC5546.7b
Gm12_Hap12bTAAT3745.8b

Cas13/1417879492790081734Gm17_Hap10aGCCG6751.6a38.7Intergenic
Gm17_Hap10bAATA4842.1b

Cas13/141549446994495212492Gm15_Hap45aCC6152.8a18.5LOC100804065 cysteine synthase-like
Gm15_Hap45bAC549.0ab
Gm15_Hap45cCT245.5ab
Gm15_Hap45dAT6143.8b

Rio13/141214306367147759305Gm12_Hap21aTTCAT4043.2a42.7Intergenic
Gm12_Hap21bCCTGG7939.5b

Rio13/149615581064700914Gm9_Hap14aGGCA1946.5a26.5Intergenic
Gm9_Hap14bGACA3940.0a
Gm9_Hap14cAATG4338.9a
Gm9_Hap14dAACA738.0a
Gm9_Hap14eAATA735.6b

Cas14/15650711282509364495Gm6_Hap52aTTGCG856.6a26.1Intergenic
Gm6_Hap52bCTGCG2049.8ab
Gm6_Hap52cTGGCG1048.6ab
Gm6_Hap52dCGGCG3948.6ab
Gm6_Hap52eTTGTA647.4ab
Gm6_Hap52fTTATA2640.1b

Cas14/151238680709389709002Gm12_Hap35aAG562.2a12.9LOC102660802micronuclear linker histone polyprotein-like
Gm12_Hap35bAT6548.4b
Gm12_Hap35cCG845.9bc
Gm12_Hap35dCT4343.9c

Cas14/152041883051422975774Gm20_Hap27aGCGG1152.7a32.9Intergenic
Gm20_Hap27bACGG1950.2a
Gm20_Hap27cATTA9145.9a

Rio14/15241787747420880454Gm2_Hap51aGGCG1142.6a18.1APO protein 3, mitochondrial-like**
Gm2_Hap51bGATA4441.6a
Gm2_Hap51cTGTA634.0b
Gm2_Hap51dTATA5833.9b

Sorr14/15241787747420880454Gm2_Hap51aGGCG1136.2a19.8**
Gm2_Hap51bGATA4434.4a
Gm2_Hap51dTATA5828.8b
Gm2_Hap51cTGTA628.6b

Env: Environment; Chr: Chromosome; SN: Number of SNPs by haplotype; Hap_ID: Haplotype ID; HapA: Allelic haplotypes; HF: Haplotype frequency; DTF: mean for days to flowering (days) of haplotypes at each environment.

= Different letter means statistical differences.

For SY, thirty-three haplotype blocks were effectively associated across environments. These haplotypes were identified on chromosomes 5, 9, 10, 11, 12, 15 and 19, and showed uncharacterized gene annotation or were located in intergenic regions (Table 2). The haplotype region Gm12_Hap12 encompasses a genomic region of 420 kb and contain the satt568 and satt442 markers, which are related to the seed protein 28-2 and 28-3 QTLs, respectively (Liang , Yang ) (Fig. 1). Interestingly, QTLs related to reproductive stage and pod maturity were located near this haplotype. The SSR marker satt192, related to seed glycitein 9-7, was found within Gm12_Hap12. The SNPs located at this chromosomal location were confirmed as an exclusive haplotype region because they were associated with seed yield at Cas14/15, which correspond to Cascavel in the growing season 2014/15.
Fig. 1

Candidate region for major-effect loci: ss715613192 ss715613203, ss715613207 and ss715613219 on Gm12_Hap12 associated with SY, DTF and pod maturity in soybean. In the top panel, the QTLs and the proposed genomic region Glyma12g075600 annotated as a double-stranded RNA-binding protein 2-like, which encodes a Ribonuclease III protein (BT097697). Glyma12g075700 is another gene close to this haplotype region that encodes a senescence regulator protein in soybean. SSR markers related to seed protein and glycitein, pod maturity and reproductive stage. The bottom panel depicts a haplotype region of 412 kb associated with SY (intensity of black color indicates the r2, and higher intensity means higher r2).

In Cascavel (Cas14/15), the haplotypes Gm12_Hap12a and Gm12_Hap12b were significantly different than haplotype Gm12_Hap12c. On average, Gm12_Hap12a and Gm12_Hap12b produced 3509.0 kg ha−1 and 3354.1 kg ha−1, while haplotype Gm12_Hap12c yielded 2323.0 kg ha−1, 34% and 31% lower than the haplotypes Gm12_Hap12a and Gm12_Hap12b, respectively. These haplotypes were well distributed in our association mapping panel (TAAT 32%, TAAC 45% and CGGT 23%) (Table 2). For SW, twenty-nine haplotype blocks were significantly associated across environments and chromosomes (Table 3). Particularly, the Gm13_Hap41 was associated with one QTL related to SW, seed weight 40-1 (Rossi ), and two QTLs for Pod maturity 20-1 and Lodging 27-6 (Li , Rossi ) (Fig. 2).
Fig. 2

Candidate region for major-effect loci: ss715615228, ss715615235, ss715615249 and ss715615257 located on haplotype Gm13_Hap41 and loci ss715615266, ss715615278 and ss715615281 located on haplotype Gm13_Hap42. Gm13_Hap41 was associated with SW, lodging and pod maturity in soybean. In the top panel, the QTLs and the proposed genomic regions are Glyma13g205300, Glyma13g207600 and Glyma13g207900, which encode an unknown protein, a nuclear transcription factor Y (subunit Gamma), and a dihydroxy-acid dehydratase, respectively. Additionally, Gm13_Hap42 were associated with SW and was identified as annotated gene Glyma13g209500, which encodes a 60S ribosomal protein. The bottom panel depicts haplotype regions of 197 kb and 122 kb associated with the aforementioned traits (intensity of black color indicates the r2, and higher intensity means higher r2).

Most of the SNPs effectively associated with PH across environments were located on chromosome 19, including haplotype regions Gm19_Hap42 and Gm19_Hap43. These haplotypes were consistent across all environments. Gm19_Hap42 is a region containing the Determinate stem 1 gene (Dt1 or GmTFL1) (Cober ), found 18.6 kb upstream of the peak SNP ss715635425, which has been previously associated with PH and days to maturity in soybean (Contreras-Soto , Zhang ). In addition, other yield QTLs have previously been identified in this region, including seed yield 11-6, and plant height 13-8 and 4-2 (Lee , Specht ) (Table 4, Fig. 3). Therefore, this QTL region should be considered as a relevant QTL responsible for PH.
Fig. 3

Candidate region for major-effect loci: ss715635403, ss715635425, ss715635433, ss715635454 and ss715635468 located on Gm19_Hap42 and loci ss715635494, ss715635506 and ss715635520 located on Gm19_Hap43. Gm19_Hap42 was associated with PH, SY and SCN in soybean. In the top panel, the QTLs and the putative genomic region are Glyma19g37890 (Dt1 or GmFLT1), which determines stem growth habit in soybean, Glyma19g194500, which encodes an abscisic acid-insensitive protein, Glyma19g38160, which encodes a beta-fructofuranosidase isoenzyme and Glyma19g196000, which encodes a spindly related enzyme. The bottom panel depicts haplotype regions of 494 kb (Gm19_Hap42) and 163 kb (Gm19_Hap43) associated with the aforementioned traits (intensity of black color indicates the r2, and higher intensity means higher r2).

For PH, interesting or discriminant haplotypes were located in our association mapping panel, i.e., the haplotype Gm19_Hap43c (GCG), which was associated in most of the environments and showed significant differences with the haplotype responsible for tallest plants (Gm19_Hap43d) in Cas13/14 and Cas14/15. On average, soybean plants with this haplotype showed heights of 93.5 and 84.4 cm of height in Cas13/14 and Cas14/15, respectively, and represented 72% of the total panel (Table 4). However, in Pal14/15, the haplotype Gm19_Hap43a (ATA) was significantly different than the haplotype responsible for smaller plants (Gm19_Hap43c), and consequently produced higher seed yield plants with significant differences among the others haplotypes (Gm19_Hap43a =1848.2 kg ha−1, yielding 28% more than the mean of Pal14/15) (Tables 2, 4). The haplotype Gm19_Hap42a should differentiate indeterminate growth type in soybean cultivars, whereas Gm19_Hap42b should differentiate determinate soybean cultivars. Gm19_Hap42b showed significant differences with the haplotype responsible for the tallest plants (Gm19_Hap42a) at environments Cas12/13, Cas13/14, Cas14/15, Pal14/15, Rio14/15 and Sorr14/15 (Table 4). Interestingly, in Pal14/15 for SY, this haplotype was not significantly different from the plants that yielded more (Table 2). For DTM, seventy-two haplotypes were associated across six environments. Of these, forty-two were located on intergenic regions and did not contain putative genes related to DTM. Specifically, some yield loci have previously been associated at the haplotype genomic region Gm20_Hap32: seed yield 12-3 and 15-15, plant height 14-1 and 26-15, and seed weight 36-5 (Han , Kabelka , Sun , Yuan ) (Table 5, Fig. 4).
Fig. 4

Candidate region for major-effect loci: ss715638618, ss715638624, ss715638629, ss715638643, ss715638650 and ss715638667 are located on Gm20_Hap32 and associated with DTM. Additionally, QTLs for SY, SW and PH were identified. In the top panel, the QTLs and the proposed genomic regions include Glyma20g218800, Glyma20g220600, Glyma20g221200, Glyma20g222500, Glyma20g222000 and Glyma20g224000 which encode a splicing factor U2AF-associated protein, beta catenin-related armadillo repeat-containing, GDSL Esterase/Lipase, serine/threonine-protein phosphatase PP1 isozyme 2-related, At-hook motif nuclear-localized protein 19-related and Trihelix transcription factor GTL2, respectively. The bottom panel depicts a haplotype region of 399 kb associated with the aforementioned traits (intensity of black color indicates the r2, and higher intensity means higher r2).

For DTF, forty haplotypes were associated across six environments. Most of these were located on intergenic regions of different chromosomes and showed no relationship with genes or markers. The haplotype Gm12_Hap12 was associated with DTF in the environments Cas12/13 and Cas13/14, and, interestingly, the same haplotype was associated with SY (Tables 2, 6). Particularly, for DTF and SY, the haplotypes Gm12_Hap12a (TAAC) and Gm12_Hap12b (TAAT) showed significant differences with Gm12_Hap12c (54 and 55 days, respectively). In fact, these haplotypes showed the lowest days to flowering (precocity) (46 and 47 days, and 44 and 46 days in Cas12/13 and Cas13/14, respectively) and the highest yielding plants when compared with Gm12_Hap12c (Table 6).

Discussion

Phenotypic variation and correlation between traits

The heritability values observed in our panel indicate that much of the phenotypic variation was genetic. Heritability for SY (56%) was moderately high but smaller compared to Kim (66%) and similar to Fox (59%). On the other hand, the heritabilities for SW, PH, DTM and DTF were high and similar to those estimated by Hao for SW and Zhang for PH, DTM and DTF. SY had a positive significant correlation with SW at six of the eight environments. Previous reports have also shown a significant positive correlation for SY and SW in soybean (Hao , Recker ). For SY and PH, more positive than negative phenotypic correlations were observed. In addition, as suggested by Zhang , the results based on multiple environments indicate that PH is a key factor for yield. On the other hand, most of the environments showed negative phenotypic correlations between SW and PH. However, Recker showed a significant positive phenotypic correlation between these traits. At the moment, it is difficult to identify the potential relationship between these traits. Our results confirmed the inconsistent pattern of observed phenotypic correlation between seed yield and other important agronomic traits in soybean (Kim ). The correlation among flowering-related traits with PH revealed the high phenotypic correlations between PH, DTM and DTF across multiple environments, suggesting close relationships among these traits.

Haplotype by environment interaction

The present study showed that some haplotype associations were location and year specific; however, stable haplotypes across environments was also found. According to Palomeque , QTLs for a specific trait are not always stable across environments and/or genetic backgrounds. The lack of validation in a different genetic background across environments could imply that these QTLs were not stable or that epistatic effects could be influencing the results. Another possibility is the presence of QTL by environment interactions, which represents a major challenge in genetic determinants of complex traits. On the other hand, for plant height, strong and consistent genomic regions within haplotypes across environments were identified (i.e., Gm19_Hap42; Gm19_Hap43). For example, in Cascavel environments, the same haplotype region (Gm19_Hap42) was associated with plant height in the 2012/13, 2013/14 and 2014/15 growing seasons (Cas12/13, Cas13/14 and Cas14/15), and explained most phenotypic variation (99.14%). Specifically, the haplotypes Gm19_Hap42a (AATxAA) and Gm19_Hap42b (GCCGGG) may help in marker-assisted selection of indeterminate and determinate growth habit soybean cultivars, respectively. QTLs controlling plant height are spread over all 20 chromosomes (Soybase 2016); however, this QTL region could be considered a relevant QTL responsible for PH (Contreras-Soto ). In fact, Zhang previously reported this region as associated with PH and DTM in soybean. In soybean, stem growth habit is regulated by an epistatic interaction between two genes, Dt1 and Dt2 (Bernard 1972). The present study reported the haplotype association with the Dt1 gene (Table 4, Fig. 3), which maintains the indeterminate growth habit (dt1dt1 plants are fully determinate); however was not identified the Dt2 gene, which in the presence of Dt1, produces semideterminate plants. Additionally, our study reported a seed yield QTL in this region. As plant height is one of the major factors determining yield potential in soybean, Gm19_Hap42 (with its large effect on plant height) may also affect soybean yield substantially, as previously reported by Zhang . In addition, Kato suggest that the indeterminate growth habit is an advantageous characteristic in breeding for high yield of early maturing soybean varieties. The present study identified the Dt1 gene associated to plant height (indeterminate plants) and co-associated to yield QTL, suggesting that this genomic region would also be responsible for high yield in soybean; however fine-mapping and cross-validation of the genes localized near of this haplotype should be performed.

Co-associated haplotype genomic regions among yield and flowering traits

For several traits, some molecular markers located at candidate genomic regions were co-localized on the same haplotype block. The co-association of a single gene or two linked genes to multiple traits that are phenotypically related has been previously reported (Sun ). On Chromosome 19 (haplotype Gm19_Hap42), four QTL regions for plant height, seed yield, SCN (soybean cyst nematode) and terminal flower harbored three genes related to TERMINAL FLOWER 1 (TFL1), Basic leucine zipper (bZIP) transcription factor family protein and a beta-fructofuranosidase insoluble isoenzyme 1-like. TFL1 is an ortholog of the Antirrhinum CENTRORADIALIS (CEN) and acts as a floral repressor by preventing the expression of LFY and AP1 (Bradley , Liu ). This gene corresponds to the Dt1 locus, which controls soybean growth habit (Tian ) and has been designated GmTFL1 (Glyma19g37890). GmTFL1 transcripts have been shown to accumulate in shoot apical meristems during early vegetative growth in both determinate and indeterminate growth habit soybeans; however, GmTFL1 transcripts are abruptly lost after flowering in determinate lines while remaining in indeterminate ones (Liu ). Consequently, this generates the difference of main stem nodes and flowering periods between indeterminate and determinate plants. Additionally, on the same haplotype region, the SSR Sat_286 has been identified and has exhibited a high accuracy in discrimination tests for growth habit in soybean (Vicente ). The LOC100789709 gene on chromosome 20 (Gm20_Hap32), described as a splicing factor U2AF-associated protein, was related to DTM in soybean. This gene is a homolog of atU2AF in Arabidopsis thaliana. Wang and Brendel (2006) demonstrated that altered expression levels of atU2AF35a or atU2AF35b causes pleiotropic phenotypes in flowering time, leaf morphology, flower, and silique shape in A. thaliana; specifically, pleiotropic phenotypes have been observed in mutants and transgenic lines. Homozygous atU2AF35a T-DNA insertion plants and atU2AF35b transgenic plants showed late flowering under both long and short day conditions. In fact, the altered expression of this gene may also affect days to flowering and maturity in soybean, confirming the haplotype association with this latter trait. Additionally, in this candidate region, some loci controlling grain yield have previously been associated: seed yield 12-3 and 15-15, plant height 14-1 and 26-15, and seed weight 36-5 (Han , Kabelka , Sun , Yuan ). These results suggest that the morphological correlations between yield components and time to flowering and maturity traits are related on a genetic basis, suggesting gene pleiotropy and high rates of linkage disequilibrium (Chen and Lubberstedt 2010). On chromosome 12 the haplotype Gm12_Hap12 was significantly associated with SY, DTF and DTM traits in all environments under study. This result may suggest that this region contains a single gene that has pleiotropic effects and is tightly linked with multiple genes. Recker evaluated multiple environments to show that SY and DTM are positively correlated, while SY was not significantly correlated with DTF. In the present study, variable correlation results were obtained at individual environments, e.g., for SY and DTM: r = −0.65 (at Cas14/15-Cascavel) to r = 0.39 and 0.26 (at Pri12/13-Primavera do Leste and Sorr14/15-Sorriso, respectively); For SY and DTF: r = −0.44 (at Cas12/13-Cascavel) to r = 0.46 (at Rio14/15-Rio Verde). As such, these results should be interpreted at the environment level considering that these traits exhibit QTL-by-environment interactions. In Cascavel, the haplotype Gm12_Hap12 should be used to improve yield and precocity in the current soybean program. Specifically, the haplotypes Gm12_Hap12a and Gm12_Hap12b showed significant differences from Gm12_Hap12c for DTF and SY. In fact, these haplotypes showed the lowest days to flowering (precocity) (46 and 47 days, and 44 and 46 days, respectively) and the highest yield plants when compared with Gm12_Hap12c. Furthermore, the fine mapping of such regions could help to discern the specific genetic elements controlling these traits. For instance, in this genomic region, two annotated (candidate) genes were identified (Glyma12g075600 and Glyma12g075700), which, in fact, should be validated. Finally, the results of this study suggest that the BARCSoySNP6K BeadChip and haplotype-based genome-wide association are valuable sources of information for discovering genomic regions that control quantitative traits in soybean. This research identified useful associated markers that have not been previously reported and that were detected in multiple environments. This will facilitate assessing and validating causal genetic variation of complex quantitative traits and may eventually be used to accelerate the optimization of molecular breeding. However, as with any molecular markers, we emphasize that the identified haplotypes should be validated before large-scale use.
  33 in total

Review 1.  On selecting markers for association studies: patterns of linkage disequilibrium between two and three diallelic loci.

Authors:  Chad Garner; Montgomery Slatkin
Journal:  Genet Epidemiol       Date:  2003-01       Impact factor: 2.135

2.  Model-based inference of haplotype block variation.

Authors:  Gideon Greenspan; Dan Geiger
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

Review 3.  Molecular basis of trait correlations.

Authors:  Yongsheng Chen; Thomas Lübberstedt
Journal:  Trends Plant Sci       Date:  2010-06-09       Impact factor: 18.313

4.  TASSEL: software for association mapping of complex traits in diverse samples.

Authors:  Peter J Bradbury; Zhiwu Zhang; Dallas E Kroon; Terry M Casstevens; Yogesh Ramdoss; Edward S Buckler
Journal:  Bioinformatics       Date:  2007-06-22       Impact factor: 6.937

5.  Variation explained in mixed-model association mapping.

Authors:  G Sun; C Zhu; M H Kramer; S-S Yang; W Song; H-P Piepho; J Yu
Journal:  Heredity (Edinb)       Date:  2010-02-10       Impact factor: 3.821

6.  Control of inflorescence architecture in Antirrhinum.

Authors:  D Bradley; R Carpenter; L Copsey; C Vincent; S Rothstein; E Coen
Journal:  Nature       Date:  1996-02-29       Impact factor: 49.962

7.  QTLs for resistance to Phomopsis seed decay are associated with days to maturity in soybean (Glycine max).

Authors:  Suli Sun; Moon Young Kim; Kyujung Van; Yin-Won Lee; Baodu Li; Suk-Ha Lee
Journal:  Theor Appl Genet       Date:  2013-05-24       Impact factor: 5.699

8.  Genetic basis of soybean adaptation to North American vs. Asian mega-environments in two independent populations from Canadian × Chinese crosses.

Authors:  M Eugenia Rossi; James H Orf; Li-Jun Liu; Zhimin Dong; Istvan Rajcan
Journal:  Theor Appl Genet       Date:  2013-04-18       Impact factor: 5.699

9.  A genome-wide association study of seed protein and oil content in soybean.

Authors:  Eun-Young Hwang; Qijian Song; Gaofeng Jia; James E Specht; David L Hyten; Jose Costa; Perry B Cregan
Journal:  BMC Genomics       Date:  2014-01-02       Impact factor: 3.969

10.  Impacts of population structure and analytical models in genome-wide association studies of complex traits in forest trees: a case study in Eucalyptus globulus.

Authors:  Eduardo P Cappa; Yousry A El-Kassaby; Martín N Garcia; Cintia Acuña; Nuno M G Borralho; Dario Grattapaglia; Susana N Marcucci Poltri
Journal:  PLoS One       Date:  2013-11-25       Impact factor: 3.240

View more
  6 in total

1.  Genome-wide association study of inflorescence length of cultivated soybean based on the high-throughout single-nucleotide markers.

Authors:  Jinyang Wang; Xue Zhao; Wei Wang; Yingfan Qu; Weili Teng; Lijuan Qiu; Hongkun Zheng; Yingpeng Han; Wenbin Li
Journal:  Mol Genet Genomics       Date:  2019-02-09       Impact factor: 3.291

2.  Characterization and quantitative trait locus mapping of late-flowering from a Thai soybean cultivar introduced into a photoperiod-insensitive genetic background.

Authors:  Fei Sun; Meilan Xu; Cheolwoo Park; Maria Stefanie Dwiyanti; Atsushi J Nagano; Jianghui Zhu; Satoshi Watanabe; Fanjiang Kong; Baohui Liu; Tetsuya Yamada; Jun Abe
Journal:  PLoS One       Date:  2019-12-05       Impact factor: 3.240

3.  Genomic regions associated with physiological, biochemical and yield-related responses under water deficit in diploid potato at the tuber initiation stage revealed by GWAS.

Authors:  Paula Díaz; Felipe Sarmiento; Boby Mathew; Agim Ballvora; Teresa Mosquera Vásquez
Journal:  PLoS One       Date:  2021-11-08       Impact factor: 3.240

4.  Genetic Diversity and Selection Footprints in the Genome of Brazilian Soybean Cultivars.

Authors:  Heitor Calux Mendonça; Luiz Filipe Protasio Pereira; João Vitor Maldonado Dos Santos; Anderson Rotter Meda; Gustavo César Sant' Ana
Journal:  Front Plant Sci       Date:  2022-03-30       Impact factor: 5.753

5.  Construction of prediction models for growth traits of soybean cultivars based on phenotyping in diverse genotype and environment combinations.

Authors:  Andi Madihah Manggabarani; Takuyu Hashiguchi; Masatsugu Hashiguchi; Atsushi Hayashi; Masataka Kikuchi; Yusdar Mustamin; Masaru Bamba; Kunihiro Kodama; Takanari Tanabata; Sachiko Isobe; Hidenori Tanaka; Ryo Akashi; Akihiro Nakaya; Shusei Sato
Journal:  DNA Res       Date:  2022-06-25       Impact factor: 4.477

6.  Thirteen years under arid conditions: exploring marker-trait associations in Eucalyptus cladocalyx for complex traits related to flowering, stem form and growth.

Authors:  Osvin Arriagada; Antonio Teixeira do Amaral Junior; Freddy Mora
Journal:  Breed Sci       Date:  2018-06-29       Impact factor: 2.086

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.