Literature DB >> 31504706

Gene-set association and epistatic analyses reveal complex gene interaction networks affecting flowering time in a worldwide barley collection.

Tianhua He1, Camilla Beate Hill1, Tefera Tolera Angessa1, Xiao-Qi Zhang1, Kefei Chen2, David Moody3, Paul Telfer4, Sharon Westcott5, Chengdao Li1,5,6.   

Abstract

Single-marker genome-wide association studies (GWAS) have successfully detected associations between single nucleotide polymorphisms (SNPs) and agronomic traits such as flowering time and grain yield in barley. However, the analysis of individual SNPs can only account for a small proportion of genetic variation, and can only provide limited knowledge on gene network interactions. Gene-based GWAS approaches provide enormous opportunity both to combine genetic information and to examine interactions among genetic variants. Here, we revisited a previously published phenotypic and genotypic data set of 895 barley varieties grown in two years at four different field locations in Australia. We employed statistical models to examine gene-phenotype associations, as well as two-way epistasis analyses to increase the capability to find novel genes that have significant roles in controlling flowering time in barley. Genetic associations were tested between flowering time and corresponding genotypes of 174 putative flowering time-related genes. Gene-phenotype association analysis detected 113 genes associated with flowering time in barley, demonstrating the unprecedented power of gene-based analysis. Subsequent two-way epistasis analysis revealed 19 pairs of gene×gene interactions involved in controlling flowering time. Our study demonstrates that gene-based association approaches can provide higher capacity for future crop improvement to increase crop performance and adaptation to different environments.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Experimental Biology.

Entities:  

Keywords:  Barley; GWAS; epistasis; flowering time; gene-set association analysis; heritability; next-generation sequencing; phenology; target capture; target enrichment

Mesh:

Year:  2019        PMID: 31504706      PMCID: PMC6812734          DOI: 10.1093/jxb/erz332

Source DB:  PubMed          Journal:  J Exp Bot        ISSN: 0022-0957            Impact factor:   6.992


Introduction

Barley (Hordeum vulgare L.) is one the most important cereal crops in the world and is cultivated both in highly productive agricultural regions and in marginal environments prone to adverse conditions (Baum ). As a particularly resilient crop compared with other cereals such as wheat and rice, barley has the ability to adapt to biotic and abiotic stresses, holding a great deal of potential to increase production in marginal areas to sustain food security (Tester and Langridge, 2011). It is vital that barley flowers within a particular time window in a given environment to maximize yield, while minimizing exposure to frost, heat, and drought stress during the growing season (Maurer ). It is also known that genes controlling phenology including flowering time (FT) overlap with many grain yield-related genes (Hill ). Sharma identified a total of 96 quantitative trait loci (QTLs) mapped for grain yield in a nested association mapping population, the majority of which also co-localized with known genes controlling FT. Harnessing the power of genomic tools to manipulate FT for barley improvement is of considerable importance to meet the food and feed demands of the future. Understanding the genetic basis of FT including the interactions between different FT genes has the potential to considerably enhance genetic improvement and future barley breeding. Insights gained from model plants such as Arabidopsis thaliana made it possible to explore the function of gene orthologues and related pathways in barley, but not all genes and gene networks discovered in A. thaliana are conserved across the plant kingdom. For example, monocot-specific genes and gene networks, including species-specific flowering gene networks in rice, have been reported (Xue ; Matsubara ). Rapid advancements in genome sequencing technologies including reduced representation sequencing approaches, combined with high-throughput genotyping and the availability of a high-quality reference genome, now allow for an unprecedented view into complex genetic architectures in barley (Waugh ; Huang ; Mayer ; Mascher ; Sharma ; Hill ). Genome-wide association studies (GWAS) have emerged as powerful tools for identifying genetic variants associated with crop plant phenotypes (Pasam ; Yano ; Fang ). Commonly used single-marker GWAS approaches test each single nucleotide polymorphism (SNP) individually for the association with a trait, which has delivered considerable insight into the genetic control of traits (Yang ). However, only the most significant SNPs in the genome are taken into account with the single-marker approach, thus can often explain only a small proportion of the genetic variation. In fact, single SNP variants explained <10% of phenotypic variation for the majority of complex phenotypes (Manolio ). Moreover, single SNP analyses consider only the effect of individual SNP and often examine additive models only, while most quantitative traits are polygenic and thus also determined by gene×gene interactions (epistasis). Epistasis is known to play a crucial role in regulation of many complex traits in plants, animals, and humans (Doust ; Phillips, 2008). Different theoretical frameworks and statistical methodologies for epistasis analysis have been developed to improve the detection of genes responsible for complex human diseases (as reviewed in Wei ). However, models that take multiple SNP markers into account are still not widely adopted and have only recently been applied to plants, including crops, to identify novel candidate genes and gene networks controlling complex agronomic traits. For example, FT is a crucial yet complex trait of interest in barley and other agronomically important crops (Hill and Li, 2016); several studies have reported gene×gene interactions affecting FT in different plant species (Caicedo ; Durand ; Maurer ). Mathew observed genomic regions with main or higher order epistatic effects overlapping with known candidate genes that were reported previously in barley and closely related species for FT. In sorghum, it is known that Maturity locus 1 (Ma1) represses expression of the floral activator Early heading date 1 (Ehd1), which activates FT to produce florigen for floral induction (Rooney and Aydin, 1999). Li revealed a significant interaction between the QTL harbouring Ma1 and the QTL harbouring FT through epistasis analysis. The reported gene×gene interactions are consistent with the networking system proposed for the control of the timing of flowering (Blázquez, 2000; Valverde ; Imaizumi and Kay, 2006). To overcome these limitations, gene-set analysis (GSA) has emerged as a more powerful approach than single SNP analysis (Nam ). GSA has several advantages. First, GSA can aggregate effects of many SNPs with weak associations. Although individual SNPs may show little or no effect, their interactions may have a non-linear effect if an unbiased analysis for interactions within combinations of SNPs is performed (Wang ; Mooney and Wilmot, 2015; Pers, 2016). Secondly, GSA takes allelic heterogeneity into consideration (i.e. different SNPs within a gene linking to a similar phenotype) which is usually not possible in the single SNP GWAS test (Zöllner and Pritchard, 2005; Guan and Stephens, 2011; Jiang ). Thirdly, GSA could capture local epistatic interactions between SNPs within a gene and therefore potentially increase prediction accuracies (Zhang ; Jiang ). As FT is believed to be controlled by a complex interacting gene network probably influenced by the effects of sets of genes (Hill and Li, 2016, and reference therein), testing associations between a phenotype and the cumulative effect of genes may identify more functionally relevant candidate genes with higher accuracy than single SNP GWAS. Here, we revisited previously published data sets: (i) phenotypic data of 895 barley varieties grown over two years in four different field locations with varying seasonal temperature and rainfall conditions in Western Australia’s South West; and (ii) genotypic data obtained from the targeted resequencing of 174 putative phenology-related genes and gene orthologues (Hill et al., 2019a, b). Building on the previous study, here we aimed to achieve higher statistical power to detect significant genes and gene networks that influence FT in barley by expanding single SNP GWAS analysis to gene-based analysis and epistasis analysis. By taking only SNPs detected within gene-coding regions of putative FT-related genes into account, we first re-calculated the narrow-sense SNP-based heritability of awn emergence as an equivalent to FT (Alqudah and Schnurbusch, 2017). We then re-assessed the association of individual SNPs and FT by standardizing and averaging FT across multiple locations and experimental years. We further grouped SNPs from the same genes into distinct gene sets and tested the association of each gene set with FT. Finally, we identified interacting SNP pairs using a two-way epistasis analysis and determined an expanded and improved gene interaction network which regulates FT in barley.

Materials and methods

Plant material, phenotypic data, and genes enriched in SNPs

Plant material, phenotypic data, and phenology gene-enriched genetic variants were previously reported in detail in Hill et al. (2019a, b). Briefly, 952 barley accessions from 41 countries in Europe, Asia, North and South America, Africa, and Australia were initially selected to represent the global diversity for phenology genes in barley. These accessions represent the entire spectrum of cultivated barley, including two- and six-row genotypes, and winter and spring growth habits. These accessions were grown in 2015 and 2016 at four locations in Western Australia which significantly differ in rainfall and temperature during the growing season. Awn emergence, defined as the number of days from sowing to the first awn emergence above the flag leaf (Z49) (Zadoks ), was recorded as an equivalent to FT (Alqudah and Schnurbusch, 2017). A total of 2758 SNPs were enriched from 174 putative genes that are related to phenology and the development of meristem and inflorescences. Full details of field experiments, targeted resequencing of phenology genes, and SNP discovery and filtering were provided in Hill et al. (2019a, b).

Data preparation

The original measurement of days to Z49 for each accession was transformed to standardized FT (FTD) separately for each growing environment and year using the formula: We then averaged FTD across four locations and two years for each barley variety to minimize the random effect, while not shrinking the genetic effects (Piepho ). Barley accessions or SNP loci with >10% missing data were excluded from analysis. For the remaining missing SNP data in the data set, we inspected each missing datum individually and replaced the missing data manually with the most likely allelic combinations with consideration of linkage equilibrium and allelic state of the individual in other SNP loci. After the filtering, 895 barley accessions and 2758 SNPs remained for heritability estimation and GWAS analysis.

Estimation of narrow-sense SNP-based heritability

A genome-based restricted maximum likelihood method (GREML-LDMS) was used to estimate the heritability of FT using all filtered SNPs. GREML-LDMS corrects linkage disequilibrium biases in the estimated SNP-based heritability (Yang ). To calculate narrow-sense heritability from SNP data, h2SNP, we first computed linkage disequilibrium (LD) scores between SNPs with the block size of 100 kb using the computer software package GCTA (Yang ). We used the GREML (a function within GCTA) to estimate the proportion of variance in a phenotype explained by all SNPs (i.e. the SNP-based heritability), following an LD score regression approach as detailed in Yang . h2SNP was estimated both with and without additional data descriptors (growth habit, row type, and origin of the barley accessions) fitted as fixed effects.

Genome-wide association analysis

We used a linear mixed model (LMM) for GWAS analysis as implemented in the Factored Spectrally Transformed Linear Mixed Models (FaST-LMM) package to perform single SNP, gene-set GWAS, and epistasis analysis (Lippert ; Listgarten ; Widmer ). GWAS are often confounded by population substructure and sample relatedness. LMMs are a powerful and established tool for studying genotype–phenotype relationships. LMMs can capture confounders (e.g. population substructure and family relatedness) of GWAS simultaneously, without requiring prior knowledge of whether the confounders are present or not (Lippert ). Its computational efficiency also makes it feasible for an exhaustive search for gene×gene interactions (Lippert ; Widmer ). For GWAS analysis, we calculated the first five principal eigenvectors from principal components analysis (PCA) using GCTA (Yang ) and subsequently included them as covariates in the model as fixed effects for association analysis. GWAS analysis was conducted using the Python-based program FaST-LMM (Listgarten ) following the developers’ instructions (available from http://microsoftgenomics.github.io/FaST-LMM/). Genetic data were formatted into the binary Plink ped input file format (*.bed, *.bim, and *.fam) using Plink 2.0 (Chang ). For single SNP association analysis, we used the average FTD (see Equation 1) of each filtered barley accession as the phenotypic data, all filtered SNPs as genetic data, and the first five principal eigenvectors from the PCA as the covariate. For GSA, we first grouped the SNPs into 174 gene sets with each set of SNPs corresponding to one gene (each gene set had an average of 18 SNPs ranging from 1 to 167). The algorithm as employed in FaST-LMM uses two random effects—one to capture the confounder’s effect and the other to reflect the set association signal—to correct for confounder, and uncovers signal not recoverable by single-SNP GWAS analysis (Listgarten ). For epistasis testing, one SNP (the first polymorphic SNP locus) was taken from each gene, as such a filtering approach significantly reduces the required statistical power for multiple testing. The GWAS analysis was then used to test whether pairs of SNPs taken together explain a higher proportion of variance than the sum of the individual effects of each SNP analysed separately (Widmer ). Because the SNPs were enriched from putative genes that were reported to be associated with FT in barley, A. thaliana, and other cereal crops, we adopted a less stringent threshold than Bonferroni correction to define the significance in GWAS. We instead used the Holm’s sequential Bonferroni correction (Holm, 1979) with a significance threshold at P<0.05 to determine significant SNPs, gene sets, and SNP pairs with epistatic interaction. Sequential Bonferroni correction is an adjusted Bonferroni correction depending on rank to maximize the statistical power in GWAS whilst being stringent. ANOVA was implemented using SPSS (Statistical Package for the Social Sciences, SPSS Inc., Chicago, Il, USA) software, and P<0.05 was used as the statistically significant threshold.

Regulatory connections between flowering genes

Interacting network of flowering genes was constructed using STRING, a database of known and predicted gene–gene (protein–protein) interactions (Szklarczyk ). In STRING, each protein–protein interaction is assigned a score, as an indicator of confidence of a true interaction. A score of 0.7 was used to assign high confidence when retaining the interaction. Connections between the networks of each key gene were achieved by connecting the overlapping genes and epistatic interactions as revealed in the epistatic analysis.

Results

Flowering time and environmental influence

All 895 barley accessions were grown across multienvironment field trials, conducted over four geographical locations and two years in Western Australia. Significant phenotypic differences of agronomic and phenological traits measured were present for the set of barley genotypes grown in the field at different geographical locations in WA in the 2015 and 2016 growing seasons (Table 1; with more details in Hill ). Average time to flowering for the 895 accessions ranged from 65 d to 85 d, with median time from 72 d to 111 d, after sowing across the trial environments. The range in FTs for all accessions evaluated varied from 42–94 d to 63–136 d across the environments. Geraldton in the North of WA is characterized by a hot and dry environment with a short growing season, with the lowest median number of days to Z49 recorded for any environment (72 d), with a range of 46–89 d recorded in 2015. The trial environments at Esperance (ESP) in Southern WA have a longer, wetter, and cooler growing season, and thus recorded the longest maximum days to Z49 (146 d) in 2016.
Table 1.

Locations and experimental years, major climatic factors, and flowering time mean (days to Z49)

Location (year) Tmin–Tmax (°C) Tmean (°C)Rainfall (mm)Global solar radiation (MJ m−2)Growth period (d)Days to Z49 median (range)
Geraldton (2015)2–4017.3189.818.0318272 (46–89)
Geraldton (2016)3–4115.1355.417.5021080 (44–91)
Katanning (2015)4–3614.0550.014.63208105 (69–132)
Katanning (2016)-3–3812.9256.216.87244105 (75–131)
Esperance (2015)1–4114.7318.212.35203104 (60–136)
Esperance (2016)3–3713.6343.612.86201110 (72–146)
Merredin (2016)-1–3714.1181.416.39191111 (80–131)

Tmin/Tmax/Tmean: minimum/maximum/mean temperature during the growing season. Environmental data were taken over 200 d since the sowing date during the growth period for comparisons. Modified from Hill

Locations and experimental years, major climatic factors, and flowering time mean (days to Z49) Tmin/Tmax/Tmean: minimum/maximum/mean temperature during the growing season. Environmental data were taken over 200 d since the sowing date during the growth period for comparisons. Modified from Hill The average range to Z49 between the earliest and the latest flowering types was 59 d, showing the considerable genetic difference in controlling the switch from vegetative growth to reproduction among the tested barley accessions. Variation of FT within barley accessions in different environments is strongly influenced by average temperature during the growth period. Growing season average temperature explained 62.9% (P=0.0001) of variance in FT across environments of four locations and two years (Fig. 1), while minimum/maximum temperature, global solar radiation, and rainfall during the growth period had no significant influence on FT (P>0.05). The trial environments received an optimum rainfall throughout the two growing seasons at all four locations.
Fig. 1.

Mean temperature influencing flowering time (days to Z49) in barley in seven experimental environmental sets across four locations in two years. r and p represent correlation coefficient and probability, respectively, assuming a linear relationship between flowering and temperature. Whiskers are standard deviations.

Mean temperature influencing flowering time (days to Z49) in barley in seven experimental environmental sets across four locations in two years. r and p represent correlation coefficient and probability, respectively, assuming a linear relationship between flowering and temperature. Whiskers are standard deviations. Barley accessions with contrasting growth habit (spring or winter type) had similar standardized days to Z49 (FTD), as did the barley accessions with different row type (P>0.05). However, average FTD of barley accessions with different origin was significantly different (ANOVA P<0.0001) (Fig. 2). Barley accessions with different origin also had unequal variances in FTD (ANOVA, F=9.117, df=44.2, P<0.0001).
Fig. 2.

Phenology of barley accessions with contrasting growth habits, row types, and geographic origin of accessions. Asterisk indicates significant difference in ANOVA. Numbers in parentheses indicate the number of samples, and only the samples positively identified were included.

Phenology of barley accessions with contrasting growth habits, row types, and geographic origin of accessions. Asterisk indicates significant difference in ANOVA. Numbers in parentheses indicate the number of samples, and only the samples positively identified were included.

Gene-set GWAS analysis of flowering time

After filtering (<10% missingness) and pruning to only SNPs located within gene-coding regions of the 174 targeted phenology genes, 895 barley accessions and 2758 SNP markers were retained. Genetic variation of the 2758 SNPs across the barley accessions were not structured by row type, nor by growth habit, nor by geographic origin, confirming previous findings (Hill et al., 2019a, b). Narrow-sense heritability as estimated from all SNP (h2SNP) was estimated at 0.395±0.048. Specifying the origin of each barley line in the analysis as a fixed effect increased h2SNP to 0.503±0.056, while including growth habit or row type as fixed effects did not improve the estimation of heritability. Average temperature, as one of the most significant environmental factors, explained 3.8% of the variance in FTD between barley accessions which was a small yet significant (P<0.001) amount. Using the sequential Bonferroni correction and significance threshold of P<0.05, GWAS analysis was performed using FaST-LMM (Listgarten ), and 170 SNP loci were found to be associated with FTD across all environments (see the Materials and methods). Systematic biases in GWAS were low, indicated by a λ GC, the genomic inflation factor, close to 1.1 (Winkler ) (Supplementary Fig. S1 at JXB online). These SNPs were located within the gene-coding regions of 32 genes on six chromosomes, with no significant SNPs detected on chromosomes 4H (Fig. 3). The subsequent GSA revealed 113 gene sets, corresponding to 113 putative genes, among the 174 genes that were previously shown as flowering-related genes in cereal crop species or in A. thaliana (Hill et al., 2019a, b), associated with FTD in the barley accessions. Those significant genes are located on all seven chromosomes (Fig. 4) and from all flowering pathways (Table 2): photoperiod and circadian clock (34 genes), meristem response and development (27 genes), gibberellin signalling and metabolism (19 genes), grain development (15 genes), vernalization regulation (14 genes), and light perception and signalling (10 genes). Among the 170 significant SNPs as detected in single SNP GWAS, 167 SNPs and the 29 corresponding genes they belong to were also detected as part of gene sets to be significantly associated with FTD (Table 2).
Fig. 3.

Manhattan plot of single SNP GWAS showing significant SNPs that are associated with flowering time in barley accessions. Significant SNPs are shown with larger symbols above the red dashed line as the significance threshold. Significance was determined by sequential Bonferroni correction at P<0.05.

Fig. 4.

Manhattan plot of gene-set GWAS showing significant genes that are associated with phenology in the barley accessions. Significant genes are shown above the red dashed line as the significance threshold as determined by sequential Bonferroni correction at P<0.05.

Table 2.

Genes, their annotation, and associated flowering pathways in barley, as revealed to be significantly associated with flowering time through gene-set analysis

Putative gene nameAnnotation (Hv_IBSC_PGSB_r1_HighConf)Gene ID (Hv_IBSC_PGSB_r1 _HighConf)Flowering pathway
HvADA2 Transcriptional adapter 2HORVU5Hr1G095400Vernalization
HvAGL1 MADS-box transcription factor TaAGL1HORVU6Hr1G002330Vernalization and autonomous pathways
HvAGL32 MADS-box transcription factor 31HORVU2Hr1G098930Meristem response and development
HvAGLG1 MADS-box transcription factor 34HORVU5Hr1G095710Meristem response and development
HvAP2 AP2-like ethylene-responsive transcription factorHORVU2Hr1G113880Meristem response and development
HvARF2 auxin response factor 2HORVU3Hr1G096510Grain size and reproductive development
HvBB E3 ubiquitin ligase BIG BROTHERHORVU4Hr1G055690Grain development
HvBM1 MADS-box transcription factor 47HORVU4Hr1G077850Meristem response and development
HvBM16 MADS-box transcription factor 16HORVU7Hr1G091210Meristem response and development
HvBM3 MADS-box transcription factor 18HORVU0Hr1G003020Meristem response and development
HvBM5 (HvVRN-H1) MADS-box transcription factor 14HORVU5Hr1G095630Vernalization
HvBM8 MADS-box transcription factor 15HORVU2Hr1G063800Meristem response and development
HvBM9 MADS-box transcription factor 7HORVU7Hr1G054220Meristem response and development
HvCBF10A ethylene-responsive element binding factor 13HORVU5Hr1G080430Vernalization
HvCBF14 Ethylene-responsive element binding factor 14HORVU5Hr1G080350Vernalization
HvCBF2A Dehydration-responsive element-binding protein 1BHORVU5Hr1G080310Vernalization
HvCBF3 C-repeat-binding factor 4HORVU5Hr1G080420Vernalization
HvCBF4A Dehydration-responsive element-binding protein 1BHORVU5Hr1G080300Vernalization
HvCBF6 C-repeat-binding factor 4HORVU5Hr1G080450Vernalization
HvCBF8A C-repeat binding factor 3-like proteinHORVU2Hr1G041090Vernalization
HvCBF9 Dehydration-responsive element-binding protein 1BHORVU5Hr1G080230Vernalization
HvCCA1 circadian clock-associated 1HORVU7Hr1G070870Photoperiod and circadian clock
HvCDF1 DOF zinc finger protein 1HORVU2Hr1G017290Photoperiod and circadian clock
HvCEN Protein TERMINAL FLOWER 1HORVU2Hr1G072750Meristem response and development
HvCIGARP GRAS family transcription factorHORVU2Hr1G043780Gibberellin signalling and metabolism
HvCIGARP-2 SCARECROW-like 1HORVU3Hr1G091250Gibberellin signalling and metabolism
HvCK2B casein kinase II beta subunit 4HORVU1Hr1G055250Photoperiod and circadian clock
HvCKX Cytokinin dehydrogenase 2HORVU3Hr1G027460Grain development
HvCMF4 CCT motif family proteinHORVU4Hr1G084020Photoperiod and circadian clock
HvCMF6b Zinc finger protein CONSTANS-LIKE 4HORVU1Hr1G095410Photoperiod and circadian clock
HvCO11 Zinc finger protein CONSTANS-LIKE 16HORVU6Hr1G073170Photoperiod and circadian clock
HvCO2 receptor kinase 3HORVU6Hr1G072620Photoperiod and circadian clock
HvCO8 CONSTANS-like 5HORVU7Hr1G027560Photoperiod and circadian clock
HvCOP1 Erect panicle 2 proteinHORVU2Hr1G031030Grain development
HvCry1a cryptochrome 1HORVU6Hr1G049950Light perception and signalling
HvCry2 cryptochrome 2HORVU6Hr1G058740Light perception and signalling
HvCYP1 Cytochrome P450 superfamily proteinHORVU2Hr1G081650Grain development
HvDRF1 Ethylene-responsive transcription factor 4HORVU1Hr1G060490Meristem response and development
HvDRF2 Ethylene-responsive transcription factor 4HORVU6Hr1G050500Meristem response and development
HvEFS Histone-lysine N-methyltransferase 2AHORVU2Hr1G000940Photoperiod and circadian clock
HvELF3 Early flowering 3HORVU1Hr1G094980Photoperiod and circadian clock
HvELF4-like4 ELF4-like 4HORVU5Hr1G060000Photoperiod and circadian clock
HvELF7 RNA polymerase II-associated factor 1 homologHORVU3Hr1G001430Photoperiod and circadian clock
HvFCA FCA-A1HORVU5Hr1G050820Photoperiod and circadian clock
HvFD Lysine-specific histone demethylase 1 homolog 3HORVU2Hr1G096300Meristem response and development
HvFT1 FLOWERING LOCUS T 1HORVU7Hr1G024610Photoperiod and circadian clock
HvFT2 Protein FLOWERING LOCUS THORVU3Hr1G027590Photoperiod and circadian clock
HvFT3 Protein FLOWERING LOCUS THORVU1Hr1G076420Photoperiod and circadian clock
HvFT5 Protein FLOWERING LOCUS THORVU4Hr1G090390Photoperiod and circadian clock
HvFTL5 Protein FLOWERING LOCUS THORVU2Hr1G084540Photoperiod and circadian clock
HvGA20ox1 gibberellin 20 oxidase 1HORVU5Hr1G124120Gibberellin signalling and metabolism
HvGA20ox2 gibberellin 20-oxidase 2HORVU3Hr1G090980Gibberellin signalling and metabolism
HvGA20ox2-2 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily proteinHORVU1Hr1G070710Gibberellin signalling and metabolism
HvGA20ox2-2 1-aminocyclopropane-1-carboxylate oxidase 1HORVU2Hr1G114980Gibberellin signalling and metabolism
HvGA20ox2-3 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily proteinHORVU4Hr1G013840Gibberellin signalling and metabolism
HvGA20ox3 gibberellin 20 oxidase 2HORVU3Hr1G089980Gibberellin signalling and metabolism
HvGA2betadiox7 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily proteinHORVU3Hr1G117870Gibberellin signalling and metabolism
HvGA3ox1 gibberellin 3-oxidase 1HORVU2Hr1G118350Gibberellin signalling and metabolism
HvGA3ox2 gibberellin 3-oxidase 2HORVU3Hr1G022840Gibberellin signalling and metabolism
HvGARMP Scarecrow-like transcription factor PAT1HORVU4Hr1G071670Gibberellin signalling and metabolism
HvGID1 Gibberellin receptor GID1HORVU1Hr1G060810Gibberellin signalling and metabolism
HvGID1L2-3 alpha/beta-Hydrolases superfamily proteinHORVU5Hr1G068140Gibberellin signalling and metabolism
HvGID1L2-4 alpha/beta-Hydrolases superfamily proteinHORVU5Hr1G069040Gibberellin signalling and metabolism
HvGID1L2-5 alpha/beta-Hydrolases superfamily proteinHORVU5Hr1G098770Gibberellin signalling and metabolism
HvGID1L2-8 AcetylesteraseHORVU4Hr1G015550Gibberellin signalling and metabolism
HvGRP7a Histone-lysine N-methyltransferaseHORVU4Hr1G003060Photoperiod and circadian clock
HvGW7 unknown functionHORVU2Hr1G032710Grain development
HvHYL alpha/beta-Hydrolases superfamily proteinHORVU0Hr1G004410Gibberellin signalling and metabolism
HvLFY1 Floricaula/leafy homologHORVU2Hr1G102590Meristem response and development
HvLNG1 unknown functionHORVU2Hr1G063820Grain development
HvLUX1 Two-component response regulator ARR1HORVU3Hr1G114970Circadian clock
HvMADS25-2 MADS-box transcription factor 25HORVU7Hr1G023940Meristem response and development
HvMADS25-3 MADS-box transcription factor 25HORVU7Hr1G024000Meristem response and development
HvMADS26 MADS-box transcription factor 26HORVU7Hr1G076310Meristem response and development
HvMADS68 MADS-box transcription factor family proteinHORVU4Hr1G032440Meristem response and development
HvMADS75 MADS-box transcription factor family proteinHORVU5Hr1G110470Meristem response and development
HvNAT Acyl-CoA N-acyltransferases (NAT) superfamily proteinHORVU7Hr1G113480Grain development
HvNHL NHL domain-containing proteinHORVU6Hr1G045970Grain development
HvPAF Phytochrome A-associated F-box proteinHORVU1Hr1G058630Light perception and signalling
HvPFT1 Mediator of RNA polymerase II transcription subunit 25HORVU5Hr1G054650Light perception and signalling
HvPhyA phytochrome AHORVU4Hr1G008610Light perception and signalling
HvPhyB phytochrome BHORVU4Hr1G053400Light perception and signalling
HvPhyC phytochrome CHORVU5Hr1G095530Light perception and signalling
HvPI MADS-box transcription factor 4HORVU1Hr1G063620Meristem response and development
HvPI-2 MADS-box transcription factor 2HORVU3Hr1G091000Meristem response and development
HvPIF4 Transcription factor EBHORVU5Hr1G011780Light perception and signalling
HvPPD-H1 pseudo-response regulator 7HORVU2Hr1G013400Photoperiod and circadian clock
HvPRR59 Two-component response regulator-like APRR5HORVU4Hr1G021010Photoperiod and circadian clock
HvPRR73 pseudo-response regulator 7HORVU4Hr1G057550Photoperiod and circadian clock
HvPRR95 Two-component response regulator-like PRR95HORVU5Hr1G081620Photoperiod and circadian clock
HvRLPK Leucine-rich receptor-like protein kinase family proteinHORVU4Hr1G079040Grain development
HvRNG Protein SIP5HORVU6Hr1G044080Grain development
HvSCPL33 Carboxypeptidase Y homolog AHORVU3Hr1G033550Grain development
HvSHP1 MADS-box transcription factor 13HORVU1Hr1G023620Meristem response and development
HvSP1 Protein NRT1/ PTR FAMILY 4.3HORVU4Hr1G015640Grain development
HvSPL11 squamosa promoter binding protein-like 2HORVU6Hr1G031450Photoperiod and circadian clock
HvSPL12 squamosa promoter-binding-like protein 3HORVU6Hr1G019700Photoperiod and circadian clock
HvSPL14 squamosa promoter-binding-like protein 17HORVU0Hr1G020810Photoperiod and circadian clock
HvSPL3 squamosa promoter binding protein-like 8HORVU6Hr1G030490Meristem response and development
HvSS1 strictosidine synthase-like 3HORVU5Hr1G091230Grain development
HvSTK MADS-box transcription factor 21HORVU1Hr1G064150Meristem response and development
HvTEM1 AP2/B3 transcription factor family proteinHORVU3Hr1G010100Photoperiod and circadian clock
HvTFL1 Protein TERMINAL FLOWER 1HORVU5Hr1G042230Meristem response and development
HvTOC1 Two-component response regulator-like PRR1HORVU6Hr1G057630Photoperiod and circadian clock
HvTT16 MADS-box transcription factor 29HORVU6Hr1G032220Meristem response and development
HvTUBA3 tubulin alpha-4 chainHORVU4Hr1G009520Meristem response and development
HvVEL1 Protein VERNALIZATION INSENSITIVE 3HORVU6Hr1G022770Vernalization
HvVIN3 Protein VERNALIZATION INSENSITIVE 3HORVU7Hr1G099250Vernalization
HvWPSRLK Mitochondrial transcription termination factor family proteinHORVU2Hr1G061060Grain development
HvWRKY61 WRKY DNA-binding protein 3HORVU5Hr1G028340Grain development
HvZCCTc Zinc finger protein CONSTANS-LIKE 4HORVU1Hr1G056120Vernalization
HvZTLa Kelch repeat-containing F-box family proteinHORVU7Hr1G099010Photoperiod and circadian clock
HvZTLb Adagio-like protein 1HORVU6Hr1G022330Photoperiod and circadian clock
a HvAG1 MADS-box transcription factor 3HORVU3Hr1G026650Meristem response and development
a HvCK2A Protein kinase superfamily proteinHORVU0Hr1G030500Photoperiod and circadian clock
a HvPAP2 Auxin-responsive protein IAA17HORVU3Hr1G031460Light perception and signalling
b HvBM7 MADS-box transcription factor 1HORVU4Hr1G067680Meristem response and development
b HvCO1 B-Box-type zinc finger transcription factorHORVU7Hr1G043030Photoperiod and circadian clock
b HvCry1b cryptochrome 1HORVU2Hr1G079220Light perception and signalling
b HvEDL2 EID1-like F-box protein 2HORVU2Hr1G034270Photoperiod and circadian clock
b HvGA2ox3 gibberellin 2-oxidaseHORVU3Hr1G072810Gibberellin signalling and metabolism

Significance was determined by sequential Bonferroni correction (P<0.05). The detailed list with chromosome position is in table S1 in Hill . Annotation and Gene ID follows Hv_IBSC_PGSB_r1_HighConf.

Significant only in single SNP GWAS analysis.

Significant only in epistasis analysis.

Manhattan plot of single SNP GWAS showing significant SNPs that are associated with flowering time in barley accessions. Significant SNPs are shown with larger symbols above the red dashed line as the significance threshold. Significance was determined by sequential Bonferroni correction at P<0.05. Manhattan plot of gene-set GWAS showing significant genes that are associated with phenology in the barley accessions. Significant genes are shown above the red dashed line as the significance threshold as determined by sequential Bonferroni correction at P<0.05. Genes, their annotation, and associated flowering pathways in barley, as revealed to be significantly associated with flowering time through gene-set analysis Significance was determined by sequential Bonferroni correction (P<0.05). The detailed list with chromosome position is in table S1 in Hill . Annotation and Gene ID follows Hv_IBSC_PGSB_r1_HighConf. Significant only in single SNP GWAS analysis. Significant only in epistasis analysis.

Epistatic effects of genes associated with flowering time

Two-way (interaction of two SNPs) epistasis analysis revealed 19 pairs of SNPs (sequential Bonferroni corrected P<0.05), among the overall 30 276 pairs between each of the phenology genes studied here, interacting to influence FT. Depending on the combination of SNPs in their allelic state, 12 pairs significantly promoted earlier flowering (–8 d), and seven pairs were linked with later flowering (+10 d) when compared with average FT (Table 3). A homozygote at an alternative state (‘GG’ versus ‘TT’ in the reference genome) in HvELF7 (an RNA polymerase II-associated factor 1 homologue gene) interacted with six other SNPs promoting earlier flowering, while the HvGA2ox3 (a gibberellin 2-oxidase gene) homozygote at an alternative state (‘AA’ versus ‘GG’ in the reference genome) interacts with other genes to delay flowering in barley (Fig. 5). For example, cultivar ‘UWA2Rsel9506’ which has genotype ‘GG’ in HvELF7 tends to flower earlier when HvCO1 (a zinc finger protein CONSTANS-LIKE gene) has genotype ‘GG’ across all experimental locations. Seven accessions (‘07T741’, ‘B559’, ‘B751’, ‘Han 85-222’, ‘I92-562’, ‘ICB104039’, and ‘Lao Wu Hu Xu Mai’) with HvGA2ox having genotype ‘AA’ and HvCKX (a cytokinin dehydrogenase gene) having genotype ‘GG’ usually flower later across our trials. When homozygous in an alternative state (‘GG’ versus ‘CC’ in the reference genome), HvPhyB (a phytochrome B gene) interacts with two other genes (HvNHL, an NHL domain-containing protein gene, and HvTOC, a two-component response regulator-like PRR1 gene) to promote early flowering, while when in the heterozygous state, this gene interacts with other genes (HvSPL3, a squamosa promoter-binding protein-like gene) to promote late flowering. Eight out of the 13 genes revealed to have epistatic interactions were also significant in the SNP-set GWAS analysis, while the remaining five were defined as insignificant both in the single SNP and gene-set GWAS analyses (Table 2).
Table 3.

SNP–SNP interaction in determining flowering time in barley as revealed by epistasis analysis

Gene_1FTDGene_2FTDGene interactionFTD
Gene interactions to promote early flowering
HvCBF8A (CC)0.64±0.12 HvELF7 (GG)0.52±0.09CC–GG0.48±0.09
HvCO1 (GG)0.63±0.12 HvELF7 (GG)0.52±0.09GG–GG0.48±0.10
HvCry1b (TT)0.64±0.12 HvELF7 (GG)0.52±0.09TT–GG0.48±0.09
HvBM7 (CC)0.64±0.12 HvELF7 (GG)0.52±0.09CC–GG0.48±0.09
HvPhyB (CC)0.65±0.12 HvELF7 (GG)0.52±0.09CC–GG0.48±0.09
HvFT1 (CC)0.64±0.13 HvELF7 (GG)0.52±0.09CC–GG0.48±0.09
HvCK2B (GG)0.64±0.12 HvCO1 (CC)0.64±0.12GG–CC0.55±0.13
HvCO1 (GG)0.63±0.12 HvZCCTc (CC)0.60±0.13GG–CC0.58±0.11
HvPhyB (GG)0.60±0.13 HvNHL (GG)0.61±0.12GG–GG0.56±0.08
HvPhyB (GG)0.60±0.13 HvTOC1 (TT)0.62±0.13GG–TT0.56±0.08
HvPhyA (AA)0.61±0.13 HvZTLa (GG)0.64±0.12AA–GG0.57±0.13
HvCBF8A (TT)0.59±0.12 HvEDL2 (TT)0.64±0.13TT–TT0.58±0.10
Gene interactions to delay flowering
HvPhyA (GG)0.64±0.12 HvZTLb (GG)0.65±0.13GG–GG0.66±0.11
HvPhyB (CC)0.64±0.12 HvSPL3 (CC)0.61±0.12CC–CC0.69±0.11
HvSLN1 (CC)0.67±0.12 HvCO8 (TT)0.70±0.16CC–TT0.74±0.14
HvCKX (CC)0.70±0.15 HvGA2ox3 (AA)0.69±0.15CC–AA0.77±0.15
HvFT2 (GG)0.70±0.16 HvGA2ox3 (AA)0.69±0.15GG–AA0.77±0.15
HvFT2 (GG)0.70±0.16 HvCBF6 (TT)0.69±0.15GG–TT0.77±0.15
HvCBF6 (TT)0.69±0.15 HvCKX (CC)0.70±0.15TT–CC0.77±0.15

Flowering time (days to Z49) was standardized to 0–1 as FTD (see the Materials and methods). Letters in parentheses indicate the genotype of the first SNP of the gene. FTD is presented as mean ±SD. Note that the average FTD across all samples was 0.64±0.12

Fig. 5.

Significant flowering genes and their regulatory connections in barley (Hordeum vulgare L.). Putative gene name and gene IDs were from Ensembl Plants Hordeum vulgare Genome assembly 082214v1 that was archived in STRING (Szklarczyk ). The interactions, including type and effects, were based direct (physical) and indirect (functional) associations from computational prediction and knowledge transfer between organisms, as implemented in STRING (Szklarczyk ).

SNP–SNP interaction in determining flowering time in barley as revealed by epistasis analysis Flowering time (days to Z49) was standardized to 0–1 as FTD (see the Materials and methods). Letters in parentheses indicate the genotype of the first SNP of the gene. FTD is presented as mean ±SD. Note that the average FTD across all samples was 0.64±0.12 Significant flowering genes and their regulatory connections in barley (Hordeum vulgare L.). Putative gene name and gene IDs were from Ensembl Plants Hordeum vulgare Genome assembly 082214v1 that was archived in STRING (Szklarczyk ). The interactions, including type and effects, were based direct (physical) and indirect (functional) associations from computational prediction and knowledge transfer between organisms, as implemented in STRING (Szklarczyk ).

Gene interaction network in regulation of flowering time

Using key genes involved in flowering regulation in barley as recorded in the comprehensive protein–protein interaction database ‘STRING’ and also including additional candidate genes as revealed in our gene–gene interactions (epistatic interaction) analysis, we constructed a complex gene regulatory network (Fig. 5). The network involved 18 genes that were identified as significant in the above gene-based associated analysis. These genes are known to have roles in light signalling (e.g. HvPhyC), photoperiod response (HvPpD-H1), circadian clock (HvELF3), and development of the inflorescence meristem (HvCEN). Twenty-one genes were uncharacterized in the Hordeum vulgare genome assembly 082214v1.

Discussion

We have previously identified 429 functional alleles within the coding regions of 95 genes associated with FT in barley using single-marker GWAS (Hill ). In this study, by expanding to GSA and epistasis analysis, we achieved higher statistical power, and with potentially high accuracy, to detect significant genes and gene networks that influence FT in barley. We have identified 121 genes that have been associated with FT in barley, including 26 that have not been described in barley in previous research. All 121 genes have been previously described in dicot A. thaliana, and monocot cereal crops (e.g. rice, maize, and sorghum), indicating that many of the flowering genes are conserved across angiosperms including dicots and monocots (Blümel ). FT genes involved in the photoperiod, vernalization, circadian clock, and gibberellin biosynthesis pathways were previously studied in barley (Turner ; Wang ; Maurer ; Mathew ). Our GSA detected essential genes involved in the key flowering pathways and confirmed that these genes were indeed controlling the FT in the barley accessions with a broad geographic origin. We note that our SNPs have been enriched from putative flowering genes; it is highly likely that there are additional genes, and gene interactions between flowering genes and other genes that may not directly be involved in flowering, influencing flowering in barley. Further research into the genetic mechanism of flowering in barley should expand to include genome-wide genetic variants. Our gene-set association analysis detected key photoperiod response genes controlling FT. The photoperiod response gene Photoperiod 1 (Ppd-H1), located at chromosome 2H, is a pseudoresponse regulator gene. This gene has previously been identified as one essential gene for providing adaptation to photoperiod in barley by flowering induction under long days (Turner ). It is known that the Ppd-H1 dominant allele induces early flowering in wild and winter barley varieties, while recessive ppd-H1 delays flowering in spring barleys (Turner ; Jones ). The second photoperiod gene Ppd-H2, also known as HvFT3 in barley, located on chromosome 1H, was shown to regulate FT under short days (Börner ; Wang ). GSA identified HvCEN as a significant flowering gene, corroborating the report from Comadran . TFL1, the homologue of HvCEN, is a key regulator of FT by controlling the development of the inflorescence meristem in A. thaliana (Hanano and Goto, 2011). HvCEN and associated QTLs were also reported to be associated with components of grain yield traits in barley (Comadran ; Pasam and Sharma, 2014; Sharma ). Saade reported that the HvCEN locus promoted early FT, and resulted in higher grain yield, under salt stress conditions. Among the three light receptor phytochrome genes—HvPhyA, HvPhyB, and HvPhyC—identified as associated with FT in our GSA, HvPhyC has previously been reported as an essential component in photoperiodic flowering in barley (Faure ; Nishida ; Pankin ; Hill ). As phytochromes are involved in plants’ ability to intercept and translate light signals, they play a crucial role in modulating and regulating growth and development (Mathews, 2010). The HvPHYC gene was reported to interact with several other photoperiod response genes under different photoperiods (Pankin ). Meanwhile, existing evidence suggests that variation at the HvPHYC locus has no pleiotropic effects on important agronomic traits and starch pasting properties (Nishida ; Pankin ). As such, Ibrahim suggested that HvPHYC can be used effectively in barley breeding programmes to manipulate FT for yield improvement for varieties in stressful growing conditions. Circadian clock-controlled mechanisms enable plants to measure changes of photoperiod as a cue for seasonal changes in their environment and therefore control developmental transitions, such as from vegetative growth to initiating flowering (Shim ). Previous reports identified HvELF3 as one of the key genes affecting the circadian clock (Faure ; Zakhrabekova ), which was also confirmed in this study. The HvELF3 locus regulates flowering under the influence of photoperiod (Boden ). In A. thaliana, it is known that ELF3, LUX, and ELF4 form a protein complex, termed the evening complex (EC). This complex represses the expression of PRR9 and LUX (two core circadian components in A. thaliana) through binding to LUX-binding sites (reviewed in Shim ). Huang recently reported that the PhyB–ELF3 complex forms one of the signalling hubs that connects red light signalling with the circadian clock. It is not clear whether the circadian clock-controlled mechanisms involving ELF3, ELF4, LUX, PRR9, and PhyB operate in the same way in barley as in A. thaliana. However, HvELF3, HvELF4, HvLUX, HvPRR9, and HvPhyB were all identified as significant in controlling FT in our gene-set test. It is known that the early flowering of some barley genotypes is closely linked to gibberellin biosynthesis (Boden ). We identified 19 genes related to gibberellin biosynthesis [e.g. HvGA20ox1 (GA20 oxidase 1)] as significant flowering genes in the barley accessions we investigated. Our findings corroborate with the notion that gibberellin is an important signal in flower development in barley. In A. thaliana, paclobutrazol—a gibberellin biosynthesis inhibitor—significantly reduces the long hypocotyl and petiole phenotypes of Arabidopsis elf3 mutants (Filo ). As discussed above, ELF3 is a key gene in a tripartite transcriptional complex, the EC. Filo further suggested that the role of the EC in the regulation of gibberellin biosynthesis and flowering in dicots is shared with monocots and is a highly conserved mechanism for growth control. As such, mechanisms of the circadian clock-controlled pathway linking regulation of gibberellin biosynthesis and flowering induction, as reported in A. thaliana, may provide a useful template for exploring clock-controlled mechanisms in barley. Fourteen genes that were reportedly involved in the vernalization pathway have been identified in the GSA. The interaction of Vrn-H1, Vrn-H2, and Vrn-H3 has been reported as an important mechanism controlling flowering in response to vernalization in barley (von Zitzewitz ). HvBM5 (equivalent to HvVrn-H1), a MADS-box transcription factor gene, was identified as a significant flowering gene, and was also previously reported to promote the transition from the vegetative to the reproductive phase (Hemming ). In the interaction, HvVrn-H1 represses the expression of Vrn-H2 (a zinc-finger CONSTANS); in turn, that represses Vrn-H3 in regulating flowering as the response to vernalization (Yan et al., 2003, 2004). Vrn-H3 (equivalent to HvFT1) in barley was thought to be a central integrator of different FT pathways (Yan ). Yan also reported that the Vrn-H3 gene in both barley and wheat is responsible for natural allelic variation in vernalization requirement. Five FT genes (HvFT1, HvFT2, HvFT4, HvFT5, and HvFTL5) were identified as significantly influencing FT in this study. These genes were observed to play different roles in their response to photoperiod, while HvFT1 has an essential role in the transition from the vegetative growth to reproductive stage (Alqudah ). We identified 22 genes involving 19 two-way epistatic interactions in either promoting early flowering or delaying flowering. Epistatic interactions have previously been reported in barley. Yan et al. (2004, 2006) have previously reported significant two-way epistasis between vernalization genes VRN-H1 (syn. HvBM5) and VRN-H3 (syn. HvFT1), and between Vrn-H1 and Vrn-H2, to play an essential role in FT regulation in barley. Griffiths postulated that FT genes HvGI, Vrn-H2, Vrn-H1, and HvCO1 could be involved in two-way epistatic interactions. Cuesta-Marcos proposed that Vrn-H1 (HvBM5), Vrn-H2, Vrn-H3 (HvFT1), and Vrn4 could interact to determine vernalization sensitivity in barley. A few of the epistatic interactions revealed in this study could be linked to previously reported interaction in barley or A. thaliana. For example, the interaction of homozygous HvFT2 and HvGA2ox3 delayed flowering in our study, which is consistent with the previous report by Filo . Our results also demonstrate the extensive epistatic interactions controlling the FT between genes involved in response to photoperiod, circadian clock pathway genes, response to vernalization, and gibberellin biosynthesis (Fig. 5). HvELF7, a homologue of the RNA polymerase II-associated factor 1 gene, is notable. This gene interacted with six other genes involved with photoperiod and vernalization to induce flowering up to 10 d earlier. Its effects were consistent across our experimental locations and years, implying that its role is probably independent of environmental impacts. HvCO1 is another key gene identified in this study. HvCO1 and HvCO8 were involved in four epistatic interactions in influencing flowering. It is known that CONSTANS (CO) plays a crucial role in the photoperiodic regulation of flowering in A. thaliana (Kim ). At least eight homologues of CO-like genes (HvCO1–HvCO8) were identified in barley, but their roles in controlling the FT pathway are not clear (Griffiths ; Cockram ). Our findings for HvCO1 and other genes involved in photoperiodic regulation and vernalization could provide some testing hypothesis of the role of CO in the regulation of flowering in barley. Interestingly, HvCO1 was involved in epistatic interactions promoting early flowering, while HvCO8 interacting with HvSLN1 delayed flowering in the studied barley accessions, implying the possible different roles that different homologues of CO-like genes may play in the regulation of FT in barley. The broad epistatic interactions in the regulation of FT in barley as revealed in our study suggest the presence of other functional networks of genes involved in controlling FT. Based on the fact that more genes and their interactions were identified as important in regulating FT in barley, this study added more details to the gene regulatory network that Hill and Li (2016) proposed. Our results on epistatic interaction and proposed gene regulatory networks could provide further insight to refine the current model of the regulatory network controlling flowering in barley and other cereal crops (e.g. Woods ), while further studies, such as with knock-out accessions, may validate the observed interaction effects and regulatory network. The main environmental factors that influence FT include the ambient temperature and day length. In sorghum, temperature explained 69.4% of the variation in average FT in different environments (Li ). Similarly, our study found that 62.9% of the variation in FT of a barley line was due to variation in average temperature in the growing locations. Gene and environment interactions explained 3.85% (P<0.001) of variance for FT. This figure, although much less than that by the average temperature, was found to be highly significant. FT in barley is highly heritable. Broad-sense heritability of FT was estimated at 88% in wild barley (Herzig ). In maize, the genetic architecture of FT is predominantly determined by small additive loci with few environmental interactions, and FT is also highly heritable (h2 >0.85) (Buckler ). Our estimate of heritability from 2758 SNPs was 0.503 if the origin of the experimental accessions was included as a fixed effect, while it was only 0.395 if the origin was not specified. Previously, we reported that peak SNPs at the identified loci explained 31–78% of the phenotypic variance for phenology in different environments (Hill ). Both our current and previous estimates of heritability seem to be low, which could be explained by four aspects. (i) There may be more genes that are essential parts of the network regulating FT in barley yet to be captured in our study. For example, Bouché curated a database containing 306 genes that were reported to have functions and interactions within the flowering pathways in A. thaliana, while we analysed 174 putative genes. (ii) Causal SNPs related to FT could be located far from the known gene in its regulatory regions; therefore, SNP enrichment based on genes could fail to capture the effect. (iii) The epistatic effect could be more extensive because of the existence of a complex regulation network in controlling flowering. (iv) Broader sampling to include samples from broader genetic background and origin could be required. Future research that builds on the insights generated from this study, and with the aim of finding the missing heritability and the genes that are important in regulating FT in barley, will help to decipher the genetic mechanism of flowering regulation, and therefore facilitate barley breeding programmes to increase performance and grain yield under optimal cultivation conditions as well as under stress. GWAS has been a powerful tool to connect genomic variation (SNPs) to complex phenotype, while pinpointing the actual genes underlying biology is still not straightforward. Our previous research (Hill ) demonstrated that targeted enrichment of SNPs from function-related genes combined with GWAS could provide great opportunities to associate DNA variations with complex phenotypes in plants. In this study, we further demonstrated that GSA could provide higher power to detect genetic association than the analysis of SNPs individually. We suggest that GSA is particularly useful for dissecting the genetic determinants of complex traits such as FT, as it is likely that many SNPs with small effects contribute to these complex traits, while their effects are difficult to detect when testing SNPs individually (Holmans, 2010). Our research also shows that the incorporation of analysis of gene interaction and gene-set GWAS offers great promise in the characterization of the biological pathway of genetic determination of complex traits. It should be noted that, despite the power to connect sequence diversity to complex traits, GSA has its limits. First, GWAS analysis so far revealed that most of the significant SNPs fall within the category of non-protein coding, and many are a distance away from the known gene (Maurano ); it is not clear how far the flanking sequencing of each gene should be included in the mapping of SNPs to a gene set (Fridley and Biernacka, 2011). Further, as with the single SNP GWAS analysis, GSA reveals the genetic changes to be correlated with a particular phenotype; this does not mean that genes identified by the studies control the phenotype, which needs to be tested in controlled experiments.

Supplementary data

Supplementary data are available at JXB online. Fig. S1. Q–Q plots showing the low level of systematic biases in genome-wide association study (GWAS) results. Click here for additional data file.
  79 in total

1.  Coalescent-based association mapping and fine mapping of complex trait loci.

Authors:  Sebastian Zöllner; Jonathan K Pritchard
Journal:  Genetics       Date:  2004-10-16       Impact factor: 4.562

2.  Natural variation in Ghd7 is an important regulator of heading date and yield potential in rice.

Authors:  Weiya Xue; Yongzhong Xing; Xiaoyu Weng; Yu Zhao; Weijiang Tang; Lei Wang; Hongju Zhou; Sibin Yu; Caiguo Xu; Xianghua Li; Qifa Zhang
Journal:  Nat Genet       Date:  2008-05-04       Impact factor: 38.330

3.  Gene set analysis for interpreting genetic studies.

Authors:  Tune H Pers
Journal:  Hum Mol Genet       Date:  2016-08-10       Impact factor: 6.150

4.  Epistatic interaction between Arabidopsis FRI and FLC flowering time genes generates a latitudinal cline in a life history trait.

Authors:  Ana L Caicedo; John R Stinchcombe; Kenneth M Olsen; Johanna Schmitt; Michael D Purugganan
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-25       Impact factor: 11.205

Review 5.  Breeding technologies to increase crop production in a changing world.

Authors:  Mark Tester; Peter Langridge
Journal:  Science       Date:  2010-02-12       Impact factor: 47.728

6.  GSA-SNP: a general approach for gene set analysis of polymorphisms.

Authors:  Dougu Nam; Jin Kim; Seon-Young Kim; Sangsoo Kim
Journal:  Nucleic Acids Res       Date:  2010-05-25       Impact factor: 16.971

7.  Quality control and conduct of genome-wide association meta-analyses.

Authors:  Thomas W Winkler; Felix R Day; Damien C Croteau-Chonka; Andrew R Wood; Adam E Locke; Reedik Mägi; Teresa Ferreira; Tove Fall; Mariaelisa Graff; Anne E Justice; Jian'an Luan; Stefan Gustafsson; Joshua C Randall; Sailaja Vedantam; Tsegaselassie Workalemahu; Tuomas O Kilpeläinen; André Scherag; Tonu Esko; Zoltán Kutalik; Iris M Heid; Ruth J F Loos
Journal:  Nat Protoc       Date:  2014-04-24       Impact factor: 13.491

8.  An exhaustive epistatic SNP association analysis on expanded Wellcome Trust data.

Authors:  Christoph Lippert; Jennifer Listgarten; Robert I Davidson; Scott Baxter; Hoifung Poon; Hoifung Poong; Carl M Kadie; David Heckerman
Journal:  Sci Rep       Date:  2013-01-22       Impact factor: 4.379

9.  Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index.

Authors:  Jian Yang; Andrew Bakshi; Zhihong Zhu; Gibran Hemani; Anna A E Vinkhuyzen; Sang Hong Lee; Matthew R Robinson; John R B Perry; Ilja M Nolte; Jana V van Vliet-Ostaptchouk; Harold Snieder; Tonu Esko; Lili Milani; Reedik Mägi; Andres Metspalu; Anders Hamsten; Patrik K E Magnusson; Nancy L Pedersen; Erik Ingelsson; Nicole Soranzo; Matthew C Keller; Naomi R Wray; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2015-08-31       Impact factor: 38.330

10.  Genomic and environmental determinants and their interplay underlying phenotypic plasticity.

Authors:  Xin Li; Tingting Guo; Qi Mu; Xianran Li; Jianming Yu
Journal:  Proc Natl Acad Sci U S A       Date:  2018-06-11       Impact factor: 11.205

View more
  6 in total

1.  Leaf thickness of barley: genetic dissection, candidate genes prediction and its relationship with yield-related traits.

Authors:  Zhi Zheng; Haiyan Hu; Shang Gao; Hong Zhou; Wei Luo; Udaykumar Kage; Chunji Liu; Jizeng Jia
Journal:  Theor Appl Genet       Date:  2022-03-29       Impact factor: 5.699

2.  Genomic structural equation modelling provides a whole-system approach for the future crop breeding.

Authors:  Tianhua He; Tefera Tolera Angessa; Camilla Beate Hill; Xiao-Qi Zhang; Kefei Chen; Hao Luo; Yonggang Wang; Sakura D Karunarathne; Gaofeng Zhou; Cong Tan; Penghao Wang; Sharon Westcott; Chengdao Li
Journal:  Theor Appl Genet       Date:  2021-05-31       Impact factor: 5.699

3.  Comparative de novo transcriptome analysis identifies salinity stress responsive genes and metabolic pathways in sugarcane and its wild relative Erianthus arundinaceus [Retzius] Jeswiet.

Authors:  P Vignesh; C Mahadevaiah; R Parimalan; R Valarmathi; S Dharshini; Singh Nisha; G S Suresha; S Swathi; H K Mahadeva Swamy; V Sreenivasa; K Mohanraj; G Hemaprabha; Ram Bakshi; C Appunu
Journal:  Sci Rep       Date:  2021-12-31       Impact factor: 4.379

4.  Multi-environment genome -wide association mapping of culm morphology traits in barley.

Authors:  Gianluca Bretani; Salar Shaaf; Alessandro Tondelli; Luigi Cattivelli; Stefano Delbono; Robbie Waugh; William Thomas; Joanne Russell; Hazel Bull; Ernesto Igartua; Ana M Casas; Pilar Gracia; Roberta Rossi; Alan H Schulman; Laura Rossini
Journal:  Front Plant Sci       Date:  2022-09-23       Impact factor: 6.627

5.  The role of epistasis and its interaction with environment in fine-tuning heading time in barley.

Authors:  Nikolai K Christov
Journal:  J Exp Bot       Date:  2020-01-23       Impact factor: 6.992

6.  Genome-wide association and epistatic interactions of flowering time in soybean cultivar.

Authors:  Kyoung Hyoun Kim; Jae-Yoon Kim; Won-Jun Lim; Seongmun Jeong; Ho-Yeon Lee; Youngbum Cho; Jung-Kyung Moon; Namshin Kim
Journal:  PLoS One       Date:  2020-01-22       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.