Literature DB >> 31125482

Strong temporal dynamics of QTL action on plant growth progression revealed through high-throughput phenotyping in canola.

Dominic Knoch1, Amine Abbadi2, Fabian Grandke3, Rhonda C Meyer1, Birgit Samans3, Christian R Werner3, Rod J Snowdon3, Thomas Altmann1.   

Abstract

A major challenge of plant biology is to unravel the genetic basis of complex traits. We took advantage of recent technical advances in high-throughput phenotyping in conjunction with genome-wide association studies to elucidate genotype-phenotype relationships at high temporal resolution. A diverse Brassica napus population from a commercial breeding programme was analysed by automated non-invasive phenotyping. Time-resolved data for early growth-related traits, including estimated biovolume, projected leaf area, early plant height and colour uniformity, were established and complemented by fresh and dry weight biomass. Genome-wide SNP array data provided the framework for genome-wide association analyses. Using time point data and relative growth rates, multiple robust main effect marker-trait associations for biomass and related traits were detected. Candidate genes involved in meristem development, cell wall modification and transcriptional regulation were detected. Our results demonstrate that early plant growth is a highly complex trait governed by several medium and many small effect loci, most of which act only during short phases. These observations highlight the importance of taking the temporal patterns of QTL/allele actions into account and emphasize the need for detailed time-resolved analyses to effectively unravel the complex and stage-specific contributions of genes affecting growth processes that operate at different developmental phases.
© 2019 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

Entities:  

Keywords:  Brassica; biomass; genome-wide association studies; growth dynamics; high-throughput phenotyping; vegetative development

Mesh:

Year:  2019        PMID: 31125482      PMCID: PMC6920335          DOI: 10.1111/pbi.13171

Source DB:  PubMed          Journal:  Plant Biotechnol J        ISSN: 1467-7644            Impact factor:   9.803


Introduction

Canola/rapeseed (Brassica napus L., AACC, 2n = 38) is the leading oilseed crop in Canada, Australia, China and Europe and second globally. Its oils have diverse uses, including food, industrial feedstock and as an environmentally friendly renewable energy source (Lu et al., 2011). Early plant growth and biomass formation are crucial traits for productivity and yield, and plant biomass has been shown to be correlated with canola yield at the mature stage (Zhao et al., 2016). Moreover, early‐stage growth is of special importance for young seedlings to provide efficient ground coverage and to avoid competition with weeds in the fields. Today, hybrid varieties dominate the seed market due to their superior early vigour, yield potential and yield stability. Heterosis manifests at a very early stage of seedling development in canola (Basunanda et al., 2010) and plays a key role in field establishment; hence, a better understanding of early plant growth is of great importance for breeding. The availability of the Brassica napus reference genome sequence (Chalhoub et al., 2014) and a 60K SNP genotyping array (Clarke et al., 2016) has enabled genomic studies to greatly improve our understanding of the genetic basis underlying key agronomic traits. Vegetative plant biomass accumulation and growth are under complex genetic control and are strongly influenced by the environment (Shi et al., 2009; Zhao et al., 2016). Thus, dissecting the genetic basis of vegetative plant growth, early plant height and biomass production is of high relevance to fundamental research and to crop improvement strategies. Previous studies applied quantitative trait locus (QTL) mapping and genome‐wide association analyses to identify QTL/alleles for growth (Yong et al., 2015), yield (Luo et al., 2017; Radoev et al., 2008) and yield‐related traits (Cai et al., 2016; Chen et al., 2007; Dong et al., 2018; Yang et al., 2012). In some cases, genes underlying these QTL were also identified (Li et al., 2018; Liu et al., 2015; Zeng et al., 2011). However, most of these studies have focused on single time points, although gene expression patterns are known to change during developmental progression. Technological advances have resulted in the availability of high‐throughput phenotyping (HTP) offering a non‐invasive, image‐based method to analyse complex traits (Barabaschi et al., 2016). Consequently, many aspects of plant growth and morphological traits have been studied in depth for diverse model and crop plants, including Arabidopsis (Granier et al., 2006; Hartmann et al., 2011; Tisné et al., 2013), maize (Cabrera‐Bosquet et al., 2016; Junker et al., 2015; Zhang et al., 2017), rice (Hairmansis et al., 2014; Schilling et al., 2015; Yang et al., 2014), barley (Honsdorf et al., 2014; Neumann et al., 2015) and rapeseed (Fanourakis et al., 2014; Hatzig et al., 2015; Kjaer and Ottosen, 2015). These new platforms and techniques allow the efficient generation of multiple time point measurements and to assess plant growth and development over time. Furthermore, time is introduced as an additional dimension to association studies. In Arabidopsis, previous analyses of projected leaf area at 12 different time points, parameters derived from growth models and end‐point biomass data revealed time‐specific and general QTL affecting growth dynamics (Bac‐Molenaar et al., 2015). Similar observations were made regarding temporal patterns of biomass accumulation in barley (Neumann et al., 2017), plant development and height in triticale (Busemeyer et al., 2013; Würschum et al., 2014a; Würschum et al., 2014b) and temporal expression of tiller number in wheat (Ren et al., 2018). Dynamic QTL for plant height and for stress‐responsive and several root traits at different developmental stages was recently reported in upland cotton (Liang et al., 2014; Pauli et al., 2016; Shang et al., 2016). In triticale, genetic dynamics underlying biomass yield were assessed in three developmental stages (Liu et al., 2014). Interestingly, besides detecting QTL active in all stages, some QTL contributed only in one or two of the stages to biomass development. Moreover, a recent study in maize assessed the genetics of growth dynamics at 11 different developmental time points and reported main effect QTL and epistatic interactions with different patterns of expression and reversing allelic effects (Muraya et al., 2017). In B. napus, dynamic QTL for plant height were described that showed opposite genetic effects in different periods/stages and experiments (Wang et al., 2015). However, multiple time point analyses to uncover the genetic basis for biomass and growth as dynamic traits have so far not been addressed in canola. In summary, the studies mentioned above highlight the need to investigate QTL/allele effects by time‐series data to efficiently elucidate growth processes and to detect stage‐specific loci that would likely be missed by analysing single or end‐point data only. Hence, here we investigated a genetically diverse population of 477 spring‐type B. napus lines from a canola breeding programme by daily automated high‐throughput phenotyping and performed genome‐wide association analyses throughout an early vegetative phase to address the following questions: (i) Which key genomic regions are associated with growth‐related traits and relative growth rates in the early phase of vegetative plant development? (ii) To what extent do identified regions contribute to trait variance? (iii) Can we resolve dynamic, stage‐specific contributions of loci for early growth by a time course analysis? and (iv) Are we able to nominate candidate genes that might be causal for the observed marker–trait associations?

Experimental procedures

Genetic material and plant cultivation conditions

The experimental materials consisted of a total of 477 genotypes (Data S1) from a diverse population of spring‐type B. napus canola with double‐low seed quality (low erucic acid, low glucosinolate content). Plants were cultivated under controlled environmental conditions in an incomplete randomized block design (Data S2) in four glasshouse experiments in spring and winter 2014. Data of an additional experiment with a selection of 120 hybrids were included in the calculation of the BLUEs but not in the GWAS, as no array data are available. Experiments were carried out in the IPK phenotyping facility for large plants (Junker et al., 2015) comprising a cultivation, transportation and imaging system with 396 mobile carriers. Each genotype was replicated three times. A container with nine plants comprised one replicate. Four lines were included as checks in higher replication (‘Achat’ n = 12, ‘Campino’ (CR 3430) n = 12 and the two male sterile testers ‘M1’ and ‘M2’ each n = 6 per experiment, respectively) in all cultivations. Plants were grown in large 25‐litre square containers (Bamaplast S.r.l., Massa e Cozzile, Italy) in red substrate 2 (Klasmann‐Deilmann GmbH, Geeste, Germany) to provide enough space for the plants to grow and to avoid pot size effects (Poorter et al., 2012), and covered with a blue rubber mat to facilitate image analysis. A controlled climate regime was applied, on the one hand to mimic natural field‐like conditions and on the other hand to ensure consistency of conditions among the experiments. Temperatures were kept constant with 10 °C (dark phase) and 15 °C (light phase) during the entire growth period, and the natural radiation was supplemented by additional illumination of 205–245 μmol/m2/s PAR using SonT Agro high‐pressure sodium lamp. The light period was set to 16‐h light from 06:00 h to 22:00 h. These conditions correspond to a typical early spring in central Europe. Relative air humidity was kept at a minimum of 65%. Initially, nine plants per container were cultivated. To ensure homogenous plant density, two seeds per position were sown, but were thinned to one seedling per position at 5 days after sowing (DAS). Before sowing, seeds were stratified for three days at 4 °C on moist filter paper in Petri dishes to trigger uniform germination. At 14 DAS, four plants per container were sampled to provide enough material for subsequent molecular/biochemical analyses. The remaining five plants were grown until 28 DAS. Watering was performed with an automated balance/watering station by target weight of the containers to maintain 80% field capacity, pH 5.5. Plants were shuffled each day by one row and every second day by one block (11 neighbouring carriers in one row) in the system to minimize position effects.

High‐throughput plant phenotyping

Over a duration of three weeks (between 6 DAS and 28 DAS), plants were subjected to a daily imaging routine involving automated capturing of top and side view images. Two types of illumination and camera systems in the IPK automated non‐invasive plant phenotyping system for large plants were used as described in Junker et al. (2015). Visible light (VIS) and static fluorescence (FLUO) image data were acquired. Each carrier was imaged with two camera systems with four/three side views taken at (0°, 45°, 180° and 225°) from 6 to 13 DAS and (0°, 45° and 135°) from 15 to 27 DAS. Shoot fresh weight (g) was determined on the basis of all five plants by cutting the shoots directly above the ground level and by weighing using a medium‐scale balance at 28 DAS. Dry weight was measured after drying the plant material for 3 days at 80 °C.

Automated phenotypic data analysis

Automated image analysis was performed using the IPK Integrated Analysis Platform, IAP version 2.07 (Klukas et al., 2014) implementing a customized pipeline combining top and side view images, and including image preprocessing, segmentation and feature extraction. Estimated biovolume, ‘combined geometry vis volume iap (voxel)’, was estimated combining information from top and side view images (Junker et al., 2015). Projected leaf area ‘top geometry vis area (px2)’ was derived from VIS top view, early plant height ‘side geometry fluo height (px)’ from FLUO side view, and plant colour uniformity ‘side intensity vis lab a stddev’ from VIS side view images, respectively. Colour uniformity is given as the standard deviation of the a‐values in the L*a*b* colour space of the plant pixels. The lower this value, the more uniform is the plant colour. Leaf colour differs between young and old leaves, and therefore, this trait may act as a proxy for the range of maturation stages of leaves within a given plant and thus of its rate of development. Traits represent information obtained from the analysis of whole containers, including nine plants at early and five plants at later stages, respectively.

Data normalization and statistical analyses

All statistical analyses were performed in R version 3.4.2 software environment for statistical computing (R Core Team, 2018) and graphics and RStudio version 1.1.419. Image‐derived traits were obtained from 6 DAS to 13 DAS and from 15 DAS to 27 DAS. An outlier correction was performed in a combined approach of manual exclusion (carriers with insufficiently germinated plants) and a threshold‐based filtering procedure (median ± 3 standard deviations) for each experiment, day and trait separately. We performed a single‐step analysis of the phenotypic data. Best linear unbiased estimators (BLUEs, Data S3) were estimated in R {lme4} (Bates et al., 2015) based on a linear mixed model for each image‐derived phenotypic trait and each day separately (Eq. 1) or in case of end‐point biomass data (Eq. 2). In the models, Y denotes the phenotypic value of a trait for each genotype, G represents the fixed effect of the genotype, E the random effect of the Experiment, GxE the genotype–experiment interaction, C the random effect of the included checks, CxE the check–experiment interaction, P the position in the pot, PxE the position–experiment interaction and e the residual error (errors were assumed to be normally, independently and identically distributed). Broad‐sense heritabilities (H 2) for each trait were estimated by Eq. 3, where and denote the variance components of the genotype and the residual variance, respectively, and n 0 the number of experiments or in case of the end‐point biomass data the number of plant replicates per genotype (He et al., 2016; Nakagawa and Schielzeth, 2010). Variance components and were extracted from the mixed linear models (Eq. 1 or Eq. 2) in R {lme4} assuming that all effects were random effects.

Calculation of absolute change and relative growth rates

Absolute change rates (ACRs) and relative growth rates (RGRs) were calculated using previously published procedures (Hunt, 1990). Relative growth rates (RGRs) were determined for the estimated biovolume, projected leaf area and early plant height (Eq. 4). To compensate for a potential growth bias due to the applied plant rotation/shift in image acquisition, growth rates were calculated with minute precision, as image acquisition date and time were documented. In addition, absolute change rates (ACRs) were calculated (Eq. 5) for plant colour uniformity. BLUEs for ACR and RGRs were subsequently estimated as described above (Eq. 1).

Reference genome version and gene annotations

To ensure the unique positioning of as many markers as possible, an enhanced version of the Brassica napus cv. Darmor‐bzh v4.1 reference genome assembly (Chalhoub et al., 2014) was used, generated by incorporating long read information (NRGene, DeNovoMAGIC™; unpublished data from David Edwards, University of Western Australia) into the pseudomolecules. Transcripts were predicted de novo using a MAKER pipeline with AUGUSTUS and SNAP. Subsequently, the transcriptome was annotated by mapping the transcript sequences on: (i) the B. napus Darmor‐bzh v.4.1, (ii) a concatenated Brassica AC‐genome assembly comprising the B. rapa v1.5 (Wang et al., 2011) and the B. oleracea TO1000 (Parkin et al., 2014), and (iii) the Arabidopsis thaliana TAIR10 transcriptomes, respectively, using the basic local alignment search tool (BLAST). Transcripts were counted as hit if they reached a minimum similarity of 80% over 40% of the target transcript. If annotations in multiple genomes were obtained, they were prioritized in the order B. napus, B. oleracea/rapa, A. thaliana. The B. napus Darmor‐bzh v.4.1, B. rapa Chiifu‐401–42 and B. oleracea TO1000 transcriptomes were functionally annotated using Blast2GO (Conesa et al., 2005) version 3.0.8 with default settings. The Arabidopsis TAIR transcriptome annotations were downloaded from the TAIR homepage (https://www.arabidopsis.org/).

Genotype data

All 477 genotypes were genotyped using the Brassica Infinium 60k genotyping array (Illumina Inc., San Diego, CA, USA) as described previously by Jan et al. (2016). Raw data were initially filtered to exclude SNPs without positional information in the Brassica rapa and Brassica oleracea genomes. SNP genotypes were called using R and the package {gsrc} (Grandke et al., 2017). To identify copy number variations (CNVs), the SNP positions together with the signal intensity values were used to define blocks of similar intensity. If the blocks' values exceed the applied thresholds, they are classified as deletions or duplications. This set of copy number variations, also generated with the package {gsrc}, complemented the obtained SNP data. Subsequently, probe oligonucleotide sequences were mapped to the enhanced Brassica napus cv. Darmor‐bzh reference genome assembly using the basic local alignment search tool (BLAST) with 95% similarity over a length of 50 bp. Markers showing multiple BLAST hits in the genome were removed. For genome‐wide association studies (GWAS), SNPs were coded in numerical format (0 = AA, 1 = AB, 2 = BB) using R {GAPIT} (Lipka et al., 2012; Tang et al., 2016). CNVs were included as 0 = normal, 2 = deletion/duplication, whereby the reciprocal events, either duplications or deletions, were treated as missing values. For CNVs, positions were shifted by ±1 bp to avoid identical marker positions. Furthermore, markers with minor allele frequencies (MAF) smaller than 0.01 and markers with more than 10% missing values or more than 25% heterozygous alleles were removed. A total of 16 311 markers comprising 13 201 unique, single‐copy SNPs, 3106 deletions and 4 duplications remaining after filtering were used for analyses (Data S4). Population structure was analysed using the programme STRUCTURE, version 2.3.4 (Pritchard et al., 2000), marker data of all 477 lines and the ‘admixture’ model. Population clustering for K = 1–10 was performed with a burn‐in period of 10 000, 10 000 MCMC replications and three iterations per K. The lambda parameter was inferred and adjusted to λ = 0.304. The mean Ln probability [L(K)] and population clustering for K = 2–5 are shown in Figure S9.

Genome‐wide association studies (GWAS)

Recently, a new method for genome‐wide association studies, FarmCPU (Fixed and random model Circulating Probability Unification), has been proposed by Liu et al. (2016a), which controls false positives and effectively reduces false negatives. The method iteratively performs marker tests with pseudoquantitative trait nucleotides (QTNs) as covariates in a fixed‐effects model and optimization on pseudo‐QTNs in a random‐effects model. To some extent, this process is capable to remove the confounding between testing markers and kinship, to prevent overfitting of the model and to control false positives simultaneously. Genome‐wide association study (GWAS) analyses were conducted in R version 3.4.3 {FarmCPU} on BLUEs of the traits of the 477 canola lines using the filtered set of 16 111 numerically coded SNP (n = 13 201) and CNV (n = 3110) markers. Analyses were performed in RStudio on a CentOS 7.2 Linux server (HP ProLiant DL580 Gen9 with HP D3600 Array, 4x Intel Xeon E7‐8880v3@2.3 GHz processors, 144 cores, 1TB RAM, 2x480 GB SSD, 2x 600GB SAS, 12x 8TB SAS). As the programme does not allow for missing marker information in the numeric genotype input file, missing data were replaced by the heterozygous allele. Kinship was calculated using the FARM‐CPU algorithm. Principal component analysis was performed on the centred genotype data using the pca function in R {pcaMethods} (Stacklies et al., 2007). The first ten principal components (PCs) were calculated, and the first four PCs were included into the GWAS model to correct for hidden population structure. The maxLoop parameter was increased to 100, and the optimal threshold for P‐value selection of the model in the first iteration was estimated by the FarmCPU.P.Threshold function and set to 0.00001 for all traits. Subsequently, P‐values of marker–trait associations were adjusted for multiple comparisons using FDR (Benjamini and Hochberg, 1995). Only associations with adjusted P‐values smaller 0.1 were considered as statistically significant and used for further analyses. The phenotypic variance explained (PVE%) by a significant marker was estimated in R (Eq. 6). The sum of squares (SS) and residuals (e) were extracted from the ANOVA fitted with a linear model incorporating the phenotypic values and all significant markers in decreasing order of their P‐value.

LD analysis and candidate gene identification

Pairwise linkage disequilibrium (LD) was analysed for each chromosome in R {LDheatmap} (Shin et al., 2006) for the SNP marker data across all 477 canola lines. Linkage disequilibrium decay was calculated in R for both subgenomes separately (Hill and Weir, 1988; Marroni et al., 2011; Remington et al., 2001), as larger differences between the A and C subgenome have been reported (Wu et al., 2016). Candidate gene regions were defined as LD blocks harbouring a significant trait‐associated marker in which flanking markers had strong LD (r 2 > 0.6), and were extended to the left and right unrelated marker, respectively. All genes within the respective LD block were considered for candidate gene identification. For significant markers outside of LD blocks, the 100 kb flanking regions on either side were searched for candidate genes as suggested by Zhou et al. (2017a). Candidate genes were prioritized according to their annotation and gene ontology (GO). A comprehensive list of all genes within the intervals and selected candidate genes for all evaluated traits can be found in Data S5.

Results and discussion

Capturing growth dynamics by high‐throughput phenotyping

In the present study, a diverse spring‐type canola breeding population consisting of 477 genotypes with ‘double‐low’ seed quality (low erucic acid, low glucosinolate content) was investigated at an early vegetative growth phase. We applied automated high‐throughput phenotyping daily using the previously described IPK phenotyping platform for large plants (Junker et al., 2015) and performed image analysis with our in‐house image analysis pipeline (IAP) to derive estimations of growth‐related traits at multiple time points (Klukas et al., 2014). The IPK systems have previously been shown to be capable of efficiently tracing plant growth in various species including Arabidopsis (Junker et al., 2015; Tschiersch et al., 2017), rice (Schilling et al., 2015), barley (Chen et al., 2014; Neumann et al., 2015), maize (Muraya et al., 2017) and rapeseed (Pommerrenig et al., 2018). Examples of acquired raw plant images are provided in Figure S1. After quality checks, estimates of biovolume, projected leaf area, early plant height as well as colour uniformity were obtained for 21 consecutive time points from 6 to 27 DAS, covering approximately the first growth phase of canola development from completely unfolded cotyledons to four or more unfolded leaves. All four traits showed broad phenotypic variation resulting in medium to high coefficients of variation (Data S3), with highest values for biovolume and lowest values for colour uniformity. Biovolume and projected leaf area displayed exponential increases over time, while early plant height increased in a linear manner. Colour uniformity increased during the first days, but remained at a rather constant level during the later phase (Figure S2a–d). Image‐derived phenotypes were complemented by manually determined end‐point fresh weight (FW) and dry weight (DW) values at 28 DAS (Figure S3). Both fresh weight and dry weight were strongly correlated (r = 0.969, Figure S3) and highly correlated with the image‐derived biovolume estimates at the latest time point, with r = 0.929 and r = 0.926 for FW and DW at 27 DAS, respectively (Data S6). These high correlations indicate that biovolume estimates can serve as a suitable proxy for the actual plant biomass. To assess the repeatability and quality of the phenotypic data, broad‐sense heritabilities (H 2) were estimated (Figure S4, Data S3). Over the whole experiment, H2 for image‐derived phenotypes ranged between 0.528 (early plant height at 15 DAS) and 0. 874 (projected leaf area at 26 DAS). High H 2 values of 0.895 and 0.878 were also obtained for fresh weight and dry weight, respectively, facilitating the temporal analysis of trait relationships and forming a solid basis for genetic analyses.

Genomic data, copy number variations and population structure

Genotyping of the 477 lines was performed on the Brassica 60k SNP Infinium consortium array (Illumina Inc., San Diego, CA, USA) as described previously by Jan et al. (2016). In addition to single nucleotide polymorphisms (SNPs), copy number variation (CNV) and presence–absence variation (PAV) can provide complementary and valuable information, potentially associated with phenotypic changes (Stein et al., 2017). To make use of this additional source of genetic information, SNPs and CNVs and PAVs were called in a combined approach from the array data as previously described (Grandke et al., 2017). A total of 16 311 markers comprising 13 201 unique, single‐copy SNPs, 3106 deletions and four duplications (Data S4) were jointly used in the subsequent genome‐wide association study. Pairwise marker LD matrices (R 2) were calculated for each chromosome and LD decay derived for both subgenomes (A & C) separately based on the SNP data (Figure S5). In concordance with previous studies (Wu et al., 2016; Zhou et al., 2017b), a faster LD decay was detected in the A subgenome compared to the C subgenome, with half‐decay values of approximately 400 kb and 3.9 Mb determined for the A and C subgenome, respectively. In addition, multiple larger genomic regions of high LD (R 2 > 0.6) were detected, especially on the C‐subgenome chromosomes (Data S7), pointing to conserved regions preferentially selected during the process of breeding or the presence of larger structural variations within the population compared to the reference genome. A principal component analysis (PCA) of the population was performed using the combined SNP and CNV data sets. The first ten principal components explain a cumulative variance of approx. 40% (PC1: 16.9%, PC2: 4.4%, PC3: 3.6%, PC4: 3.3%, PC5: 2.8%, PC6: 2.0%, PC7: 1.8%, PC8: 1.8%, PC9: 1.6% and PC10: 1.5%). The PCA indicates the existence of population structure (Figure 1), coinciding with the known affiliation of the lines to the three breeding pools within the population (Jan et al., 2016). As breeding pools do not necessarily reflect the genetic structure of the population, we additionally performed a population structure analysis using STRUCTURE (Pritchard et al., 2000). The analysis indicates the presence of two larger population groups and several potential subpopulations (Figure S9). The first three clusters coincide to a substantial degree with the breeding pools, but many individuals show pronounced admixture. As a consequence, the first four principal components, each accounting for more than 3% of the total variance, were included as covariates into the GWAS analysis, as recommended by the developers of the {FarmCPU} R package.
Figure 1

Visualization of breeding pools by principal component analysis (PCA). PCA was performed on 477 canola lines using a panel of 13 201 SNP and 3110 CNV markers. Proportions of explained variance of principal components (PCs) 1, 2 and 3 are indicated on the axes. Different colours correspond to canola breeding pools from which the investigated lines were selected.

Visualization of breeding pools by principal component analysis (PCA). PCA was performed on 477 canola lines using a panel of 13 201 SNP and 3110 CNV markers. Proportions of explained variance of principal components (PCs) 1, 2 and 3 are indicated on the axes. Different colours correspond to canola breeding pools from which the investigated lines were selected.

Predominantly small and medium effect marker–trait associations contribute to variation in growth

BLUEs of image‐derived phenotype data for projected leaf area, estimated biovolume, early plant height and colour uniformity at 21 time points, as well as manually determined biomass (FW and DW) at 28 DAS, were used for genome‐wide association studies using Fixed and random model Circulating Probability Unification, R {FarmCPU} (Liu et al., 2016b). This method features a low rate of false‐positive associations and a fast runtime, which, in combination with parallelization, allows the analysis of multiple phenotypic traits in a reasonable period of time. Although relatively new, the method was already applied successfully in several different studies, for example to identify genetic loci for drought tolerance in maize (Li et al., 2016a), plant height in maize (Hu et al., 2017), salt tolerance in cowpea (Ravelombola et al., 2017), seed traits in soybean (Wang et al., 2018), or tolerance to preharvest sprouting and low falling numbers in wheat (Martinez et al., 2018). For manually determined biomass, 22 significant marker–trait associations (MTAs) were detected at P‐value(FDR) ≤ 0.1 (Figure 2 and Table 1), with thirteen and nine MTAs for fresh weight and dry weight, respectively. Although FW and DW were highly correlated (r = 0.969), these traits are not redundant, as similar FW values may break down into different contributions of FW components. Despite the high phenotypic correlation, only three shared MTAs for FW and DW, one on chromosome A10 and two on C02, were identified. We compared the 22 significant MTAs for fresh weight and dry weight to a list of 771 previously described QTL obtained from 13 publications analysing 45 growth, yield and quality‐related traits (Körber et al., 2015, 2016; Li et al., 2016b; Li et al., 2014, 2011, 2017; Liu et al., 2016a; Lu et al., 2016; Luo et al., 2015; Sun et al., 2016; Tang et al., 2015; Wang et al., 2016; Zheng et al., 2017). The marker ‘Bn‐scaff_18702_1‐p589589’ has been shown to be associated with plant height (Tang et al., 2015). Seven other MTAs were in proximity (±500 kb, based on NRGene marker positions) to previously described QTL: ‘Bn‐A04‐p4409752’ close to a QTL for stem dry weight (Lu et al., 2016); ‘Bn‐A10‐p11817272’ close to a QTL for plant height (Sun et al., 2016) and QTL for branch angle (Li et al., 2017); ‘Bn‐A07‐p9632473’ colocalized with a QTL for flowering time (Wang et al., 2016); ‘Bn‐A08‐p16771030’ close to QTL for biomass yield and stem dry weight (Lu et al., 2016) and a QTL for branching angle (Li et al., 2017); ‘Bn‐A10‐p10672359’ in proximity to a QTL for biomass yield, a QTL for stem dry weight (Lu et al., 2016) and a QTL for plant height (Sun et al., 2016); ‘Bn‐A10‐p13343454’ close to another QTL for branching angle (Li et al., 2017); ‘Bn‐scaff_21312_1‐p895326’ close to QTL for stem dry weight, a QTL for biomass yield (Lu et al., 2016) and a QTL for plant height (Li et al., 2016b).
Figure 2

Manhattan and quantile–quantile plots for end‐point vegetative biomass. Genome‐wide marker–trait associations for end‐point biomass determined at 28 DAS. a Manhattan plot (left) and quantile–quantile plot (right) for fresh weight (FW). b Manhattan plot (left) and quantile–quantile plot (right) for dry weight (DW). GWAS was performed in R {FarmCPU} on BLUEs estimated using three replicates (carriers) with five plants each. Significant marker–trait associations (MTAs) are shown with marker IDs. MTAs with P‐values(FDR) ≤ 0.05 or 0.1 are indicated by red and orange dots, respectively.

Table 1

Information about markers associated with end‐point biomass

TraitMarker_IDChromosomePosition (bp)MAF P‐value P‐value (FDR) EffectPVE%*
Fresh weightBn‐A04‐p4409752A045 462 5870.49371.89E‐080.00021.07628.64
Fresh weightBn‐A01‐p7850092A051 595 5850.04308.62E‐070.0028−2.67574.01
Fresh weightBn‐A07‐p9632473A0715 644 8700.27151.05E‐050.0191−0.91275.07
Fresh weightBn‐A07‐p5114831A085 664 0050.36375.54E‐080.00030.91701.75
Fresh weightBn‐A08‐p16771030A0826 455 0710.29664.72E‐090.0001−1.23571.37
Fresh weightBn‐A10‐p10672359A1010 601 8450.30199.28E‐060.01910.81440.02
Fresh weightBn‐A10‐p13343454A1012 120 3570.21171.48E‐070.0006−1.15022.16
Fresh weightBn‐scaff_16116_1‐p487063_delC0225 078 4530.06293.01E‐050.0377−0.87490.33
Fresh weightBn‐scaff_18702_1‐p589589C0227 593 7100.06398.45E‐060.0191−1.44391.21
Fresh weightBn‐scaff_16545_1‐p862530 C0250 263 1200.48741.05E‐050.0191−0.88231.51
Fresh weightBn‐scaff_21705_1‐p175010 C0254 034 0640.33441.60E‐050.02610.86041.33
Fresh weightBn‐scaff_16200_1‐p503123C0717 183 6550.31762.52E‐050.03421.51860.57
Fresh weightBn‐scaff_16197_1‐p3022518C0841 613 0710.28091.92E‐050.02850.74340.44
Dry weightBn‐A04‐p9426523A0413 935 8290.19081.71E‐050.0466−0.07052.11
Dry weightBn‐A10‐p11817272A072 411 9210.20021.63E‐060.0066−0.07554.85
Dry weightBn‐A10‐p13343454 A1012 120 3570.21173.05E‐050.0709−0.06233.56
Dry weightBn‐scaff_16804_1‐p178142C029 108 1490.11221.07E‐070.0009−0.11865.08
Dry weightBn‐scaff_16545_1‐p862530 C0250 263 1200.48744.81E‐050.0872−0.04110.00
Dry weightBn‐scaff_21705_1‐p175010 C0254 034 0640.33444.03E‐070.00220.07361.48
Dry weightBn‐scaff_21312_1‐p895326C0311 220 9630.03988.64E‐090.00010.27355.91
Dry weightBn‐scaff_15766_1‐p117110C0714 697 0100.29046.13E‐060.02000.12301.47
Dry weightBn‐A10‐p10083397_delC0959 994 6010.02733.70E‐050.07550.12830.89

Estimated percentage of phenotypic variance explained by the marker.

Common MTAs shared between fresh weight and dry weight.

Manhattan and quantile–quantile plots for end‐point vegetative biomass. Genome‐wide marker–trait associations for end‐point biomass determined at 28 DAS. a Manhattan plot (left) and quantile–quantile plot (right) for fresh weight (FW). b Manhattan plot (left) and quantile–quantile plot (right) for dry weight (DW). GWAS was performed in R {FarmCPU} on BLUEs estimated using three replicates (carriers) with five plants each. Significant marker–trait associations (MTAs) are shown with marker IDs. MTAs with P‐values(FDR) ≤ 0.05 or 0.1 are indicated by red and orange dots, respectively. Information about markers associated with end‐point biomass Estimated percentage of phenotypic variance explained by the marker. Common MTAs shared between fresh weight and dry weight. Genome‐wide association analyses performed for data measured at all 21 time points with the moderate threshold (P‐value(FDR) ≤ 0.1) revealed a total of 787 MTAs, including 191 associations for estimated biovolume, 200 MTAs for projected leaf area, 182 MTAs for early plant height and 192 MTAs for colour uniformity, respectively. There were no substantial differences in the number of associations between the A and the C subgenomes. The majority of detected associations could be attributed to unique, single‐copy SNP markers (84% of all associations). A substantial number of CNVs (deletions and duplications) also showed trait associations independently of the two SNP alleles (Grandke et al., 2017; Mason et al., 2017). In particular, segmental deletions caused by widespread homoeologous exchanges (Hurgobin et al., 2018; Samans et al., 2017) were associated with trait variation as previously described (Hatzig et al., 2018; Qian et al., 2016; Schiessl et al., 2017; Stein et al., 2017). To reduce the list to robust candidate regions, detected MTAs were further filtered to retain only loci showing significant associations for at least three consecutive time points (Data S8). Most of the detected MTAs explained only a small percentage of phenotypic variance (<5 PVE%, Figure S6) and were randomly distributed over the subgenomes. Only 40 (3.8%) marker–trait associations with larger effects (>5 PVE%) were detected, for example Bn‐A04‐p4409752 explaining up to 8.64% PVE of biomass (fresh weight). These findings strengthen the hypothesis that biomass accumulation and growth‐related traits are mostly governed by small effect loci and their interactions.

Dynamic genetic patterns and time interval‐specific QTL for early vegetative growth

The time‐resolved design of the phenotyping experiments enabled us to track the effects of individual markers over the course of 21 days of early growth between 6 and 27 DAS. Previous studies in Arabidopsis (Bac‐Molenaar et al., 2015) and maize (Muraya et al., 2017) also addressed the dynamics of growth, but few studies provided such a high temporal resolution at a daily basis. Markers that displayed sequentially significant association with our measured phenotypic traits for multiple consecutive time points were evaluated in more detail. In summary, 14, 9, 4 and 3 MTAs for projected leaf area, estimated biovolume, early plant height and colour uniformity were detected to be significant at three consecutive days, respectively (Figure 3, Data S8). In accordance with the theory of developmental genetics that genes are expressed selectively at different developmental stages, our data indicate that plant growth is the cumulative result of the interaction of various different genes and that the contributing sets of growth factors change during plant development. In contrast to a previous study in Arabidopsis, which revealed time‐specific and general QTL affecting growth dynamics (Bac‐Molenaar et al., 2015), in the present study in canola only time‐specific associations were detected. The longest interval of significance was found for marker ‘Bn‐scaff_16361_1‐p2350469’ on chromosome C08 associated with projected leaf area between 16 and 27 DAS. The nature of these dynamic, time‐specific patterns with their phenotypic plasticity suggests that they are under the control of dynamic genetic regulation. The beneficial effect of an allele of an early QTL might lose its benefit with progression of development and another allele of a later QTL might take up the beneficial effect. Remarkably, the same marker, ‘Bn‐scaff_16804_1‐p178142’ on chromosome C02, was found to be associated with both, the projected leaf area at 25–27 DAS and end‐point dry weight. Many associations with effects at earlier time points would likely not have been identified if biomass‐associated traits had only been evaluated as integrated effects at the end of the experiment. As a result, underlying genes might not be uncovered or the genetic value of the loci might be underestimated.
Figure 3

Dynamic associations detectable during cultivation from 6 to 27 DAS. GWAS was performed on BLUEs of a projected leaf area, b estimated biovolume, c early plant height and d plant colour uniformity in R/package {FarmCPU}. Different colours indicate markers with P‐value(FDR) ≤0.1 at three consecutive days, with the colour gradient corresponding to the temporal pattern. DAS denotes days after sowing. BLUEs were estimated using three replicates (carriers) with nine and five plants for 6 to 13 DAS and 15 to 27 DAS, respectively. No data were recorded at 14 DAS due to sampling of shoot material.

Dynamic associations detectable during cultivation from 6 to 27 DAS. GWAS was performed on BLUEs of a projected leaf area, b estimated biovolume, c early plant height and d plant colour uniformity in R/package {FarmCPU}. Different colours indicate markers with P‐value(FDR) ≤0.1 at three consecutive days, with the colour gradient corresponding to the temporal pattern. DAS denotes days after sowing. BLUEs were estimated using three replicates (carriers) with nine and five plants for 6 to 13 DAS and 15 to 27 DAS, respectively. No data were recorded at 14 DAS due to sampling of shoot material. To further address the dynamic nature of these traits, relative growth rates (RGRs) for projected leaf area, estimated biovolume and early plant height, as well as absolute change rates (ACRs), for colour uniformity were calculated over 15 intervals of three day durations to integrate the effects over longer periods (Figure S2e,f). Highest relative growth rates, especially for plant height, were detectable in the beginning of the cultivation and show a decreasing trend over time attributed either to an actual decrease in growth or to a bias due to overlapping leaves. Absolute change rates for colour uniformity were more stable than the relative growth rates during development. Growth rates were subsequently mapped with the same approach as the single time point data. It is remarkable to note that (beyond achievements in previous studies) GWAS was successfully applied here to RGR traits of multiple successive time intervals resulting in the detection of a total of 268 significant associations, with 100 MTAs for biovolume RGRs, 76 MTAs for leaf area RGRs, 73 MTAs for plant height RGRs and 19 MTAs for the colour uniformity changes detected for individual intervals. This can be attributed to the statistical power achieved in the present study through the large dimension and the setup of the experiments assessing each of the 477 analysed genotypes replicated in three of the four large‐scale glasshouse experiments performed under controlled environmental conditions with nine (6 to 13 DAS) or five (15 to 27 DAS) individuals, respectively, per replicate. To focus on particularly robust MTAs, the growth rate associations were further filtered for at least two consecutive significant intervals, as it has been done previously for the absolute trait values at individual time points. For colour uniformity ACRs, no consecutive significant associations were found. Two MTAs for leaf area RGRs at intermediate growth intervals, four MTAs for biovolume RGRs distributed evenly over the entire examined growth period and two MTAs for plant height at a very early phase were detected (Figure 4). The substantially lower number of RGR MTAs active at two consecutive intervals vs. the total number of RGR MTAs may indicate that the majority of effects are restricted to very narrow time windows. Since RGR MTAs address the acute action of the genetic loci at the assessed time point, while the MTAs of absolute trait values reflect the cumulative effects of the loci that happened during the entire growth period up to the time point of measurement, it is not surprising that the number of detected RGR MTAs is generally lower than the number of MTAs of absolute trait MTAs and that there is only minor overlap between the MTAs of the two types of traits.
Figure 4

Dynamic associations detectable for relative growth rates. GWAS was performed on BLUEs of a relative growth rates for projected leaf area, b relative growth rates for estimated biovolume and c relative growth rates for early plant height in R/package {FarmCPU}. Different colours indicate markers with P‐value(FDR) ≤0.1 at two consecutive intervals. DAS denotes days after sowing. BLUEs were estimated using three replicates (carriers) with nine and five plants for 6 to 13 DAS and 15 to 27 DAS, respectively. No data were recorded at 14 DAS due to sampling of shoot material.

Dynamic associations detectable for relative growth rates. GWAS was performed on BLUEs of a relative growth rates for projected leaf area, b relative growth rates for estimated biovolume and c relative growth rates for early plant height in R/package {FarmCPU}. Different colours indicate markers with P‐value(FDR) ≤0.1 at two consecutive intervals. DAS denotes days after sowing. BLUEs were estimated using three replicates (carriers) with nine and five plants for 6 to 13 DAS and 15 to 27 DAS, respectively. No data were recorded at 14 DAS due to sampling of shoot material. A recent study analysed the genetic architecture of biomass accumulation in spring barley (Neumann et al., 2017) by image analysis and described temporal patterns similar to the findings for absolute trait MTAs in the present study. Muraya et al. (2017) detected MTA effects on RGR for a subset of the strongest absolute trait MTAs and described the reversal of allelic effects over time for markers associated with relative growth rates. Similar observations were made in the present study on canola: allelic effects of loci did not only increase and decrease with time, tending to diminish after a certain interval, but for a substantial fraction of MTAs (16/30 for absolute trait MTAs and even 8/8 for RGR MTAs) allele effects reversed over time (Figures S7 and S8). As most dynamic growth/biomass‐associated QTL actions tended to persist for periods of only a few days during early growth (a particularly remarkable pattern was observed for the RGR of estimated biovolume; Figure 4b), it might be hypothesized that these QTL are associated with the initiation or development of new leaves. Manual analysis of leaf number for a subset of 30 lines at the different time points indicated that new leaves emerge on average in three‐ to four‐day intervals, coinciding with the observed pattern of dynamic growth QTL. To verify this initial observation, more in‐depth analyses will be necessary that will require robust high‐throughput quantification of leaf number in the acquired images. While promising advances in image analyses have been achieved in this direction, for example by ‘CVPPP challenges’ (Pape and Klukas, 2015; Scharr et al., 2016), further developments will be necessary to use automated image analyses towards this goal. If the hypothesis of different QTL being involved in initiation and development of successive leaves can be supported, it indicates the exciting possibility that formation of each leaf (or more generally every organ) may be controlled by a distinct genetic programme triggered through certain leaf‐specific loci.

Shared associations and novel candidate genes for growth dynamics

The purpose of our study was to reveal dynamic growth QTL patterns by a time‐resolved association analysis. Our findings highlight the need for stage‐specific investigations in future studies to identify genes operating at different developmental phases. Muraya et al. (2017) proposed that genes corresponding to dynamic QTL are either selectively expressed at different growth stages or their functions are required or growth‐limiting only in certain developmental phases. Among the 36 identified markers displaying temporal dynamic patterns, nine were shared between different traits. In particular, ‘Bn‐A10‐p13343454’ showed association with projected leaf area, estimated biovolume and early plant height, as well as with fresh weight and dry weight. The marker ‘Bn‐scaff_21312_1‐p895326’ was associated with projected leaf area, biovolume and dry weight, while ‘Bn‐scaff_16804_1‐p178142’ was shared between projected leaf area and dry weight. The other six markers: ‘Bn‐A02‐p24543172_del’, ‘Bn‐A04‐p2218115’, ‘Bn‐scaff_15911_1‐p571842’, ‘Bn‐scaff_16361_1‐p2350469’, ‘Bn‐scaff_17831_1‐p292580_del’ and ‘Bn‐scaff_20947_1‐p146783_del’ were associated with both projected leaf area and biovolume. From these nine markers, a promising subset of five was selected based on the number of associations and traits for detailed analysis. Candidate genes were identified in the corresponding regions on chromosomes A04, A10, C02, C03 and C08 by an LD‐based confidence interval approach (Table 2, Figure 5 and Figure S10). Genes were selected within LD blocks (r2 ≥ 0.6) as exemplarily shown for candidate region 5 on chromosome C08 (Figure 5), where the significantly associated marker ‘Bn‐scaff_16361_1‐p2350469’ forms an LD block with four of its neighbouring SNPs. The block spans a region of 368 kb and contains 72 genes, of which seven were selected as putative candidates based on their annotation: the citrate synthase CSY2; the MADS‐box transcription factor SHP1; PAR2 involved in the brassinosteroid‐mediated signalling pathway; the pectinesterase PME35 implicated in cell wall modification; the bHLH transcription factor PIF5; the tetrapyrrole‐binding protein GUN4 which regulates chlorophyll synthesis and the flowering time control protein FPA also annotated to be involved in cell differentiation. In case of the absence of detectable LD, genes were selected in the 100 kb flanking regions on either side of the significant marker as suggested by Zhou et al. (2017a).
Table 2

List of candidate regions and selected candidate genes

IntervalMarkerChr.Pos. (bp)LD block* Interval start (bp)Interval stop (bp)Interval size (bp)Number of genesNumber of MTAsTraitsSelected candidates Arabidopsis homologue/putative function
1Bn‐A04‐p2218115A042 103 821no2 003 8212 203 821200 0004414

leaf area

biovolume

Bra014695

BnaA04g02550D

BnaA04g02600D

BnaC05g07680D

ARR17/ two‐component response regulator

WRKY55/ transcription factor, WRKY

ANAC064/ transcription factor, NAC domain

ANAC064/ transcription factor, NAC domain

2Bn‐A10‐p13343454A1012 120 357no12 020 35712 220 357200 0005416

FW,

DW,

leaf area,

biovolume,

plant height

BnaA10g18330D

BnaA10g18440D

BnaA10g18480D

BnaA10g18530D

BnaA10g18590D

BnaA10g18600D

BnaA10g18650D

NIK1/ protein phosphorylation

BZIP3/ transcription factor, basic‐leucine zipper

SEPALLATA1/ transcription factor, MADS‐box

RRT1/ O‐fucosyltransferase, pectin biosynthetic process

RGP2/ UDP‐arabinose mutase, cell wall biogenesis

COBRA‐like protein/ cell wall biogenesis

LONGIFOLIA1/ regulation of cell growth

3Bn‐scaff_16804_1‐p178142C029 108 149yes8 497 70610 116 8201 619 1141489

DW,

leaf area,

biovolume,

plant height,

plant colour uniformity

BnaC02g11320D

BnaC02g11400D BnaC02g11890D

BnaC02g44440D

BnaC02g44470D

BnaC02g11520D

BnaC02g11970D

BnaC02g12210D

BnaC02g12340D

BnaC03g72190D

SMAX1/ hydrolase, seedling development

COL5/ transcription factor, zinc finger (B‐box type)

AT1G50890/ cell growth

ZEP2/ transcription factor, zinc finger

IAA33/ auxin‐activated signalling pathway

AIL5/ transcription factor, postembryonic development

AT5G56960/ transcription factor, bHLH

GULLO4/ oxidoreductase activity, cell wall biogenesis

EXP14/ alpha‐expansion, cell growth

CRK/ CDPK‐related kinase, ABA‐activated signalling pathway

4Bn‐scaff_21312_1‐p895326C0311 220 963no11 120 96311 320 963200 0004316

DW,

leaf area,

biovolume

BnaC03g18580D

BnaC03g18800D

IAA13/ auxin‐activated signalling pathway

SPL3/ transcription factor, SBP‐box

5Bn‐scaff_16361_1‐p2350469C0840 120 568yes39 843 45640 211 817368 3617214

leaf area,

biovolume

BnaC08g29460D

BnaC08g29530D

BnaC08g29560D

BnaCnng47940D

BnaC08g29580D

BnaC08g48840D

BnaC08g29740D

CSY2/ citrate synthase

SHP1/ transcription factor, MADS‐box

PAR2/ brassinosteroid‐mediated signalling pathway

PME35/ pectinesterase, cell wall modification

PIF5/ transcription factor, bHLH

GUN4/ tetrapyrrole‐binding, chlorophyll biosynthetic process

FPA/ flowering time control, cell differentiation

In case of the absence of an LD block, flanking 100 kb regions on either side of the associated marker were screened for candidate genes.

Best match using BLAST of transcript sequences to the B. napus Darmor‐bzh v.4.1, the concatenated Brassica AC and the A. thaliana TAIR10 transcriptomes; a full list of de novo annotated transcripts within the five intervals, BLAST results, descriptions and functional annotations (BLAST2GO) is available in Data S5.

Closest homologue in Arabidopsis thaliana; putative function (selection) obtained from the Brassica (BRAD) and the (TAIR) databases.

Figure 5

Manhattan plot for a representative MTA in the candidate region 5 on Chr. C08 with selected candidate genes and correlations between markers. The Manhattan plot describes genome‐wide marker–trait associations for the candidate region 5 on chromosome C08. The trait ‘projected leaf area at 21 DAS’ is shown as a representative trait for the 14 traits associated with the marker ‘Bn‐scaff_16361_1‐p2350469’ (Data S8). The significant associated SNP is indicated by a red dot. Grey dots represent surrounding nonsignificant markers in the region. Please note that the FarmCPU GWAS method, which iteratively uses fixed‐ and random‐effects models and pseudo‐QTN as covariates, results in a different appearance of the Manhattan plots. Significant associations are illustrated by ‘helicopters’ rather than ‘skyscrapers’. For reasons of clarity and comprehensibility, the zoom‐in of the candidate region was extended to the next flanking SNP markers (‘Bn‐A04‐p1895018’ and ‘Bn‐A04‐p2094818’). Red triangles indicate the positions of selected candidate genes (Table 2). The LD heatmap in the bottom section shows the correlations (r 2) between surrounding SNP markers. The markers ‘Bn‐scaff_16361_1‐p2350469’, ‘Bn‐scaff_16361_1‐p2354073’, ‘Bn‐scaff_16361_1‐p2400621’, ‘Bn‐scaff_16361_1‐p2401475’ and ‘Bn‐scaff_16361_1‐p2402567’ form an LD block (r 2 ≥ 0.6).

List of candidate regions and selected candidate genes leaf area biovolume Bra014695 BnaA04g02550D BnaA04g02600D BnaC05g07680D ARR17/ two‐component response regulator WRKY55/ transcription factor, WRKY ANAC064/ transcription factor, NAC domain ANAC064/ transcription factor, NAC domain FW, DW, leaf area, biovolume, plant height BnaA10g18330D BnaA10g18440D BnaA10g18480D BnaA10g18530D BnaA10g18590D BnaA10g18600D BnaA10g18650D NIK1/ protein phosphorylation BZIP3/ transcription factor, basic‐leucine zipper SEPALLATA1/ transcription factor, MADS‐box RRT1/ O‐fucosyltransferase, pectin biosynthetic process RGP2/ UDP‐arabinose mutase, cell wall biogenesis COBRA‐like protein/ cell wall biogenesis LONGIFOLIA1/ regulation of cell growth DW, leaf area, biovolume, plant height, plant colour uniformity BnaC02g11320D BnaC02g11400D BnaC02g11890D BnaC02g44440D BnaC02g44470D BnaC02g11520D BnaC02g11970D BnaC02g12210D BnaC02g12340D BnaC03g72190D SMAX1/ hydrolase, seedling development COL5/ transcription factor, zinc finger (B‐box type) AT1G50890/ cell growth ZEP2/ transcription factor, zinc finger IAA33/ auxin‐activated signalling pathway AIL5/ transcription factor, postembryonic development AT5G56960/ transcription factor, bHLH GULLO4/ oxidoreductase activity, cell wall biogenesis EXP14/ alpha‐expansion, cell growth CRK/ CDPK‐related kinase, ABA‐activated signalling pathway DW, leaf area, biovolume BnaC03g18580D BnaC03g18800D IAA13/ auxin‐activated signalling pathway SPL3/ transcription factor, SBP‐box leaf area, biovolume BnaC08g29460D BnaC08g29530D BnaC08g29560D BnaCnng47940D BnaC08g29580D BnaC08g48840D BnaC08g29740D CSY2/ citrate synthase SHP1/ transcription factor, MADS‐box PAR2/ brassinosteroid‐mediated signalling pathway PME35/ pectinesterase, cell wall modification PIF5/ transcription factor, bHLH GUN4/ tetrapyrrole‐binding, chlorophyll biosynthetic process FPA/ flowering time control, cell differentiation In case of the absence of an LD block, flanking 100 kb regions on either side of the associated marker were screened for candidate genes. Best match using BLAST of transcript sequences to the B. napus Darmor‐bzh v.4.1, the concatenated Brassica AC and the A. thaliana TAIR10 transcriptomes; a full list of de novo annotated transcripts within the five intervals, BLAST results, descriptions and functional annotations (BLAST2GO) is available in Data S5. Closest homologue in Arabidopsis thaliana; putative function (selection) obtained from the Brassica (BRAD) and the (TAIR) databases. Manhattan plot for a representative MTA in the candidate region 5 on Chr. C08 with selected candidate genes and correlations between markers. The Manhattan plot describes genome‐wide marker–trait associations for the candidate region 5 on chromosome C08. The trait ‘projected leaf area at 21 DAS’ is shown as a representative trait for the 14 traits associated with the marker ‘Bn‐scaff_16361_1‐p2350469’ (Data S8). The significant associated SNP is indicated by a red dot. Grey dots represent surrounding nonsignificant markers in the region. Please note that the FarmCPU GWAS method, which iteratively uses fixed‐ and random‐effects models and pseudo‐QTN as covariates, results in a different appearance of the Manhattan plots. Significant associations are illustrated by ‘helicopters’ rather than ‘skyscrapers’. For reasons of clarity and comprehensibility, the zoom‐in of the candidate region was extended to the next flanking SNP markers (‘Bn‐A04‐p1895018’ and ‘Bn‐A04‐p2094818’). Red triangles indicate the positions of selected candidate genes (Table 2). The LD heatmap in the bottom section shows the correlations (r 2) between surrounding SNP markers. The markers ‘Bn‐scaff_16361_1‐p2350469’, ‘Bn‐scaff_16361_1‐p2354073’, ‘Bn‐scaff_16361_1‐p2400621’, ‘Bn‐scaff_16361_1‐p2401475’ and ‘Bn‐scaff_16361_1‐p2402567’ form an LD block (r 2 ≥ 0.6). A comprehensive list of all thus identified candidate genes for all evaluated traits can be found in Data S5. Among the 361 genes, 30 genes were selected as particularly interesting candidates based on their annotation and gene ontology (GO). Nine of these genes have annotations related to meristem development and cell growth, including Sepallata1 (BnaA10g18480D), Longifolia1 (BnaA10g18650D), Squamosa promoter binding 4 (BnaC03g18800D) and Shatterproof1 (BnaC08g29530D). Several other genes are putatively involved in flowering time or cell wall biogenesis and modification, or were annotated as transcription factors. Although candidate genes need to be further analysed and validated in follow‐up studies involving temporally and spatially resolved assessment of gene activity, the described dynamic QTL represent a well exploitable resource to deepen our knowledge of early plant growth and biomass accumulation. Further examination of the associated markers of the five candidate regions revealed that the allele distribution differs between the three breeding pools: For example, for ‘Bn‐A04‐p2218115’, the minor allele is underrepresented in breeding pool 2 and completely absent in breeding pool 3. In contrast, the minor allele of ‘Bn‐scaff_21312_1‐p895326’ is nearly absent in breeding pool 1 and 2, but although only present in the heterozygous state, is highly overrepresented in breeding pool 3. The introgression of beneficial alleles underlying dynamic QTL, absent or underrepresented in conventional breeding pools, on the one side, as well as selection and stacking of beneficial alleles on the other side might help to enhance genetic gain for complex traits towards further improvement of growth performance in canola breeding. Moreover, it broadens the selection basis by introducing the factor temporal dynamics, facilitating marker‐assisted selection to breed high vigour cultivars.

Author contributions

TA, RCM and DK designed the experiments. AA provided seed material. DK performed the experiments and analysed the data. FG and CRW analysed and provided genotype data. BS and RJS provided the reference assembly and transcript annotations. DK wrote the manuscript. RJS, RCM and TA managed the project, advised on interpretation and obtained the funding. All authors read and edited the manuscript. Figure S1 Example for acquired raw image data. Figure S2 Overview of phenotypic data. Figure S3 Biomass distribution and correlation with image‐derived traits. Figure S4 Heritability of phenotypic traits. Figure S5 LD‐decay in the A and C subgenomes. Figure S6 Phenotypic variance explained (PVE%) by detected MTAs. Figure S7 Allele effects of dynamic associations. Figure S8 Allele effects of dynamic associations for relative growth rates. Figure S9 Population structure analysis. Figure S10 Manhattan plots for representative associations in the candidate regions with selected candidate genes and correlations between markers. Data S1 List of canola lines utilized in this study. Data S2 Overview of experimental design. Data S3 Phenotypic data (BLUEs), heritabilities and coefficients of variation. Data S4 Genotype dataset (SNP and CNV markers). Data S5 List of genes in candidate regions. Data S6 Correlations of FW, DW and biomass‐related traits. Data S7 Pairwise LD matrices for all chromosomes. Data S8 List of all associations detected during the cultivation from 6 to 28 DAS. Click here for additional data file.
  88 in total

1.  gsrc: an R package for genome structure rearrangement calling.

Authors:  Fabian Grandke; Rod Snowdon; Birgit Samans
Journal:  Bioinformatics       Date:  2017-02-15       Impact factor: 6.937

2.  GAPIT Version 2: An Enhanced Integrated Tool for Genomic Association and Prediction.

Authors:  You Tang; Xiaolei Liu; Jiabo Wang; Meng Li; Qishan Wang; Feng Tian; Zhongbin Su; Yuchun Pan; Di Liu; Alexander E Lipka; Edward S Buckler; Zhiwu Zhang
Journal:  Plant Genome       Date:  2016-07       Impact factor: 4.089

3.  Dynamic QTL and epistasis analysis on seedling root traits in upland cotton.

Authors:  Qingzhi Liang; Pengbo Li; Cheng Hu; Hua Hua; Zhaohu Li; Yihua Rong; Kunbo Wang; Jinping Hua
Journal:  J Genet       Date:  2014-04       Impact factor: 1.166

4.  Genome-wide association mapping of growth dynamics detects time-specific and general quantitative trait loci.

Authors:  Johanna A Bac-Molenaar; Dick Vreugdenhil; Christine Granier; Joost J B Keurentjes
Journal:  J Exp Bot       Date:  2015-04-28       Impact factor: 6.992

5.  Unravelling the complex trait of harvest index in rapeseed (Brassica napus L.) with association mapping.

Authors:  Xiang Luo; Chaozhi Ma; Yao Yue; Kaining Hu; Yaya Li; Zhiqiang Duan; Ming Wu; Jinxing Tu; Jinxiong Shen; Bin Yi; Tingdong Fu
Journal:  BMC Genomics       Date:  2015-05-12       Impact factor: 3.969

6.  Adult plant development in triticale (× triticosecale wittmack) is controlled by dynamic genetic patterns of regulation.

Authors:  Tobias Würschum; Wenxin Liu; Katharina V Alheit; Matthew R Tucker; Manje Gowda; Elmar A Weissmann; Volker Hahn; Hans Peter Maurer
Journal:  G3 (Bethesda)       Date:  2014-09-18       Impact factor: 3.154

7.  Genome-Wide SNP Markers Based on SLAF-Seq Uncover Breeding Traces in Rapeseed (Brassica napus L.).

Authors:  Qinghong Zhou; Can Zhou; Wei Zheng; Annaliese S Mason; Shuying Fan; Caijun Wu; Donghui Fu; Yingjin Huang
Journal:  Front Plant Sci       Date:  2017-04-28       Impact factor: 5.753

8.  Genome-Wide Association Study Provides Insight into the Genetic Control of Plant Height in Rapeseed (Brassica napus L.).

Authors:  Chengming Sun; Benqi Wang; Lei Yan; Kaining Hu; Sheng Liu; Yongming Zhou; Chunyun Guan; Zhenqian Zhang; Jiana Li; Jiefu Zhang; Song Chen; Jing Wen; Chaozhi Ma; Jinxing Tu; Jinxiong Shen; Tingdong Fu; Bin Yi
Journal:  Front Plant Sci       Date:  2016-07-27       Impact factor: 5.753

9.  Transcriptome and methylome profiling reveals relics of genome dominance in the mesopolyploid Brassica oleracea.

Authors:  Isobel A P Parkin; Chushin Koh; Haibao Tang; Stephen J Robinson; Sateesh Kagale; Wayne E Clarke; Chris D Town; John Nixon; Vivek Krishnakumar; Shelby L Bidwell; France Denoeud; Harry Belcram; Matthew G Links; Jérémy Just; Carling Clarke; Tricia Bender; Terry Huebert; Annaliese S Mason; J Chris Pires; Guy Barker; Jonathan Moore; Peter G Walley; Sahana Manoli; Jacqueline Batley; David Edwards; Matthew N Nelson; Xiyin Wang; Andrew H Paterson; Graham King; Ian Bancroft; Boulos Chalhoub; Andrew G Sharpe
Journal:  Genome Biol       Date:  2014-06-10       Impact factor: 13.583

10.  Genome-wide association study dissects the genetic architecture of seed weight and seed quality in rapeseed (Brassica napus L.).

Authors:  Feng Li; Biyun Chen; Kun Xu; Jinfeng Wu; Weilin Song; Ian Bancroft; Andrea L Harper; Martin Trick; Shengyi Liu; Guizhen Gao; Nian Wang; Guixin Yan; Jiangwei Qiao; Jun Li; Hao Li; Xin Xiao; Tianyao Zhang; Xiaoming Wu
Journal:  DNA Res       Date:  2014-02-07       Impact factor: 4.458

View more
  11 in total

Review 1.  Advanced high-throughput plant phenotyping techniques for genome-wide association studies: A review.

Authors:  Qinlin Xiao; Xiulin Bai; Chu Zhang; Yong He
Journal:  J Adv Res       Date:  2021-05-12       Impact factor: 10.479

Review 2.  Crop adaptation to climate change as a consequence of long-term breeding.

Authors:  Rod J Snowdon; Benjamin Wittkop; Tsu-Wei Chen; Andreas Stahl
Journal:  Theor Appl Genet       Date:  2020-11-22       Impact factor: 5.699

Review 3.  Integrating High-Throughput Phenotyping and Statistical Genomic Methods to Genetically Improve Longitudinal Traits in Crops.

Authors:  Fabiana F Moreira; Hinayah R Oliveira; Jeffrey J Volenec; Katy M Rainey; Luiz F Brito
Journal:  Front Plant Sci       Date:  2020-05-26       Impact factor: 5.753

Review 4.  Conventional and Molecular Techniques from Simple Breeding to Speed Breeding in Crop Plants: Recent Advances and Future Outlook.

Authors:  Sunny Ahmar; Rafaqat Ali Gill; Ki-Hong Jung; Aroosha Faheem; Muhammad Uzair Qasim; Mustansar Mubeen; Weijun Zhou
Journal:  Int J Mol Sci       Date:  2020-04-08       Impact factor: 5.923

5.  Multi-omics-based prediction of hybrid performance in canola.

Authors:  Dominic Knoch; Christian R Werner; Rhonda C Meyer; David Riewe; Amine Abbadi; Sophie Lücke; Rod J Snowdon; Thomas Altmann
Journal:  Theor Appl Genet       Date:  2021-02-01       Impact factor: 5.699

6.  Genome-wide association study identifies QTL for eight fruit traits in cultivated tomato (Solanum lycopersicum L.).

Authors:  Minkyung Kim; Thuy Tien Phan Nguyen; Joon-Hyung Ahn; Gi-Jun Kim; Sung-Chur Sim
Journal:  Hortic Res       Date:  2021-09-01       Impact factor: 6.793

7.  Construction of a Quantitative Genomic Map, Identification and Expression Analysis of Candidate Genes for Agronomic and Disease-Related Traits in Brassica napus.

Authors:  Nadia Raboanatahiry; Hongbo Chao; Jianjie He; Huaixin Li; Yongtai Yin; Maoteng Li
Journal:  Front Plant Sci       Date:  2022-03-11       Impact factor: 5.753

8.  Multiple Loci Control Variation in Plasticity to Foliar Shade Throughout Development in Arabidopsis thaliana.

Authors:  James Ta; Christine Palmer; Marcus Brock; Matthew Rubin; Cynthia Weinig; Julin Maloof; Daniel Runcie
Journal:  G3 (Bethesda)       Date:  2020-11-05       Impact factor: 3.154

Review 9.  Opportunities and limits of controlled-environment plant phenotyping for climate response traits.

Authors:  Anna Langstroff; Marc C Heuermann; Andreas Stahl; Astrid Junker
Journal:  Theor Appl Genet       Date:  2021-07-24       Impact factor: 5.699

10.  Exploiting High-Throughput Indoor Phenotyping to Characterize the Founders of a Structured B. napus Breeding Population.

Authors:  Jana Ebersbach; Nazifa Azam Khan; Ian McQuillan; Erin E Higgins; Kyla Horner; Venkat Bandi; Carl Gutwin; Sally Lynne Vail; Steve J Robinson; Isobel A P Parkin
Journal:  Front Plant Sci       Date:  2022-01-05       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.