Literature DB >> 32401769

Use of multiple traits genomic prediction, genotype by environment interactions and spatial effect to improve prediction accuracy in yield data.

Hsin-Yuan Tsai1,2, Fabio Cericola3, Vahid Edriss4, Jeppe Reitan Andersen4, Jihad Orabi4, Jens Due Jensen4, Ahmed Jahoor4,5, Luc Janss1, Just Jensen1.   

Abstract

Genomic selection has been extensively implemented in plant breeding schemes. Genomic selection incorporates dense genome-wide markers to predict the breeding values for important traits based on information from genotype and phenotype records on traits of interest in a reference population. To date, most relevant investigations have been performed using single trait genomic prediction models (STGP). However, records for several traits at once are usually documented for breeding lines in commercial breeding programs. By incorporating benefits from genetic characterizations of correlated phenotypes, multiple trait genomic prediction (MTGP) may be a useful tool for improving prediction accuracy in genetic evaluations. The objective of this study was to test whether the use of MTGP and including proper modeling of spatial effects can improve the prediction accuracy of breeding values in commercial barley and wheat breeding lines. We genotyped 1,317 spring barley and 1,325 winter wheat lines from a commercial breeding program with the Illumina 9K barley and 15K wheat SNP-chip (respectively) and phenotyped them across multiple years and locations. Results showed that the MTGP approach increased correlations between future performance and estimated breeding value of yields by 7% in barley and by 57% in wheat relative to using the STGP approach for each trait individually. Analyses combining genomic data, pedigree information, and proper modeling of spatial effects further increased the prediction accuracy by 4% in barley and 3% in wheat relative to the model using genomic relationships only. The prediction accuracy for yield in wheat and barley yield trait breeding, were improved by combining MTGP and spatial effects in the model.

Entities:  

Year:  2020        PMID: 32401769      PMCID: PMC7219756          DOI: 10.1371/journal.pone.0232665

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Wheat (Triticum aestivum L.) and barley (Hordeum vulgare L.) are two of the earliest domesticated crop species and are ranked as the first and fourth most-grown cereals worldwide, respectively [1-4]. Approximately 75% of barley’s global production is used as an ingredient in animal feed with the remaining 25% used for alcoholic and non-alcoholic beverages and a variety of other foodstuffs. Due to barley’s diploid genome architecture and its ability to self-fertilize, barley is considered an ideal model species for cereal genetic research [5]. Most wheat varieties are tetraploid (durum) or hexaploid (bread), but a few diploid varieties also exist. Due to their importance in food production, a high quality assembly of the entire genome sequence for barley is publicly available [1]. In contrast, the first genome assembly for wheat became available only recently [4], enhancing the opportunities for plant breeders to advance genome-assisted crop improvements and discover quantitative trait loci (QTLs) of commercial interest. Previous researchers have indicated that most traits of commercial importance in barley and wheat (e.g., yield) can likely be explained by many QTLs, each of which provide small contributions to total genetic variance [6,7]. This architecture has significantly restricted the application of traditional marker-assisted selection techniques, particularly for economically important traits with a highly polygenic architecture. The concept of genomic selection (GS) proposed by Meuwissen et al. [8] was developed to incorporate whole-genome marker data in selection programs to accumulate single nucleotide polymorphisms (SNP) or haplotype effects that can accurately predict future performance of potential new lines. As such, genomic prediction (GP) is now utilized to predict the breeding values of individuals based on a sufficient number of molecular markers and a training population (TP) that is genotyped and phenotyped for traits of interest. The performances of phenotypes in a validated population (VP) can then be predicted by exploiting dense molecular markers (or QTLs) that are associated with traits in the TP. For commercial breeding programs, large scale phenotyping and genotyping of breeding lines in the TP can lead to the development of promising statistical models for variance component estimation and for predicting breeding values using established approaches (e.g., REML [9] and BLUP [10]). In contrast to animal breeding approaches, the utilization of genomic approaches in plant breeding has been developed only recently [11]. Several methods in statistical genetics have been developed that benefit from genetic correlations between traits [12-14]. Univariate analysis, also known as single trait genomic prediction (STGP), is currently the most common method used in plant breeding schemes (e.g., in cassava [15], wheat [16-18], barley [19], rye [20], and rice [21]). However, for most commercial plant breeding programs, breeders have collected data on several phenotypes, which enable them to take advantage of genetic and phenotypic correlations among traits. Such multiple trait genomic prediction (MTGP) methods have recently been extensively examined [15,17,18,22,23]. The MTGP approach was originally developed to exploit information gained from correlated indicator traits [13]. Results have generally indicated that MTGP can increase the accuracy of genetic evaluations, especially when traits with high genetic correlations are involved in the analyses [13,15,17,20,22-24]. These findings agree with expected advantages of indirect selection [25]. Compared with traditional pedigree-based breeding methods and STGP, MTGP will likely be able to provide an ideal alternative for characterizing a higher number of candidate genes for selection and at lower cost, especially for traits that are labor intensive to evaluate or require a long time before they are expressed (e.g., baking quality or resistance to pests). For several economic traits of spring wheat, studies have shown that correlations between observed phenotypes and estimated breeding values are higher when the genomic prediction model involves both genomic and pedigree information than when pedigree alone is used [26]. In general, commercial plant breeders usually have phenotypic records across multiple generations for traits of economic importance. In this study, we used data from multiple plots of F5 generations and analyzed those results jointly with records from replicated experiments of F6 generations from a variety of field locations. Because testing conditions are not necessarily identical for each generation, it may be necessary to treat records from different generations as being different, but correlated traits. This approach might considerably increase selection accuracy and further increase the genetic gain achieved per generation [15,17]. The major aims of this study were to: (1) compare the predictive ability for both genomic information and spatial effect in breeding lines of winter wheat and spring barley, (2) evaluate the prediction accuracy underlying STGP and MTGP methods, and (3) apply F5 and F6 data in the MTGP analysis (as multiple training populations) to predict the future yield in breeding lines of winter wheat and spring barley.

Materials and methods

Field experiment and phenotypes

Our field experiment was performed by Nordic Seed A/S (Galten, Denmark). In total, we tested 1,317 spring barley (H. vulgare) and 1,325 winter wheat (T. aestivum) breeding lines. We tested each line in two consecutive years and at three locations every year (Fig 1). The three locations tested were Dyngby, Holeby, and Skive (for first year only) in Denmark.
Fig 1

Trial plan of spring barley and winter wheat field growth experiments.

‘B’ is the number of lines in spring barley in each corresponding set and ‘W’ is the number of lines in winter wheat. Each set contains data from two consecutive years. For instance, set 1 contained data from 2013 to 2014, set 2 contained data from 2014 to 2015, and so on. The green box represents data we included in the test, whereas the white box in Set 4 represents data still under collection at time of analysis, and not yet included in the test. The figure was adapted from and originally drawn by Andrea Bellucci (pers. comm.).

Trial plan of spring barley and winter wheat field growth experiments.

‘B’ is the number of lines in spring barley in each corresponding set and ‘W’ is the number of lines in winter wheat. Each set contains data from two consecutive years. For instance, set 1 contained data from 2013 to 2014, set 2 contained data from 2014 to 2015, and so on. The green box represents data we included in the test, whereas the white box in Set 4 represents data still under collection at time of analysis, and not yet included in the test. The figure was adapted from and originally drawn by Andrea Bellucci (pers. comm.). We nested multiple trials within the three test locations, and tested plots using a randomized complete block design [27] within each trial and each trial contained the same number of breeding lines. For barley, every trial comprised 22 lines and 3 checks, with 3 replicates in the first year and 2 replicates in the second year. For wheat, every trial had 21 lines and 4 checks, with 2 replicates in each year. For each trial, lines from any given family were sown in a randomized order, in each replicate, next to each other in the field. Based on the size of the family, a trial consisted of one or more families, and if the last family to be sown was more numerous than the remaining available plots, they were sown in the next trial. Therefore, many families had members in at least two different trials. In general, there were 3–5 full-siblings in each trial. Yield data for the F5 and F6 generations were collected in this study. Every year, we made a new set of crosses and every set contained approximately 330 unique single seed descent lines in F5, which were then used to produce the F6 line. The number of recorded plots in F6 were slightly different in spring barley and winter wheat, as details in Table 1. The yield data were measured as kg grain per 8.25-m2 plot in both spring barley and winter wheat breeding lines for F5 and F6, respectively.
Table 1

Descriptive statistics for spring barley and winter wheat phenotypic records.

SpeciesTraitUnitsNo. of PlotsMean (SD)Min.Max.
BarleyYield F6kg grain /8.25m2 per plot153766.60 (0.8)4.29.4
Yield F513176.11 (1.0)3.78.0
WheatYield F6133298.62 (0.9)3.914.8
Yield F513259.68 (1.8)4.113.4
The phenotypes of trait and pedigree information of every line were recorded by Nordic Seed A/S (Galten, Denmark). The three farms used are owned by Nordic Seed A/S, and they, therefore did not need any further permission to use the land. The three farms are legal for farming use, and not located on any national parks or other protected areas of land or sea.

Genotypes

We used the Illumina 9K barley SNP-chip and the 15K wheat SNP-chip to genotype all breeding lines. After quality control procedures, 4,056 SNPs in spring barley and 11,154 SNPs in winter wheat remained for analysis using the following two filters: (1) a minor allele frequency of <0.01 and (2) a missing SNP frequency per line value of >0.02. There were 2,841 SNPs in spring barley and 9,290 SNPs in winter wheat mapped to existing linkage groups according to the genome assembly [1,4], whereas 1,215 SNPs in spring barley and 1,864 SNPs in winter wheat had unknown positions in the genome.

Statistical methods

Pedigree relationship matrices were constructed based on the pedigree information of spring barley and winter wheat using the tabular approach [28,29], which assumed that parental lines have nine cycles of self-fertilization. Genomic relationship matrices (G) were generated for spring barley and winter wheat, using the first method of VanRaden (2008) [30], with G = ZZ’ / 2∑p (1-p), where the matrix Z was calculated as (M–P). M is a matrix of minor allele counts (0, 1, and 2) with m columns (one for each marker) and n rows (one for each line). P is a matrix containing allele frequencies, with column j defined as l2(p − 0.5), wherein l is a vector of ones, and p is the frequency of the second allele at corresponding locus j. After quality control procedures, the percentage of missing values was about 1% for both species in the genotype file before the genomic relationship matrices were constructed. The mean imputation approach was then applied to assign any missing genotypes [30]. All the missing genotypes were imputed while constructing the genomic relationship matrices. We performed a principle coordinate analysis (PCoA) (Fig 3) on the genomic relationship matrix using the built-in R function [31]. We used univariate and multivariate linear mixed models to obtain REML estimates of the variance components of traits using the DMU multivariate mixed model package [32].
Fig 3

Principal coordinate analysis of (a) spring barley and (b) winter wheat.

Model used for yield traits of F5 and F6 generations

We developed the following models for the analyses. Model 1 was developed for yield for both F5 and F6 generations using only genomic information (G). As yield data were both available for F5 and F6 in spring barley and winter wheat, the univariate and multivariate analyses were applied using Model 1: The b is the fixed factor comprising year, location, and trial (YLT), whereas the g is the genomic information. In addition, to estimate effects from pedigree information and spatial effects, we also developed Model 2 for F5 and F6 yields. The b and g terms are described by Model 2: where the a term corresponds to additive genetic effects using pedigree information for the covariance structure, the s term is a spatial effect variable to account for local spatial variation of experiments in the field. For the models described above, where y is a vector of observations for one trait, X is a design matrix for the fixed effect, and the b term is the vector of fixed effects, including combined effects of year, location, and trial (YLT). Z comprises the design matrices of random effects and the g term is a vector of additive genetic effects with , wherein represents genomic variance and G is the genomic relationship matrix. The distribution of , then represents the additive genetic variance and A is the pedigree relationship matrix. The s term is a vector of spatial effect with , which contains the X and Y coordinates of plots in the F5 test (n = 2), and eight surrounding plots and plot itself in the F6 test (n = 9), as illustrated in Fig 2. The e term is a vector of random residuals with .
Fig 2

Illustration of spatial effects employed in the F5 and F6 test.

In the F5 test, we fitted X- and Y-coordinates as the spatial effect in the model, whereas for the F6 test, we included its eight surrounding plots as well in the spatial effect (as a moving average). The figure was adapted from and originally drawn by Andrea Bellucci (pers. comm.).

Illustration of spatial effects employed in the F5 and F6 test.

In the F5 test, we fitted X- and Y-coordinates as the spatial effect in the model, whereas for the F6 test, we included its eight surrounding plots as well in the spatial effect (as a moving average). The figure was adapted from and originally drawn by Andrea Bellucci (pers. comm.). For multivariate analysis, we modeled two traits together to estimate all effects, including the marker effects. The testing combinations are detailed in Fig 4. Taking Model 1 as an example, y represents yield for F6 and y represents yield for F5. The year, location, and trial (YLT) serves as a fixed factor represented by b in the model. The terms X and Z are the designed matrices of the fixed factor and random factor, respectively. The g term is the genomic information, as described in the statistical model section. We assumed the residual covariance to be zero because yield in F5 and F6 generation were statistically independent (as they were collected from different years and generations). where with , wherein H is the variance and covariance matrix of the genomic breeding values of the two traits, and for with , R is the residual variance and covariance matrix of the two traits. Residual co-variance did not exist when we performed yield calculations for F5 and F6 generations, because the traits were collected from different years. When there were missing data for one of the traits, the residual variance was equal to for the observed trait.
Fig 4

Comparison of MTGP and STGP approaches for predicting yield in the F6 generation of winter wheat and spring barley.

For MTGP, we used a training population, including F5 as Trait I, Sets 1, 2, 3 for yield by F6 as Trait II, and Set 4 to predict yield of F6 (as a validation population). For STGP, we used Sets 1, 2, and 3 for yield by F6 data as the training population to predict Set 4 for yield by F6 (as a validation population). Fig 4c shows the corresponding models used for MTGP and STGP, respectively. The corresponding models are described in statistical model section in material and methods.

We used the variances to estimate the heritability of line means. The total phenotypic variance () of line means was: Heritability was estimated as: where d(G) is the mean diagonal element of the genomic relationship matrix, n is the number of surrounding plots considered in the spatial effect, r is the number of replicates of corresponding effects for each genotype when estimating line heritability, and r was one (1.0) when estimating the narrow-sense plot heritability [33] based on the data of a single plot. The narrow-sense plot heritability was used to consider the random effects from the plot itself, whereas the line heritability was used to calculate the mean of effects from records across all replicates based on the same breeding line [6]. Line heritability is higher than plot heritability when there are more replicates in the experiment.

Cross-validation and predictive ability

For our multivariate analysis (MTGP) of the F5 and F6 yield dataset in particular, we used four sets (Sets 1, 2, 3, and 4) in F5 as the first trait, and Sets 1, 2, and 3 in F6 as the second trait to predict the yield performance of Set 4 in F6. This strategy helped us test the feasibility of using a multivariate analysis for predicting future traits of interest in the coming year. For univariate analysis (STGP), we used Sets 1, 2, and 3 for yield by F6 data as the training population to predict Set 4 for yield by F6 (as a validation population). We estimated the predictive ability for future yield performance [ρ(ӯ, ĝ)] by calculating the correlation between the average of phenotypic records corrected for the fixed effect (ӯ) and genomic predicted breeding values (ĝ). The accuracy of predicting additive breeding values we calculated as the predictive ability divided by the square root of heritability of line means: ρ(ӯ, ĝ)/h.

Results

Genomic relationship analysis among breeding lines

The first two principal components of PCoA explained 69% (Axis 1) and 13% (Axis 2) of the total variance in genomic relationships for spring barley, and 83% (Axis 1) and 10% (Axis 2) for winter wheat (Fig 3). In general, most lines were highly genetically associated with others. Based on genomic information, PCoA indicated that there were clearly identifiable groups in spring barley and winter wheat, implying that certain lines were from the same groups. For example, there were some lines from Set 2 segregating in the left area of the PCoA plot in barley, whereas Set 3 also segregated in left area of the PCoA plot in wheat. However, in general, although the PCoA plot showed that there were only two major genetic clusters in both species, we also found that certain lines came from different crosses, sets, and parents. The heat-map of genomic relationship (using a similar dataset) also highlights the same results for both grain species [6,7].

Descriptive statistics and variance components

We studied yield traits in spring barley and winter wheat commercial breeding lines. The number of plots and phenotype statistics for each trait are listed in Table 1. The heritability and variance component estimates of traits are given in Table 2. Heritability (using the genomic-based method) of yield in F5 was 9% for spring barley and 41% for winter wheat, whereas the heritability of yield in F6 was 24% for spring barley, and 33% for winter wheat (Table 2).
Table 2

Variance components, narrow-sense plot heritability, and correlation estimation of traits using model 1.

The column for and are given by 10−2 as base unit.

SpeciesTraitsσg2(x 10−2)σe2(x 10−2)plot1 h2line h2Cor_G3
BarleyYield F50.36.60.090.090.7
Yield F61.75.70.240.75
WheatYield F52.97.80.410.410.72
Yield F67.622.80.330.76

1 The plot heritability. For yield F5, we only have one plot in F5, so rn in the denominator is always one (see Model 4) and the plot heritability is equal to line heritability. For other traits, we have multiple plots from the same breeding line, so we obtained more information based on the same breeding line. Therefore, line heritability is higher than plot heritability. See more descriptions in Model 4.

2 Line heritability.

3 The environmental correlation was set as independent between yield F6 and F5 because their records were collected in different years and locations. Therefore, only genetic correlations (Cor_G) are provided for yield traits.

Variance components, narrow-sense plot heritability, and correlation estimation of traits using model 1.

The column for and are given by 10−2 as base unit. 1 The plot heritability. For yield F5, we only have one plot in F5, so rn in the denominator is always one (see Model 4) and the plot heritability is equal to line heritability. For other traits, we have multiple plots from the same breeding line, so we obtained more information based on the same breeding line. Therefore, line heritability is higher than plot heritability. See more descriptions in Model 4. 2 Line heritability. 3 The environmental correlation was set as independent between yield F6 and F5 because their records were collected in different years and locations. Therefore, only genetic correlations (Cor_G) are provided for yield traits.

STGP versus MTGP

For multiple trait analysis, we used yield in F5 as the first trait and yield in F6 as the second trait to predict the future yield performance for the spring barley and winter wheat breeding lines. Overall, the prediction accuracies of bivariate analyses were higher than for univariate analyses in all scenarios (Fig 4). In using a bivariate analysis, we improved predictive accuracies by about 7% in spring barley and by about 57% in winter wheat varying from STGP and MTGP. The MTGP model that combined pedigree, the genomic relationship matrix, and spatial effects showed higher prediction accuracy than using the genomic relationship matrix only.

Comparison of MTGP and STGP approaches for predicting yield in the F6 generation of winter wheat and spring barley.

For MTGP, we used a training population, including F5 as Trait I, Sets 1, 2, 3 for yield by F6 as Trait II, and Set 4 to predict yield of F6 (as a validation population). For STGP, we used Sets 1, 2, and 3 for yield by F6 data as the training population to predict Set 4 for yield by F6 (as a validation population). Fig 4c shows the corresponding models used for MTGP and STGP, respectively. The corresponding models are described in statistical model section in material and methods.

Discussion

The goal of this study was to utilize pedigree information, genomic information, and genetic covariance between associated traits to increase the accuracy of prediction of economically important traits in cereal breeding programs. Our main findings were that the prediction accuracy of yield performance clearly increased when we modeled both yield F5 and F6 data simultaneously in the analysis. Furthermore, the prediction accuracy calculated from test data involving both pedigree, genomic and spatial information was clearly higher than data obtained from genomic information.

Genetic correlation is critical for improving accuracy in MTGP

Theoretically, genetic correlation can arise mainly by pleiotropy or, less commonly, by linkage disequilibrium [34]. A high genetic correlation between two traits does not imply that both traits are highly heritable, but neither does a high phenotypic correlation [25]. Several studies using both real and simulated data have suggested that the genetic correlation between genetically-linked traits is important for multivariate genomic selection to be advantageous [13,15,20,23,25,35]. Therefore, genetic correlations between traits of interest have been recently exploited to increase the statistical power for detecting segregating QTLs [36,37] and to improve accuracy in genomic predictions in plant breeding programs [20,23,35]. To date, there have been only a few published multiple trait studies using field data for plant breeding [15-17,20,23,38]. In a simulation study, Jia and Jannink [23] reported that for two traits with no genetic correlation, the prediction accuracy of STGP was equivalent to or even better than the accuracy of MTGP. In the current study, the genetic correlation between yield in F5 and in F6 data was approximately 0.7 in both spring barley and winter wheat breeding lines. Because we collected phenotypes from different years and locations, the environmental effects on yield in F5 and F6 were independent. Our results showed that MTGP outperformed STGP by 7% of yield in spring barley and 57% in winter wheat. A similar improvement rate (60%) using MTGP was also reported from pine breeding data [23]. Notably, the predictive ability for spring barley was generally higher than it was for winter wheat, but the relative improvement was not as dramatic as it was for winter wheat. For winter wheat, the predictive ability for yield was 0.23 for F6 generation in our single trait analysis, and 0.37 using yield data from F5 and F6 generations in the multiple trait analysis. Because the predictive ability for spring barley was 0.48 using yield data from F6 with the STGP model, but 0.51 using MTGP, the result clearly shows that, for estimating yield performance, the prediction accuracy of the STGP model for the spring barley line was better than was the MTGP model for the winter wheat breeding lines.

Yield heritability difference between F5 and F6

For spring barley and winter wheat, our results showed that the heritability of yield in F5 and F6 differed slightly. One reason to cause the differences could be due to the smaller plot size and lower sowing density for F5. In addition, the F5 data were tested on one location with one replicate only (compared with F6, there were multiple tested locations and plots), this may cause that the genetic effects included both general additive genetic effects plus specific additive genetic effects due to GxE effect between genotypes and the one location used. These effects cannot be separated for F5 data, compared with F6 data. As such, the above reason cloud lead to the heritability difference between yield F5 and F6.

Genomic information boosts the prediction accuracy

To our knowledge, there are only few major QTLs segregating identified (such as, the Mlo locus), at least for the economically important traits we investigated in this study. A review by Bernardo [39] stated that approximately 10,000 QTLs have been identified by QTL mapping experiments on twelve major crop species. However, only a few QTLs have been exploited in marker-assisted selection in practical breeding schemes, which indicates that most economically important traits in spring barley and winter wheat are highly polygenic in nature. Thus, if sufficient genomic information is available (e.g., segregated SNPs across an entire genome), then genomic predictions can be an efficient tool for capturing genetic variances, much more efficient than relying on pedigree records in plant breeding. Previous studies that applied genomic-based BLUP (GBLUP) approaches show consistent prediction accuracies across various genetic architectures under simulated scenarios [40]. Additionally, Jia and Jannink [23] indicated that multiple trait GBLUP performed equally as well as Bayesian models (Bayes A and Bayes Cpi) when the traits were controlled by a polygenic genetic architecture. Both authors suggested that BLUP is likely an ideal option for modelling the traits we investigated. In this study, our model involved both genomic and pedigree information, the prediction accuracies were slightly higher than using genomic information only. This result suggests that our evaluation involving pedigree information was less accurate than using a genomic-based method. On the other hand, GBLUP is potentially not as robust as the Bayesian model when there are outliers involved (e.g., the disease traits in spring barley investigated in this study deviated from the normal distribution). The prediction accuracy reported in this study was sufficiently high (e.g., prediction accuracy > 0.5) for genomic breeders to make selection decisions on favored traits earlier in the breeding cycle, which would enable them to maximize genetic gains [17].

Clear genetic grouping observed in commercial spring barley and winter wheat breeding lines

Although our principal components analysis (PCoA) indicated that the genetic relationship and degree of variation between all lines in both species we examined differed slightly, the PCoA clearly showed that there were some segregating groups among all breeding lines, thus implying that many lines had strong genomic relationships in certain genetic clusters.

Future perspectives

Simulation studies based on STGP have suggested that when a high SNP marker density is used, a substantial improvement in prediction accuracy can be expected in genomic evaluations [41]. Our study used a full set of marker genotypes as well as the total available population in the MTGP model. However, genotyping cost is still a major concern in plant breeding, especially for commercial breeders. Therefore, although our approach has been tested using simulation scenarios [13], the effect of marker density and optimization of TP size may require further investigation based on real data. In addition, non-standard phenotypes, such as those obtained from metabolomics data, may assist practitioners in boosting correlations in MTGP. For example, some investigations have involved metabolomics data in multiple trait analyses, aiming to improve accuracy in plant breeding schemes (and in animal breeding) [17,42-45]. Although STGP usually achieves a predictive ability similar to MTGP in some cases (e.g., soybeans [38], bread wheat [17], durum wheat [18], and African cassava [15]), our study suggests that the predictive ability of certain traits can be improved using MTGP (based on winter wheat and spring barley breeding lines and the large number of lines we included in our study). As such, cereal breeders can apply MTGP, combined with GxE effect, to improve predictive ability for selecting high yielding cultivars with improved resistance and quality by exploiting genetic correlation between the traits.

Conclusion

Our study showed that the MTGP approach is better than STGP for predicting yield traits in spring barley and winter wheat breeding lines when we included yield in F5 and yield in F6 in the evaluation. We also found that a model fitting pedigree, genomic and spatial information will have better prediction accuracy than using genomic information only. To conclude, prediction accuracy clearly increased in both species when we modelled yield data from F5 and F6 generations with MTGP, GxE, and spatial effects in the model. Thus, breeders can use the genetic relationship between traits to predict future trait performance, with considerably improved accuracy, by including genetically related traits using multivariate genomic prediction approaches. (GENOTYPE) Click here for additional data file. (YIELD) Click here for additional data file. (PHENOTYPE) Click here for additional data file. (GENOTYPE) Click here for additional data file. (R) Click here for additional data file. (DIR) Click here for additional data file. (DIR) Click here for additional data file. (DOCX) Click here for additional data file. (XLSX) Click here for additional data file. (XLSX) Click here for additional data file.
  32 in total

1.  Efficient methods to compute genomic predictions.

Authors:  P M VanRaden
Journal:  J Dairy Sci       Date:  2008-11       Impact factor: 4.034

2.  Systematic procedures for calculating inbreeding coefficients.

Authors:  L O EMIK; C E TERRILL
Journal:  J Hered       Date:  1949-02       Impact factor: 2.645

3.  Genomic and metabolic prediction of complex heterotic traits in hybrid maize.

Authors:  Christian Riedelsheimer; Angelika Czedik-Eysenberg; Christoph Grieder; Jan Lisec; Frank Technow; Ronan Sulpice; Thomas Altmann; Mark Stitt; Lothar Willmitzer; Albrecht E Melchinger
Journal:  Nat Genet       Date:  2012-01-15       Impact factor: 38.330

4.  Genomic prediction unifies animal and plant breeding programs to form platforms for biological discovery.

Authors:  John M Hickey; Tinashe Chiurugwi; Ian Mackay; Wayne Powell
Journal:  Nat Genet       Date:  2017-08-30       Impact factor: 38.330

5.  Metabolomic prediction of yield in hybrid rice.

Authors:  Shizhong Xu; Yang Xu; Liang Gong; Qifa Zhang
Journal:  Plant J       Date:  2016-08-29       Impact factor: 6.417

6.  Differentially penalized regression to predict agronomic traits from metabolites and markers in wheat.

Authors:  Jane Ward; Mariann Rakszegi; Zoltán Bedő; Peter R Shewry; Ian Mackay
Journal:  BMC Genet       Date:  2015-02-26       Impact factor: 2.797

7.  Association mapping and genomic prediction for resistance to sudden death syndrome in early maturing soybean germplasm.

Authors:  Yong Bao; James E Kurle; Grace Anderson; Nevin D Young
Journal:  Mol Breed       Date:  2015-05-17       Impact factor: 2.589

8.  Genomic Prediction of Seed Quality Traits Using Advanced Barley Breeding Lines.

Authors:  Nanna Hellum Nielsen; Ahmed Jahoor; Jens Due Jensen; Jihad Orabi; Fabio Cericola; Vahid Edriss; Just Jensen
Journal:  PLoS One       Date:  2016-10-26       Impact factor: 3.240

9.  Accuracies of univariate and multivariate genomic prediction models in African cassava.

Authors:  Uche Godfrey Okeke; Deniz Akdemir; Ismail Rabbi; Peter Kulakow; Jean-Luc Jannink
Journal:  Genet Sel Evol       Date:  2017-12-04       Impact factor: 4.297

10.  Prediction of complex human traits using the genomic best linear unbiased predictor.

Authors:  Gustavo de Los Campos; Ana I Vazquez; Rohan Fernando; Yann C Klimentidis; Daniel Sorensen
Journal:  PLoS Genet       Date:  2013-07-11       Impact factor: 5.917

View more
  7 in total

1.  Comparison of single-trait and multi-trait genomic predictions on agronomic and disease resistance traits in spring wheat.

Authors:  Kassa Semagn; José Crossa; Jaime Cuevas; Muhammad Iqbal; Izabela Ciechanowska; Maria Antonia Henriquez; Harpinder Randhawa; Brian L Beres; Reem Aboukhaddour; Brent D McCallum; Anita L Brûlé-Babel; Amidou N'Diaye; Curtis Pozniak; Dean Spaner
Journal:  Theor Appl Genet       Date:  2022-06-23       Impact factor: 5.574

Review 2.  Advances in integrated genomic selection for rapid genetic gain in crop improvement: a review.

Authors:  C Anilkumar; N C Sunitha; Narayana Bhat Devate; S Ramesh
Journal:  Planta       Date:  2022-09-23       Impact factor: 4.540

3.  Genetic architecture and genomic predictive ability of apple quantitative traits across environments.

Authors:  Michaela Jung; Beat Keller; Morgane Roth; Maria José Aranzana; Annemarie Auwerkerken; Walter Guerra; Mehdi Al-Rifaï; Mariusz Lewandowski; Nadia Sanin; Marijn Rymenants; Frédérique Didelot; Christian Dujak; Carolina Fonti Forcada; Andrea Knauf; François Laurens; Bruno Studer; Hélène Muranty; Andrea Patocch
Journal:  Hortic Res       Date:  2022-02-19       Impact factor: 7.291

4.  Improvement of genomic prediction in advanced wheat breeding lines by including additive-by-additive epistasis.

Authors:  Miguel Angel Raffo; Pernille Sarup; Xiangyu Guo; Huiming Liu; Jeppe Reitan Andersen; Jihad Orabi; Ahmed Jahoor; Just Jensen
Journal:  Theor Appl Genet       Date:  2022-01-01       Impact factor: 5.574

5.  Multi-trait genomic prediction using in-season physiological parameters increases prediction accuracy of complex traits in US wheat.

Authors:  Dipendra Shahi; Jia Guo; Sumit Pradhan; Jahangir Khan; Muhsin Avci; Naeem Khan; Jordan McBreen; Guihua Bai; Matthew Reynolds; John Foulkes; Md Ali Babar
Journal:  BMC Genomics       Date:  2022-04-12       Impact factor: 3.969

6.  Integrating a growth degree-days based reaction norm methodology and multi-trait modeling for genomic prediction in wheat.

Authors:  Miguel Angel Raffo; Pernille Sarup; Jeppe Reitan Andersen; Jihad Orabi; Ahmed Jahoor; Just Jensen
Journal:  Front Plant Sci       Date:  2022-09-02       Impact factor: 6.627

Review 7.  Harnessing adult-plant resistance genes to deploy durable disease resistance in crops.

Authors:  Eric Dinglasan; Sambasivam Periyannan; Lee T Hickey
Journal:  Essays Biochem       Date:  2022-09-30       Impact factor: 7.258

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.