Literature DB >> 31267147

Training set determination for genomic selection.

Jen-Hsiang Ou1, Chen-Tuo Liao2.   

Abstract

KEY MESSAGE: A new optimality criterion is proposed to determine a training set for genomic selection, which is derived from Pearson's correlation between GEBVs and phenotypic values of a test set. R functions are provided to generate the optimal training set. For a specified test set, we develop a highly efficient algorithm to determine an optimal subset from a large candidate set in which the individuals have been genotyped but not phenotyped yet. The chosen subset serves as a training set to be phenotyped, and then a genomic selection (GS) model is built based on its phenotype and genotype data. In this study, we consider the additive effects whole-genome regression model and adopt ridge regression estimation for marker effects in the GS model. The resulting GS model is then employed to predict genomic estimated breeding values (GEBVs) for the individuals of the test set, which have been genotyped only. We propose a new optimality criterion to determine the required training set, which is derived directly from Pearson's correlation between GEBVs and phenotypic values of the test set. Pearson's correlation is the standard measure for prediction accuracy of a GS model. Our proposed methods can be applied to data with the varying degree of population structure. All the R functions for implementing our training set determination algorithms are available from the R package TSDFGS. The algorithms are illustrated with two datasets which have strong (rice genome dataset) and mild (wheat genome dataset) population structures. Our methods are shown to be advantageous over existing ones, mainly because they fully use the genomic relationship between the test set and the training set by taking into account both the variance and bias for predicting GEBVs.

Entities:  

Mesh:

Year:  2019        PMID: 31267147     DOI: 10.1007/s00122-019-03387-0

Source DB:  PubMed          Journal:  Theor Appl Genet        ISSN: 0040-5752            Impact factor:   5.699


  14 in total

1.  Efficient methods to compute genomic predictions.

Authors:  P M VanRaden
Journal:  J Dairy Sci       Date:  2008-11       Impact factor: 4.034

2.  Maximizing the reliability of genomic selection by optimizing the calibration set of reference individuals: comparison of methods in two diverse groups of maize inbreds (Zea mays L.).

Authors:  R Rincent; D Laloë; S Nicolas; T Altmann; D Brunel; P Revilla; V M Rodríguez; J Moreno-Gonzalez; A Melchinger; E Bauer; C-C Schoen; N Meyer; C Giauffret; C Bauland; P Jamin; J Laborde; H Monod; P Flament; A Charcosset; L Moreau
Journal:  Genetics       Date:  2012-08-03       Impact factor: 4.562

3.  Genome-wide prediction of traits with different genetic architecture through efficient variable selection.

Authors:  Valentin Wimmer; Christina Lehermeier; Theresa Albrecht; Hans-Jürgen Auinger; Yu Wang; Chris-Carolin Schön
Journal:  Genetics       Date:  2013-08-09       Impact factor: 4.562

4.  Training set determination for genomic selection.

Authors:  Jen-Hsiang Ou; Chen-Tuo Liao
Journal:  Theor Appl Genet       Date:  2019-07-02       Impact factor: 5.699

5.  Bayesian optimization for genomic selection: a method for discovering the best genotype among a large number of candidates.

Authors:  Ryokei Tanaka; Hiroyoshi Iwata
Journal:  Theor Appl Genet       Date:  2017-10-06       Impact factor: 5.699

6.  Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa.

Authors:  Keyan Zhao; Chih-Wei Tung; Georgia C Eizenga; Mark H Wright; M Liakat Ali; Adam H Price; Gareth J Norton; M Rafiqul Islam; Andy Reynolds; Jason Mezey; Anna M McClung; Carlos D Bustamante; Susan R McCouch
Journal:  Nat Commun       Date:  2011-09-13       Impact factor: 14.919

7.  Optimization of genomic selection training populations with a genetic algorithm.

Authors:  Deniz Akdemir; Julio I Sanchez; Jean-Luc Jannink
Journal:  Genet Sel Evol       Date:  2015-05-06       Impact factor: 4.297

8.  Training set optimization under population structure in genomic selection.

Authors:  Julio Isidro; Jean-Luc Jannink; Deniz Akdemir; Jesse Poland; Nicolas Heslot; Mark E Sorrells
Journal:  Theor Appl Genet       Date:  2014-11-01       Impact factor: 5.699

9.  Predicting genomic selection efficiency to optimize calibration set and to assess prediction accuracy in highly structured populations.

Authors:  R Rincent; A Charcosset; L Moreau
Journal:  Theor Appl Genet       Date:  2017-08-09       Impact factor: 5.699

10.  Design of training populations for selective phenotyping in genomic prediction.

Authors:  Deniz Akdemir; Julio Isidro-Sánchez
Journal:  Sci Rep       Date:  2019-02-05       Impact factor: 4.379

View more
  7 in total

1.  Building a Calibration Set for Genomic Prediction, Characteristics to Be Considered, and Optimization Approaches.

Authors:  Simon Rio; Alain Charcosset; Tristan Mary-Huard; Laurence Moreau; Renaud Rincent
Journal:  Methods Mol Biol       Date:  2022

Review 2.  Genomic Prediction: Progress and Perspectives for Rice Improvement.

Authors:  Jérôme Bartholomé; Parthiban Thathapalli Prakash; Joshua N Cobb
Journal:  Methods Mol Biol       Date:  2022

3.  Training set determination for genomic selection.

Authors:  Jen-Hsiang Ou; Chen-Tuo Liao
Journal:  Theor Appl Genet       Date:  2019-07-02       Impact factor: 5.699

4.  Combining genetic resources and elite material populations to improve the accuracy of genomic prediction in apple.

Authors:  Xabi Cazenave; Bernard Petit; Marc Lateur; Hilde Nybom; Jiri Sedlak; Stefano Tartarini; François Laurens; Charles-Eric Durel; Hélène Muranty
Journal:  G3 (Bethesda)       Date:  2022-03-04       Impact factor: 3.542

5.  Identification of superior parental lines for biparental crossing via genomic prediction.

Authors:  Ping-Yuan Chung; Chen-Tuo Liao
Journal:  PLoS One       Date:  2020-12-03       Impact factor: 3.240

6.  Genomic Prediction of Grain Yield in a Barley MAGIC Population Modeling Genotype per Environment Interaction.

Authors:  Damiano Puglisi; Stefano Delbono; Andrea Visioni; Hakan Ozkan; İbrahim Kara; Ana M Casas; Ernesto Igartua; Giampiero Valè; Angela Roberta Lo Piero; Luigi Cattivelli; Alessandro Tondelli; Agostino Fricano
Journal:  Front Plant Sci       Date:  2021-05-24       Impact factor: 5.753

7.  Selection of parental lines for plant breeding via genomic prediction.

Authors:  Ping-Yuan Chung; Chen-Tuo Liao
Journal:  Front Plant Sci       Date:  2022-07-27       Impact factor: 6.627

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.