Literature DB >> 29057969

Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation.

Zhenyu Jia1.   

Abstract

In genomic selection (GS), all the markers across the entire genome are used to conduct marker-assisted selection such that each quantitative trait locus of complex trait is in linkage disequilibrium with at least one marker. Although GS improves estimated breeding values and genetic gain, in most GS models genetic variance is estimated from training samples with many trait-irrelevant markers, which leads to severe overfitting in the calculation of trait heritability. In this study, we demonstrated overfitting heritability due to the inclusion of trait-irrelevant markers using a series of simulations, and such overfitting can be effectively controlled by cross validation experiment. In the proposed method, the genetic variance is simply the variance of the genetic values predicted through cross validation, the residual variance is the variance of the differences between the observed phenotypic values and the predicted genetic values, and these two resultant variance components are used for calculating the unbiased heritability. We also demonstrated that the heritability calculated through cross validation is equivalent to trait predictability, which objectively reflects the applicability of the GS models. The proposed method can be implemented with the Mixed Procedure in SAS or with our R package "GSMX" which is publically available at https://cran.r-project.org/web/packages/GSMX/index.html .

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 29057969      PMCID: PMC5651917          DOI: 10.1038/s41598-017-14070-z

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  18 in total

Review 1.  Review of statistical methods for QTL mapping in experimental crosses.

Authors:  K W Broman
Journal:  Lab Anim (NY)       Date:  2001 Jul-Aug       Impact factor: 12.625

2.  Bayesian shrinkage estimation of quantitative trait loci parameters.

Authors:  Hui Wang; Yuan-Ming Zhang; Xinmin Li; Godfred L Masinde; Subburaman Mohan; David J Baylink; Shizhong Xu
Journal:  Genetics       Date:  2005-03-21       Impact factor: 4.562

Review 3.  Genomic selection.

Authors:  M E Goddard; B J Hayes
Journal:  J Anim Breed Genet       Date:  2007-12       Impact factor: 2.380

4.  An expectation-maximization algorithm for the Lasso estimation of quantitative trait locus effects.

Authors:  S Xu
Journal:  Heredity (Edinb)       Date:  2010-01-06       Impact factor: 3.821

5.  Multiple-trait genomic selection methods increase genetic value prediction accuracy.

Authors:  Yi Jia; Jean-Luc Jannink
Journal:  Genetics       Date:  2012-10-19       Impact factor: 4.562

6.  A comment on the simple regression method for interval mapping.

Authors:  S Xu
Journal:  Genetics       Date:  1995-12       Impact factor: 4.562

Review 7.  A review on SNP and other types of molecular markers and their use in animal genetics.

Authors:  Alain Vignal; Denis Milan; Magali SanCristobal; André Eggen
Journal:  Genet Sel Evol       Date:  2002 May-Jun       Impact factor: 4.297

8.  Mapping quantitative trait loci controlling milk production in dairy cattle by exploiting progeny testing.

Authors:  M Georges; D Nielsen; M Mackinnon; A Mishra; R Okimoto; A T Pasquino; L S Sargeant; A Sorensen; M R Steele; X Zhao
Journal:  Genetics       Date:  1995-02       Impact factor: 4.562

9.  Potential benefits of genomic selection on genetic gain of small ruminant breeding programs.

Authors:  F Shumbusho; J Raoul; J M Astruc; I Palhiere; J M Elsen
Journal:  J Anim Sci       Date:  2013-06-04       Impact factor: 3.159

10.  Gains in QTL detection using an ultra-high density SNP map based on population sequencing relative to traditional RFLP/SSR markers.

Authors:  Huihui Yu; Weibo Xie; Jia Wang; Yongzhong Xing; Caiguo Xu; Xianghua Li; Jinghua Xiao; Qifa Zhang
Journal:  PLoS One       Date:  2011-03-03       Impact factor: 3.240

View more
  4 in total

1.  NOREVA: enhanced normalization and evaluation of time-course and multi-class metabolomic data.

Authors:  Qingxia Yang; Yunxia Wang; Ying Zhang; Fengcheng Li; Weiqi Xia; Ying Zhou; Yunqing Qiu; Honglin Li; Feng Zhu
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

2.  Accurate diagnosis of atopic dermatitis by combining transcriptome and microbiota data with supervised machine learning.

Authors:  Ziyuan Jiang; Jiajin Li; Nahyun Kong; Jeong-Hyun Kim; Bong-Soo Kim; Min-Jung Lee; Yoon Mee Park; So-Yeon Lee; Soo-Jong Hong; Jae Hoon Sul
Journal:  Sci Rep       Date:  2022-01-07       Impact factor: 4.379

3.  Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients.

Authors:  Alejandro Reina Reina; José M Barrera; Bernardo Valdivieso; María-Eugenia Gas; Alejandro Maté; Juan C Trujillo
Journal:  Sci Rep       Date:  2022-04-06       Impact factor: 4.379

4.  Integration of DNA Methylation and Transcriptome Data Improves Complex Trait Prediction in Hordeum vulgare.

Authors:  Pernille Bjarup Hansen; Anja Karine Ruud; Gustavo de Los Campos; Marta Malinowska; Istvan Nagy; Simon Fiil Svane; Kristian Thorup-Kristensen; Jens Due Jensen; Lene Krusell; Torben Asp
Journal:  Plants (Basel)       Date:  2022-08-24
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.