Literature DB >> 26174666

Evaluation of random forest regression for prediction of breeding value from genomewide SNPs.

Rupam Kumar Sarkar1, A R Rao, Prabina Kumar Meher, T Nepolean, T Mohapatra.   

Abstract

Genomic prediction is meant for estimating the breeding value using molecular marker data which has turned out to be a powerful tool for efficient utilization of germplasm resources and rapid improvement of cultivars. Model-based techniques have been widely used for prediction of breeding values of genotypes from genomewide association studies. However, application of the random forest (RF), a model-free ensemble learning method, is not widely used for prediction. In this study, the optimum values of tuning parameters of RF have been identified and applied to predict the breeding value of genotypes based on genomewide single-nucleotide polymorphisms (SNPs), where the number of SNPs (P variables) is much higher than the number of genotypes (n observations) (P » n). Further, a comparison was made with the model-based genomic prediction methods, namely, least absolute shrinkage and selection operator (LASSO), ridge regression (RR) and elastic net (EN) under P » n. It was found that the correlations between the predicted and observed trait response were 0.591, 0.539, 0.431 and 0.587 for RF, LASSO, RR and EN, respectively, which implies superiority of the RF over the model-based techniques in genomic prediction. Hence, we suggest that the RF methodology can be used as an alternative to the model-based techniques for the prediction of breeding value at genome level with higher accuracy.

Mesh:

Year:  2015        PMID: 26174666     DOI: 10.1007/s12041-015-0501-5

Source DB:  PubMed          Journal:  J Genet        ISSN: 0022-1333            Impact factor:   1.166


  9 in total

1.  A comparison of dairy cattle breeding designs that use genomic selection.

Authors:  M Lillehammer; T H E Meuwissen; A K Sonesson
Journal:  J Dairy Sci       Date:  2011-01       Impact factor: 4.034

Review 2.  Genomic selection in plant breeding: from theory to practice.

Authors:  Jean-Luc Jannink; Aaron J Lorenz; Hiroyoshi Iwata
Journal:  Brief Funct Genomics       Date:  2010-02-15       Impact factor: 4.241

3.  Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers.

Authors:  José Crossa; Gustavo de Los Campos; Paulino Pérez; Daniel Gianola; Juan Burgueño; José Luis Araus; Dan Makumbi; Ravi P Singh; Susanne Dreisigacker; Jianbing Yan; Vivi Arief; Marianne Banziger; Hans-Joachim Braun
Journal:  Genetics       Date:  2010-09-02       Impact factor: 4.562

4.  Genetic architecture of maize kernel composition in the nested association mapping and inbred association panels.

Authors:  Jason P Cook; Michael D McMullen; James B Holland; Feng Tian; Peter Bradbury; Jeffrey Ross-Ibarra; Edward S Buckler; Sherry A Flint-Garcia
Journal:  Plant Physiol       Date:  2011-12-01       Impact factor: 8.340

5.  Development of highly polymorphic SNP markers from the complexity reduced portion of maize [Zea mays L.] genome for use in marker-assisted breeding.

Authors:  Jafar A Mammadov; Wei Chen; Ruihua Ren; Reetal Pai; Wesley Marchione; Feyruz Yalçin; Hanneke Witsenboer; Thomas W Greene; Steven A Thompson; Siva P Kumpatla
Journal:  Theor Appl Genet       Date:  2010-04-18       Impact factor: 5.699

6.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

7.  An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests.

Authors:  Carolin Strobl; James Malley; Gerhard Tutz
Journal:  Psychol Methods       Date:  2009-12

8.  Understanding and using quantitative genetic variation.

Authors:  William G Hill
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-01-12       Impact factor: 6.237

9.  SNP markers and their impact on plant breeding.

Authors:  Jafar Mammadov; Rajat Aggarwal; Ramesh Buyyarapu; Siva Kumpatla
Journal:  Int J Plant Genomics       Date:  2012-12-18
  9 in total
  7 in total

1.  Genome-Enabled Prediction Methods Based on Machine Learning.

Authors:  Edgar L Reinoso-Peláez; Daniel Gianola; Oscar González-Recio
Journal:  Methods Mol Biol       Date:  2022

2.  A Linear Regression Predictor for Identifying N6-Methyladenosine Sites Using Frequent Gapped K-mer Pattern.

Authors:  Y Y Zhuang; H J Liu; X Song; Y Ju; H Peng
Journal:  Mol Ther Nucleic Acids       Date:  2019-10-10       Impact factor: 8.886

3.  Would large dataset sample size unveil the potential of deep neural networks for improved genome-enabled prediction of complex traits? The case for body weight in broilers.

Authors:  Tiago L Passafaro; Fernando B Lopes; João R R Dórea; Mark Craven; Vivian Breen; Rachel J Hawken; Guilherme J M Rosa
Journal:  BMC Genomics       Date:  2020-11-09       Impact factor: 3.969

4.  Deep polygenic neural network for predicting and identifying yield-associated genes in Indonesian rice accessions.

Authors:  Nicholas Dominic; Tjeng Wawan Cenggoro; Arif Budiarto; Bens Pardamean
Journal:  Sci Rep       Date:  2022-08-15       Impact factor: 4.996

5.  Multi-tissue transcriptome-wide association studies.

Authors:  Nastasiya F Grinberg; Chris Wallace
Journal:  Genet Epidemiol       Date:  2020-12-28       Impact factor: 2.135

6.  A zero altered Poisson random forest model for genomic-enabled prediction.

Authors:  Osval Antonio Montesinos-López; Abelardo Montesinos-López; Brandon A Mosqueda-Gonzalez; José Cricelio Montesinos-López; José Crossa; Nerida Lozano Ramirez; Pawan Singh; Felícitas Alejandra Valladares-Anguiano
Journal:  G3 (Bethesda)       Date:  2021-02-09       Impact factor: 3.154

7.  Investigation of prediction accuracy and the impact of sample size, ancestry, and tissue in transcriptome-wide association studies.

Authors:  James J Fryett; Andrew P Morris; Heather J Cordell
Journal:  Genet Epidemiol       Date:  2020-03-19       Impact factor: 2.344

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.