Literature DB >> 21749471

Dimension reduction and variable selection for genomic selection: application to predicting milk yield in Holsteins.

N Long1, D Gianola, G J M Rosa, K A Weigel.   

Abstract

Genome-assisted prediction of genetic merit of individuals for a quantitative trait requires building statistical models that can handle data sets consisting of a massive number of markers and many fewer observations. Numerous regression models have been proposed in which marker effects are treated as random variables. Alternatively, multivariate dimension reduction techniques [such as principal component regression (PCR) and partial least-squares regression (PLS)] model a small number of latent components which are linear combinations of original variables, thereby reducing dimensionality. Further, marker selection has drawn increasing attention in genomic selection. This study evaluated two dimension reduction methods, namely, supervised PCR and sparse PLS, for predicting genomic breeding values (BV) of dairy bulls for milk yield using single-nucleotide polymorphisms (SNPs). These two methods perform variable selection in addition to reducing dimensionality. Supervised PCR preselects SNPs based on the strength of association of each SNP with the phenotype. Sparse PLS promotes sparsity by imposing some penalty on the coefficients of linear combinations of original SNP variables. Two types of supervised PCR (I and II) were examined. Method I was based on single-SNP analyses, whereas method II was based on multiple-SNP analyses. Supervised PCR II was clearly better than supervised PCR I in predictive ability when evaluated on SNP subsets of various sizes, and sparse PLS was in between. Supervised PCR II and sparse PLS attained similar predictive correlations when the size of the SNP subset was below 1000. Supervised PCR II with 300 and 500 SNPs achieved correlations of 0.54 and 0.59, respectively, corresponding to 80 and 87% of the correlation (0.68) obtained with all 32 518 SNPs in a PCR model. The predictive correlation of supervised PCR II reached a plateau of 0.68 when the number of SNPs increased to 3500. Our results demonstrate the potential of combining dimension reduction and variable selection for accurate and cost-effective prediction of genomic BV.
© 2011 Blackwell Verlag GmbH.

Mesh:

Year:  2011        PMID: 21749471     DOI: 10.1111/j.1439-0388.2011.00917.x

Source DB:  PubMed          Journal:  J Anim Breed Genet        ISSN: 0931-2668            Impact factor:   2.380


  12 in total

1.  Exploring the areas of applicability of whole-genome prediction methods for Asian rice (Oryza sativa L.).

Authors:  Akio Onogi; Osamu Ideta; Yuto Inoshita; Kaworu Ebana; Takuma Yoshioka; Masanori Yamasaki; Hiroyoshi Iwata
Journal:  Theor Appl Genet       Date:  2014-10-24       Impact factor: 5.699

2.  Genomic selection using principal component regression.

Authors:  Caroline Du; Julong Wei; Shibo Wang; Zhenyu Jia
Journal:  Heredity (Edinb)       Date:  2018-05-01       Impact factor: 3.821

3.  A comparison of principal component regression and genomic REML for genomic prediction across populations.

Authors:  Christos Dadousis; Roel F Veerkamp; Bjørg Heringstad; Marcin Pszczola; Mario P L Calus
Journal:  Genet Sel Evol       Date:  2014-11-05       Impact factor: 4.297

4.  Enlarging a training set for genomic selection by imputation of un-genotyped animals in populations of varying genetic architecture.

Authors:  Eduardo C G Pimentel; Monika Wensch-Dorendorf; Sven König; Hermann H Swalve
Journal:  Genet Sel Evol       Date:  2013-04-26       Impact factor: 4.297

Review 5.  Whole-genome regression and prediction methods applied to plant and animal breeding.

Authors:  Gustavo de Los Campos; John M Hickey; Ricardo Pong-Wong; Hans D Daetwyler; Mario P L Calus
Journal:  Genetics       Date:  2012-06-28       Impact factor: 4.562

6.  Accuracy of Genomic Prediction in Switchgrass (Panicum virgatum L.) Improved by Accounting for Linkage Disequilibrium.

Authors:  Guillaume P Ramstein; Joseph Evans; Shawn M Kaeppler; Robert B Mitchell; Kenneth P Vogel; C Robin Buell; Michael D Casler
Journal:  G3 (Bethesda)       Date:  2016-04-07       Impact factor: 3.154

7.  Genomic Prediction of Breeding Values Using a Subset of SNPs Identified by Three Machine Learning Methods.

Authors:  Bo Li; Nanxi Zhang; You-Gan Wang; Andrew W George; Antonio Reverter; Yutao Li
Journal:  Front Genet       Date:  2018-07-04       Impact factor: 4.599

8.  Incorporating Prior Knowledge of Principal Components in Genomic Prediction.

Authors:  Sayed M Hosseini-Vardanjani; Mohammad M Shariati; Hossein Moradi Shahrebabak; Mojtaba Tahmoorespur
Journal:  Front Genet       Date:  2018-08-02       Impact factor: 4.599

9.  SNP variable selection by generalized graph domination.

Authors:  Shuzhen Sun; Zhuqi Miao; Blaise Ratcliffe; Polly Campbell; Bret Pasch; Yousry A El-Kassaby; Balabhaskar Balasundaram; Charles Chen
Journal:  PLoS One       Date:  2019-01-24       Impact factor: 3.240

10.  Integrating Molecular Markers and Environmental Covariates To Interpret Genotype by Environment Interaction in Rice (Oryza sativa L.) Grown in Subtropical Areas.

Authors:  Eliana Monteverde; Lucía Gutierrez; Pedro Blanco; Fernando Pérez de Vida; Juan E Rosas; Victoria Bonnecarrère; Gastón Quero; Susan McCouch
Journal:  G3 (Bethesda)       Date:  2019-05-07       Impact factor: 3.154

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.