Literature DB >> 27540725

Multi-task Gaussian process for imputing missing data in multi-trait and multi-environment trials.

Tomoaki Hori1, David Montcho2, Clement Agbangla3, Kaworu Ebana4, Koichi Futakuchi2, Hiroyoshi Iwata5.   

Abstract

KEY MESSAGE: A method based on a multi-task Gaussian process using self-measuring similarity gave increased accuracy for imputing missing phenotypic data in multi-trait and multi-environment trials. Multi-environmental trial (MET) data often encounter the problem of missing data. Accurate imputation of missing data makes subsequent analysis more effective and the results easier to understand. Moreover, accurate imputation may help to reduce the cost of phenotyping for thinned-out lines tested in METs. METs are generally performed for multiple traits that are correlated to each other. Correlation among traits can be useful information for imputation, but single-trait-based methods cannot utilize information shared by traits that are correlated. In this paper, we propose imputation methods based on a multi-task Gaussian process (MTGP) using self-measuring similarity kernels reflecting relationships among traits, genotypes, and environments. This framework allows us to use genetic correlation among multi-trait multi-environment data and also to combine MET data and marker genotype data. We compared the accuracy of three MTGP methods and iterative regularized PCA using rice MET data. Two scenarios for the generation of missing data at various missing rates were considered. The MTGP performed a better imputation accuracy than regularized PCA, especially at high missing rates. Under the 'uniform' scenario, in which missing data arise randomly, inclusion of marker genotype data in the imputation increased the imputation accuracy at high missing rates. Under the 'fiber' scenario, in which missing data arise in all traits for some combinations between genotypes and environments, the inclusion of marker genotype data decreased the imputation accuracy for most traits while increasing the accuracy in a few traits remarkably. The proposed methods will be useful for solving the missing data problem in MET data.

Entities:  

Mesh:

Year:  2016        PMID: 27540725     DOI: 10.1007/s00122-016-2760-9

Source DB:  PubMed          Journal:  Theor Appl Genet        ISSN: 0040-5752            Impact factor:   5.699


  17 in total

1.  Genomic-assisted prediction of genetic value with semiparametric procedures.

Authors:  Daniel Gianola; Rohan L Fernando; Alessandra Stella
Journal:  Genetics       Date:  2006-04-28       Impact factor: 4.562

2.  Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions.

Authors:  Nicolas Heslot; Deniz Akdemir; Mark E Sorrells; Jean-Luc Jannink
Journal:  Theor Appl Genet       Date:  2013-11-22       Impact factor: 5.699

3.  Genotypic stability and adaptability in tropical maize based on AMMI and GGE biplot analysis.

Authors:  M Balestre; R G Von Pinho; J C Souza; R L Oliveira
Journal:  Genet Mol Res       Date:  2009-11-03

4.  Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers.

Authors:  José Crossa; Gustavo de Los Campos; Paulino Pérez; Daniel Gianola; Juan Burgueño; José Luis Araus; Dan Makumbi; Ravi P Singh; Susanne Dreisigacker; Jianbing Yan; Vivi Arief; Marianne Banziger; Hans-Joachim Braun
Journal:  Genetics       Date:  2010-09-02       Impact factor: 4.562

5.  Multi-environment multi-QTL association mapping identifies disease resistance QTL in barley germplasm from Latin America.

Authors:  Lucia Gutiérrez; Silvia Germán; Silvia Pereyra; Patrick M Hayes; Carlos A Pérez; Flavio Capettini; Andres Locatelli; Natalia M Berberian; Esteban E Falconi; Rigoberto Estrada; Dario Fros; Victor Gonza; Hernan Altamirano; Julio Huerta-Espino; Edgar Neyra; Gisella Orjeda; Sergio Sandoval-Islas; Ravi Singh; Kelly Turkington; Ariel J Castro
Journal:  Theor Appl Genet       Date:  2014-12-30       Impact factor: 5.699

6.  Fine definition of the pedigree haplotypes of closely related rice cultivars by means of genome-wide discovery of single-nucleotide polymorphisms.

Authors:  Toshio Yamamoto; Hideki Nagasaki; Jun-ichi Yonemaru; Kaworu Ebana; Maiko Nakajima; Taeko Shibaya; Masahiro Yano
Journal:  BMC Genomics       Date:  2010-04-27       Impact factor: 3.969

7.  A mixed-model quantitative trait loci (QTL) analysis for multiple-environment trial data using environmental covariables for QTL-by-environment interactions, with an example in maize.

Authors:  Martin P Boer; Deanne Wright; Lizhi Feng; Dean W Podlich; Lang Luo; Mark Cooper; Fred A van Eeuwijk
Journal:  Genetics       Date:  2007-10-18       Impact factor: 4.562

8.  Genome wide association study for drought, aflatoxin resistance, and important agronomic traits of maize hybrids in the sub-tropics.

Authors:  Ivan D Barrero Farfan; Gerald N De La Fuente; Seth C Murray; Thomas Isakeit; Pei-Cheng Huang; Marilyn Warburton; Paul Williams; Gary L Windham; Mike Kolomiets
Journal:  PLoS One       Date:  2015-02-25       Impact factor: 3.240

9.  A reaction norm model for genomic selection using high-dimensional genomic and environmental data.

Authors:  Diego Jarquín; José Crossa; Xavier Lacaze; Philippe Du Cheyron; Joëlle Daucourt; Josiane Lorgeou; François Piraux; Laurent Guerreiro; Paulino Pérez; Mario Calus; Juan Burgueño; Gustavo de los Campos
Journal:  Theor Appl Genet       Date:  2013-12-12       Impact factor: 5.699

Review 10.  Kernel-based whole-genome prediction of complex traits: a review.

Authors:  Gota Morota; Daniel Gianola
Journal:  Front Genet       Date:  2014-10-16       Impact factor: 4.599

View more
  5 in total

1.  3D-MICE: integration of cross-sectional and longitudinal imputation for multi-analyte longitudinal clinical data.

Authors:  Yuan Luo; Peter Szolovits; Anand S Dighe; Jason M Baron
Journal:  J Am Med Inform Assoc       Date:  2018-06-01       Impact factor: 4.497

2.  Context-Aware Time Series Imputation for Multi-Analyte Clinical Data.

Authors:  Kejing Yin; Liaoliao Feng; William K Cheung
Journal:  J Healthc Inform Res       Date:  2020-10-18

3.  A Combined Interpolation and Weighted K-Nearest Neighbours Approach for the Imputation of Longitudinal ICU Laboratory Data.

Authors:  Sebastian Daberdaku; Erica Tavazzi; Barbara Di Camillo
Journal:  J Healthc Inform Res       Date:  2020-03-02

4.  Advantages and limitations of multiple-trait genomic prediction for Fusarium head blight severity in hybrid wheat (Triticum aestivum L.).

Authors:  Albert W Schulthess; Yusheng Zhao; C Friedrich H Longin; Jochen C Reif
Journal:  Theor Appl Genet       Date:  2017-12-02       Impact factor: 5.699

5.  Exploiting mutual information for the imputation of static and dynamic mixed-type clinical data with an adaptive k-nearest neighbours approach.

Authors:  Erica Tavazzi; Sebastian Daberdaku; Rosario Vasta; Andrea Calvo; Adriano Chiò; Barbara Di Camillo
Journal:  BMC Med Inform Decis Mak       Date:  2020-08-20       Impact factor: 2.796

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.