Literature DB >> 34448888

Predicting phenotypes from genetic, environment, management, and historical data using CNNs.

Jacob D Washburn1, Emre Cimen2,3, Guillaume Ramstein2,4, Timothy Reeves2, Patrick O'Briant2, Greg McLean5, Mark Cooper5, Graeme Hammer5, Edward S Buckler2,6.   

Abstract

KEY MESSAGE: Convolutional Neural Networks (CNNs) can perform similarly or better than standard genomic prediction methods when sufficient genetic, environmental, and management data are provided. Predicting phenotypes from genetic (G), environmental (E), and management (M) conditions is a long-standing challenge with implications to agriculture, medicine, and conservation. Most methods reduce the factors in a dataset (feature engineering) in a subjective and potentially oversimplified manner. Deep neural networks such as Multilayer Perceptrons (MPL) and Convolutional Neural Networks (CNN) can overcome this by allowing the data itself to determine which factors are most important. CNN models were developed for predicting agronomic yield from a combination of replicated trials and historical yield survey data. The results were more accurate than standard methods when tested on held-out G, E, and M data (r = 0.50 vs. r = 0.43), and performed slightly worse than standard methods when only G was held out (r = 0.74 vs. r = 0.80). Pre-training on historical data increased accuracy compared to trial data alone. Saliency map analysis indicated the CNN has "learned" to prioritize many factors of known agricultural importance.
© 2021. This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply.

Entities:  

Mesh:

Year:  2021        PMID: 34448888     DOI: 10.1007/s00122-021-03943-7

Source DB:  PubMed          Journal:  Theor Appl Genet        ISSN: 0040-5752            Impact factor:   5.699


  21 in total

1.  Best linear unbiased estimation and prediction under a selection model.

Authors:  C R Henderson
Journal:  Biometrics       Date:  1975-06       Impact factor: 2.571

2.  TASSEL: software for association mapping of complex traits in diverse samples.

Authors:  Peter J Bradbury; Zhiwu Zhang; Dallas E Kroon; Terry M Casstevens; Yogesh Ramdoss; Edward S Buckler
Journal:  Bioinformatics       Date:  2007-06-22       Impact factor: 6.937

Review 3.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

Review 4.  Genomic Selection in Plant Breeding: Methods, Models, and Perspectives.

Authors:  José Crossa; Paulino Pérez-Rodríguez; Jaime Cuevas; Osval Montesinos-López; Diego Jarquín; Gustavo de Los Campos; Juan Burgueño; Juan M González-Camacho; Sergio Pérez-Elizalde; Yoseph Beyene; Susanne Dreisigacker; Ravi Singh; Xuecai Zhang; Manje Gowda; Manish Roorkiwal; Jessica Rutkoski; Rajeev K Varshney
Journal:  Trends Plant Sci       Date:  2017-09-28       Impact factor: 18.313

5.  Lack of ethylene does not affect reproductive success and synergid cell death in Arabidopsis.

Authors:  Wenhao Li; Qiyun Li; Mohan Lyu; Zhijuan Wang; Zihan Song; Shangwei Zhong; Hongya Gu; Juan Dong; Thomas Dresselhaus; Sheng Zhong; Li-Jia Qu
Journal:  Mol Plant       Date:  2021-11-03       Impact factor: 21.949

6.  A reaction norm model for genomic selection using high-dimensional genomic and environmental data.

Authors:  Diego Jarquín; José Crossa; Xavier Lacaze; Philippe Du Cheyron; Joëlle Daucourt; Josiane Lorgeou; François Piraux; Laurent Guerreiro; Paulino Pérez; Mario Calus; Juan Burgueño; Gustavo de los Campos
Journal:  Theor Appl Genet       Date:  2013-12-12       Impact factor: 5.699

7.  Maize Genomes to Fields: 2014 and 2015 field season genotype, phenotype, environment, and inbred ear image datasets.

Authors:  Naser AlKhalifah; Darwin A Campbell; Celeste M Falcon; Jack M Gardiner; Nathan D Miller; Maria Cinta Romay; Ramona Walls; Renee Walton; Cheng-Ting Yeh; Martin Bohn; Jessica Bubert; Edward S Buckler; Ignacio Ciampitti; Sherry Flint-Garcia; Michael A Gore; Christopher Graham; Candice Hirsch; James B Holland; David Hooker; Shawn Kaeppler; Joseph Knoll; Nick Lauter; Elizabeth C Lee; Aaron Lorenz; Jonathan P Lynch; Stephen P Moose; Seth C Murray; Rebecca Nelson; Torbert Rocheford; Oscar Rodriguez; James C Schnable; Brian Scully; Margaret Smith; Nathan Springer; Peter Thomison; Mitchell Tuinstra; Randall J Wisser; Wenwei Xu; David Ertl; Patrick S Schnable; Natalia De Leon; Edgar P Spalding; Jode Edwards; Carolyn J Lawrence-Dill
Journal:  BMC Res Notes       Date:  2018-07-09

8.  A CNN-RNN Framework for Crop Yield Prediction.

Authors:  Saeed Khaki; Lizhi Wang; Sotirios V Archontoulis
Journal:  Front Plant Sci       Date:  2020-01-24       Impact factor: 5.753

9.  The effect of artificial selection on phenotypic plasticity in maize.

Authors:  Joseph L Gage; Diego Jarquin; Cinta Romay; Aaron Lorenz; Edward S Buckler; Shawn Kaeppler; Naser Alkhalifah; Martin Bohn; Darwin A Campbell; Jode Edwards; David Ertl; Sherry Flint-Garcia; Jack Gardiner; Byron Good; Candice N Hirsch; Jim Holland; David C Hooker; Joseph Knoll; Judith Kolkman; Greg Kruger; Nick Lauter; Carolyn J Lawrence-Dill; Elizabeth Lee; Jonathan Lynch; Seth C Murray; Rebecca Nelson; Jane Petzoldt; Torbert Rocheford; James Schnable; Patrick S Schnable; Brian Scully; Margaret Smith; Nathan M Springer; Srikant Srinivasan; Renee Walton; Teclemariam Weldekidan; Randall J Wisser; Wenwei Xu; Jianming Yu; Natalia de Leon
Journal:  Nat Commun       Date:  2017-11-07       Impact factor: 14.919

10.  Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.

Authors:  Rostam Abdollahi-Arpanahi; Daniel Gianola; Francisco Peñagaricano
Journal:  Genet Sel Evol       Date:  2020-02-24       Impact factor: 4.297

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.