| Literature DB >> 30547185 |
Guillaume P Ramstein1, Sarah E Jensen2, Edward S Buckler3,2,4.
Abstract
In the past, plant breeding has undergone three major transformations and is currently transitioning to a new technological phase, Breeding 4. This phase is characterized by the development of methods for biological design of plant varieties, including transformation and gene editing techniques directed toward causal loci. The application of such technologies will require to reliably estimate the effect of loci in plant genomes by avoiding the situation where the number of loci assayed (p) surpasses the number of plant genotypes (n). Here, we discuss approaches to avoid this curse of dimensionality (n ≪ p), which will involve analyzing intermediate phenotypes such as molecular traits and component traits related to plant morphology or physiology. Because these approaches will rely on novel data types such as DNA sequences and high-throughput phenotyping images, Breeding 4 will call for analyses that are complementary to traditional quantitative genetic studies, being based on machine learning techniques which make efficient use of sequence and image data. In this article, we will present some of these techniques and their application for prioritizing causal loci and developing improved varieties in Breeding 4.Entities:
Mesh:
Year: 2018 PMID: 30547185 PMCID: PMC6439136 DOI: 10.1007/s00122-018-3267-3
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.699
Fig. 1Timeline of plant breeding phases. Breeding 1, selection with unknown loci; Breeding 2, selection by controlled crosses; Breeding 3, marker-assisted selection; Breeding 4, ideotype-based selection and transformation
Fig. 2The shift in statistical framework between Breeding 3 and Breeding 4. Breeding 4 will aim to avoid the curse of dimensionality (n ≪ p) which precludes the inference of causal loci, especially for traits controlled by many causal loci. Machine learning techniques such as neural networks will be useful under this novel framework, to model effects of DNA polymorphisms on endophenotypes and predict component traits from high-throughput phenotyping (HTP) data