Literature DB >> 33822175

Digital insights: bridging the phenotype-to-genotype divide.

Matthew F McCabe1, Mark Tester2.   

Abstract

Entities:  

Keywords:  Digital agriculture; drones; genome-wide association studies; genotype; machine learning; phenotype; remote sensing; unmanned aerial vehicles

Year:  2021        PMID: 33822175      PMCID: PMC8023183          DOI: 10.1093/jxb/erab108

Source DB:  PubMed          Journal:  J Exp Bot        ISSN: 0022-0957            Impact factor:   6.992


× No keyword cloud information.
The convergence of autonomous platforms for field-based phenotyping with advances in machine learning for big data analytics and rapid sequencing for genome description herald the promise of new insights and discoveries in the plant sciences. With growing populations driving an increased demand for food over the next few decades (Foley ), combined with changes in climate that present as threat multipliers (Wheeler and von Braun, 2013), there is an immediate need to develop tools and techniques for enhancing the performance of our agricultural systems (Tester and Langridge, 2010). Digital agriculture is at the forefront of this effort (Shepherd ). In broad terms, this encompasses the collection, analysis, and interpretation of data across the food production system, using a variety of approaches to turn gathered information into actionable knowledge. At its core, digital agriculture is simply a data mining activity, where the output is enhanced knowledge on crop condition or physiological behaviour, as well as insights into on-farm responses to environmental changes. The expectation is that a data-driven approach to agricultural production has the potential to deliver a more sustainable utilization and management of resources, while increasing the output of farms across diverse locations and environments (Basso and Antle, 2020). A key element of digital agriculture is the procurement of high-resolution spatial and temporal data in a timely and useable manner. Indeed, the promise of remote sensing for improving the characterization and description of this field has been evident for many decades (McCabe ). Unfortunately, it is one that has remained largely unfulfilled, principally as a consequence of inadequate spatial detail. However, the recent rise of unmanned aerial vehicles (UAVs) has witnessed a revolution in the way spatial information can be obtained and interpreted, offering capacity for on-demand sensing with high spatio-temporal coverage (Maes and Steppe, 2019). For UAVs, a wide range of sensing payloads have become available (see Box 1), ranging from traditional optical red–green–blue (RGB) systems, to advanced hyperspectral and LIDAR sensors, offering the possibility of derivable metrics that can be processed in near real-time and at spatially explicit scales. With such flexibility, unique fusions of optical, thermal, and multi- and hyperspectral sensors facilitate the retrieval of numerous land surface parameters and variables, spanning canopy structural properties, vegetation health and condition, soil and leaf temperatures, and even higher order variables such as evaporation and transpiration (Manfreda ). Given its capacity for local-to-global coverage, remote sensing has an obvious role in advancing precision agriculture and plant phenotyping studies (Weiss ). However, satellite-based observations are constrained by issues of temporal frequency and spatial resolution, as well as the latency with which data can be obtained. Although commercial high-resolution satellite data are increasingly available for precision agricultural applications (Houborg and McCabe, 2018), it is not yet at the resolution to deliver the type of plant-to-leaf scale retrievals needed for phenotyping studies (McCabe ). On the other hand, the increasing availability of flight-stable and user-friendly unmanned aerial vehicle (UAV) platforms that can be combined with lightweight miniaturized sensors has dramatically expanded the capacity for these types of focused investigations (Mahlein, 2016; Yang ). UAVs offer the possibility of ultra-high (subcentimetre) resolution sensing with on-demand sampling, realizing a considerable advantage over satellite-based sensing. An expanding array of sensing systems include combinations of thermal (Khanal et al., 2017), multispectral, and hyperspectral sensors (Aasen ), which facilitate the retrieval of a variety of vegetation indices, pigment-based retrievals, and plant structural properties (Berni ). Other sensing systems include active LIDAR for detailed structural mapping (Madec ) and even fluorescence systems that provide a means for monitoring photosynthesis and stress (Zarco-Tejada ). In combination with field-based sampling and laboratory-based analysis, UAV and related phenotyping technologies are poised to deliver knowledge advances not just to scientists but, more importantly, to farmers and breeders (Hickey ). As a consequence of these developments, we are increasingly awash with rich geospatial datasets. Yet data availability is only one aspect of the knowledge equation. To realize impact, we must also develop the tools and techniques that can turn this information into usable and useful products. New and improved retrieval algorithms, advanced data-fusion and data-harmonization approaches, and, increasingly, machine learning techniques, are all required. However, so too are more fundamental assessments that relate these varied observations to detailed in situ collections, providing a basis to drive improvements in process descriptions and allowing new mechanistic relationships to be developed, as well as improved understanding of underlying biological responses to environmental change. The opportunity to exploit remotely sensed features in the field, and then use these to identify key genetic determinants of phenotypes in the laboratory, represents one of the latest and (potentially) most impactful of frontiers (Shi ). In the case of Han , remote sensing-driven insights support a detailed genetics-based investigation of a key phenotypic characteristic, namely differential floral opening and closing times. Using a multitemporal collection of UAV-derived RGB imagery (and supported by some prior ground-based photo and video sequences), the authors were able to produce a training–testing dataset to be used as input into several machine learning approaches, with the ultimate aim of automatically classifying the spatio-temporal behaviour of numerous individual lettuce plants within a field, and then to link the observed response to subsequent genetic analyses. Although RGB imagery was used in this example, there is an extensive and growing body of work devoted to the retrieval of specific plant spectral characteristics that may offer additional insights into patterns of plant growth and response. How these multiple plant functional traits and inter-related descriptors can be used to provide even deeper insights into phenotyping (and genotyping) studies represents an exciting area of research. Increasingly, we are seeing the implementation of machine learning approaches driving these insights (Bauer ). Machine learning has already proven to be a versatile tool to assist in unlocking the vast volumes of both remote sensing (Zhu ) and genomic data (Eraslan ). However, its application routinely presents its own challenges. An aspect of this was investigated by the authors via their considered assessment of several learning approaches. The rationale for doing this is sound since, at this stage, it remains unclear under what circumstances and pre-conditions a specific machine learning technique is best suited for use in any particular application; that is, it is unusual to know the optimal approach to apply a priori. Reproducibility represents another aspect requiring careful attention: the need for thorough documentation of parameter selection (and the underlying justification for these), let alone the myriad other tuning elements available to the user, highlights the need for thorough description. Finally, the training and accuracy assessments required in machine learning are critical to understanding their efficacy. Confusion matrices, classification errors, sensitivity, specificity, and false-positive rates all influence the relative worth of any particular approach. A high prediction accuracy can be achieved by many different means, and for many different reasons: it may indicate either a robust, or an overly tuned, highly specific model. Resolving the conflict between model generality and transferability remains a key challenge not limited to the plant sciences (Yosinski ). Just as phenotyping of field-grown plants is being significantly enhanced by remote sensing and machine learning, so genotyping is being significantly enhanced by the power of high-throughput sequencing and the rapid proliferation of high-quality genome sequences. Phenotyping in controlled environments still has an important role, especially to investigate the effects on plants of manipulations to the environment; however, the value of field phenotyping is, of course, that measured traits are more likely to be more directly relevant to field-grown crops, and thus of use to breeders and agronomists. Similarly, on the genotyping side, as gold and platinum standard reference genomes of crops become increasingly available, and whole-genome re-sequencing becomes trivially cheap, so the need to use model species reduces. We are living in exciting times for plant science. UAV imaging and analyses are ideal for field trials of mapping populations, where ultra-high spatial and temporal resolution information that cannot be manually collected in the field is so valuable. However, one aspect that requires increased attention is the use of sensors and Internet-of-Things (IoT) approaches to accurately describe the environment in which the plants are growing (Papoutsoglou ). In a similar vein, we need to remain vigilant to the often strong effects of genotype–environment (G×E) interactions, and the need to repeat experiments across more than one season. An extra layer on top of the work of Han et al. would be to identify where genetic controls of traits were stable across seasons, and where loci had different effects in the face of different temperatures and day-lengths, for example. This could be particularly important for lettuce, where flowering time is well known to be strongly affected by these environmental parameters. It should also be noted that genetic studies could be considered to be ‘simply’ the (admittedly sophisticated) positing of hypotheses, and that such work needs to be built upon with the testing of the effects of candidate genes on plant phenotype. In a diversity panel, for example, this could start with a haplotype analysis of candidate genes, where differences in phenotypes are correlated with different alleles of candidate genes. In a bi-parental population, such as used by Han et al., fine mapping approaches can be used. Ultimately, testing of candidate gene function is usually done using reverse genetic approaches, such as gene editing to knock out gene function.

The path forward

While discoveries and insights are often (waiting) to be found within the overlapping boundaries of aligned disciplines, it is a domain that presents more than just knowledge-related barriers. Language and terminology present as real constraints, and the need for researchers to be conversant in multiple fields is paramount. This is certainly the case with applications of machine learning to various disciplines (e.g. remote sensing or plant sciences), and even more so when multiple disciplines converge, as we see in Han . The availability of these emerging technologies and techniques, and their cross-field potential, is encouraging this dialogue, which can only be of benefit to all fields of investigation. Still, it remains to be seen whether the information revolution that is occurring within the agricultural sector will resolve to the scales necessary to realize change at the level of the farm or the seed company, or whether the researcher to end-user divide will remain. Perhaps more importantly, it is imperative that these technologies do not just produce more data and information, but lead to actual insight and knowledge. Developing use cases and objectively demonstrating the information benefit, as is done in Han , represents an important step towards realizing this goal.
  15 in total

1.  Solutions for a cultivated planet.

Authors:  Jonathan A Foley; Navin Ramankutty; Kate A Brauman; Emily S Cassidy; James S Gerber; Matt Johnston; Nathaniel D Mueller; Christine O'Connell; Deepak K Ray; Paul C West; Christian Balzer; Elena M Bennett; Stephen R Carpenter; Jason Hill; Chad Monfreda; Stephen Polasky; Johan Rockström; John Sheehan; Stefan Siebert; David Tilman; David P M Zaks
Journal:  Nature       Date:  2011-10-12       Impact factor: 49.962

Review 2.  Perspectives for Remote Sensing with Unmanned Aerial Vehicles in Precision Agriculture.

Authors:  Wouter H Maes; Kathy Steppe
Journal:  Trends Plant Sci       Date:  2018-12-15       Impact factor: 18.313

3.  Drone phenotyping and machine learning enable discovery of loci regulating daily floral opening in lettuce.

Authors:  Rongkui Han; Andy J Y Wong; Zhehan Tang; Maria J Truco; Dean O Lavelle; Alexander Kozik; Yufang Jin; Richard W Michelmore
Journal:  J Exp Bot       Date:  2021-04-02       Impact factor: 6.992

Review 4.  Breeding technologies to increase crop production in a changing world.

Authors:  Mark Tester; Peter Langridge
Journal:  Science       Date:  2010-02-12       Impact factor: 47.728

Review 5.  Deep learning: new computational modelling techniques for genomics.

Authors:  Gökcen Eraslan; Žiga Avsec; Julien Gagneur; Fabian J Theis
Journal:  Nat Rev Genet       Date:  2019-07       Impact factor: 53.242

Review 6.  Breeding crops to feed 10 billion.

Authors:  Lee T Hickey; Amber N Hafeez; Hannah Robinson; Scott A Jackson; Soraya C M Leal-Bertioli; Mark Tester; Caixia Gao; Ian D Godwin; Ben J Hayes; Brande B H Wulff
Journal:  Nat Biotechnol       Date:  2019-06-17       Impact factor: 54.908

7.  High-Throughput Phenotyping of Plant Height: Comparing Unmanned Aerial Vehicles and Ground LiDAR Estimates.

Authors:  Simon Madec; Fred Baret; Benoît de Solan; Samuel Thomas; Dan Dutartre; Stéphane Jezequel; Matthieu Hemmerlé; Gallian Colombeau; Alexis Comar
Journal:  Front Plant Sci       Date:  2017-11-27       Impact factor: 5.753

8.  Combining computer vision and deep learning to enable ultra-scale aerial phenotyping and precision agriculture: A case study of lettuce production.

Authors:  Alan Bauer; Aaron George Bostrom; Joshua Ball; Christopher Applegate; Tao Cheng; Stephen Laycock; Sergio Moreno Rojas; Jacob Kirwan; Ji Zhou
Journal:  Hortic Res       Date:  2019-06-01       Impact factor: 6.793

Review 9.  Priorities for science to overcome hurdles thwarting the full promise of the 'digital agriculture' revolution.

Authors:  Mark Shepherd; James A Turner; Bruce Small; David Wheeler
Journal:  J Sci Food Agric       Date:  2018-10-22       Impact factor: 3.638

10.  Enabling reusability of plant phenomic datasets with MIAPPE 1.1.

Authors:  Evangelia A Papoutsoglou; Daniel Faria; Daniel Arend; Elizabeth Arnaud; Ioannis N Athanasiadis; Inês Chaves; Frederik Coppens; Guillaume Cornut; Bruno V Costa; Hanna Ćwiek-Kupczyńska; Bert Droesbeke; Richard Finkers; Kristina Gruden; Astrid Junker; Graham J King; Paweł Krajewski; Matthias Lange; Marie-Angélique Laporte; Célia Michotey; Markus Oppermann; Richard Ostler; Hendrik Poorter; Ricardo Ramı Rez-Gonzalez; Živa Ramšak; Jochen C Reif; Philippe Rocca-Serra; Susanna-Assunta Sansone; Uwe Scholz; François Tardieu; Cristobal Uauy; Björn Usadel; Richard G F Visser; Stephan Weise; Paul J Kersey; Célia M Miguel; Anne-Françoise Adam-Blondon; Cyril Pommier
Journal:  New Phytol       Date:  2020-04-25       Impact factor: 10.323

View more
  1 in total

1.  A Machine-Learning Method to Assess Growth Patterns in Plants of the Family Lemnaceae.

Authors:  Leone Ermes Romano; Maurizio Iovane; Luigi Gennaro Izzo; Giovanna Aronne
Journal:  Plants (Basel)       Date:  2022-07-23
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.