Literature DB >> 21457493

High-throughput phenotyping of multicellular organisms: finding the link between genotype and phenotype.

Rosangela Sozzani1, Philip N Benfey.   

Abstract

High-throughput phenotyping approaches (phenomics) are being combined with genome-wide genetic screens to identify alterations in phenotype that result from gene inactivation. Here we highlight promising technologies for 'phenome-scale' analyses in multicellular organisms.

Entities:  

Mesh:

Year:  2011        PMID: 21457493      PMCID: PMC3129668          DOI: 10.1186/gb-2011-12-3-219

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


Review

The availability of complete genomic sequences of many model organisms has made it possible to perform highly informative genome-wide functional analyses. For multicellular organisms (including the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster, the plants Arabidopsis thaliana and rice, as well as mouse), phenotypic analysis of genetic mutations is still one of the most effective ways to explore the function of a gene. Collections of strains with mutations in nearly every gene are now available, making it possible to analyze the phenotypes of a large number of independent strains. However, conventional analytic approaches, such as high-magnification microscopy at the single-cell level, require manual manipulation of samples and screening by eye, thus limiting throughput and presenting bottlenecks to large-scale genetic studies in multicellular organisms. Therefore, development of high-throughput methods, including automation in phenotyping and screening, is a strategy that is now coming to fruition [1]. Systematic large-scale phenotyping efforts have begun to generate information on a previously unattainable scale. For example, it was recently shown that even a highly dynamic process such as the division of human cells can be studied on a genome-wide scale by live imaging [2]. Cultured cells have also proved amenable to high-throughput phenotyping [2]. Although more challenging, the study of living organisms can provide insights into biological pathways, regulatory networks and/or cellular activity and behavior not obtainable from cultured cells [3-6]. Large-scale acquisition of phenotypic data can then predict important biological outputs, such as the roles of individual genes in development. Thus, high-throughput phenotyping approaches (that is, phenomics) can encompass a broad range of model systems and techniques aimed at understanding the link between genotype and phenotype. A good example of the evolution of high-throughput phenotyping is provided by RNA interference (RNAi) screens in the worm C. elegans, where recent advances in robotic sample preparation have facilitated high-throughput screens. However, C. elegans is only one of many systems in which innovative technologies for high-throughput studies are being developed. Indeed, the development and use of robotic platforms has also enabled high-throughput phenotypic analysis of plant growth and development at a larger physical scale. Here, we use C. elegans and Arabidopsis as the primary examples of the exciting new wave of approaches to functional genomics [7-10]. We focus on current advances in high-throughput phenotyping (HTP) for the analysis of C. elegans and Arabidopsis, as lessons learned from these organisms can be broadly applied to other animal and plant species.

RNAi and high-throughput phenotyping in C. elegans

Reverse genetic screening has proved a powerful method to identify gene function [11,12]. RNAi is a well-conserved phenomenon observed in many different organisms [13-23]. It was originally discovered in plants, and became one of the first genome-wide techniques used to study loss-of-function phenotypes in several model systems and in mammalian cell culture [24-27]. RNAi screens have become invaluable tools in assessing genotype-phenotype relationships [28,29], and several large-scale RNAi libraries have been generated to identify essential genes and those with novel functions [16,30-32]. For example, an RNAi library of 750 ovary-enriched genes was generated to study the function of genes involved in embryogenesis [31]. RNAi genome-wide screens in Drosophila have been performed using cell culture [12,15]. The genome-wide collection of transgenic constructs that has been prepared for in vivo screening has underpinned a number of studies, including a screen that led to the identification of the sex-peptide receptor of Drosophila [33,34]. Large-scale mutagenesis and phenotyping projects are also under way in mammalian cells, and are likely to yield similarly important results [23,35]. Over the past few years, increasingly sophisticated image-analysis tools have facilitated RNAi screens. Initially, high-throughput RNAi phenotyping focused on endpoint observations, such as worm morphology and viability, and thus were unable to distinguish between the primary and secondary effects of gene silencing. It is now possible to perform rapid and accurate phenotyping of embryonic lethality in different C. elegans developmental stages by analyzing high-throughput image data [36]. The image-analysis system DevStaR uses a hierarchical approach, in which the output of one step is the input for the next, for automatic classification of the developmental stages of worms from a population of mixed stages (including adult, larval and embryonic stages; Figure 1). The system consists of several layers that result in the identification of an area of interest: a segmentation of pixels within this region; a model-based component that breaks the pixel regions into object parts; and finally, a categorization of those objects using a machine-learning approach. This multi-layered object-recognition software offers the computational flexibility for generalized object-recognition problems and, therefore, is not limited to high-throughput worm screens [36].
Figure 1

Simplified illustration of the DevStaR system. The input images are from 96-well plates containing a population of mixed stages of adult, larva and embryo worms. Each pixel within the wells is first grouped together (contrast measure). Pixels are then grouped into connected components based on a threshold value (pairwise symmetry score). Third, for the object categorization, a support vector machine (SVM) learning method assigned a score to each category. Finally, as a result of the segmentation and labeling, DevStaR distinguishes adult (blue), larva (red) and embryo (green) worms.

Simplified illustration of the DevStaR system. The input images are from 96-well plates containing a population of mixed stages of adult, larva and embryo worms. Each pixel within the wells is first grouped together (contrast measure). Pixels are then grouped into connected components based on a threshold value (pairwise symmetry score). Third, for the object categorization, a support vector machine (SVM) learning method assigned a score to each category. Finally, as a result of the segmentation and labeling, DevStaR distinguishes adult (blue), larva (red) and embryo (green) worms. New computer-aided visualization methods, which automatically distinguish images of worms grown in agar plates, are also available [37]. In addition, automated phenotyping based on machine-learning methods of images obtained from movie frames can also be used to study embryo development [38]. These systems overcome previous bottlenecks in image analysis by scoring image data in a fully automated manner and providing rapid quantitative output that would not be obtainable at high-throughput by manual scoring. Because high-throughput phenotyping generates a large volume of data, which need to be standardized, normalized and analyzed, statistical and bioinformatics approaches are also becoming increasingly available.

Automated screening using worm-sorters

Further advances combining RNAi and sample sorters have enabled rapid selection of organisms with phenotypes of interest for a variety of assays, including genetic screens (Figure 2). Small-animal sorters, such as the BIOSORT/COPAS (complex object parametric analysis and sorter) machine, use a flow-through technique and a profiler system that can analyze up to 100 live animals per second and generate fluorescence emission profiles of the C. elegans body. COPAS has recently been used to analyze the expression pattern of 900 predicted C. elegans genes [39]. By analyzing large numbers of animals from a mixed-stage culture, Dupuy and colleagues [39] generated digitized chronograms of the intensity of gene expression throughout post-embryonic development. This machine allows researchers to study gene expression patterns in a large population of adult animals with a quantitative read-out. However, its sensitivity in sorting non-adult animals, such as embryos and larvae, is limited. Therefore, as a complementary approach, fluorescence-activated cell sorting (FACS) can be employed. By using embryo FACS (eFACS), large numbers of living embryos enriched in any desired embryonic stage can now be selected. Given the availability of different fluorescent marker genes, eFACS enables the assay of embryonic stage-specific gene expression in a high-throughput manner. Moreover, the need for a fast and reliable way of identifying phenotypic alterations in larvae, after modulating or eliminating genes, led researchers to develop a method to sort live C. elegans larvae (laFACS) [40]. Modifying a FACS machine enabled the collection of large quantities of live mutant worms from mixed populations, thereby expanding the arsenal of tools for high-throughput 'sample preparation' for genetic screens. Because these flow-cytometry-based systems sort animals only on one-dimensional intensity profiles, microfluidics chips have been developed to obtain single-cell resolution [41,42]. Microfluidic chips can be designed to function as small-scale sorters with channels and computer-controlled valves that control the environment surrounding the organism and restrict the worms' movements. This technology, when combined with automated image processing, allows high-throughput, non-biased phenotyping, imaging and screening of multicellular organisms [43].
Figure 2

Outline of general strategies of phenotyping in . The sorting techniques COPAS and laFACS can be used to sort live worms. FACS is used to rapidly sort and collect large quantities of live larvae from a mixed population. After laFACs, pure GFP or mutant worms can be used for either genetic or chemical screens, microarray or biochemical assays.

Outline of general strategies of phenotyping in . The sorting techniques COPAS and laFACS can be used to sort live worms. FACS is used to rapidly sort and collect large quantities of live larvae from a mixed population. After laFACs, pure GFP or mutant worms can be used for either genetic or chemical screens, microarray or biochemical assays. The resolution at which biological samples can be analyzed has greatly increased in recent years as fluorescence microscopy strategies have been developed to characterize gene expression at the single-cell level in C. elegans [1,44]. Methods to quantitatively measure gene-expression dynamics with cellular resolution are anticipated, and will be advantageous to functional genomic studies. However, the challenge of capturing high-resolution images that represent the entire sample remains formidable. Extensive high-throughput time-lapse fluorescent microscopy will only become a reality with improvements to the automation of microscopy imaging and the processing of large datasets.

High-throughput phenotyping for plant biotechnology

The identification of genes that underlie phenotypic variation for complex agronomic traits such as biomass and drought tolerance will be key to biotechnology-aided crop improvement. Because such traits are often controlled by many genes that are also heavily influenced by the environment, the discovery of their genetic basis often requires large-scale phenotyping strategies. Mutational methods such as chemical or fast neutron mutagenesis can be used in forward genetic screens, whereas insertional mutagenesis via T-DNA lines or transposons is used to generate libraries of loss-of-function mutants for reverse genetic screens. Arabidopsis has led the way in plant phenotypic profiling because insertional mutations of most genes are available [45-51]. Rice, as a leading experimental model for monocotyledonous crops, also has a panel of insertional mutant lines [52]. Insertional mutagenesis has also been applied to other crops, including maize and Medicago truncatula [53,54]. However, advances in phenomics will be essential to fully realize the potential of these powerful genetic resources. The investigation of complex traits such as root morphology, leaf size, plant height, flower shape or seed weight requires analyzing hundreds to thousands of plants, which poses a major challenge. Furthermore, gene response as a function of the environment must be accounted for. For this reason, tools specific for digital phenotyping together with automation of this process in controlled environments are necessary for high-throughput screening of plant phenotypes. Digital phenotyping offers the major advantage that data can be reanalyzed when new traits of interest or new types of measurements emerge. As the demand for digital image-acquisition technologies increases, several efforts have been made to generate software tools capable of producing objective and quantitative analyses of large image sets. Automated platforms have been developed for Arabidopsis and for crop plants to allow different aspects of automated visualization and image quantification. For example, the PHENOPSIS platform was used to dissect plant responses to soil water deficit in a collection of natural accessions of Arabidopsis [55]. The PHENODYN platform imposes drought scenarios and has been used to image maize and rice plants [56]. In addition, several efforts to improve aspects of automated visualization and image quantification for high-throughput phenotype scoring (for example, seed germination, hypocotyl growth, leaf-area development and root growth dynamics) have been made for Arabidopsis. Specifically, the high-throughput seed-germination analysis platform GERMINATOR was used to screen for natural variation in a population of 165 recombinant inbred lines, which revealed several quantitative trait loci (QTLs) for salt tolerance [57]. High-resolution measurements of hypocotyl growth and shape have been obtained by automated quantification of time series of electronic images using HYPOTrace [58]. Other examples of fully or partially automated imaging platforms for non-destructive image-based phenotyping are LeafAnalyser, LAMINA and GROWSCREEN 3 D [59-61]. These computer-based tools provide quantitative descriptors for leaf shape and size. A shortcoming of most of these tools is that they are designed to address very specific questions. Moreover, most traditional phenotype-scoring systems are based on endpoint analysis, and therefore do not easily capture the dynamic aspects of complex traits. Recent approaches to capture these aspects have incorporated time-course data acquisition so that transient events and subtle temporal changes can be observed. However, the challenge of observing dynamic growth processes and responses to environmental stimuli, through the combination of automated time-lapse imaging with automated image analysis, remains [62]. Many image-analysis-based software tools have focused on quantifying root growth rates and root structure. Advances in machine vision and computation of automatic trait evaluation have facilitated digital reconstruction of root systems and have potentially increased the levels of throughput for phenotyping in plants. Examples of software that allow higher-throughput phenotyping are RootTrace [63], KineRoot [64], SmartRoot [65], RootLM [66], Phytomorph [67,68], RootFlow [69] and WinRhizo [70]. Many high-throughput methods have been developed for Arabidopsis, aided by its small size. For crop plants, an automatic imaging system has been applied to monitoring rice growth [71]. Moreover, a foundation for high-throughput automatic phenotyping for QTL analysis of root system architecture (RSA) traits of crop plants has been laid recently. To capture the root-system topologies of diverse rice cultivars, inbred lines were grown in a transparent gel substrate and imaged at high resolution. The resulting images were combined in an analysis pipeline that automatically extracted RSA measurements. Using a machine-learning approach, these measurements were able to distinguish between closely related genotypes [72] (Figure 3). Alternative methods exist for the non-destructive capture of images of crop root systems grown in solid substrates, such as X-ray tomography and positron emission tomography (PET), but these are limited by throughput, resolution or cost [73,74].
Figure 3

The general strategies of phenotyping in plants. Illustration of the root-imaging platform. (1) Rice plants are grown in cylinders in gel-based media (sample preparation). (2) The cylinders are placed in a box containing water on the imaging turntable with backlighting. Computers control cameras attached to a four-post support system, which permits adjustments vertically and horizontally. Images are acquired through 360° (image-acquisition platform and data handling). (3) Cropped images from multiple angles are used for analysis (data processing). (4) Feature maps of root architecture that record values for a variety of root features, such as perimeter, depth, bushiness and volume, of each image (image/data analysis).

The general strategies of phenotyping in plants. Illustration of the root-imaging platform. (1) Rice plants are grown in cylinders in gel-based media (sample preparation). (2) The cylinders are placed in a box containing water on the imaging turntable with backlighting. Computers control cameras attached to a four-post support system, which permits adjustments vertically and horizontally. Images are acquired through 360° (image-acquisition platform and data handling). (3) Cropped images from multiple angles are used for analysis (data processing). (4) Feature maps of root architecture that record values for a variety of root features, such as perimeter, depth, bushiness and volume, of each image (image/data analysis). Low-cost packages for high-throughput phenotyping allow the handling of large-scale experiments, and downstream software pipelines offer flexibility for analysis of numerous lines and treatments. The improved efficiency and absence of subjectivity are great advantages of computer-aided assessment. In the past few years, the generation of phenotypic databases for large numbers of mutants has become a collaborative effort. For example, large-scale phenotypic analysis has been reported in rice using several mutant resources and several phenotype databases are now available (Table 1) [75-79]. Web-accessible collections of visible phenotypes observed for other crop plants, such as barley, maize, tomato and soybean, are also available (Table 1) [80-82].
Table 1

Phenotypic databases for crop plants

Plant speciesDatabase nameWebsiteReference
RiceOryza Tag Line (OTL)http://urgi.versailles.inra.fr/OryzaTagLine[76]
Rice Mutant Database (RMD)http://rmd.ncpgr.cn[78]
Tos17http://pc7080.abr.affrc.go.jp/phenotype[79]
OryGenesDBhttp://orygenesdb.cirad.fr/index.html[77]
BarleySCRI Barley Mutantshttp://bioinf.scri.ac.uk/barley/[80]
MaizemaizeGDBhttp://www.maizegdb.org/rescuemu-phenotype.php[81]
TomatoTomato Mutant Databasehttp://zamir.sgn.cornell.edu/mutants/[82]
LycoTILLhttp://www.agrobios.it/tilling/index.html[85]
SoybeanSoybean Mutation Databasehttp://www.soybeantilling.org/psearch.jsp[86]
Phenotypic databases for crop plants

Conclusions

Mutational analysis remains the gold standard for identifying and characterizing gene function and this is being facilitated by high-throughput phenotyping. Given the demand for high-throughput phenotypic analysis in many organisms, we can expect the further development of large-scale phenotyping to unravel complex genotype-phenotype relationships. As an example, automated microscopy provides the opportunity to collect vast amounts of data that need to be standardized, normalized and analyzed. This increases the need for community access to store and search these large datasets. It would be of great benefit if large-scale phenotypic data could be easily compared and shared between labs. However, current limitations to the reuse and sharing of such data include the lack of standardized vocabulary terms, experimental parameters and quantitative benchmarks. Therefore, there is a pressing need for clearly defined standards and terms agreed upon by a given community. To achieve this goal, databases that contain phenotypic information and, especially, integration of phenomic and other genome-wide data are required. Multi-organism phenotype-genotype databases that facilitate cross-species identification of genes associated with orthologous phenotypes are now becoming available (for example, PhenomicDB) [83,84]. In the next few years, the ability to harvest the full benefit of such large datasets can only be obtained by combining the genomic, epigenomic, transcriptomic, proteomic, metabolomic and phenomic data into shared databases. This resource will be invaluable for the investigation and eventual elucidation of molecular mechanisms regulating the biology of multicellular organisms, and will form a comprehensive description of the whole organism, opening new paths into systems biology.
  79 in total

Review 1.  Exploring plant genomes by RNA-induced gene silencing.

Authors:  Peter M Waterhouse; Christopher A Helliwell
Journal:  Nat Rev Genet       Date:  2003-01       Impact factor: 53.242

2.  Gene clustering based on RNAi phenotypes of ovary-enriched genes in C. elegans.

Authors:  Fabio Piano; Aaron J Schetter; Diane G Morton; Kristin C Gunsalus; Valerie Reinke; Stuart K Kim; Kenneth J Kemphues
Journal:  Curr Biol       Date:  2002-11-19       Impact factor: 10.834

3.  RNAi in C. elegans: soaking in the genome sequence.

Authors:  H Tabara; A Grishok; C C Mello
Journal:  Science       Date:  1998-10-16       Impact factor: 47.728

4.  Phenotypic profiling of the human genome by time-lapse microscopy reveals cell division genes.

Authors:  Beate Neumann; Thomas Walter; Jean-Karim Hériché; Jutta Bulkescher; Holger Erfle; Christian Conrad; Phill Rogers; Ina Poser; Michael Held; Urban Liebel; Cihan Cetin; Frank Sieckmann; Gregoire Pau; Rolf Kabbe; Annelie Wünsche; Venkata Satagopam; Michael H A Schmitz; Catherine Chapuis; Daniel W Gerlich; Reinhard Schneider; Roland Eils; Wolfgang Huber; Jan-Michael Peters; Anthony A Hyman; Richard Durbin; Rainer Pepperkok; Jan Ellenberg
Journal:  Nature       Date:  2010-04-01       Impact factor: 49.962

Review 5.  Functional genomics in Arabidopsis: large-scale insertional mutagenesis complements the genome sequencing project.

Authors:  S Parinov; V Sundaresan
Journal:  Curr Opin Biotechnol       Date:  2000-04       Impact factor: 9.740

6.  Advances in genetical genomics of plants.

Authors:  R V L Joosen; W Ligterink; H W M Hilhorst; J J B Keurentjes
Journal:  Curr Genomics       Date:  2009-12       Impact factor: 2.236

7.  Plasticity of Arabidopsis root gravitropism throughout a multidimensional condition space quantified by automated image analysis.

Authors:  Tessa L Durham Brooks; Nathan D Miller; Edgar P Spalding
Journal:  Plant Physiol       Date:  2009-11-18       Impact factor: 8.340

8.  Genome-wide insertional mutagenesis of Arabidopsis thaliana.

Authors:  José M Alonso; Anna N Stepanova; Thomas J Leisse; Christopher J Kim; Huaming Chen; Paul Shinn; Denise K Stevenson; Justin Zimmerman; Pascual Barajas; Rosa Cheuk; Carmelita Gadrinab; Collen Heller; Albert Jeske; Eric Koesema; Cristina C Meyers; Holly Parker; Lance Prednis; Yasser Ansari; Nathan Choy; Hashim Deen; Michael Geralt; Nisha Hazari; Emily Hom; Meagan Karnes; Celene Mulholland; Ral Ndubaku; Ian Schmidt; Plinio Guzman; Laura Aguilar-Henonin; Markus Schmid; Detlef Weigel; David E Carter; Trudy Marchand; Eddy Risseeuw; Debra Brogden; Albana Zeko; William L Crosby; Charles C Berry; Joseph R Ecker
Journal:  Science       Date:  2003-08-01       Impact factor: 47.728

9.  Genome-wide RNAi analysis of Caenorhabditis elegans fat regulatory genes.

Authors:  Kaveh Ashrafi; Francesca Y Chang; Jennifer L Watts; Andrew G Fraser; Ravi S Kamath; Julie Ahringer; Gary Ruvkun
Journal:  Nature       Date:  2003-01-16       Impact factor: 49.962

10.  Diel growth cycle of isolated leaf discs analyzed with a novel, high-throughput three-dimensional imaging method is identical to that of intact leaves.

Authors:  Bernhard Biskup; Hanno Scharr; Andreas Fischbach; Anika Wiese-Klinkenberg; Ulrich Schurr; Achim Walter
Journal:  Plant Physiol       Date:  2009-01-23       Impact factor: 8.340

View more
  18 in total

Review 1.  Stochastic developmental variation, an epigenetic source of phenotypic diversity with far-reaching biological consequences.

Authors:  Günter Vogt
Journal:  J Biosci       Date:  2015-03       Impact factor: 1.826

2.  3D phenotyping and quantitative trait locus mapping identify core regions of the rice genome controlling root architecture.

Authors:  Christopher N Topp; Anjali S Iyer-Pascuzzi; Jill T Anderson; Cheng-Ruei Lee; Paul R Zurek; Olga Symonova; Ying Zheng; Alexander Bucksch; Yuriy Mileyko; Taras Galkovskyi; Brad T Moore; John Harer; Herbert Edelsbrunner; Thomas Mitchell-Olds; Joshua S Weitz; Philip N Benfey
Journal:  Proc Natl Acad Sci U S A       Date:  2013-04-11       Impact factor: 11.205

3.  Rapid automated landmarking for morphometric analysis of three-dimensional facial scans.

Authors:  Mao Li; Joanne B Cole; Mange Manyama; Jacinda R Larson; Denise K Liberton; Sheri L Riccardi; Tracey M Ferrara; Stephanie A Santorico; Jordan J Bannister; Nils D Forkert; Richard A Spritz; Washington Mio; Benedikt Hallgrimsson
Journal:  J Anat       Date:  2017-01-12       Impact factor: 2.610

Review 4.  Whole-animal imaging, gene function, and the Zebrafish Phenome Project.

Authors:  Keith C Cheng; Xuying Xin; Darin P Clark; Patrick La Riviere
Journal:  Curr Opin Genet Dev       Date:  2011-09-28       Impact factor: 5.578

5.  Dissecting the phenotypic components of crop plant growth and drought responses based on high-throughput image analysis.

Authors:  Dijun Chen; Kerstin Neumann; Swetlana Friedel; Benjamin Kilian; Ming Chen; Thomas Altmann; Christian Klukas
Journal:  Plant Cell       Date:  2014-12-11       Impact factor: 11.277

6.  Matapax: an online high-throughput genome-wide association study pipeline.

Authors:  Liam H Childs; Jan Lisec; Dirk Walther
Journal:  Plant Physiol       Date:  2012-02-21       Impact factor: 8.340

Review 7.  Applications of comparative evolution to human disease genetics.

Authors:  Claire D McWhite; Benjamin J Liebeskind; Edward M Marcotte
Journal:  Curr Opin Genet Dev       Date:  2015-09-04       Impact factor: 5.578

8.  The details in the distributions: why and how to study phenotypic variability.

Authors:  K A Geiler-Samerotte; C R Bauer; S Li; N Ziv; D Gresham; M L Siegal
Journal:  Curr Opin Biotechnol       Date:  2013-04-06       Impact factor: 9.740

9.  A comprehensive dataset of genes with a loss-of-function mutant phenotype in Arabidopsis.

Authors:  Johnny Lloyd; David Meinke
Journal:  Plant Physiol       Date:  2012-01-13       Impact factor: 8.340

10.  GiA Roots: software for the high throughput analysis of plant root system architecture.

Authors:  Taras Galkovskyi; Yuriy Mileyko; Alexander Bucksch; Brad Moore; Olga Symonova; Charles A Price; Christopher N Topp; Anjali S Iyer-Pascuzzi; Paul R Zurek; Suqin Fang; John Harer; Philip N Benfey; Joshua S Weitz
Journal:  BMC Plant Biol       Date:  2012-07-26       Impact factor: 4.215

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.