Literature DB >> 28217003

Reverse Genetics and High Throughput Sequencing Methodologies for Plant Functional Genomics.

Anis Ben-Amar1, Samia Daldoul2, Götz M Reustle3, Gabriele Krczal3, Ahmed Mliki2.   

Abstract

In the post-genomic era, increasingly sophisticated genetic tools are being developed with the long-term goal of understanding how the coordinated activity of genes gives rise to a complex organism. With the advent of the next generation sequencing associated with effective computational approaches, wide variety of plant species have been fully sequenced giving a wealth of data sequence information on structure and organization of plant genomes. Since thousands of gene sequences are already known, recently developed functional genomics approaches provide powerful tools to analyze plant gene functions through various gene manipulation technologies. Integration of different omics platforms along with gene annotation and computational analysis may elucidate a complete view in a system biology level. Extensive investigations on reverse genetics methodologies were deployed for assigning biological function to a specific gene or gene product. We provide here an updated overview of these high throughout strategies highlighting recent advances in the knowledge of functional genomics in plants.

Entities:  

Keywords:  Functional genomics; Large-scale sequencing; Omics platforms; Reverse genetics

Year:  2016        PMID: 28217003      PMCID: PMC5282599          DOI: 10.2174/1389202917666160520102827

Source DB:  PubMed          Journal:  Curr Genomics        ISSN: 1389-2029            Impact factor:   2.236


INTRODUCTION

In recent decades, technological advances in gene sequencing and computational tools have generated massive genomic resources, which made possible the annotation and the identification of thousand genes. Although nucleotide sequences of several plant genes were released, the function of their encoding proteins and their involvement in specific networks remain unclear. In the post-genomic era, the current challenge of plant research is focusing increasingly on gene function analysis. To raise this challenge, various reverse genetics approaches including gene silencing strategies, transgene-induced ectopic expression, gene targeting, T-DNA/transposons insertional mutagenesis and target-induced local mutations are available to investigate the biological function of such genes [1-5]. In many cases these approaches have been applied to plant model systems as well as to agronomically important species to gain powerful insights into gene function [6-9]. Conversely to the static aspect of the genomic information (DNA sequence), functional genomics based on the dynamic of biological system, aims to assign a function (s) to genes and determines how genes and their products (RNA, proteins, metabolites) interact together [10]. Gain and loss of function approaches represent the most adapted tools to investigate gene function. Since generation of transgenic plants is not always recommended due to laborious work, time-consuming procedures and variable transformation efficiency among species, genome studies has driven an increasing interest in rapid assay systems as alternatives to stable transgenics for establishing gene function [10-12]. Here we will overview the main reverse genetics systems and we will further discuss the challenges associated with characterizing gene function in the post-genome era.

Advances in sequencing platforms

Next Generation Sequencing

The most obvious technological advance in recent years is undoubtedly the development of the Next Generation Sequencing (NGS) that had significantly reduced the cost of whole genome and transcriptome sequencing [13]. NGS is producing a vast array of genomic information and has led to the identification and annotation of high number of plant genes. However, determining the function of these genes and the networks they are involved in remains a major challenge for modern plant research. Many sequencing platforms were developed [14-16] opening new avenues of research in plant improvement [17]. Following, most relevant advances of high throughput sequencing strategies for gene discovery are presented including (i) the development of sequencing platforms and (ii) their wide range of applications in combination with network integration. This review provides an updated view of how advanced omics-based high throughput methodologies can effectively enhance the discovery of new candidate genes for crop improvement. Publication of the first complete genome sequence of Arabidopsis thaliana in 2000 [18] and first monocotyledonous plant Oryza sativa in 2005 [19] revolutionized research in plant genomics. These genomes used the traditional Sanger sequencing based on clone-by-clone strategy which involved sequencing of overlapped bacterial artificial chromosomes (BAC) clones selected from a physical map. However, Sanger sequencing technology remains costly and low throughput. The demand for rapid and cost-effective sequencing technologies led to the development of several alternative approaches in the use of genomic template libraries, number of reads, read length and genome coverage. Thus the whole genome shotgun (WGS) approach was introduced to produce the draft sequences of many plants like Sorghum bicolor [20] or Vitis vinifera [21]. Beginning in 2005, the traditional Sanger-based approach to DNA sequencing has experienced revolutionary changes and new sequencing platforms have emerged, having the high-throughput and cost-efficient capabilities, called ‘Next Generation Sequencing (NGS)’ [22, 23]. A variety of NGS technologies include the 454 FLX (Roche) [14], the Genome Analyzer/Hiseq (Illumina Solexa) [24] and the SOLiD (Life Technologies), as well as newer platforms such as Heliscope (Helicos) [25], PacBio RS (Pacific Biosciences) [26] for single molecular sequencing, and Ion Torrent (Life Technologies), based on a semi-conductor chip [27], were also available (Table ). NGS is rapidly becoming an established tool in translational research as well. Several plant genomes have recently been sequenced using NGS technology (Table ), as a result, the number of sequenced genomes increased exponentially. More than 100 plant genomes were sequenced [see for review 28, 29] (List of sequenced plant genomes; URL: https://genomevolution.org/wiki/index.php/Sequenced-plant-genomes). Newly sequenced plant systems can be interpreted by a homology search against other plant model systems. Based on the detection of orthologeous genes in other organisms, many genes can be characterized only with reference to their homology. The high amount of data generated displays the enormous challenge for bioinformatics in genome sequences analysis [30]. An increasing number of alignment and de novo assembler tools have been developed specifically for rapid alignment of large sets of reads [31, 32]. The identification of genes affecting agronomically relevant traits and supports molecular breeding (Table ) from economically important plants would be an important application. However, NGS has some limit especially in their short read lengths (~60 and 400bp for Illumina ABI-SOLiD and 454/Roche respectively) comparing to the fist Sanger sequencing (~1Kb)(454. Roche 454 GSFLX.http:// www.454.com/; SOLiD.http://www3.appliedbiosystems.com/AB_Home/applicationstechnologies/ SOLiDSystemSequencing/index.htm.; Illumina.http://www.illumina.com/;Helicos. http://www.helicos bio.com/, leading to sequence errors and difficulty for sequence assemblies [33]. Therefore, the development of the third generation sequencing with long read length and fast sequencing enhanced the genome assemblies [29].

Next Generation-based RNA-seq Technologies

Global analyses have become possible with the development of high throughput genomic technologies which facilitated the identification of putative gene function. In this context, methods have been developed for quantitative data acquisition such as suppressive substractive hybridization libraries and microarrays [58] in case of grapevines. However, the recent advent of high-throughput-based sequencing technologies became revolutionizing the analysis of transcriptomes [59]. In fact, RNA sequencing (RNA-Seq) involves direct sequencing of complementary DNAs (cDNAs) and followed by the mapping of the sequencing reads to the genome. It allows for the precise quantification of exon expression, generating absolute rather than relative gene expression measurements, providing greater insight and accuracy than microarrays [60, 61] since it can detect and measure rare transcripts even those with abundances of 1 to 10 RNA molecules per cell [61]. NGS technologies enable the completion and the characterization of the entire genome of several agronomically important plant species including grapevine [21] where Illumina RNA-seq method have been successfully applied to analyze the total transcriptome during berry development [62]. This transcriptomic approach will open up large application perspectives in terms of plant improvement and development of plants with a better tolerance to environmental stresses. Therefore, understanding the function of genes remains a major challenge of the post-genomic research. It should be noted that a successful NGS project requires expertise in lab work as well as in bioinformatics in order to warrant high quality data and data interpretation.

Genome annotation, Omics and System biology

Network Integration and Gene Function

It is known that metabolism is highly dynamic and cannot be directly derived from the genotype. Consequently, this static genomic information needs to be complemented with genome-wide molecular data to reveal the dynamic genotype–phenotype relationship. Assigning a function requires a phenotype and this is addressed by the phenomics platform [63]. Also, NGS can provide information about the methylation state of the entire genome (methylome), thereby enabling us to determine the epigenetic control of different genes within the genome [64]. Data integration from multiple Omics platforms is of considerable interest to provide a complete view of the biological system as illustrated in (Fig. ). Transcriptomics, proteomics and metabolomics data should be used to predict and understand the function of each gene and also to study the gene network as a whole system [65-67]. Hence, the databases of the metabolic reconstruction of plant species are continuously growing and will provide the basis for comparative plant genomics (http://www.plantcyc.org/). Despite major progress in the development of proteomics and metabolomics tools, these sub-platforms are still costly compared to NGS [64]. The integration of protein and metabolite profiling significantly improves the identification of plant biomarkers under different environmental conditions [67, 68]. For instance, such integrated analysis was applied to wheat and rice in response to anoxia [69] and to A.thaliana for characterization of starch and raffinose metabolisms under different temperatures [70]. There are also a number of reports on the elucidation of gene function by combining the transcriptomics/proteomics approach to study grapevine stress tolerance [67], nitrogen assimilation in maize [71], growth to dormancy transition in white spruce stems [72], phytohormone crosstalk [73] and flour quality in wheat [74]. In the same context, integrated transcriptome-metabolome analysis was lately applied in rice grown under high-temperature conditions [75], and changing metabolic systems in rice mutants [76] and transgenic barley plants [77] growing in field conditions. Furthermore, since quantitative traits are regulated by differential expression of candidate genes, the combined approach of QTL mapping and transcriptome profiling was used in tomato [78], rice [79] and wheat [80]. The exploitation of these Omics tools in plant biotechnology and QTL-based marker-assisted breeding approaches is an obvious development. Network integration has given rise to bioinformatics challenges of modeling the complexity and behavior of biological systems as a ‘whole’ computational means [65, 67, 81]. Recently, the development of bioinformatics tools that will integrate data from multiple Omics platforms will give not only a complete view of the cellular systems and networks [64], but also can predict the system behavior under new unexplored perturbations [82]. In this context, a network for heat transcriptome of three model plants (Arabidopsis, Populus and soybean) has been characterized [83] and a conceptual model that links traditional network analysis to whole plant physiology was later proposed [84]. These approaches have opened up many application perspectives in terms of plant improvement in non-favorable environmental conditions.

Plant Genome Annotations and Tools

NGS technology is revolutionizing genomics and many plant genomes are already available. The increasing in demands for large-scale genome annotation has resulted in placing several genomics databases for annotation. Currently these genomes are being collected and annotated by various resources and databases including Phytozome [85], Plant Metabolic Networks (PMN) [86], Plants [87], Plaza [88], and MetNet Online [89]. The newly web-based system for integrated plant genome annotation MEGANTE (https://megante.dna.affrc.go.jp/.) recently published [90] could target 24 genome plant species from the Brassicaceae, Fabaceae, Musaceae, Poaceae, Salicaceae, Solanaceae, Rosaceae and Vitaceae families. Annotation of plant genes with their correct functions is of considerable interest and can be limited by the “missing annotations,” in which a function is known to exist, but the corresponding gene remains unidentified. PlantSEED (http://plantseed.theseed.org/) appeared as a new resource to support functional identification, annotation, comparative analysis, and modeling of plant genomes allowing downstream data analyses [91].

Forward versus Reverse genetics

Progress in plant genomics have been accelerating when the genome of Arabidopsis thaliana was published in late 2000 [18]. Since then many genome sequencing projects have been undertaken including economically and biologically important species for biological research [28, 29, 92]. After a genome is sequenced and annotated, functional characterization of genes that are important for development, cellular processes, or stress response becomes a major effort in the scientific community [93]. In this way, bioinformatics tries to identify candidate genes with the aim of better understanding the genetic basis of disease, unique adaptations to pathogen and severe conditions, desirable properties in agricultural species, or differences between populations. Classically, gene function is determined using two opposite but complementary approaches: forward and reverse genetics. Forward genetic screens (mutant phenotype -► gene) aim to determine the genetic basis responsible for a phenotype through screening of mutant phenotypes. Typically, there is a wide use of chemical mutagenesis for forward-genetic screens in many organisms including bacteria, C.elegans, Drosophila and plants [94]. The basis is to treat organism with a mutagen, then screen offspring for particular phenotypes of interest and seek to identify genes involved in a biological pathway or process. Using large-scale genetic screens, several genes involved plant stress or other developmental or biochemical processes have been identified [95-98]. Analysis of segregating populations and subsequent locus mapping within the genome reveals genes that are associated with the observed biological process [99]. The goal is to find all of the genes involved in a trait; this approach is known as a “genetic screen”. With the advent of whole genome sequencing, researchers have now access to all of the gene sequence information within a given organism to investigate their function. Instead of going from phenotype to sequence as in forward genetics, reverse genetics works in the opposite direction – from known sequence towards assigned gene function [1, 10]. In reverse genetics (gene -► mutant phenotype), the target gene was subjected to gene manipulation, modification or disruption of its activity followed by analyzing phenotypic effects of up- and down- regulations (Fig. ).

Reverse genetics: Engineering loss and gain of function

Several plant genomes have been already sequenced [29] and in the few next years, we will be in front of many others fully-sequenced genomes (which are now in progress). The functions of a small number of genes were assigned, but those of many of them remain unclear. The current and future challenge is to investigate the biological function of identified and annotated gene sequences. Reverse genetics attempts to connect a given genetic sequence with specific effects on the organism and thus appears as an essential set of approaches for functional studies. A number of techniques have been developed, over the last 15 years, to enable researchers to identify plants with mutations in genes of known sequence. These reverse genetic approaches include silencing strategies (RNA interference: RNAi and virus-induced gene silencing: VIGS), screening of populations mutagenised by insertion (Insertional mutagenesis), deletion (Knock-out) and point mutation (TILLING: Target Induced Local Lesions in Genome), as well as gene targeting and ectopic expression using transgenics. Such tools are listed in (Table ), each with its own strengths and weaknesses, try to explore gene function by analyzing the phenotypic effects of specific engineered gene sequences and seek to find what phenotypes arise as a result of particular genetic sequences. Classically, genes are transcribed into mRNA and translated into proteins determining the phenotypic traits. Each disruption in the genome (DNA sequence) can affect indirectly physical traits via the alteration of protein synthesis or activity. With the advent of recombinant DNA technology, gene engineering by gain- or loss-of-function approaches that mutate or knock out the gene function will switch on or off protein synthesis/activity resulting in new phenotypes. For example, changes in regulatory gene sequences or site-directed mutations targeting the open reading frame of gene-coding protein could identify amino residues for protein function. Reverse genetics experiments usually start with a cDNA, often corresponding to a transcript with an interesting pattern of expression, and then attempt to ascribe a biological function associated with a phenotypic trait to the target gene. The function is generally investigated by using the cDNA sequence to create a mutation of the wild-type allele in a transgenic plant (Ectopic expression using transgenics strategy) [4]. This mutation may be also inserted in a specific site or induced lesion in the genome (Insertional mutagenesis and TILLING). The phenotypic alteration will be further investigated. The target gene can be down-regulated by the expression of an antisense mRNA (Gene silencing) which interferes with the gene transcript causing RNA degradation or inhibiting translation. Similar approach was also used to deactivate transcripts of unknown genes and then check the phenotypic changes [100, 101]. Another way to elucidate the gene function is to exchange wild-type allele with an inactive mutant allele by homologous recombination (Gene targeting). These methods provide a way for linking genotype to phenotype to analyze the function of unknown genes for both fundamental and practically oriented studies.

RNA Silencing Approaches for Functional Gene Analysis in Plants

RNA-mediated Interference

RNA interference (RNAi) is induced by double-stranded RNAs (dsRNAs) leading to specific post-transcriptional gene silencing (PTGS) mechanism. This may be caused commonly by the expression of sequence homologous to an endogenous gene or by the expression of transgene that include a segment of gene sequence in an inverted repeat orientation that will generate dsRNAs. The amplification process of RNAi involves RNA-dependent RNA polymerases (RdRPs) that are required for the RNA silencing pathways [110, 111]. Besides its antiviral functions as part of the host defense response evolved to control plant virus replication, RNAi process fulfils fundamental regulatory roles through the activities of microRNAs and small interfering RNAs [112]. Previous studies on gene silencing have revealed two RNA-mediated epigenetic processes, RNA-directed RNA degradation and RNA-directed DNA methylation, providing new avenues for gene suppression technology in plants [113]. There is increasing evidence that components of the RNAi machinery are associated also with the formation of heterochromatin and that RNA-mediated chromatin modifications play an important role in epigenetic transcriptional gene silencing [114]. Since RNAi can specifically suppress the function of the targeted gene, the technique has been very useful in functional genomics studies [102]. RNAi have been widely described in plants and this is currently one of the hottest areas of biological research in the new era of functional genomics since interfering RNA-mediated silencing has revolutionized the understanding of gene function for basic and applied studies. Transgene-induced RNAi has been effective at silencing one or more genes in a wide range of plants across the plant kingdom. The principle is simple: a fragment of a gene is introduced into a cell as dsRNA or as DNA that will give rise to dsRNA that would trigger silencing of endogenous genes in a homology dependent fashion. The dsRNA activates the DICER/RISC process so that the properties of the affected cell reflect a loss of function in the corresponding gene [115]. In this way, several functional genomics projects attempted to generate lines that are deficient for the activity of a subset of genes, and check their phenotypes to characterize the function of the knocked down gene [101]. A particularly useful property of RNA silencing is that it does not require neither full sequence of the target gene, nor complete sequence identity in the dsRNA and the target RNA. Since RNAi is highly sequence-specific, it is possible to knock down simultaneously multiple closely related genes by targeting their conserved sequences [102, 116]. For that reason, transgene-induced RNA silencing was implemented since years as a versatile reverse genetics tool and is well suited to the systematic analysis of gene function. RNAi technology has recently become a highly effective and powerful tool of functional genomics to explore plant genome [100] and has been applied to elucidate gene function in various crop species like tomato [11], potato [117], maize [116], wheat [118, 119] and rice [120]. Gene transfer-based techniques have also been used to induce RNAi in plants by delivering dsRNA or DNA constructs that encode harpin-RNA into leaf epidermal cells of many crop plants [100]. Fast Agrobacterium-mediated transient gene expression/silencing methods have been applied for commercial plant species for successful transgene-induced PTGS via fruit agro-injection in tomato [11], agro-infiltration in potato [117], or through in planta agro-infiltration in grapevine [12]. Despite of its instability in plants, RNAi-mediated gene suppression approach open new avenues in the development of eco-friendly biotech approaches for crop improvement by knocking-out the specific genes for better stress tolerance and integrating novel traits in various plant species including insect/pest/pathogen resistance [121].

Virus-induced Gene Silencing Strategy

Another alternative to knock-down endogenous plant genes may be achieved via viral vectors through RNA-mediated post-transcriptional gene silencing mechanism. This powerful reverse genetics approach, known as Virus-induced gene silencing (VIGS), is a virus vector technology that exploits RNA-mediated anti-viral defense mechanism [122-124]. Production of dsRNA activates the RNA silencing pathway, resulting in down-regulation of the host gene transcript. VIGS offers an easy way to test the function of several genes in a short time, as it only requires a fragment (typically 300-800 bp) of the target plant gene inserted into a suitable viral vector to form a recombinant virus. Upon infection of a plant host usually via Agrobacterium tumefaciens, this recombinant viral vector induces PTGS targeting both the virus RNA and homologous endogenous plant RNA sequences for degradation [125-128]. Besides the lack of suppressors of gene silencing needed in VIGS strategy, successful RNA virus-based VIGS requires simultaneous infiltration of both viral clones including genomic RNA1 and RNA2. Nowadays, there are infectious clones of several plant viruses that have been used as VIGS vectors, most of which have RNA genomes [100]. Other source of VIGS vectors used recently for silencing is the small subviral RNA satellite along with helper virus-dependent replication [129]. While several plant viruses have been so far developed into VIGS vectors, the Tobacco Rattle Virus (TRV) established by David Baulcombe’s group [130] provides the most robust results in terms of efficiency, ease of application, and absence of disease symptoms [128]. VIGS has been broadly regarded as the tool of choice for transient induction of silencing that occurs for only few weeks with further decrease resulting in plant recovery [131]. However, recent evidences suggest that using some vectors under specific conditions, VIGS can persist for years or even transmitted to progeny [132, 133] behaving like stable transgenic plants. Besides many advantages of VIGS over other approaches including methodological simplicity, switching off genes specifically, rapid monitoring, robustness and speedy results, the system allows the study of genes whose functions are essential to plant viability [103, 127]. VIGS has been widely used in model plants as tool to assess function of candidate genes and to discover new genes required for diverse pathways. VIGS technology has been successfully used to validate and functionally analyze the contribution of candidate genes in many plant species [8, 134-137]. For instance, several stress-responsive-genes were studied using VIGS system in various model plants including tobacco [133], chili pepper [138], soybean [139], rose [140] and wheat [136, 141]. Such strategy can be also very helpful to assess gene function, especially in species recalcitrant to transformation making the VIGS an attractive alternative instrument for high-throughput functional genomics [103, 104].

Transgene-induced Ectopic Expression/Silencing

The development of powerful “omics” technologies has enabled researchers to identify many genes of interest for which comprehensive functional analyses are highly desirable. Large number of plant genes continues to be identified every day with no defined functions. Although powerful in silico techniques can predict the function of many gene-encoding proteins, evidence of gene function have to be constantly verified in vivo using transgenic approach allowing analysis of phenotypic changes occurred by transgene integration and expression. The production of transgenic lines which ectopically express recombinant genes or those in which endogenous genes are knocked down remains a major bottleneck. Gene transfer technology appears to be an essential tool implemented for functional studies through gain- or loss-of-function approaches. Overexpression of a wild-type gene product, however, can cause mutant phenotypes, providing geneticists with an alternative yet powerful tool to identify pathway components that might remain undetected using loss-of-function screens [142]. Gain-of-function is achieved by increasing gene expression level through the activation of endogenous genes by transcriptional enhancers or through ectopic expression of individual transgene via transformation [4]. Heterologous expression has also been exploited to study gene functions across species barriers and transcript abundance is increased by cloning the full open reading frame (ORF) downstream of a strong constitutive promoter such as 35S promoter of cauliflower mosaic virus (CaMV35S), or using chemical- or stress-inducible promoters and transcription factors to control ectopic expression level [10, 143, 144]. Several approaches were used for gene transfer to plants, depending on the fact of how the gene construct is expressed in a stable or in a transient manner. Over the last decades, hundreds of model and crop plant species have been genetically transformed essentially via Agrobacterium- or viral-mediated gene transfer technologies [145] as well as by direct gene delivery using bombardment of DNA-coated particles [146-148] or protoplast electroporation [149]. Methods for effective DNA transfer into regenerable competent cells were extensively developed and optimized for stable transformation and generation of transgenic plants. However, transgenic approaches for reverse genetic studies are not yet practical in several plants in which transformation methodology is so far not efficient or not available. A heterologous expression approach provides a solution for the high-throughput characterization of gene functions in these plant species [4]. To overcome these in vitro regeneration-dependent procedures which are laborious and time-consuming, rapid and versatile transient assays were widely investigated as an alternative to the transgenic approach. Agroinoculation became a reliable method for plant functional gene analysis after being firstly developed as a simple procedure for plant systemic infection with viruses by delivery of viral genomes into plant tissues using Agrobacterium binary vectors [150, 151]. A variety of agroinoculation methods were developed during the last decade in order to examine the effect of transient gene expression, with applications ranging from monitoring of plant-pathogen interactions [152], subcellular localization [153], ectopic up-/down-regulation [12], in vivo analysis of plant promoters and transcription factors [154, 155] to the production of recombinant proteins [156, 157]. Efforts of several laboratories have resulted in the development of various methods for Agrobacterium tumefaciens–mediated gene transfer. Among these, vacuum infiltration [158, 159] and floral dipping [160, 161] are efficient methods widely used for generating transgenic Arabidopsis plants. Similarly, Agrodrench was established as an effective agroinoculation method where soil adjacent to the plant root is drenched with an Agrobacterium suspension carrying virus-derived VIGS vectors [131]. Agrobacterium infiltration into leaf tissues remains a method of choice for transiently expressing foreign genes in many plant species. Localized introduction and expression of T-DNA constructs have been largely implemented over the last years, even in recalcitrant plants, for assigning gene function directly in other target tissues and organs by fruit agroinjection [11] or agroinoculation of floral tissues [162]. Currently, in planta agroinfiltration-based procedures were substantially improved for either inducing transgene expression or gene silencing in rice [163] and grapevine [12]. Infiltrating leaves (using a needle-less syringe or vacuum) with Agrobacterium culture, in which the T-DNA plasmid contains a transgene that encodes an endogenous plant gene sequence, can trigger RNAi against the target endogenous gene and generates dsRNA. For instance, agroinfiltration of transgenic tobacco and grapevine plants expressing GFP resulted in GFP overexpression during the first days after infiltration, while this expression subsides to undetectable levels few days post-infiltration, concomitant with a reduction of endogenous GFP expression and induction of a silencing mechanism, firstly localized and subsequently spreads throughout the plant [12, 100]. In some cases where agroinfiltration is not appropriate, direct DNA uptake is useful for both stable transformation and transient gene expression. For instance, biolistic - as an alternative physical system that avoids the need for a pathogen-host interaction - could be easily performed for transient assays and have long been used for overexpression of foreign genes [146-148] as well as for silencing studies [162]. This particle delivery system allows use of simple multicopy vectors lacking T-DNA or virus sequences and even a minimal gene expression cassette carrying only the gene of interest with its regulatory sequences needed for its expression in plant cells, without the plasmid backbone [146, 161]. The advantages of gain-of-function approaches for the functional gene characterization include the abilities to characterize the function of genes from non-model plants using a heterologous expression system, and to identify genes that confer stress tolerance to plants as a result of ectopic transgene expression [4]. Overexpression of heterologous genes is widely used for the introduction of novel traits into transgenic plants. Overexpression can also be used in combination with down-regulation and controlled expression (e.g. induced, developmentally regulated, or tissue specific) studies as a tool for basic plant research and for functional analysis of native genes in various model plants [6]. One of the major progress in transgenic research relied to the technical advance in cloning technologies making possible the elaboration of genetic constructs. While traditional binary vectors have proven helpful in gene transfer assays, these plasmids rarely permit the cloning and transfer of more than a single target gene. The Gateway cloning system based on site-specific recombination has recently provided a significant improvement; in particular, the system enables assembly of multiple DNA fragments in a predefined order, orientation and frame register in a flexible manner and allows delivery of multiple transgenes to plant cells [6, 164]. Many of the principles described here arise from the numerous overexpression studies that have been reported in model species, where technical advantages facilitate overexpression screens and subsequent analysis, but examples are provided from other organisms that support or expand the concepts and demonstrate their validity in other plant systems.

Insertional Mutagenesis and TILLING

Insertional Mutagenesis

Insertional mutagenesis is a mutation caused by insertion of new genetic material (foreign DNA) into the target gene. Both T-DNA and transposon insertional mutants are being produced recently as an extremely valuable research tools for model plant systems to study gene function. Insertional mutagenesis has been developed within the last decade to generate specific mutations in organisms in which homologous recombination is a low frequency event [165]. The most effective method for insertional mutagenesis is targeted gene disruption. Large populations of T-DNA-tagged lines or mutants with transposon activation having an insertion at unique site in the genome have been generated in algae [165] and model plants [166, 167]. The approach relies on the generation of thousands of transformants followed by PCR-based screenings that allow for identification of lines harboring the introduced mutation within specific genes of interest [107]. Defining the insertion site for each transformant has allowed for the establishment of sequence-indexed libraries of mutant plants. T-DNA or transposon insertion has been exploited to create disruptions in target genes of interest, introduce new genes, or activate endogenous genes in the plant genome [2, 108, 167]. Transposons have several advantages over T-DNA including the ability to produce multiple independent insertion lines from individual starter lines, as well as producing revertants by remobilization [168].

Tilling

Targeting-induced local lesions in genomes (TILLING) is a strategy for the discovery and mapping of induced point mutations that was raised a decade ago as an alternative to insertional mutagenesis. This method was originally developed by the Henikoff laboratory to screen libraries of Ethyl methanesulfonate (EMS)-treated Arabidopsis for desired mutant alleles [169] and was subsequently adapted to others plant species. TILLING, which combines traditional mutagenesis with genome-wide high-throughput screening for point mutations in desired genes, is moving beyond functional genomics into crop improvement [94, 170, 171]. This technique was developed as a high-throughput and low cost reverse genetic method that has been successfully applied to many plant species making the TILLING process broadly applicable [172, 173]. Besides model plants, the feasibility of TILLING has already been demonstrated for a large number of agronomically important crops, including rice, barley, wheat, maize, sorghum, soybean, rapeseed and tomato plants. Furthermore, this method does not require transformation procedures and thus it is suitable for recalcitrant species and recommended as non-GMO technology to avoid controversies [3]. Recently, large-scale TILLING platforms for identifying mutants in plants offer great potential as a tool for functional genomics [3, 105, 106, 171].

Gene Targeting for Genome Engineering in Plants

Homologous Recombination-based Approaches for Genome Modification

Homologous recombination (HR) is one of the most universal events during the course of life evolution since it plays a central role in creating genetic diversity. It is most widely used by cells to accurately repair harmful breaks that occur on both strands of DNA known as double-strand breaks and thus safeguarding genomic integrity. HR was applied to several species to induce directed-site mutations and gene-disruption activity. HR-dependent targeted gene replacement (also called gene targeting) has been used extensively in yeast genome to elucidate the biological function of all predicted ORF. For plant cells, this method was firstly established on tobacco protoplasts [174] and then in moss Physcomitrella which exhibits high frequencies of gene targeting through HR [175, 176]. However, HR-based gene targeting has been for long term inappropriate and far from common practice for higher plants [177-179]. Despite of the low frequency of HR-based approaches that remains the major barrier, successful gene targeting in tobacco [174], Arabidopsis [180], and rice [181] using HR were reported. In general, a targeting DNA construct contains typically part of the gene to be targeted along with a reporter gene or/and a selectable marker. The transformed organism carrying a loss-of-function mutation can then be analyzed for its phenotype. Gene targeting is thus a powerful tool for directed ‘knockout’ of genes [177]. Plant DNA-repair machinery predominantly uses non-homologous end-joining (NHEJ), making the homologous recombination (HR)-based methods, which have proved fruitful for gene targeting in non-plant systems, unsuitable for use in plants [109, 182]. Precise genome modification is an attractive method for understanding gene function. Gene targeting is known as one of the best method currently available to induce specific change of endogenous gene via HR [183]. This strategy can be used to delete (knock-out) or substitute a gene (knock-in), remove exons, and introduce point mutations. Gene targeting can be applied to the whole organism or may be limited to a particular developmental stage and definite plant tissue. It can be used also for any gene, regardless of transcriptional activity or gene size.

Artificial Endonucleases-mediated Genome Editing

Targeted genome engineering (also known as genome editing) has emerged as an option to classical plant breeding and transgenic methods for crop improvement [184] and it is currently the most attractive topic in plant molecular biology and genomics research. A key step in genome editing is the generation of a double-stranded DNA break that is specific to the target gene. This is achieved by engineered endonucleases, which enable site-directed mutagenesis via a NHEJ repair pathway and/or gene targeting via HR to occur efficiently at specific sites in the genome [185, 186]. Some technical advances described targeted mutagenesis and gene targeting by either NHEJ machinery using site-specific induction of double-strand breaks (DSBs), or by activation of a HR pathway through overexpression of a yeast DNA recombination gene in transgenic plants [109]. Although gene targeting efficiency has been restricted to some plant species, its frequency was significantly enhanced by the recent use of programmable nucleases [5, 187, 188]. Since that, many plant genes have been knocked out by this method. Over the past 15 years, tremendous efforts have been made and a variety of technologies have been developed to target mutations to a specific location in the genome using artificial endonucleases. One of the earliest nuclease technologies involved mega-nucleases that were very difficult to engineer and requiring a long and costly process [189, 190]. The DNA-binding domains of Zinc finger transcription factors (ZFNs) were the first to be used as genome editing tools [191] and have proven easier to manipulate and have been used in tobacco [187], Arabidopsis [188] and maize [192]. More recently, transcription activator-like effector nucleases (TALENs) [193], as well as the clustered regularly interspaced short palindromic repeats/Cas (CRISPR/Cas9) system using an RNA-guided bacterial immune system complex were reported [194]. First studies confirmed that CRISPR provides protection against invading viruses when combined with Cas9 genes involving an RNA-mediated DNA targeting [195]. Thereafter, CRISPR/Cas9 system is revolutionizing the field of genomic editing and have proved very useful and functional [9] providing scientists with a powerful tool able to change any gene, in any cell in a highly targeted manner and without introducing foreign DNA. So far, several reports demonstrated the immense versatility of the technology in the field of plant biology by using a range of transformation platforms (PEG-protoplast transfection [196, 197], agroinfiltration [198, 199], virus transfection [200] and generation of stable transgenic plants [185, 201, 202]. This CRISPR/Cas9 allows for specific genome disruption and replacement in a flexible, robust and simple system resulting in high specificity and low cell toxicity, by targeting both endogenous genes and transgenes and by exploiting NHEJ and HR to generate small deletions, targeted insertions and multiplex genome modifications [9, 185]. This method has just been applied to a number of species including Arabidopsis [184, 201], tobacco [196, 198], tomato [201], sweet orange [199], wheat [197] and rice [203, 204]. For all instances, genome editing was shown to work in both model and crop plants, as well as in a variety of other organisms using engineered nucleases as powerful tools to target specific DNA sequences to edit genes precisely in the plant genomes.

Challenges and future outlooks

With the advent of high-throughput sequencing technologies, functional genomics have become a key platform in determining the biological function of individual gene. Today, many approaches are available for reverse genetics studies in plants that offer powerful tools to identify target genes and explore their functions in different manner. Functional analysis of the plant genome relies through loss or gain of function approaches and could be carried out using stable transgenics or alternative tools of transient expression assays depending on the target plant species and the gene studied. It is not clear however how the availability in the near future of many hundreds of fully sequenced plant genomes would be integrated; and new levels of data organization may need to be developed. The complexity of plant genome and gene expression, regulation, function and interaction in a single cell as well as within a whole organism should be deeply investigated in the future via multiple omics platforms. In addition, new platforms for analyzing others biomolecules require to be implemented including hormones (hormonome), signaling and transduction pathways (signalome), as well as methodological advance in cell imaging, cell micro-manipulation and other tools for studying the dynamic at cytoplasm, organelle and membrane levels. It seems also that the largest part of research studies focus on the genomic information and subsequent putative function of target genes, while it is already known that cytoplasmic environment is the crucial factor that coordinate gene expression and cell differentiation during developmental stages or under adverse conditions. The direction of how the plant genome will react to any stimuli requires specific environment with proper cytoplasmic components that could be investigated in the near future. Integration of such tools will provide new insights for functional genomics/system biology in plant science fundamental and applied research towards food safety and crop improvement.
Table 1

Evolution of sequencing platforms.

Platforms Companies Detection Methods References Advantages + / Disadvantages -
Conventional sequencingSangerSanger biochemistryCapillary electrophoresis/ clone by clone strategy[34]Low throughput-,In vivo cloning,-Long reads+/ High costs-
Next Generation SequencingGS FLXRoche 454 Life Sciences,Pyrosequencingdetection of pyrophosphate[14]No in vivo cloning+Better sequence quality+,High throughput+,Low cost+,wide range of uses+,Very short reads-.Difficulty for genome sequencing assemblies-
George Church’s Laboratory at Harvard UniversityPolony sequencingMultiplex polony sequencing[35]
Genome analyzerSolexa, Illumina IncFluorophore labeled reversible terminator nucleotides[36]
Intelligent Bio-SystemsQiagensequencing by synthesis[37]
SOLiD (Supported oligo ligation detection)Applied BiosystemsFluorophore labeledoligonucleotide probes[38]
HeliScopeHelicos BioscienceSigne molecule detection system, sequencing by synthesis[15]
Ion PGM SystemLife TechnologyPersonal genome machinePost-light Ion semiconductor sequencing[39]
Ion PGM SystemLife TechnologyPersonal genome machineNon-optical DNA sequencing[40]
Third Generation SequencingPacBio RSPacific biosciencesPhospholinked fluorophore labeled nucleotides[16]Sequencing in real time+Long reads length+novo genome assemblies+
PacBio RSPacific biosciencesspecific single-molecule sequencing of 5-hydroxy methylcytosine[41]
Table 2

Major agronomically important plant genomes sequenced over the last years. ( (For entire list 2015, see [29]).

Plant Species Genome Size Number of Predicted Genes Sequencing and Assembly Status References
Zea mays(maize)2.30 Gb39 656Contig N50/Scaffold N50/ Sanger[42]
Sorghum bicolor(sorghum)0.73 Gb34 496Contig N50/Scaffold N50/ Sanger/ whole-genome shotgun[20]
Triticum aestivum(bread wheat)3.92 GbRoche 454/ Illumina whole-genome shotgun[43, 44]
Hordeum vulgare(barley)5.1 Gbwhole-genome shotgun /RNA seq[45]
Glycine max(soybean)1.115 Gb46 430Contig N50/Scaffold N50/ Sanger/ whole-genome shotgun[46]
Solanum tuberosum (potato)0.856 Gb39 031Sanger/Roche 454/ Illumina/ Solid[47]
Solanum lycopersicum (tomato)0.9 Gb34 727Sanger/Roche 454/ Illumina/ Solid[48]
Capsicum annuum(pepper)3.48 Gb34 476whole-genome shotgun / Illumina-sequencing[49]
Beta vulgaris(sugarbeet)0.758 Gb27 421Roche 454, Illumina and Sanger sequencing platforms[50]
Cicer arietinum(chickpea)0.738 Gb28 269Illumina sequencing[51]
Phaseolus vulgaris (common bean)0.587 Gb43 627whole-genome shotgun/ Sanger /Roche454/Illumina-sequencing/RNA seq[52]
Vitis vinifera(grapevine)0.5 Gb54 216whole-genome shotgun[21]
Citrus sinensis(sweet orange)0.319 Gb25 376Sanger/454sequencing technology[53]
Prunus persica(peach)0.265 Gb27 852Sanger whole-genome shotgun methods/sanger sequencing[54]
Musa acuminata (banana)0.523 Gb36 542Contig N50/Scaffold N50[55]
Solanum melongena (egg plants)1.126 Gb85 446HisSeq2000/454 GS FLX[56]
Brassica oleracea (cabbage)0.630 Gb45 758Illumina, Roche 454 and Sanger sequencing[57]
Table 3

Reverse genetics methods used for functional genomics in plants.

Method Advantages Disadvantages References
RNA interference▪ High-throughput vectors▪ Heritable,▪ Ability to silence multiple target genes at once▪ Single copy of the target transgene is frequently sufficient to induce silencing▪ Variability in silencing efficiency▪ Require efficient transformation▪ Not developed for all species▪ Effect on non-target genes[100-102]
Virus-induced gene silencing (VIGS)▪ No limitation imposed by transformation efficiency▪ Rapid, easy to use▪ Can be adapted for high-throughput screens▪ Generate partial loss-of-function▪ Host range limitations▪ Not established on all plant species▪ Allows only transient expression▪ Silencing level is variable[8, 100, 103, 104]
Ectopic expression▪ Can be adapted for high-throughput screens▪ Suitable for transgene expression and gain-of-function analysis▪ Limited to transformable plant species▪ Possibility of generating misleading neomorphs▪ Expression level can be changed by exogenous regulatory sequences[1, 4, 11, 12]
Target Induced Local Lesions in Genome (TILLING)▪ Stable mutations▪ Suitable for non-transformable species▪ Allows identification of allelic series of mutants with a range of modified functions for a particular gene▪ Low/medium-throughput▪ Desired mutation might never be found▪ Need a large mutant population▪ Relatively expensive[1-3, 105, 106]
Insertional mutagenesis▪ High-throughput▪ Results in stable mutations▪ Suitable for non-transformable species▪ Can be adapted for both loss-of-function and gain-of-function studies▪ Desired mutation might never be found▪ Variable effects depending on site of integration[1, 2, 107, 108]
Gene targeting and Genome editing▪ Highly specific▪ Most revolutionary method that can be used for gene replacement▪ Can be adapted for both loss-of-function and gain-of-function studies▪ Very low efficiency of homologous recombination[1, 5, 9, 109]
  184 in total

Review 1.  Exploring plant genomes by RNA-induced gene silencing.

Authors:  Peter M Waterhouse; Christopher A Helliwell
Journal:  Nat Rev Genet       Date:  2003-01       Impact factor: 53.242

2.  From systems biology to photosynthesis and whole-plant physiology: a conceptual model for integrating multi-scale networks.

Authors:  David J Weston; Paul J Hanson; Richard J Norby; Gerald A Tuskan; Stan D Wullschleger
Journal:  Plant Signal Behav       Date:  2012-02-01

3.  Agroinoculation and agroinfiltration: simple tools for complex gene function analyses.

Authors:  Zarir Vaghchhipawala; Clemencia M Rojas; Muthappa Senthil-Kumar; Kirankumar S Mysore
Journal:  Methods Mol Biol       Date:  2011

Review 4.  Integrated omics approaches in plant systems biology.

Authors:  Atsushi Fukushima; Miyako Kusano; Henning Redestig; Masanori Arita; Kazuki Saito
Journal:  Curr Opin Chem Biol       Date:  2009-12       Impact factor: 8.822

5.  Efficient gene editing in tomato in the first generation using the clustered regularly interspaced short palindromic repeats/CRISPR-associated9 system.

Authors:  Christopher Brooks; Vladimir Nekrasov; Zachary B Lippman; Joyce Van Eck
Journal:  Plant Physiol       Date:  2014-09-15       Impact factor: 8.340

6.  High-throughput comparison, functional annotation, and metabolic modeling of plant genomes using the PlantSEED resource.

Authors:  Samuel M D Seaver; Svetlana Gerdes; Océane Frelin; Claudia Lerma-Ortiz; Louis M T Bradbury; Rémi Zallot; Ghulam Hasnain; Thomas D Niehaus; Basma El Yacoubi; Shiran Pasternak; Robert Olson; Gordon Pusch; Ross Overbeek; Rick Stevens; Valérie de Crécy-Lagard; Doreen Ware; Andrew D Hanson; Christopher S Henry
Journal:  Proc Natl Acad Sci U S A       Date:  2014-06-09       Impact factor: 11.205

7.  A physical, genetic and functional sequence assembly of the barley genome.

Authors:  Klaus F X Mayer; Robbie Waugh; John W S Brown; Alan Schulman; Peter Langridge; Matthias Platzer; Geoffrey B Fincher; Gary J Muehlbauer; Kazuhiro Sato; Timothy J Close; Roger P Wise; Nils Stein
Journal:  Nature       Date:  2012-10-17       Impact factor: 49.962

Review 8.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

9.  A large-scale genetic screen for mutants with altered salicylic acid accumulation in Arabidopsis.

Authors:  Yezhang Ding; Danjela Shaholli; Zhonglin Mou
Journal:  Front Plant Sci       Date:  2015-01-07       Impact factor: 5.753

10.  High-frequency modification of plant genes using engineered zinc-finger nucleases.

Authors:  Jeffrey A Townsend; David A Wright; Ronnie J Winfrey; Fengli Fu; Morgan L Maeder; J Keith Joung; Daniel F Voytas
Journal:  Nature       Date:  2009-04-29       Impact factor: 49.962

View more
  8 in total

1.  Double-Strand RNA (dsRNA) Delivery Methods in Insects: Diaphorina citri.

Authors:  Yulica Santos-Ortega; Alex Flynt
Journal:  Methods Mol Biol       Date:  2022

2.  Methods for Delivery of dsRNAs for Agricultural Pest Control: The Case of Lepidopteran Pests.

Authors:  Bruna Garbatti Factor; Flavia de Moura Manoel Bento; Antonio Figueira
Journal:  Methods Mol Biol       Date:  2022

3.  Selection of suitable candidate genes for mRNA expression normalization in bulbil development of Pinellia ternata.

Authors:  Haoyu Fan; Qiuling He; Yiheng Dong; Wenxin Xu; Yanlin Lou; Xuejun Hua; Tao Xu
Journal:  Sci Rep       Date:  2022-05-25       Impact factor: 4.996

Review 4.  Advances and perspectives in discovery and functional analysis of small secreted proteins in plants.

Authors:  Xiao-Li Hu; Haiwei Lu; Md Mahmudul Hassan; Jin Zhang; Guoliang Yuan; Paul E Abraham; Him K Shrestha; Manuel I Villalobos Solis; Jin-Gui Chen; Timothy J Tschaplinski; Mitchel J Doktycz; Gerald A Tuskan; Zong-Ming Max Cheng; Xiaohan Yang
Journal:  Hortic Res       Date:  2021-06-01       Impact factor: 6.793

5.  Long non-coding RNA ZFAS1 sponges miR-486 to promote osteosarcoma cells progression and metastasis in vitro and vivo.

Authors:  Nan Li; Zhen-Hui Sun; Min Fang; Jing-Yi Xin; Chun-You Wan
Journal:  Oncotarget       Date:  2017-10-24

6.  Editorial: Genomics and Functional Genomics of Stress-mediated Signaling in Plants: Volume I.

Authors:  Manisha Sharma; Girdhar K Pandey
Journal:  Curr Genomics       Date:  2017-12       Impact factor: 2.236

7.  Comprehensive transcriptomic analysis of two RIL parents with contrasting salt responsiveness identifies polyadenylated and non-polyadenylated flower lncRNAs in chickpea.

Authors:  Mayank Kaashyap; Sukhjiwan Kaur; Rebecca Ford; David Edwards; Kadambot H M Siddique; Rajeev K Varshney; Nitin Mantri
Journal:  Plant Biotechnol J       Date:  2022-05-13       Impact factor: 13.263

8.  Functional Role of AsAP in the Reproduction of Adelphocoris suturalis (Hemiptera: Miridae).

Authors:  Shidong Qin; Bangqin Zhu; Xingxing Huang; J Joe Hull; Lizhen Chen; Jing Luo
Journal:  Insects       Date:  2022-08-22       Impact factor: 3.139

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.