Literature DB >> 26050794

A knowledge base for Vitis vinifera functional analysis.

Alfredo Pulvirenti, Rosalba Giugno, Rosario Distefano, Giuseppe Pigola, Misael Mongiovi, Girolamo Giudice, Vera Vendramin, Alessandro Lombardo, Federica Cattonaro, Alfredo Ferro.   

Abstract

BACKGROUND: Vitis vinifera (Grapevine) is the most important fruit species in the modern world. Wine and table grapes sales contribute significantly to the economy of major wine producing countries. The most relevant goals in wine production concern quality and safety. In order to significantly improve the achievement of these objectives and to gain biological knowledge about cultivars, a genomic approach is the most reliable strategy. The recent grapevine genome sequencing offers the opportunity to study the potential roles of genes and microRNAs in fruit maturation and other physiological and pathological processes. Although several systems allowing the analysis of plant genomes have been reported, none of them has been designed specifically for the functional analysis of grapevine genomes of cultivars under environmental stress in connection with microRNA data. DESCRIPTION: Here we introduce a novel knowledge base, called BIOWINE, designed for the functional analysis of Vitis vinifera genomes of cultivars present in Sicily. The system allows the analysis of RNA-seq experiments of two different cultivars, namely Nero d'Avola and Nerello Mascalese. Samples were taken under different climatic conditions of phenological phases, diseases, and geographic locations. The BIOWINE web interface is equipped with data analysis modules for grapevine genomes. In particular users may analyze the current genome assembly together with the RNA-seq data through a customized version of GBrowse. The web interface allows users to perform gene set enrichment by exploiting third-party databases.
CONCLUSIONS: BIOWINE is a knowledge base implementing a set of bioinformatics tools for the analysis of grapevine genomes. The system aims to increase our understanding of the grapevine varieties and species of Sicilian products focusing on adaptability to different climatic conditions, phenological phases, diseases, and geographic locations.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26050794      PMCID: PMC4464603          DOI: 10.1186/1752-0509-9-S3-S5

Source DB:  PubMed          Journal:  BMC Syst Biol        ISSN: 1752-0509


Background

Vitis vinifera [1] is one of the most important plant species in the modern world. Its economic and cultural impact is undeniable. With about 8 million hectares of vineyards across the world [2] and more than 5000 estimated cultivars [3], Vitis vinifera contributes significantly to the economy of wine-producing countries. Improving the quality of wine and increasing the resistance of grapevine to pathogens and environmental stress are crucial tasks for the development of wine industry [4]. A deeper understanding of the underlying mechanisms can aid plantation development and effective pest and pathogen treatment design. This is particularly helpful when environmental conditions are not optimal (e.g. scarcity of water, climate instability). Finally, it drives the development of superior germplasm and the establishment of more robust and high-quality cultivars. Recently, several projects have been devoted to acquire more knowledge about grapevine at a genomic and biomolecular level. A recent review of the cutting-edge of grape and wine biotechnologies can be found in [5]. In 2007, the French-Italian Public Consortium for Grapevine Genome Characterization presented the first sequenced genome of Vitis vinifera [6]. This was obtained from a highly homozygous genotype, by a total of 6.2 million end-reads, representing an 8.4-fold coverage of the grape genome. At a later stage, a higher quality sequenced genome (12-fold coverage), from highly heterozygous cultivar, was presented [7] and deposited in PlantGDB (http://www.plantgdb.org/VvGDB/). The availability of the whole grapevine genome, and its comparison with other available plant genomes (Arabidopsis, Oryza sativa, Populus, Medicago and Solanum lycopersicum), has led to the discovery of ancestral traits and genetic organization of flowering plants. Furthermore, the availability of the grape genome has driven a multitude of studies on the genetics of grapes, including genome annotation and the development of genomic resources such as bacterial artificial chromosomes (BAC) and cDNA libraries [8-10]. Vitis vinifera genes are associated with many existing pathways [11], concerning Metabolism, Genetic and Environmental Information Processing, Cellular Processes, Transport, and Transcription Factors. On the other hand, for several genes the functional annotation is still missing. Among the 29,971 known Vitis vinifera genes, more than 30% are without any annotation. Sequencing of grape genomes enables new studies on grapevine variety, aimed at the identification of the genetic origin of phenotypic traits, resistance and wine quality. Among the others, two genes have been associated with grape berry coloring [12], giving us evidence that white grapes were probably caused by the simultaneous mutation of these two adjacent regulatory genes. A comprehensive study of this hypothesis has been performed by Velasco et al [13]. They assembled de novo the genome of the heterozygous Vitis vinifera cultivar Pinot Noir and identified several genes involved in pathogen resistance and polyphenol metabolism (influencing vine quality). They also discovered about 2 million SNPs and mapped most of them to chromosomes. Later on, Myles et al [14] performed a large-scale polymorphism detection on 10 cultivars (including Pinot Noir, White Riesling and Malvasia, among others) and 7 wild Vitis species. They identified hundreds of thousands of SNPs and produced a 9K genotyping array, able to discriminate different cultivars. Laucou et al [15] performed a large study of genetic diversity involving 4,370 accessions of Vitis originating from different geographic areas. They used 20 simple sequence repeat markers leading to the identification of 524 alleles and 2,836 different profiles. Considerable attention has been dedicated to the analysis of the grapevine transcriptome. Studies on gene expressions and trascriptional profiling have shed light on several biological processes of Vitis vinifera involving ripening and maturation [16-19], dormancy transitioning [20], and resistance to pathogens and environmental conditions [21-23]. High throughput sequencing approaches has led to characterizing various transcripts of Vitis, in particular microRNAs, small RNA molecules that have been shown to play a central role in gene regulation within several important biological processes. Mica et al. [24] used oligonucleotide arrays to measure the expression of previously identified putative microRNAs in different tissues and during fruit maturation. They found several patterns of differentially expressed microRNAs and identified candidate splicing events. Wang et al [25] built a library of small RNAs of Vitis amurensis, a resistant wild type of Vitis genus, and identified several conserved and non-conserved microRNAs. Currently, Vitis vinifera's genomic data are stored by multiple sources, each of them maintaining a specific kind of data (genomic [26-28], miRNA [24], SNPs [13] and pathways [11]). VitisNet [11] stores manually annotated biological networks of Vitis Vinifera and allows the understanding of dynamic processes in systems biology experiments. All VitisNet files together with the manual annotation of the grape genome encompassing pathway names, individual genes, their genome identifier, and chromosome location can be accessed and downloaded from the VitisNet portal. Recently, in [29] authors proposed VTCdb, a grapevine gene co-expression database equipped with basic functional enrichment and visualization capabilities. This computational tool allows querying RNA-Seq data for grapevine berry development [30]. Here we present BIOWINE, a novel platform to manage and analyze genomic data of Vitis vinifera. BIOWINE is a free web-based resource allowing users to browse and query Vitis vinifera RNA-seq data properly integrated with knowledge coming from different external sources including genes, transcripts, proteins, microRNAs, pathways and Gene Ontology (GO) associations. BIOWINE is complementary to the Vitis data sources, such as VitisNet and VTCdb, and allows a more comprehensive genomic analysis of Vitis Vinifera cultivars. It integrates data from multiple sources and enables for comprehensive genomic analysis of grapevine cultivars at different phenological and pathological states. It allows the analysis of novel data obtained with state-of-the-art RNA-seq. BIOWINE computes the abundance of transcripts from RNA-seq data, obtaining highly reliable expression data for both mRNAs and short non-coding RNAs such as microRNAs. Data and results can be properly visualized and downloaded through an appropriate web-interface available at the following URL: http://alpha.dmi.unict.it/biowine/.

Construction and content

Data source

Vitis Vinifera samples. BIOWINE is a knowledge base storing RNA-seq data of Vitis vinifera native Sicilian grapevines in different climatic and phenological conditions. The samples have been selected both healthy and with diseases such as water stress, leaf curling, iron chlorosis and yellow mosaic. The presence of water stress was inferred by carrying out a recognition the year before sampling. The plants to be sampled affected by water stress have been pointed by the technical director of the company on the basis of the ground with excessive skeleton (see. Figure 1 (A)) and water stress presented each year. Yellow mosaic is caused by chromogenic grapevine fan-leaf nepovirus (GFLV) strains. Affected vines show chrome-yellow discolorations that develop early in the spring and may affect all vegetative parts of the vines (leaves, herbaceous shoot axes, tendrils and inflorescences, see Figure 1 (B)). Chromatic alterations of the leaves vary from a few scattered yellow spots, sometimes appearing as rings and lines, to variously extended mottling of the veinal and/or interveinal areas, to total yellowing. In spring, affected plants in a vineyard can readily be spotted from a distance. Malformations of leaves and canes are usually not prominent, but clusters may be smaller than normal and may have shot berries. In hot climates the newly produced summer vegetation has a normal green colour, while the yellowing of the old growth turns whitish and tends to fade away. Our inspections identified the following plant material:
Figure 1

(A) A plant presenting a water stress condition. (B) A plant presenting yellow Mosaic disease. (C) Woody tissues used for sampling the water stressed grapevine.

(A) A plant presenting a water stress condition. (B) A plant presenting yellow Mosaic disease. (C) Woody tissues used for sampling the water stressed grapevine. • Grapevine Nerello Mascalese located in the territory of Randazzo (CT), 672,8 m above sea level, planted in 2006. Inside the vineyard the following plants have been selected: 3 apparently healthy; 3 showing symptoms of leaf curling; 3 with clear water stress; 1 with Mosaic disease. • Grapevine Nerello Mascalese located in the territory of Linguaglossa (CT), 614 m above sea level, planted in 2007. Inside the vineyard 3 apparently healthy plants and 1 with Mosaic disease have been selected. • Grapevine Nero d'Avola located in the territory of Caltanissetta (CL), 379 m above sea level, planted in 2001. Inside the vineyard the following plants have been selected: 3 apparently healthy; 3 with obvious iron chlorosis. • Grapevine Nero d'Avola located in the territory of Marzamemi (SR), 12 m above sea level, planted in 1989. Inside the vineyard 3 apparently healthy plants have been selected. • Grapevine Nero d'Avola located in the territory of Noto (SR), 25 m above sea level, planted in 1996. Inside the vineyard 3 apparently healthy plants have been selected. Within the identified plants it was decided to carry out the following sampling during different phenological stages: • Vegetative sleeping: Sampling of woody shoots. • Germination: Sampling of leaves. • Bloom: Sampling of leaves. • End of flowering: Sampling of berries. • Bunch closure: Sampling of berries. • Veraison: Sampling at ripening. • Fruit: Sampling of berries. For sampling at ripening, each biological replicate comprised a pool of berries from 3 to 5 grapes collected on the same plant. The collection of samples for the water-stressed vine during vegetative sleeping were taken from the woody tissues from vines pruned previously. These were pooled and subsequently subjected to RNA extraction (see. Figure 1 (C)). See Table 1 for the complete list of all the sequenced samples.
Table 1

The list of sequenced samples.

Sample ID Growing location Grapevine Phenotype Phenological Phase Age
CL-1CaltanissettaNero D'AvolaNormalVegetative sleeping12 years

CL-2CaltanissettaNero D'AvolaIron chlorosisVegetative sleeping12 years

MA-1MarzamemiNero D'AvolaNormalVegetative sleeping24 years

NO-1NotoNero D'AvolaNormalVegetative sleeping17 years

LI-1LinguaglossaNerello MascaleseNormalVegetative sleeping5 years

RA-1RandazzoNerello MascaleseWater stressVegetative sleeping7 years

RA-2RandazzoNerello MascaleseLeaf curlingVegetative sleeping7 years

RA-3RandazzoNerello MascaleseNormalVegetative sleeping7 years

MA-2MarzamemiNero D'AvolaNormalGermination24 years

NO-2NotoNero D'AvolaNormalGermination17 years

CL-3CaltanissettaNero D'AvolaNormalGermination12 years

CL-4CaltanissettaNero D'AvolaIron chlorosisGermination12 years

RA-4RandazzoNerello MascaleseWater stressGermination7 years

RA-5RandazzoNerello MascaleseLeaf curlingGermination7 years

RA-6RandazzoNerello MascaleseNormalGermination7 years

LI-2LinguaglossaNerello MascaleseNormalGermination5 years

MA-3MarzamemiNero D'AvolaNormalEnd of flowering24 years

NO-3NotoNero D'AvolaNormalEnd of flowering17 years

CL-5CaltanissettaNero D'AvolaNormalEnd of flowering12 years

CL-6CaltanissettaNero D'AvolaIron chlorosisEnd of flowering12 years

LI-3LinguaglossaNerello MascaleseNormalEnd of flowering5 years

LI-MS1LinguaglossaNerello MascaleseMosaicEnd of flowering5 years

RA-7RandazzoNerello MascaleseWater stressEnd of flowering7 years

RA-8RandazzoNerello MascaleseLeaf curlingEnd of flowering7 years

RA-9RandazzoNerello MascaleseNormalEnd of flowering7 years

MA-4MarzamemiNero D'AvolaNormalFruit24 years

NO-4NotoNero D'AvolaNormalFruit17 years

CL-7CaltanissettaNero D'AvolaNormalFruit12 years

CL-8CaltanissettaNero D'AvolaIron chlorosisFruit12 years

LI-4LinguaglossaNerello MascaleseNormalFruit5 years

LI-MS2LinguaglossaNerello MascaleseMosaicFruit5 years

RA-10RandazzoNerello MascaleseWater stressFruit7 years

RA-11RandazzoNerello MascaleseLeaf curlingFruit7 years

RA-12RandazzoNerello MascaleseNormalFruit7 years

RA-MS2RandazzoNerello MascaleseMosaicFruit7 years

MA-5MarzamemiNero D'AvolaNormalBunch closure24 years

NO-5NotoNero D'AvolaNormalBunch closure17 years

LI-5LinguaglossaNerello MascaleseWater stressBunch closure5 years

LI-MS3LinguaglossaNerello MascaleseMosaicBunch closure5 years

RA-13RandazzoNerello MascaleseWater stressBunch closure7 years

RA-14RandazzoNerello MascaleseLeaf curlingBunch closure7 years

RA-15RandazzoNerello MascaleseNormalBunch closure7 years

RA-MS3RandazzoNerello MascaleseMosaicBunch closure7 years

CL-9CaltanissettaNero D'AvolaNormalBunch closure12 years

CL-10CaltanissettaNero D'AvolaIron chlorosisBunch closure12 years

CL-11CaltanissettaNero D'AvolaNormalVeraison12 years

LI-6LinguaglossaNerello MascaleseNormalVeraison5 years

LI-MS4LinguaglossaNerello MascaleseMosaicVeraison5 years

RA-16RandazzoNerello MascaleseWater stressVeraison7 years

RA-17RandazzoNerello MascaleseLeaf curlingVeraison7 years

RA-18RandazzoNerello MascaleseNormalVeraison7 years
The list of sequenced samples. RNA isolation. Total RNAs for mRNA-seq were extracted using Spectrum Plant Total RNA Kit (Sigma-Aldrich) following the manufacturer's instructions. MiRNA/small RNAs were purified with MirPremier microRNA Isolation Kit (Sigma-Aldrich) following the manufacturer's instructions. The starting materials were tissues conserved in RNALater Stabilization Solution (Ambion). RNA-sequencing. RNA samples were processed using TruSeq mRNA-seq sample prep kit from Illumina (Illumina, Inc., CA, USA). Briefly, the poly-A containing mRNA molecules were purified using poly-T oligo-attached magnetic beads, fragmented into small pieces using divalent cations under elevated temperature, cDNA was synthesized by reverse transcription and standard blunt-ending plus add 'A' was performed. Then, Illumina TruSeq adapters with indexes were ligated to the ends of the cDNA fragments. After ligation reaction and separation of not ligated adapters, samples were amplified by PCR to selectively enrich those cDNA fragments in the library having adapter molecules at both ends. Each 6-plex pool was sequenced on HiSeq2000 at Institute of Applied Genomics (Udine, Italy), producing about 20 million of single-reads per sample, 50 bp long. miRNA-sequencing. RNA samples were processed by using TruSeq smallRNA-seq Sample Prep kit from Illumina (Illumina, Inc., CA, USA), following the subsequent steps. First the sequential ligation of the RNA 3' and RNA 5' adapters to the samples was performed, then the reverse transcription was followed by PCR to selectively enrich those cDNA fragments in the library having adapter molecules at both ends. Individual libraries with unique indices were pooled in 12plex and the bands corresponding to approximately the adapter-ligated constructs derived from the 22 nt and 30 nt small RNA fragments were recovered from a 6% PAGE gel. Each 12-plex pool was sequenced on HiSeq2000 at Institute of Applied Genomics (Udine, Italy), producing about 10 million of single-reads per sample, 50 bp long. RNA-seq analysis. In order to analyze RNA-seq data, a standard pipeline has been adopted. This is based essentially on the well-known Tophat2/Cufflinks/Cuffdiff pipeline by Trapnell et al [31]. As the first step, raw sequences have been trimmed. Then, reads of each sample have been aligned against the reference genome and finally the alignments have been processed by Cufflinks which is able to compute transcript abundances in Fragments Per Kilobase of exon per Million fragments mapped (FPKM). This allowed for obtainment of gene expression levels for each sample. In order to perform a differential expression analysis between each pair of samples in each phonological phase, the log2 fold change of FPKMs was computed by using Cuffdiff. Notice that Cufflinks has been used only for computing abundance of known transcripts. Future research work includes the possibility of analyzing putative novel transcripts (by running Cufflinks and Cuffmerge packages after Tophat2 and before Cuffdiff). We also indexed all the putative SNPs and INDELs found in the sequenced genomes. The pipeline used for Variant calling has been based on Samtools [32]. miRNA analysis. To remove adapter sequences, we trimmed Raw sequence by a proprietary script. Therefore, we considered only putative small RNA sequences. Next, we produced a list of unique sequences (tags) together with their occurrences. Each unique sequence was then mapped against the corresponding miRBase entry to build a table of counts of each known miRNA in each sample. Finally, the counts table was normalized according to the DESeq [33] normalization algorithm implemented in the DESeq Bioconductor package. As results, we obtained a table of miRNA expression levels. Moreover, in order to discover new putative miRNAs, small RNA sequences have been processed by using the miRDeep-P [34] package which is able to identify miRNA in plant species. Each sample was processed and a set of predicted miRNAs that meets the criteria of plant miRNAs was given in output. These new putative miRNAs will be investigated for a validation in a future work. Integrated knowledge bases. The genomic information have been integrated with third party databases. Genes have been associated with their GO terms through the Ensembl Biomarts [35], miRNAs were annotated with information about their precursor and mature sequences coming from miRBase [36]. Pathways have been obtained from KEGG [37]. We used UNIPROT [38] to index protein sequences. Finally, from psRNATarget [39] we stored information about microRNAs targets.

Database schema and implementation

The BIOWINE knowledge base has been built and maintained up-to-date on top of a MySQL database management system, accessible through the lightweight interface PHP Data Objects running on an Apache server. The front-end web has been developed by using Bootstrap and HTML. The database contains sixteen tables to store genes, samples, SNPs, validated and predicted miRNAs, mRNAs, coded proteins, pathways and GO terms. The system has been implemented by adopting a Model-View-Controller approach. The processing of the results as well as the extraction of data from the database are made by using Python and R. BIOWINE relies on Python scripts for enrichment and on R scripts for generating heatmaps (pheatmap library) and volcano plots (ggplot2 library). Each submitted job is associated with a unique alphanumeric identifier allowing subsequent consultations. Finally, two levels of controls on the input provided by the user have been implemented, a client-level and a server-level, in order to guide the writing and to avoid misspelling input.

Utility and Discussion

The BIOWINE Interface

BIOWINE is free web based resource allowing users to browse and query Vitis vinifera RNA-seq data. BIOWINE implements a custom version of genome browser GBrowse [40] for Vitis vinifera genome 12x available from [26]. All the functional annotation tools can be accessed from the home page of BIOWINE. Browsing. User can access all the data stored within BIOWINE according to a 'subject' which can be genes, pathways or microRNAs (see Figures 2 (A)).
Figure 2

BIOWINE main interface. (A) User lists all genes by chromosome, pathways, and miRNAs. (B) User searches information by (i) a single gene, miRNA, and pathway, or (ii) shared among a list of genes or pathways.

BIOWINE main interface. (A) User lists all genes by chromosome, pathways, and miRNAs. (B) User searches information by (i) a single gene, miRNA, and pathway, or (ii) shared among a list of genes or pathways. Search by single gene or miRNA. A simple search is used to get genomic and quantitative information about a single gene within the reference genome as observed by our RNA-seq experiments (see Figure 2 (B)). Once the user types the query, the system yields a page with all the information associated with that gene. The page contains several sections. The first one shows the chromosome location, the strand, the starting and ending position together with the NCBI and EBI gene IDs if known. A link to GBrowse allows to view the gene within the BIOWINE genomic browser. A second section gives the list of associated GO Terms (in Figure 3 we report the distribution of GO annotation in our knowledge base). The complete mRNA sequence together with the 3' and 5' UTR regions are given. A subsection within the mRNA panel gives an exonview capability of the gene expression for each sample. Through such a panel the user analyzes the expression level of each single exon in each sample.
Figure 3

Distribution of genes GO annotations. Among 29971 genes present in BIOWINE, we have 16967 genes annotated with a biological process; 18664 genes annotated with a molecular function; 9816 genes annotated with a cellular component. More than 30% of genes (9130) are without any annotation.

Distribution of genes GO annotations. Among 29971 genes present in BIOWINE, we have 16967 genes annotated with a biological process; 18664 genes annotated with a molecular function; 9816 genes annotated with a cellular component. More than 30% of genes (9130) are without any annotation. The next section lists all the known microRNAs targeting such a gene. Also in this case a link to GBrowse shows each miRNA within the BIOWINE genomic browser. Then, we report all the known proteins encoded by such a gene. Next we list all the pathways in which the gene enters. Then, BIOWINE gives a section with all the SNPs and INDELs observed in our sequenced samples. For each sample BIOWINE gives the type of event (SNP or INDEL), the nucleotide in the reference genome and the observed one. Information about SNPs and INDELs are given together with a "Quality" value which is basically a measure of how confident Samtools [32] are about that variant. The Genotype Quality (GQ) value encodes the read quality score. The user can limit the visualized SNPs by choosing the proper range of GQ values. The GQ range can be specified through the dedicated combo boxes at the beginning of the section. Next, we report a heatmap with the fold change of the gene expression for each pair of sequences samples. We show only the pairs of samples having a fold change supported by statistical significance. We also give a volcano plot of the results. Finally, through the last section the user can download all data in .csv and .txt formats. Similarly, through the searching by a single miRNA, BIOWINE shows all targeting information (genes and MRE (MiRNA Recognition Elements)) together with its expression over the samples. Search by multiple genes. The search by multiple genes works by pasting a list of genes (see Figure 2 (B)). The search can be done either by gene names (e.g. VvMYBA1) or by external identifiers from NCBI or EBI. BIOWINE performs a functional analysis on them. The system yields a page in which the user can find the genomic information of each gene. The results page is organized in the following sections. The "gene" panel summarizes the basic genomic information of the genes given as input (see Figure 4 (A)). For each gene BIOWINE gives a link to the gene details pointing to a BIOWINE page with the single gene information. Next, a list of enriched GO terms, if any, is given. The enrichment level of a gene set is evaluated through Fisher's exact test with a default cutoff of 0.05. We also apply the Bonferroni test for multiple testing correction.
Figure 4

(A) The basic genomic information of the genes given as input; (B) Sample pair comparisons of gene expressions; (C) A volcano plot is also shown. Notice that the figure partially reports the result page. By scrolling the result page, users can jump to any of the following sections: Pathways, Genes, Go Terms, miRNAs, Gene expression, Gene enrichment Analysis, Downloads. By using the bottom right menu, users can download results in .csv format.

(A) The basic genomic information of the genes given as input; (B) Sample pair comparisons of gene expressions; (C) A volcano plot is also shown. Notice that the figure partially reports the result page. By scrolling the result page, users can jump to any of the following sections: Pathways, Genes, Go Terms, miRNAs, Gene expression, Gene enrichment Analysis, Downloads. By using the bottom right menu, users can download results in .csv format. Then, a section lists the microRNAs targeting those a genes. The section showing the list of enriched pathways follows. Finally we provide a heatmap with the fold change of each pair of samples (see Figure 4 (B)). For each of them we give a table with the genes having a statistically significant fold change. We report also the volcano plot of the fold change values (see Figure 4 (C)). Finally, the download section allows users to store locally all the functional analysis results in a text format. Search by pathways. BIOWINE stores Vitis Vinifera pathway details obtained from KEGG. Users can query the system by giving as input the code or the title of one or more pathways (see Figure 2 (A)), BIOWINE performs a functional enrichment on these. The system yields the list of genes shared by those pathways. Then, a section lists the enriched GO Terms. The next panel gives the microRNAs targeting those genes if any. Then a list with the pathways in which those genes enter is given. Next, the heatmap with the fold change of each pair of samples is given. The download section allow to store locally all the data. Comparative differential expression analysis. Here, user is able to perform comparative analysis on the genes/miRNAs expression of all the RNA-seq data stored in BIOWINE. It is possible to compare one or more subjects among phenotype, phenological phase, grapevine, growing location and sample ID. The result page shows a heatmap with the genes/miRNAs having a statistical significant differential expression in the compared classes together with the list of enriched GO terms and pathways. Network visualization of results. Cytoscape web network visualization (http://www.cytoscape.org/) has been used to show the protein-protein interactions related to the searched genes or the genes contained in the searched pathways and their neighbors together with all miRNAs targeting such genes. When searching by miRNA, BIOWINE shows the interactions among the miRNA targeting genes and their adjacent PPI. Case Study. In what follows we present a BIOWINE investigation inspired by known results in the literature. This is an example of considerations that may be conducted through BIOWINE by analyzing phenotype and genomic data. In [41] Wang et al. studied the VvWRKY proteins which are an important class of transcriptional regulators in higher plants. Through a bioinformatics pipeline authors characterized the large class of WRKY genes in Vitis vinifera. Then, members of such a VvWRKY family genes were grouped into three main classes according to their exon-intron structures and motif compositions. Authors identified two subclasses containing genes exhibiting different patterns of expression in response to different stresses. We selected such a list of genes as reported in Table 2 and ran the functional analysis through BIOWINE. The analysis highlights that VIT 09s0018g00240 is over expressed on CL-2 sample which is subject to Iron chlorosis, and it presents SNPs only on such a sample. VIT 16s0050g02510 is clearly downregulated on all Nerello Mascalese samples. In particular on RA-1, which is subject to water stress, VIT 16s0050g02510 presents a deletion in the transcript. In Table 3 we report the enriched GO terms of the vVwRKY selected proteins. We can notice that genes VIT 04s0008g05750 and VIT 09s0018g00240 are enriched by the term "response to salicylic acid" which alleviates decreases in photosynthesis under heat stress and helps the recovery in grapevine leaves [42].
Table 2

Selected genes for the case study.

GenesEnsable gene IDchr
1VIT 04s0008g057504

2VIT 09s0018g002409

3VIT 02s0025g012802

4VIT 08s0058g013908

5VIT 13s0067g0314013

6VIT 15s0046g0114015

7VIT 16s0050g0251016
Table 3

Case study: Enriched Process and Functions

GO TermDescriptionp-valueGenes
Processes

GO:0006355regulation of transcription, DNA-templated8.56e-091 2 3 4 5 6 7

GO:0050691regulation of defense response to virus by host1.46e-071 2

GO:0031347regulation of defense response9.65e-051 2

GO:0002237response to molecule of bacterial origin0.00011 2

GO:0009751response to salicylic acid0.00041 2

Functions

GO:0043565sequence-specific DNA binding3.82e-131 2 3 4 5 6 7

GO:0001071nucleic acid binding transcription factor activity1.49e-101 2 3 4 5 6 7

GO:0003700sequence-specific DNA binding transcription factor activity1.6e-101 2 3 4 5 6 7

GO:0003677DNA binding3.08e-081 2 3 4 5 6 7

Enriched process and functions.

Selected genes for the case study. Case study: Enriched Process and Functions Enriched process and functions.

Conclusions

BIOWINE is a web resource for the study of Vitis vinifera genome. It stores exome and small RNAs RNA-seq sequencing data of two Sicilian grapevines: Nerello Mascalese e Nero D'avola. Its flexible web interface allows for the functional analysis of gene and microRNA expression, SNPs and exome expression variations of grapevine samples on different phenological phases and diseases status. BIOWINE integrates third party databases concerning GO Terms and pathways and implements a custom version of GBrowse with the Vitis Vinifera genome 12x. BIOWINE will be maintained up-to-date and it will integrate V. Vinifera genome data of other grapevines present in Italy.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

AP, RG and AF conceived the research. AP and RG developed and coordinated the research. RD and GP designed and implemented the system. MM and GG developed the functional enrichment module. VV, AL and FC provided and analyzed the samples. AP, RG, RD, GP, VV, AL, FC and AF contributed to analysis aspects and wrote the paper.
  34 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  White grapes arose through the mutation of two similar and adjacent regulatory genes.

Authors:  Amanda R Walker; Elizabeth Lee; Jochen Bogs; Debra A J McDavid; Mark R Thomas; Simon P Robinson
Journal:  Plant J       Date:  2007-03       Impact factor: 6.417

3.  Differential floral development and gene expression in grapevines during long and short photoperiods suggests a role for floral genes in dormancy transitioning.

Authors:  Lekha Sreekantan; Kathy Mathiason; Jérôme Grimplet; Karen Schlauch; Julie A Dickerson; Anne Y Fennell
Journal:  Plant Mol Biol       Date:  2010-02-12       Impact factor: 4.076

Review 4.  At the cutting-edge of grape and wine biotechnology.

Authors:  Anthony R Borneman; Simon A Schmidt; Isak S Pretorius
Journal:  Trends Genet       Date:  2012-12-04       Impact factor: 11.639

5.  The grapevine expression atlas reveals a deep transcriptome shift driving the entire plant into a maturation program.

Authors:  Marianna Fasoli; Silvia Dal Santo; Sara Zenoni; Giovanni Battista Tornielli; Lorenzo Farina; Anita Zamboni; Andrea Porceddu; Luca Venturini; Manuele Bicego; Vittorio Murino; Alberto Ferrarini; Massimo Delledonne; Mario Pezzotti
Journal:  Plant Cell       Date:  2012-09-04       Impact factor: 11.277

6.  Construction and characterization of BAC libraries from major grapevine cultivars.

Authors:  A-F Adam-Blondon; A Bernole; G Faes; D Lamoureux; S Pateyron; M S Grando; M Caboche; R Velasco; B Chalhoub
Journal:  Theor Appl Genet       Date:  2005-04-16       Impact factor: 5.699

7.  Transcriptional analysis of late ripening stages of grapevine berry.

Authors:  Sabine Guillaumie; Romain Fouquet; Christian Kappel; Céline Camps; Nancy Terrier; Dominique Moncomble; Jake D Dunlevy; Christopher Davies; Paul K Boss; Serge Delrot
Journal:  BMC Plant Biol       Date:  2011-11-18       Impact factor: 4.215

8.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

9.  Berry flesh and skin ripening features in Vitis vinifera as assessed by transcriptional profiling.

Authors:  Diego Lijavetzky; Pablo Carbonell-Bejerano; Jérôme Grimplet; Gema Bravo; Pilar Flores; José Fenoll; Pilar Hellín; Juan Carlos Oliveros; José M Martínez-Zapater
Journal:  PLoS One       Date:  2012-06-29       Impact factor: 3.240

10.  VTCdb: a gene co-expression database for the crop species Vitis vinifera (grapevine).

Authors:  Darren C J Wong; Crystal Sweetman; Damian P Drew; Christopher M Ford
Journal:  BMC Genomics       Date:  2013-12-16       Impact factor: 3.969

View more
  11 in total

Review 1.  Recent advances in biotechnological studies on wild grapevines as valuable resistance sources for smart viticulture.

Authors:  Samia Daldoul; Hatem Boubakri; Mahmoud Gargouri; Ahmed Mliki
Journal:  Mol Biol Rep       Date:  2020-03-04       Impact factor: 2.316

2.  Preface: BITS2014, the annual meeting of the Italian Society of Bioinformatics.

Authors:  Angelo Facchiano; Claudia Angelini; Roberta Bosotti; Alessandro Guffanti; Anna Marabotti; Roberto Marangoni; Stefano Pascarella; Paolo Romano; Andreas Zanzoni; Manuela Helmer-Citterich
Journal:  BMC Bioinformatics       Date:  2015-06-01       Impact factor: 3.169

Review 3.  Field Guide to Plant Model Systems.

Authors:  Caren Chang; John L Bowman; Elliot M Meyerowitz
Journal:  Cell       Date:  2016-10-06       Impact factor: 41.582

4.  VESPUCCI: Exploring Patterns of Gene Expression in Grapevine.

Authors:  Marco Moretto; Paolo Sonego; Stefania Pilati; Giulia Malacarne; Laura Costantini; Lukasz Grzeskowiak; Giorgia Bagagli; Maria Stella Grando; Claudio Moser; Kristof Engelen
Journal:  Front Plant Sci       Date:  2016-05-10       Impact factor: 5.753

5.  Constructing Integrated Networks for Identifying New Secondary Metabolic Pathway Regulators in Grapevine: Recent Applications and Future Opportunities.

Authors:  Darren C J Wong; José Tomás Matus
Journal:  Front Plant Sci       Date:  2017-04-12       Impact factor: 5.753

6.  miRVIT: A Novel miRNA Database and Its Application to Uncover Vitis Responses to Flavescence dorée Infection.

Authors:  Walter Chitarra; Chiara Pagliarani; Simona Abbà; Paolo Boccacci; Giancarlo Birello; Marika Rossi; Sabrina Palmano; Cristina Marzachì; Irene Perrone; Giorgio Gambino
Journal:  Front Plant Sci       Date:  2018-07-17       Impact factor: 5.753

7.  Space and Vine Cultivar Interact to Determine the Arbuscular Mycorrhizal Fungal Community Composition.

Authors:  Álvaro López-García; José A Jurado-Rivera; Josefina Bota; Josep Cifre; Elena Baraza
Journal:  J Fungi (Basel)       Date:  2020-11-27

8.  microRNA profile of Hermetia illucens (black soldier fly) and its implications on mass rearing.

Authors:  Sarah DeRaedt; Anandi Bierman; Peter van Heusden; Cameron Richards; Alan Christoffels
Journal:  PLoS One       Date:  2022-03-17       Impact factor: 3.240

Review 9.  Current Knowledge and Computational Techniques for Grapevine Meta-Omics Analysis.

Authors:  Salvatore Alaimo; Gioacchino P Marceca; Rosalba Giugno; Alfredo Ferro; Alfredo Pulvirenti
Journal:  Front Plant Sci       Date:  2018-01-09       Impact factor: 5.753

Review 10.  Facing Climate Change: Biotechnology of Iconic Mediterranean Woody Crops.

Authors:  Carlos De Ollas; Raphaël Morillón; Vasileios Fotopoulos; Jaime Puértolas; Patrick Ollitrault; Aurelio Gómez-Cadenas; Vicent Arbona
Journal:  Front Plant Sci       Date:  2019-04-16       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.