| Literature DB >> 25392413 |
Alexis Dereeper1, Stéphanie Bocs2, Mathieu Rouard3, Valentin Guignon3, Sébastien Ravel4, Christine Tranchant-Dubreuil5, Valérie Poncet5, Olivier Garsmeur2, Philippe Lashermes4, Gaëtan Droc6.
Abstract
The whole genome sequence of Coffea canephora, the perennial diploid species known as Robusta, has been recently released. In the context of the C. canephora genome sequencing project and to support post-genomics efforts, we developed the Coffee Genome Hub (http://coffee-genome.org/), an integrative genome information system that allows centralized access to genomics and genetics data and analysis tools to facilitate translational and applied research in coffee. We provide the complete genome sequence of C. canephora along with gene structure, gene product information, metabolism, gene families, transcriptomics, syntenic blocks, genetic markers and genetic maps. The hub relies on generic software (e.g. GMOD tools) for easy querying, visualizing and downloading research data. It includes a Genome Browser enhanced by a Community Annotation System, enabling the improvement of automatic gene annotation through an annotation editor. In addition, the hub aims at developing interoperability among other existing South Green tools managing coffee data (phylogenomics resources, SNPs) and/or supporting data analyses with the Galaxy workflow manager.Entities:
Mesh:
Year: 2014 PMID: 25392413 PMCID: PMC4383925 DOI: 10.1093/nar/gku1108
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The Hub content. Number of entries by data types as of September 2014
| Data type | Number of entries |
|---|---|
| Genome | 1 |
| Genes | 25 574 |
| Similarities with proteome of related species ( | 155 283 |
| Similarities with other Uniprot proteins | 7 498 085 |
| Similarities with Coffee ESTs | 524 675 |
| Transposable Elements | 448 845 |
| Bac Ends sequences | 68 542 (BstYI) + 68 928 (HindIII) |
| SSRs | 4 949 134 |
| SNPs | 386 560 |
| Anchored genetic markers | 2564 |
| Expression studies (RNASeq samples) | 10 |
| Expression studies (microarray samples) | 18 |
| Gene families | 4543 |
| Metabolic Pathways | 330 |
| Synteny relationships (number of syntenic segments) | 566 (Coffee–Coffee) |
| 960 (Coffee–Grape) | |
| 1409 (Coffee–Tomato) |
Figure 1.Overview of the Coffee Genome Hub (A) A gene search is performed with the of SAM dependent carboxyl methyltransferase (IPR005299) InterPro family identifier. The result page returns a list of genes with graphical display on the chromosomes. (B) The gene report summarizes all the data available for a gene and links to additional resources: (C) Gene family—here the distribution of the Sam Dependent Carboxyl Methyltransferase gene family (GP000195) of coffee illustrates its abundance in plants, (D) JBrowse centered on the region of selected gene (±10 kb) and (E) Pathways tools (e.g. biosynthesis of the caffeine).
Figure 2.Transcriptomics data exploration using the Coffee Genome Hub. (A) JBrowse displays alignments of RNA-Seq reads to the genome and allows for each gene a graphical bar representation of RPKM expression values. (B) Heatmap representation of expression values using a user-defined list of genes. (C) Differential expression values (log2ratio, P-value) can be searched by comparison between samples/conditions, and then intersected between studies.
Figure 3.Management of SNP polymorphisms in the Coffee Genome Hub. (A) Users can retrieve polymorphic positions based on a subset of genotypes and a subset of genes. The database outputs SNPs together with annotations, minor allele frequency and genotypic data. (B) Connection with JBrowse allows visualizing and browsing the genomic location of the selected SNP. (C) Resulting SNPs can be sent to our tool that calculates and displays the distribution of SNP density on the genome or to a SNP-based distance tree analysis.