| Literature DB >> 19788737 |
Olga Plechakova1, Christine Tranchant-Dubreuil, Fabrice Benedet, Marie Couderc, Alexandra Tinaut, Véronique Viader, Petra De Block, Perla Hamon, Claudine Campa, Alexandre de Kochko, Serge Hamon, Valérie Poncet.
Abstract
BACKGROUND: In the past few years, functional genomics information has been rapidly accumulating on Rubiaceae species and especially on those belonging to the Coffea genus (coffee trees). An increasing number of expressed sequence tag (EST) data and EST- or genomic-derived microsatellite markers have been generated, together with Conserved Ortholog Set (COS) markers. This considerably facilitates comparative genomics or map-based genetic studies through the common use of orthologous loci across different species. Similar genomic information is available for e.g. tomato or potato, members of the Solanaceae family. Since both Rubiaceae and Solanaceae belong to the Euasterids I (lamiids) integration of information on genetic markers would be possible and lead to more efficient analyses and discovery of key loci involved in important traits such as fruit development, quality, and maturation, or adaptation. Our goal was to develop a comprehensive web data source for integrated information on validated orthologous markers in Rubiaceae. DESCRIPTION: MoccaDB is an online MySQL-PHP driven relational database that houses annotated and/or mapped microsatellite markers in Rubiaceae. In its current release, the database stores 638 markers that have been defined on 259 ESTs and 379 genomic sequences. Marker information was retrieved from 11 published works, and completed with original data on 132 microsatellite markers validated in our laboratory. DNA sequences were derived from three Coffea species/hybrids. Microsatellite markers were checked for similarity, in vitro tested for cross-amplification and diversity/polymorphism status in up to 38 Rubiaceae species belonging to the Cinchonoideae and Rubioideae subfamilies. Functional annotation was provided and some markers associated with described metabolic pathways were also integrated. Users can search the database for marker, sequence, map or diversity information through multi-option query forms. The retrieved data can be browsed and downloaded, along with protocols used, using a standard web browser. MoccaDB also integrates bioinformatics tools (CMap viewer and local BLAST) and hyperlinks to related external data sources (NCBI GenBank and PubMed, SOL Genomic Network database).Entities:
Mesh:
Substances:
Year: 2009 PMID: 19788737 PMCID: PMC2760556 DOI: 10.1186/1471-2229-9-123
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Microsatellite markers, Sequence sources and original data
| CofEST-SSR | EST | 9 | (Bhat et al., 2005) | [ | 11 | |
| ES | EST | 99 | (Poncet et al., 2006) | [ | 7 | |
| 21 Rubiaceae sp. | ||||||
| SSR | EST | 10 | (Geromel et al., 2006) | [ | ||
| DCM/CofEST-SSR | EST | 9 | (Aggarwal et al., 2007) | [ | 11 | |
| Ssr | EST | 132 | (Crouzillat et al., unpublished data) | |||
| 3 | ||||||
| M | Genomic | 10 | (Combes et al., 2000) | [ | ||
| M | Genomic | 17 | (Coulibaly et al., 2003) | [ | 2 | |
| CM | Genomic | 9 | (Baruah et al., 2003) | [ | 11 | |
| CFGA | Genomic | 34 | (Moncada et al., 2004) | [ | ||
| M | Genomic | 77 | (Poncet et al., 2004) | [ | 6 | |
| DL | Genomic | 8 | (Leroy et al., 2005) | [ | ||
| M | Genomic | 213 | (Poncet et al., 2007) | [ | 3 | |
| Genomic | 9 | (Lashermes et al., unpublished data) | ||||
Figure 1Overview of the MoccaDB application. MoccaDB integrates different data types, which are interconnected and linked to external resources and bioinformatic tools (CMap and BLAST).
Figure 2Screenshots of a MoccaDB marker data pages. (A) Marker detail page resulting from searching for the M257 marker. The result page provides general information about the SSR marker (e.g. repeat motif, repeat number) as well as the corresponding genomic sequence with database cross-references to NCBI and SOL (if available) and a hyperlink to the MoccaDB Sequence detail Page (B). The mapping information section provides information about marker locations on the genetic map, which links to the CMap viewer (C) and the CMap feature detail popup (D). The PCR information section may also provide details on marker assay conditions (forward and reverse primers, melting temperatures, predicted size of PCR product). The reference section gives publications related to this marker with links to Pubmed.
Figure 3Screenshots of sequence/putative function MoccaDB result pages. Sequences can be searched by name, putative annotation. The search can be restricted using different criteria such as sequence origin or marker type. The group of screenshots shows an example sequence search using the keyword 'transferase' to find out what sequences have been "putatively" annotated with this term. (A) The result page displays sequences and related data resulting from searching for the annotation term 'transferase'. The tabular text summary lists all the sequences found, each line in the table presenting the sequence name and related information (sequence type and origin, marker name and BLAST annotation). The marker name and sequence name are respectively linked to the marker detail page (C) and to the sequence detail page (B). The user can select sequences and export them in FASTA format. The sequence detail page (B) displays all the associated information for that sequence which includes general information (e.g.: sequence type, DNA bank), annotation information, marker information and publications related to that sequence. Hyperlinks give access to associated data within MOCCAdb such as markers, DNA bank or link to external resources such as SOL or NCBI.
Figure 4Schematic phylogenetic tree adapted from [21]and number of successfully amplified/tested markers (percentage) observed for each species. The information was extracted from MoccaDB database. Names of Coffea species follow [22,23].
Transferability to Rubiaceae species: efficiency of cross amplification of Coffea markers in other Rubiaceae genus (Nb species tested when over 1).
| | 32% | 25 C | |||
| | 94% | Up to 207 C, up to 49 A | | 24% | 25 C |
| | 82% | 9 A | | 16% | 25 C |
| | 68% | 25 C | | 16% | 25 C |
| | 56% | 25 C | | 4% | 25 C |
| | 48% | 25 C | |||
| | 40% | 25 C | | 12% | 25 C |
| | 36% | 25 C | | 10% | 25 C |
| | 36% | 25 C | | 6% | 25 C |
Markers were developed on C. canephora (C) or C. arabica (A) sequences.
Figure 5Screenshots of transferability/diversity MoccaDB result pages. (A) Marker search gives access to a synthetic results table with basic transferability data. Users can select markers and export related data on the amplified species and the corresponding allele size ranges (in bp) as Excel files. Hyperlinks on this result page gives access to the marker detail page (B), where this information can be directly visualized for each specific marker. Using the hyperlinks "more details", users can access additional details (C) on the transferability efficiency (presence or not of amplification products, quality of the amplification) and on the polymorphism status (number and sizes of the alleles, PIC values for each study). These diversity data can also be retrieved through the diversity search page and query filtered according to the species studied.