| Literature DB >> 17142232 |
Nunzio D'Agostino1, Mario Aversano, Luigi Frusciante, Maria Luisa Chiusano.
Abstract
TomatEST is a secondary database integrating expressed sequence tag (EST)/cDNA sequence information from different libraries of multiple tomato species. Redundant EST collections from each species are organized into clusters (gene indices). A cluster consists of one or multiple contigs. Multiple contigs in a cluster represent alternatively transcribed forms of a gene. The set of stand-alone EST sequences (singletons) and contigs, representing all the computationally defined 'Transcript Indices', are annotated according to similarity versus protein and RNA family databases. Sequence function description is integrated with the Gene Ontologies and the Enzyme Commission identifiers for a standard classification of gene products and for the mapping of the expressed sequences onto metabolic pathways. Information on the origin of the ESTs, on their structural features, on clusters and contigs, as well as on functional annotations are accessible via a user-friendly web interface. Specific facilities in the database allow Transcript Indices from a query be automatically classified in Enzyme classes and in metabolic pathways. The 'on the fly' mapping onto the metabolic maps is integrated in the analytical tools. The TomatEST database website is freely available at http://biosrv.cab.unina.it/tomatestdb.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17142232 PMCID: PMC1669777 DOI: 10.1093/nar/gkl921
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Database organization.
Current status of the TomatEST database
| Source | ESTs | Libraries | Gene indices | Transcript indices | Contigs | Singletons |
|---|---|---|---|---|---|---|
| 200 438 | 95 | 42 261 | 43 370 | 16 888 | 26 482 | |
| 8346 | 2 | 4349 | 4355 | 741 | 3614 | |
| 8000 | 2 | 4263 | 4309 | 1088 | 3221 | |
| 1008 | 6 | 746 | 746 | 96 | 650 | |
| All | 217 792 | 105 | 51 619 | 52 780 | 18 813 | 33 967 |
Figure 2Snapshots of the TomatEST web interface. The panels show examples resulting from a general query on S.pennellii. (A) Shows an example of the tree menu listing all the metabolic enzymes annotated for the species. The node corresponding to chalcone isomerase (EC 5.1.1.6) is expanded showing the transcript associated with the enzyme and the metabolic pathway which include the enzyme. (B) Shows an example of the tree menu listing all the metabolic pathways associated with S.pennellii. The node corresponding to ‘Stilbene, coumarine and lignine biosynthesis’ is expanded. (C) Shows the pathway schema where seven enzymes highlighted in red are mapped.
Contig distribution per cluster in the tomato species
| Cluster | Contigs per cluster | Cluster | Contigs per cluster | Cluster | Contigs per cluster | Cluster | Contigs per cluster |
|---|---|---|---|---|---|---|---|
| 96 | 1 | 730 | 1 | 1025 | 1 | 15 133 | 1 |
| — | — | 4 | 2 | 10 | 2 | 515 | 2 |
| — | — | 1 | 3 | 2 | 3 | 66 | 3 |
| — | — | — | — | 1 | 4 | 19 | 4 |
| — | — | — | — | 1 | 6 | 9 | 5 |
| — | — | — | — | 1 | 9 | 5 | 6 |
| — | — | — | — | 1 | 18 | 2 | 7 |
| — | — | — | — | — | — | 1 | 8 |
| — | — | — | — | — | — | 2 | 9 |
| — | — | — | — | — | — | 1 | 10 |
| — | — | — | — | — | — | 2 | 11 |
| — | — | — | — | — | — | 1 | 13 |
| — | — | — | — | — | — | 1 | 19 |
| — | — | — | — | — | — | 1 | 23 |
| — | — | — | — | — | — | 1 | 249 |