| Literature DB >> 18387209 |
Maria Luisa Chiusano1, Nunzio D'Agostino, Alessandra Traini, Concetta Licciardello, Enrico Raimondo, Mario Aversano, Luigi Frusciante, Luigi Monti.
Abstract
BACKGROUND: Present-day '-omics' technologies produce overwhelming amounts of data which include genome sequences, information on gene expression (transcripts and proteins) and on cell metabolic status. These data represent multiple aspects of a biological system and need to be investigated as a whole to shed light on the mechanisms which underpin the system functionality. The gathering and convergence of data generated by high-throughput technologies, the effective integration of different data-sources and the analysis of the information content based on comparative approaches are key methods for meaningful biological interpretations. In the frame of the International Solanaceae Genome Project, we propose here ISOLA, an Italian SOLAnaceae genomics resource.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18387209 PMCID: PMC2323670 DOI: 10.1186/1471-2105-9-S2-S7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Representation of the multilevel structure in ISOLA. Data sources, tools and methods of the platform are indicated.
The genome and the expression levels' under parts are included in the light and dark grey areas, respectively. Shared data are located in the gridded area. Subsidiary tools lay on the interface of the two levels. Value-added data gathered from the platform are listed and enclosed in the level to which they contribute more. Entry points for proteome and metabolome approaches are indicated.
Figure 2Snapshot of the Web-based application for navigating ISOLA. ISOLA is accessible through two different gateways.
The Genome Browser gateway let the user explore the list of the tomato BAC sequences, grouped by chromosome number, and visualize the tracks that are displayed along each BAC sequence. Each track is cross-linked to other local or external resources. Cross-references to the tomato genome annotation pages at the SOL Genomics Network are part of the ‘genome level’ too.
The Solanaceae EST database gateway let the user investigate Solanaceae transcriptomes as revealed by EST sampling. Each EST database can be queried with keywords to identify functional annotations associated to a single EST or a TC. Cross-links to the UniProt external resource are established. In case an expressed sequence affects an enzyme function, a cross-link to the corresponding KEGG metabolic pathway(s) is provided. These links permit data from proteomics and metabolomics approaches to be integrated in the existing multi-level environment. In addition, the association of the tomato ESTs to the oligo-nucleotide probes from the Affymetrix or from the TED database expands information concerning the ‘expression level’ and provides the opportunity to integrate into the platform data from expression profiling arrays.
Statistics on the EST collections.
| family | genus | sub-genus | species | total EST | nr ESTs | gene indices | TC | sEST | total transcripts | ESTs mapped | TCs mapped |
| Solanaceae | Solanum | Lycopersicum | SOLLC | 250552 | 190763 | 44759 | 17629 | 28005 | 45634 | 14286 | 975 |
| Solanaceae | Solanum | Lycopersicum | SOLPN | 8346 | 6888 | 3863 | 730 | 3140 | 3870 | 488 | 37 |
| Solanaceae | Solanum | Lycopersicum | SOLHA | 8000 | 7868 | 4101 | 907 | 3203 | 4110 | 306 | 30 |
| Solanaceae | Solanum | Lycopersicum | SOLLP | 1008 | 979 | 744 | 94 | 650 | 744 | 3 | 0 |
| Solanaceae | Solanum | SOLTU | 226805 | 206696 | 62752 | 19732 | 44138 | 63870 | 5785 | 584 | |
| Solanaceae | Solanum | SOLCH | 7752 | 7750 | 7192 | 306 | 6886 | 7192 | 139 | 10 | |
| Solanaceae | Nicotiana | TOBAC | 74940 | 67745 | 37845 | 7529 | 30578 | 38107 | 403 | 74 | |
| Solanaceae | Nicotiana | NICBE | 27010 | 24784 | 9420 | 3206 | 6315 | 9521 | 95 | 21 | |
| Solanaceae | Nicotiana | NICLS | 12448 | 11749 | 6785 | 958 | 5840 | 6798 | 40 | 6 | |
| Solanaceae | Nicotiana | NICSY | 8580 | 8425 | 7534 | 512 | 7023 | 7535 | 68 | 4 | |
| Solanaceae | Nicotiana | NICAT | 329 | 324 | 312 | 11 | 301 | 312 | 3 | 0 | |
| Solanaceae | Capsicum | CAPAN | 31089 | 28664 | 15703 | 3474 | 12262 | 15736 | 373 | 67 | |
| Solanaceae | Capsicum | CAPCH | 372 | 372 | 343 | 11 | 332 | 343 | 4 | 0 | |
| Solanaceae | Petunia | PETHY | 10670 | 10336 | 7004 | 1166 | 5842 | 7008 | 31 | 4 | |
| Rubiaceae | Coffea | COFCA | 46907 | 38308 | 16121 | 4494 | 11713 | 16207 | 20 | 10 | |
| Rubiaceae | Coffea | COFAR | 1071 | 1059 | 1007 | 42 | 965 | 1007 | 3 | 1 |
species: SOLLC: Solanum lycopersicum; SOLPN: Solanum pennellii; SOLHA: Solanum habrochaites; SOLLP: S. lycopersicum X S. pimpinellifolium; SOLTU: Solanum tuberosum; SOLCH: Solanum chacoense; TOBAC: Nicotiana tabacum; NICBE: Nicotiana benthamiana; NICLS: Nicotiana langsdorffii x Nicotiana sanderae; NICSY: Nicotiana sylvestris; NICAT: Nicotiana attenuata; PETHY: Petunia x hybrida; CAPAN: Capsicum annuum; CAPCH: Capsicum chinense; COFCA: Coffea canephora; COFAR: Coffea arabica.
nr EST: non-redundant ESTs. It is the number of EST sequences obtained after the removal of over-represented ESTs from each collection. Gene indices are created by grouping overlapping EST sequences into clusters. Each cluster corresponds to a unique gene. TCs: tentative consensus; TCs are generated from multiple sequence alignments of ESTs (assembling process). sESTs: singleton ESTs. The total transcripts number is obtained adding the TCs to the sESTs. The number of ESTs and TCs which are mapped onto the S. lycopersicum genome is included too.
Occurrence and distribution of non-coding RNA families in the tomato genome draft sequences.
| CHR number | BAC accession number | Matching region | Rfam accession number | RNA family | number of matches |
| 1 | AC171728.2 | 17763::18065 | RF00017 | SRP_euk_srch | 1 |
| 135747::135317 | RF00005 | tRNA | 1 | ||
| 1 | AC193777.1 | 99831::99901 | RF00005 | tRNA | 1 |
| 4 | AC193778.1 | 148954::149035 | RF00005 | tRNA | 1 |
| 4 | CT990489.3 | 73520:73639 | RF00086 | U27 | 1 |
| 73740:73863 | RF00016 | U14 | 1 | ||
| 4 | CU062498.5 | 60943::B1015 | RF00005 | tRNA | 2 |
| 153111::1S3181 | RF00005 | tRNA | 1 | ||
| 4 | CU074307.7 | 632:702 | RF00005 | tRNA | 1 |
| 153112: 153182 | RF00005 | tRNA | 1 | ||
| 4 | CU074307.9 | 632:702 | RF00005 | tRNA | 1 |
| 4 | CU074337.8 | 88091::88161 | RF00005 | tRNA | 1 |
| 4 | CU104691.12 | 51151:51223 | RF00005 | tRNA | 2 |
| 4 | CU179634.6 | 129509::129931 | RF00177 | SSU_rRNA_5 | 44 |
| 4 | CU222538.4 | 305::3S6 | RF00005 | tRNA | 1 |
| 4 | CU222540.3 | 26296::26383 | RF00028 | Intron_qpl | 1 |
| 4 | CU2BD045.2 | 29885::29958 | RF00005 | tRNA | 7 |
| 4 | CU302231.4 | 25551::25623 | RF00005 | tRNA | 1 |
| 4 | CU313315.3 | 39165::39244 | RF00028 | Intron_gpl | 1 |
| 4 | CU326362.1 | 32386::32465 | RF00028 | Intron_gpl | 1 |
| 85446::85520 | RF00005 | tRNA | 2 | ||
| 5 | AC194694.1 | 82814:82839 | RF00451 | mir-395 | 1 |
| 82977:83070 | RF00451 | mir-395 | 2 | ||
| 94123::94339 | RF00451 | mir-395 | 1 | ||
| 7 | AC 187539 1 | 9990::10061 | RF00005 | tRNA | 1 |
| 7 | AC187540 1 | 88984::89055 | RF00005 | tRNA | 1 |
| 8 | AP009261.1 | 23923::23995 | RF00005 | tRNA | 1 |
| 8 | AP009264.1 | 119184:119256 | RF00029 | Intron_gpll | 2 |
| 123055::123128 | RF00029 | Intron_gpll | 1 | ||
| 123711-123782 | RF00029 | Intron_gpll | 1 | ||
| 8 | AP009267.1 | 105178::105257 | RF00028 | Intron_gpl | 1 |
| 8 | AP009274.1 | 90899::90976 | RF00005 | tRNA | 1 |
| 8 | AP009279.1 | 27381::27454 | RF00005 | tRNA | 1 |
| 8 | AP008280.1 | 44680::44752 | RF00005 | tRNA | 1 |
| 8 | AP009281 1 | 43139:43211 | RF00005 | tRNA | 2 |
| 8 | AP009284.1 | 1962:2157 | RF00004 | U2 | 1 |
| 121585::121656 | RF00005 | tRNA | 1 | ||
| 8 | AP0D9287.1 | 17284::17354 | RF00005 | tRNA | 1 |
| 8 | AP009327.1 | 44492::44572 | RF00005 | tRNA | 1 |
| 8 | AP009330 1 | 141496::141567 | RF00029 | Intron_gpll | 1 |
| 142150::142223 | RF00029 | Intron_gpll | 1 | ||
| 146022:146094 | RF00029 | Intron_gpll | 2 | ||
| 8 | AP009357.1 | 44492::44572 | RF00005 | tRNA | 1 |
| 8 | AP009360.1 | 141496::141567 | RF00029 | Intron_gpll | 1 |
| 142150::142223 | RF00029 | Intron_qpll | 1 | ||
| 146022::146094 | RF00029 | Intron_qpll | 2 | ||
| 10 | AC171731.1 | 9631::9728 | RF0G003 | U1 | 1 |
| 10 | AC193781.1 | 49869::49939 | RF00005 | tRNA | 1 |
In a row are reported: the chromosome number; the BAC GenBank accession number; the start/end positions for the BAC regions covered by the RNAs; the Rfam accession number; a brief description of the RNA family; the total number of Rfam sequences per BAC region.
Figure 3The figure enumerates statistics on the functional annotation versus the UniProt, the Rfam, the Enzyme and the Gene Ontology databases. The number of enzymes involved into known metabolic pathways is reported too.