| Literature DB >> 16351742 |
Barbara Lazzari1, Andrea Caprera, Alberto Vecchietti, Alessandra Stella, Luciano Milanesi, Carlo Pozzi.
Abstract
BACKGROUND: The ESTree db http://www.itb.cnr.it/estree/ represents a collection of Prunus persica expressed sequenced tags (ESTs) and is intended as a resource for peach functional genomics. A total of 6,155 successful EST sequences were obtained from four in-house prepared cDNA libraries from Prunus persica mesocarps at different developmental stages. Another 12,475 peach EST sequences were downloaded from public databases and added to the ESTree db. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts and data were collected in a MySQL database. A php-based web interface was developed to query the database.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16351742 PMCID: PMC1866392 DOI: 10.1186/1471-2105-6-S4-S16
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The ESTree db pipeline. Data flow in the ESTree db pipeline. Accessory in-house developed Perl programs are not shown in the scheme.
Statistics on sequence analysis. Data are derived from the outputs of the programs invoked by the pipeline. (1)Each SNP report contains data on one or more putative SNP sites. (2)The unigene dataset encompasses all the singlets plus the longest sequence of each contig. (3)Annotation threshold: E-value < 1e-10.
| Sequence Number | Sequence % | |
| Total number of sequences | 18,630 | |
| Average sequence base count | 544.52 | |
| Number of singletons | 6,891 | 36.99 |
| Number of contigs | 2,328 | |
| Number of sequences in contigs | 11,739 | 63.01 |
| Average number of sequences in each contig | 5.04 | |
| Number of SNP reports (1) | 166 | |
| Number of putative unigenes (2) | 9,219 | 49.48 |
| Annotated sequences (NCBI blast) (3) | 13,114 | 70.39 |
| Annotated sequences (GO blast) (3) | 9,056 | 48.61 |
| Number of enzyme sequences | 661 | 3.55 |
| Number of sequences linked to KEGG metabolic pathways | 282 | 1.51 |
Figure 2The ESTree db database structure. Main tables of the ESTree MySQL db. The database structure is subject to frequent changes, due to the implementation of the database features.
Figure 3The ESTree db Contig display page. An example of the ESTree db contig graphical display. The bar colours reflect the developmental stage of the sequence clone library of origin. The same colours recur in the AutoSNP output and in the library details. The graphical display is dynamically created by the php web interface.