| Literature DB >> 19445732 |
Dae-Won Kim1, Tae-Sung Jung, Seong-Hyeuk Nam, Hyuk-Ryul Kwon, Aeri Kim, Sung-Hwa Chae, Sang-Haeng Choi, Dong-Wook Kim, Ryong Nam Kim, Hong-Seog Park.
Abstract
BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. DESCRIPTION: GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The GarlicESTdb web application is freely available at http://garlicdb.kribb.re.kr.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19445732 PMCID: PMC2689220 DOI: 10.1186/1471-2229-9-61
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1Workflow schema of the pipeline. The pipeline consists of five steps within the red-dotted rectangle: configuration, cleansing, clustering and assembling, annotation, and data mining & visualization.
Figure 2Snapshots of the GarlicESTdb web application. The panels show examples resulting from general functions. (a) An example of a pre-processing report showing cleaning sequences, assembly status, contig view and trace view. (b) An example of functional annotation reporting showings annotation statistics, annotation report, each annotated report (BLASTN, BLASTX, TBLASTX, etc), detailed information of consensus sequence, results of a BLAST search, data downloaded AB1 (contigs, singleton) and Excel. (c) An example of secondary mining information showing electronic northern expression information for all of garlic libraries and pathway information with enzymes annotations mapped from KEGG pathways. (d) An example of the personalized curation service. After free registration, users can modify the annotated results themselves. Please refer to the user's manual for details on the GarlicESTdb webpage.