| Literature DB >> 18927105 |
Paolo Romano1, Assunta Manniello, Ottavia Aresu, Massimiliano Armento, Michela Cesaro, Barbara Parodi.
Abstract
The Cell Line Data Base (CLDB) is a well-known reference information source on human and animal cell lines including information on more than 6000 cell lines. Main biological features are coded according to controlled vocabularies derived from international lists and taxonomies. HyperCLDB (http://bioinformatics.istge.it/hypercldb/) is a hypertext version of CLDB that improves data accessibility by also allowing information retrieval through web spiders. Access to HyperCLDB is provided through indexes of biological characteristics and navigation in the hypertext is granted by many internal links. HyperCLDB also includes links to external resources. Recently, an interest was raised for a reference nomenclature for cell lines and CLDB was seen as an authoritative system. Furthermore, to overcome the cell line misidentification problem, molecular authentication methods, such as fingerprinting, single-locus short tandem repeat (STR) profile and single nucleotide polymorphisms validation, were proposed. Since this data is distributed, a reference portal on authentication of human cell lines is needed. We present here the architecture and contents of CLDB, its recent enhancements and perspectives. We also present a new related database, the Cell Line Integrated Molecular Authentication (CLIMA) database (http://bioinformatics.istge.it/clima/), that allows to link authentication data to actual cell lines.Entities:
Mesh:
Year: 2008 PMID: 18927105 PMCID: PMC2686526 DOI: 10.1093/nar/gkn730
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.CLDB database schema. Cells_tab includes features which are not encoded and textual information. Vocabularies are included in reference tables that are linked from cells_tab by their unique ids. Join tables are defined for linking catalogs and bibliographic references to cell lines since the same line can link to many catalogs and papers. The same apply to viruses table, since the same line can be transfected by more viruses.
Figure 2.CLIMA database schema. Actual data tables (on the left) include the clima_cell_lines table, where information on cell line names for which a molecular characterization exist are stored, and one table for each dataset including STR profiles. Metadata tables (on the right) include tables for the description of datasets, kits and related loci, and bibliographic information.