| Literature DB >> 15608249 |
Angelo Boccia1, Mauro Petrillo, Diego di Bernardo, Alessandro Guffanti, Flavio Mignone, Stefano Confalonieri, Lucilla Luzi, Graziano Pesole, Giovanni Paolella, Andrea Ballabio, Sandro Banfi.
Abstract
The identification and study of evolutionarily conserved genomic sequences that surround disease-related genes is a valuable tool to gain insight into the functional role of these genes and to better elucidate the pathogenetic mechanisms of disease. We created the DG-CST (Disease Gene Conserved Sequence Tags) database for the identification and detailed annotation of human-mouse conserved genomic sequences that are localized within or in the vicinity of human disease-related genes. CSTs are defined as sequences that show at least 70% identity between human and mouse over a length of at least 100 bp. The database contains CST data relative to over 1088 genes responsible for monogenetic human genetic diseases or involved in the susceptibility to multifactorial/polygenic diseases. DG-CST is accessible via the internet at http://dgcst.ceinge.unina.it/ and may be searched using both simple and complex queries. A graphic browser allows direct visualization of the CSTs and related annotations within the context of the relative gene and its transcripts.Entities:
Mesh:
Year: 2005 PMID: 15608249 PMCID: PMC539965 DOI: 10.1093/nar/gki011
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Classification of human CSTs present in DG-CST
| CST type | Number | % | Length (bp) | Length (%) |
|---|---|---|---|---|
| Exonic | 21 139 | 31.8 | 5 247 362 | 34.9 |
| Intronic | 18 390 | 27.7 | 3 832 169 | 25.5 |
| Intergenic | 26 966 | 40.5 | 5 962 769 | 39.6 |
| Total | 66 495 | 100 | 15 042 300 | 100 |
Figure 1The DG-CST database: examples of query interfaces. (A) The DG-CST home page. The quick search boxes are highlighted in color: the CST ID box in green; the gene box in black; and the BLAST box in red. (B) The list of all analyzed genes obtained following the link on the home page. (C) The DNA-based feature search page. (D) The advanced CST search page, where all annotated features may be used in combination or alone to query the database. (E) The gene-based CST search page, which allows a more detailed gene search.
Figure 2The DG-CST database: data display. (A) Example of a gene entry (A2M) and the related CST list. (B) Graphical representation of the selected gene, accessible via the map link in (A). On mouse over, details of CST #250083 are displayed as an example. In this representation, CSTs are color-coded based on the number of matches with human ESTs. (C) Example of a CST entry with all annotations and the list of the corresponding CSTs conserved in other species. CST details are accessible either from the CST list of the gene page (A) or by clicking on the interactive graphical browser in (B). (D) Graphical representation of the sequence alignment of the orthologous CSTs shown in (C).