| Literature DB >> 26286194 |
Philip V Toukach1, Ksenia S Egorova2.
Abstract
The Carbohydrate Structure Databases (CSDBs, http://csdb.glycoscience.ru) store structural, bibliographic, taxonomic, NMR spectroscopic, and other data on natural carbohydrates and their derivatives published in the scientific literature. The CSDB project was launched in 2005 for bacterial saccharides (as BCSDB). Currently, it includes two parts, the Bacterial CSDB and the Plant&Fungal CSDB. In March 2015, these databases were merged to the single CSDB. The combined CSDB includes information on bacterial and archaeal glycans and derivatives (the coverage is close to complete), as well as on plant and fungal glycans and glycoconjugates (almost all structures published up to 1998). CSDB is regularly updated via manual expert annotation of original publications. Both newly annotated data and data imported from other databases are manually curated. The CSDB data are exportable in a number of modern formats, such as GlycoRDF. CSDB provides additional services for simulation of (1)H, (13)C and 2D NMR spectra of saccharides, NMR-based structure prediction, glycan-based taxon clustering and other.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26286194 PMCID: PMC4702937 DOI: 10.1093/nar/gkv840
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.CSDB coverage. Number of organisms, publications, structures and NMR spectra assigned to corresponding taxonomic groups. Taxonomy is designated by the color code.
Figure 2.Query routes in CSDB. Six search modes are provided: bibliographical, structural, compositional, taxonomical, using NMR signals and CSDB IDs.
Search modes in CSDB
| Search mode | Query term | Example | Result | Note |
|---|---|---|---|---|
| Record, structure, publication or organism IDs | Record: 1–10,12 | List of records, structures, publications or organisms | Only record IDs are persistent; other IDs may change upon CSDB updates | |
| Primary structure (fragment / complete) | b?Qui?3N(1-?)[PE P(2-6)]?DGalpA | List of compounds + list of publications for each compound | Structure queries may include underdetermined components; user may specify molecule type, compound class, taxonomical domain and presence of NMR data; search of aglycons and glycan sequences by systematic or trivial names is supported | |
| Composition (partial / complete) | 1 HEX + 2 Xyl + 1 Man + … | List of compounds + list of publications for each compound | User may restrict molecule type, compound class and taxonomical domain | |
| Genus, species, strain / serogroup / subspecies, NCBI TaxID | List of organisms + list of compounds for each organism | Taxon indices are available; user may restrict domain; search among host organisms is supported | ||
| Authors, terms from title or abstract, keywords, journal / book, year, volume, pages | List of publications + list of compounds for each publication | User may restrict taxonomical domains and select papers with structure elucidation; author and journal indices are available; search terms may be combined using logical operations and wildcards | ||
| 1H or 13C chemical shifts | 13C: 18.0 49.5 | List of compounds and their NMR spectra + list of publications for each compound | Search for all signals within a single residue is specified by default |
Modes of structure input
| Input mode | Description | Note |
|---|---|---|
| For visual structure building; requires knowledge of general carbohydrate nomenclature | This mode does not support some rarely-occurring queries, which can be processed by the CSDB search engine | |
| Widespread carbohydrate structures can be selected by common names | Structures are visualized in a pseudographic format | |
| Carbohydrate structures are constructed and viewed in a graphic from | GlycanBuilder was developed by Damerell | |
| Structures may be pasted in the GlycoCT condensed format and converted into the CSDB linear encoding | GlycoCT was developed by Herget | |
| A previous structural query may be copied to the search term field and edited manually | Available only if there has already been a structural request within the session | |
| Structures are entered into the search field manually | Requires knowledge of the CSDB linear encoding rules (see Supplementary Figure S2) ( |
Figure 3.Exemplary complex query. See explanations in the text.
Figure 4.1H and 13C NMR spectra for a model saccharide simulated in water solution in the extreme quality mode. Partial output is shown. The red box encloses the structure of interest. The red arrow reflects that clicking on the cell displays the corresponding reference data.