| Literature DB >> 31790229 |
Philip V Toukach1,2, Ksenia S Egorova1.
Abstract
The CSDB Linear notation for carbohydrate sequences utilized in the Carbohydrate Structure Database (CSDB) has been improved to meet modern requirements in glycoinformatics. The new features include: the possibility to combine repeating and nonrepeating moieties in one structure; support of carbon-carbon bonds; and usage of SMILES encodings for unambiguous chemical description of glycan structures, including aglycons and atypical components. The new capabilities of CSDB Linear, together with the older ones, allow efficient detection of errors in CSDB and, at the same time, ensure the absence of informatic problems common for human-readable notations. The CSDB Linear implementation provides translation to other carbohydrate notations and multiple procedures for content error checking.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31790229 DOI: 10.1021/acs.jcim.9b00744
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956