| Literature DB >> 32324859 |
Robel Kahsay1, Jeet Vora1, Rahi Navelkar1, Reza Mousavi1, Brian C Fochtman1, Xavier Holmes1, Nagarajan Pattabiraman1, Rene Ranzinger2, Rupali Mahadik2, Tatiana Williamson2, Sujeet Kulkarni2, Gaurav Agarwal2, Maria Martin3, Preethi Vasudev3, Leyla Garcia4, Nathan Edwards5, Wenjin Zhang5, Darren A Natale5, Karen Ross5, Kiyoko F Aoki-Kinoshita6, Matthew P Campbell7, William S York2, Raja Mazumder1.
Abstract
SUMMARY: Glycoinformatics plays a major role in glycobiology research, and the development of a comprehensive glycoinformatics knowledgebase is critical. This application note describes the GlyGen data model, processing workflow and the data access interfaces featuring programmatic use case example queries based on specific biological questions. The GlyGen project is a data integration, harmonization and dissemination project for carbohydrate and glycoconjugate-related data retrieved from multiple international data sources including UniProtKB, GlyTouCan, UniCarbKB and other key resources.Entities:
Year: 2020 PMID: 32324859 PMCID: PMC7320628 DOI: 10.1093/bioinformatics/btaa238
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.GlyGen data processing workflow showing various steps. Data are retrieved from various resources including UniProtKB, GlyTouCan, UniCarbKB, RefSeq and other key resources, followed by extraction and filtering based on relevance to glycobiology. Extracted data are integrated after harmonization that is based on various standard ontologies. The resulting datasets are then ingested into a MongoDB docstore and Virtuoso triplestore using the GlyGen data model