| Literature DB >> 32935101 |
N Gogate1, D Lyman1, K A Crandall2, R Kahsay1, D A Natale3, S Sen4, R Mazumder1,5.
Abstract
Scientists, medical researchers, and health care workers have mobilized worldwide in response to the outbreak of COVID-19, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2; SCoV2). Preliminary data have captured a wide range of host responses, symptoms, and lingering problems post-recovery within the human population. These variable clinical manifestations suggest differences in influential factors, such as innate and adaptive host immunity, existing or underlying health conditions, co-morbidities, genetics, and other factors. As COVID-19-related data continue to accumulate from disparate groups, the heterogeneous nature of these datasets poses challenges for efficient extrapolation of meaningful observations, hindering translation of information into clinical applications. Attempts to utilize, analyze, or combine biomarker datasets from multiple sources have shown to be inefficient and complicated, without a unifying resource. As such, there is an urgent need within the research community for the rapid development of an integrated and harmonized COVID-19 Biomarker Knowledgebase. By leveraging data collection and integration methods, backed by a robust data model developed to capture cancer biomarker data we have rapidly crowdsourced the collection and harmonization of COVID-19 biomarkers. Our resource currently has 138 unique biomarkers. We found multiple instances of the same biomarker substance being suggested as multiple biomarker types during our extensive cross-validation and manual curation. As a result, our Knowledgebase currently has 265 biomarker type combinations. Every biomarker entry is made comprehensive by bringing in together ancillary data from multiple sources such as biomarker accessions (canonical UniProtKB accession, PubChem Compound ID, Cell Ontology ID, Protein Ontology ID, NCI Thesaurus Code, and Disease Ontology ID), BEST biomarker category, and specimen type (Uberon Anatomy Ontology) unified with ontology standards. Our preliminary observations show distinct trends in the collated biomarkers. Most biomarkers are related to the immune system (SAA,TNF-∝, and IP-10) or coagulopathies (D-dimer, antithrombin, and VWF) and a few have already been established as cancer biomarkers (ACE2, IL-6, IL-4 and IL-2). These trends align with proposed hypotheses of clinical manifestations compounding the complexity of COVID-19 pathobiology. We explore these trends as we put forth a COVID-19 biomarker resource that will help researchers and diagnosticians alike. All biomarker data are freely available from https://data.oncomx.org/covid19 .Entities:
Year: 2020 PMID: 32935101 PMCID: PMC7491515 DOI: 10.1101/2020.09.09.196220
Source DB: PubMed Journal: bioRxiv
Figure 1.Model of BEST biomarker subtypes. Each category of Best Biomarker, indicated by red arrows, fulfils a distinct role “as an indicator of normal biological processes, pathogenic processes, or responses to an exposure or intervention.” Tests for specific instances of potential COVID-19 biomarkers (in brackets) provide measurable evidence data of existing or potential health status.
Figure 2.Steps for collection, organization, standardization and integration of the data elements in the COVID-19 biomarker resource data model.
Biomarker table header descriptions and column content.
| Header | Column Content |
|---|---|
| literature_evidence | PMID (PubMed ID) or DOI (DOI number) |
| biomarker_substance_accession | UPKB (UniProtKB/Swiss-Prot ac) or PCCID (PubChem Compound ID) or CO (Cell Ontology ID) or PRO (Protein Ontology ID) or DO (Disease Ontology ID) |
| biomarker_name | Common name or UniProtKB protein name and gene symbol or short name in parenthesis |
| measured_biomarker | increased/decreased level or expression or counts or ratio |
| specimen_type | Uberon name (Uberon ID) |
| BEST_biomarker_type | monitoring; diagnostic; prognostic; predictive; risk/susceptibility; safety; pharmacodynamic/response |
| drug | Drug name (DrugBank ID) |
| biomarker_description | Free text from paper (preferably unedited) |
| curator name | First and Last name |
| curator_ORCID | ORCID |
| curator_notes | Free text from curator |
| reviewer_initial | Name/Initial: Free text from reviewer |
Figure 3.COVID-19 Biomarkers highlights. Top biomarkers are depicted as increased or decreased levels. The size of the circle is indicative of the number of articles supporting the biomarker. The color of the circle corresponds to the BEST biomarker type (Blue – monitoring, Purple – prognostic, Green – diagnostic).