| Literature DB >> 35468755 |
Wei Yu1, Emily Drzymalla2, Marta Gwinn2, Muin J Khoury2.
Abstract
The scientific response to the COVID-19 pandemic has produced an abundance of publications, including peer-reviewed articles and preprints, across a wide array of disciplines, from microbiology to medicine and social sciences. Genomics and precision health (GPH) technologies have had a particularly prominent role in medical and public health investigations and response; however, these domains are not simply defined and it is difficult to search for relevant information using traditional strategies. To quantify and track the ongoing contributions of GPH to the COVID-19 response, the Office of Genomics and Precision Public Health at the Centers for Disease Control and Prevention created the COVID-19 Genomics and Precision Health database (COVID-19 GPH), an open access knowledge management system and publications database that is continuously updated through machine learning and manual curation. As of February 11, 2022, COVID-GPH contained 31,597 articles, mostly on pathogen and human genomics (72%). The database also includes articles describing applications of machine learning and artificial intelligence to the investigation and control of COVID-19 (28%). COVID-GPH represents about 10% (22983/221241) of the literature on COVID-19 on PubMed. This unique knowledge management database makes it easier to explore, describe, and track how the pandemic response is accelerating the applications of genomics and precision health technologies. COVID-19 GPH can be freely accessed via https://phgkb.cdc.gov/PHGKB/coVInfoStartPage.action .Entities:
Keywords: COVID-19; Database; Genomics; Precision Public Health; SARS-CoV-2
Mesh:
Year: 2022 PMID: 35468755 PMCID: PMC9035978 DOI: 10.1186/s12879-022-07219-3
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.667
Fig. 1COVID-19 GPH data retrieval and curation processes
Performance evaluation of the automatic curation process (ACP)
| Experts | Sum | ||
|---|---|---|---|
| Positive | Negative | ||
| ACP | |||
| Positive | 41 | 14 | 55 |
| Negative | 9 | 435 | 444 |
| Sum | 50 | 449 | 499 |
Specificity = 0.97, Sensitivity = 0.82
Fig. 2The screenshot of COVID-19 GPH landing page
Number of articles in COVID-19 GPH from each source
| Publication type | Numbera |
|---|---|
| PubMed | 22,983 |
| Online news/reports/publications | 431 |
| Preprint | 8172 |
aNumber as of February 11, 2022
Publication category definitions
| Publication category | Description |
|---|---|
| Categories annotated by LitCovid from NCBI NIH [ | |
| Mechanism | Underlying cause(s) of covid-19 infections and transmission and possible drug mechanism of action |
| Transmission | Characteristics and modes of covid-19 transmissions, such as human-to-human |
| Diagnosis | Disease assessment through symptoms, test results, and radiological features |
| Prevention | Prevention, control, response, and management strategies |
| Forecasting | Modelling and estimating the trend of covid-19 spread |
| Categories annotated by the text mining tool from CDC PHGKB | |
| Health Equity | Relevant to health equity. Search terms are derived from a list provided by the Association for Territorial Health Officials which include terms such as diversity, health disparities, and others |
| Vaccine | Relevant to vaccine development, evaluation, implementation, and impact |
| Variant | Relevant to SARS-CoV-2 variants and their impact on public health |
| Surveillance | Relevant to SARS-CoV-2 public health surveillance and tracking |
Fig. 3Number of articles in each publication category. Numbers on February 11, 2022. The definitions for the publication categories are: mechanism: underlying cause(s) of COVID-19 infections and transmission and possible drug mechanism of action; transmission: characteristics and modes of covid-19 transmissions, such as human-to-human, diagnosis: disease assessment through symptoms, test results, and radiological features; prevention: prevention, control, response, and management strategies; forecasting: modelling and estimating the trend of COVID-19 spread; health equity: relevant to health equity and search terms are derived from a list provided by the Association for Territorial Health Officials which include terms such as diversity, health disparities, and others; vaccine: relevant to vaccine development, evaluation, implementation and impact; variant: relevant to SARS-CoV-2 variants and their impact on public health; surveillance: relevant to SARS-CoV-2 public health surveillance and tracking
Selected topics in the COVID-19 GPH database, with examples
| Section | Topic | Article name | Summary |
|---|---|---|---|
| GPH | Variants of SARS-CoV-2 | Genomic Analysis and Lineage Identification of SARS-CoV-2 Strains in Migrants Accessing Europe Through the Libyan Route [ | This study sequenced SARS-CoV-2 strains to determine the genetic variation and lineage of the virus in migrants [ |
| Rapid Emergence and Epidemiologic Characteristics of the SARS-CoV-2 B.1.526 Variant [ | This MMWR discusses the epidemiology B.1.526 variant, which has a E484K mutation, and compares the severity of disease resulting from B.1.526 to other SARS-CoV-2 strains [ | ||
| Role of human genetic variation in COVID-19 susceptibility and severity | A cluster of differentiation 14(CD14) polymorphism (C-159T rs2569190) is associated with SARS-CoV-2 infection and mortality in the European population [ | This study examined the association between the infection rate for COVID-19 in countries and the T allele frequency for rs2569190 in the country’s population [ | |
| The influence of IFITM3 polymorphisms on susceptibility to SARS-CoV-2 infection and severity of COVID-19 [ | This study compared positive and negative SARS-CoV-2 cases in order to determine whether are genotypic differences between the cases for SNPs within | ||
| Efficacy of SARS-CoV-2 vaccinesa | Efficacy of NVX-CoV2373 Covid-19 Vaccine against the B.1.351 Variant [ | Different strains of SARS-CoV-2 have varying genotypes. As a result, the efficacy of a vaccine may vary for different strains. This study provides evidence for the efficacy of the NVX-CoV2373 vaccine against a specific strain, B.1.351 [ | |
| Antibody Response to 2-Dose SARS-CoV-2 mRNA Vaccine Series in Solid Organ Transplant Recipients [ | mRNA vaccines use viral mRNA to generate immunity. This study examines the antibody response in patients who received solid organ transplants after receiving up to 2 doses of a mRNA vaccine for COVID-19 [ | ||
| Impact of COVID-19 on people with genetic diseases | BRCA testing in a genomic diagnostics referral center during the COVID-19 pandemic [ | Mutations in | |
| SARS-CoV-2 infection associated with hepatitis in an infant with X-linked severe combined immunodeficiency [ | A case report of 11 week old infant with X-linked severe combined immunodeficiency, a genetic disease resulting from mutations in | ||
| Non-GPH | Epidemic modeling of COVID-19 | Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England [ | This study developed a mathematical model to predict the reproduction number, resulting hospitalizations, and resulting deaths for the B.1.1.7 COVID-19 variant [ |
| Modeling COVID-19 Pandemic with Hierarchical Quarantine and Time Delay [ | This study predicts the spread of COVID-19 using a SIDQR model and the effect of hierarchical quarantine on the spread of the virus [ | ||
| Machine learning applied to COVID-19 data | COVID-Classifier: an automated machine learning model to assist in the diagnosis of COVID-19 infection in chest X-ray images [ | This study to test the accuracy of a machine learning model to distinguish between COVID-19 cases, pneumonia cases, and normal cases using chest X-ray images [ | |
| Application of Artificial Intelligence in COVID-19 Pandemic: Bibliometric Analysis [ | This review determines the research trends and publication patterns for using artificial intelligence for COVID-19 [ | ||
| Big data analysis | Public Perception of COVID-19 Vaccines through Analysis of Twitter Content and Users [ | This study analyzed about 2.4 million tweets from about 1 million users in order to determine the general attitude of the public toward COVID-19 vaccination [ | |
| IgM anti-SARS-CoV-2-specific determination: useful or confusing? Big Data analysis of a real-life scenario [ | This study analyzed a laboratory database, about 209,408 samples and tests, to determine the clinical utility of IgM detection in slowing the spread of COVID-19 [ |
amRNA vaccines or vaccines concerning SARS-CoV-2 variants
Fig. 4Number of articles per month for all articles, GPH articles only, and non-GPH articles only
Fig. 5Number of articles per month by publication category
Fig. 6Percent of articles per publication category with 100 highest Altmetric scores
Comparison of major open-access COVID-19 scientific publication databases
| Category | All | GPH | Non-GPH |
|---|---|---|---|
| Rare Disease | 6811 | 5994 | 817 |
| Heart, Lung, Blood, and Sleep Disorders | 3689 | 2443 | 1246 |
| Environmental Health | 1839 | 1195 | 644 |
| Pharmacogenomics | 1295 | 1165 | 130 |
| Health Equity | 802 | 448 | 354 |
| Cancer | 690 | 586 | 104 |
| Neurological Disorders | 623 | 483 | 140 |
| Diabetes | 514 | 359 | 155 |
| Reproductive and Child Health | 448 | 382 | 66 |
| Primary Immune Deficiency | 362 | 358 | 4 |
| Family Health History | 10 | 8 | 2 |