| Literature DB >> 33870064 |
Abstract
As scientists worldwide search for answers to the overwhelmingly unknown behind the deadly pandemic, the literature concerning COVID-19 has been growing exponentially. Keeping abreast of the body of literature at such a rapidly advancing pace poses significant challenges not only to active researchers but also to society as a whole. Although numerous data resources have been made openly available, the analytic and synthetic process that is essential in effectively navigating through the vast amount of information with heightened levels of uncertainty remains a significant bottleneck. We introduce a generic method that facilitates the data collection and sense-making process when dealing with a rapidly growing landscape of a research domain such as COVID-19 at multiple levels of granularity. The method integrates the analysis of structural and temporal patterns in scholarly publications with the delineation of thematic concentrations and the types of uncertainties that may offer additional insights into the complexity of the unknown. We demonstrate the application of the method in a study of the COVID-19 literature.Entities:
Keywords: COVID-19; CiteSpace; Microsoft Academic Services; citation context analysis; epistemic uncertainty; scientometrics; visual analytics
Year: 2020 PMID: 33870064 PMCID: PMC8025977 DOI: 10.3389/frma.2020.607286
Source DB: PubMed Journal: Front Res Metr Anal ISSN: 2504-0537
A self-constructed dataset of the COVID-19 literature on Microsoft Academic Graph as of September 5, 2020.
| Time of publication | Unique citing articles | Unique references | Citation contexts | Unique contexts |
|---|---|---|---|---|
| 2014-SEP | 1 | 14 | 38 | 23 |
| 2018-FEB | 1 | 57 | 92 | 59 |
| 2018-NOV | 1 | 18 | 32 | 26 |
| 2019-JAN | 2 | 18 | 22 | 14 |
| 2019-JUN | 1 | 5 | 15 | 15 |
| 2019-MAR | 1 | 81 | 172 | 73 |
| 2019-MAY | 1 | 246 | 443 | 221 |
| 2019-SEP | 1 | 10 | 17 | 17 |
| 2019-DEC | 4 | 102 | 157 | 96 |
| 2020-JAN | 241 | 1,939 | 3,476 | 2,529 |
| 2020-FEB | 241 | 1,601 | 3,287 | 2,251 |
| 2020-MAR | 997 | 5,704 | 13,206 | 9,596 |
| 2020-APR | 2,765 | 15,478 | 36,599 | 26,507 |
| 2020-MAY | 1,657 | 11,227 | 23,131 | 17,089 |
| 2020-JUN | 756 | 6,445 | 12,144 | 9,260 |
| 2020-JUL | 687 | 6,218 | 11,837 | 8,640 |
| 2020-AUG | 300 | 3,462 | 5,845 | 4,304 |
| 2020-SEP | 18 | 230 | 371 | 292 |
| 2020-OCT | 8 | 193 | 287 | 231 |
| 2020-NOV | 5 | 76 | 138 | 107 |
| 2020-DEC | 3 | 17 | 24 | 22 |
| 2021-JAN | 2 | 20 | 27 | 23 |
A simple comparison between different data sources on the search query used for this study as of September 5, 2020.
| Data source | Articles | % Of CORD-19 | Search strategy |
|---|---|---|---|
| CORD-19 | 130,000 | 100.00 | Combined |
| Dimensions | 124,131 | 95.49 | Full text |
| Lens | 100,759 | 77.51 | Full text |
| Lens | 98,839 | 76.03 | Title, abstract, keyword |
| Dimensions | 90,747 | 69.81 | Title, abstract, keyword |
| Lens | 83,048 | 63.88 | Title |
| MAG | 80,676 | 62.06 | Fields of study |
| Dimensions | 75,435 | 58.03 | Title |
| Google scholar | 73,700 | 56.69 | Title, abstract, full text |
| Web of science | 29,858 | 22.97 | Topic search |
https://www.semanticscholar.org/cord19
https://app.dimensions.ai/discover/publication
https://www.lens.org/
https://docs.microsoft.com/en-us/academic-services/
https://scholar.google.com/schhp?hl=en
https://clarivate.com/webofsciencegroup/solutions/web-of-science/
FIGURE 1An overview of 1,330 top-cited articles in the COVID-19 literature of 77,897 articles. The size of a node represents the number of times the corresponding article has been cited in the dataset. The prominent theme of each cluster of cited articles is algorithmically labeled.
FIGURE 2Making sense of a cluster (#5 pregnant women). The list of citation contexts shown in the left window corresponds to the current mouse-over event on the concept of vertical transmission.
FIGURE 3Making sense of major themes of citations to a specific reference Li Q. (2020).
Epistemic uncertainties of citation contexts containing specific rhetorical words on conclusions.
| Citing Article→Cited reference | Epistemic uncertainty | Citation context Uncertainty: Uncertain/conflicting/contradict/inconsistent rhetorical: Conclusion/conclude |
|---|---|---|
| 3037877512→3018691224 | 0.0314 | Rly complex and counter-intuitive due to the |
| 3079224143→3013360115 | 0.0314 | [35] |
| 3020670761→3010344953 | 0.0314 | (14) Finally, a recent systematic review of the literature |
| 3023144169→2969352266 | 0.0237 | It can be |
| 3009935283→2811210701 | 0.0205 | 21 The |
| 3021685303→2802058961 | 0.01 | Moreover, the |
| 3007114958→3002108456 | 0.0014 | Compared with the results of the two studies on Wuhan cases by Chen et al. 18 and Huang et al. 19, we found that the gender proportion was equal in the 80 patients we included, |
Uncertainty cue words are in bold, whereas rhetorical words are in italic.
FIGURE 4Uncertainties of citation contexts of Li et al. (2020).
FIGURE 5Concept tree of a phrase: vaccine.
FIGURE 6An overview of a smaller network to illustrate SVA. The size of a disc in red depicts epistemic uncertainty (E). The largest three discs are labeled with a black background.
FIGURE 7The distribution of citation contexts with uncertainties is uneven. The most uncertainties are in clusters 0, 2, and 5.
FIGURE 8References associated with the strongest sentiment of uncertainty are from Cluster #0 spike protein.
FIGURE 9An article identified by SVA with a high transformative potential according to centrality divergence.
Citation contexts of the references cited by Fang and Meng (2020).
| Cluster | References | Citation context | Section heading |
|---|---|---|---|
| #4 laboratory diagnosis |
| Infection of SARS-CoV-2 triggers the host humoral response, leading to the generation of antibodies including IgA, IgM, and IgG against SARS-CoV-2 [ | #5 Serology testing |
| Another study with N protein-based ELISA on 208 plasma samples from 82 confirmed and 58 probable cases revealed that the median time for detection of IgM and IgA was 5 days while for IgG it was 14 days after symptom onset [ | #5 Serology testing | ||
| #8 neurological manifestation |
| A PCR screening test on 78 residents in a long-term care nursing home in Washington State resulted in the detection of 10 symptomatic, 10 pre-symptomatic, and 3 asymptomatic cases [ | #4 Large scale screening to detect asymptomatic or pre-symptomatic cases |
| #3 clinical trial |
| Preliminary results showed that about 68% of patients with severe COVID-19 treated with compassionate-use of remdesivir had clinical improvement [ | #1 Introduction |
| #7 inflammatory syndrome |
| Possible roles of cytokine storm syndrome that leads to critical disease and death of COVID-19 patients have been discussed [ | #3 Clinical characteristics of COVID-19 |
| In fact, compared with non-intensive care unit (ICU) patients, ICU patients had higher plasma levels of IL-2, IL-6, IL-7, granulocyte-colony stimulating factor, interferon-γ inducible protein 10, monocyte chemo-attractant protein 1, macrophage inflammatory protein 1-α, and TNFα [7,48,49, | #3 Clinical characteristics of COVID-19 | ||
| In critically ill patients, cytokines and other biomarkers are significantly changed and measurement of these biochemical markers can be used to determine the severity and mortality of the disease [7,48,50, | #3 Clinical characteristics of COVID-19 |
Some of the articles with the strongest transformative potentials in terms of M for modularity change. C-L is for cluster linkage. C-D is for centrality divergence. The Harmonic column shows the harmonic mean of M, C-L, and C-D. The NR column is the number of references cited by the corresponding article.
| MAG ID | M | C-L | C-D | Harmonic | Citations | NR | Title | References |
|---|---|---|---|---|---|---|---|---|
| 3006967091 | 9.082 | 0.242 | 0.134 | 0.257 | 60 | 32 | 2019 Novel coronavirus of pneumonia in wuhan china emerging attack and management strategies |
|
| 3033364035 | 5.748 | 0.020 | 0.260 | 0.055 | 19 | 105 | Covid 19 from epidemiology to treatment |
|
| 3036326077 | 4.594 | 0.016 | 0.303 | 0.047 | 4 | 116 | The laboratory's role in combating covid 19 |
|
| 3006282354 | 3.531 | 0.105 | 0.209 | 0.206 | 11 | 57 | Structural modeling of 2019 novel coronavirus ncov spike protein reveals a proteolytically sensitive activation loop as a distinguishing feature compared to sars cov and related sars like coronaviruses |
|
| 3035754070 | 2.984 | 0.010 | 0.138 | 0.029 | 0 | 80 | Covid 19 and the cardiovascular system a review of current data summary of best practices outline of controversies and illustrative case reports |
|
| 3028751559 | 2.277 | 0.008 | 0.239 | 0.024 | 1 | 123 | The epidemiology and therapeutic options for the covid 19 |
|
| 3037777580 | 1.796 | 0.006 | 0.174 | 0.018 | 1 | 56 | Rheumatology practice amid the covid 19 pandemic a pragmatic view |
|
| 3032928719 | 1.787 | 0.007 | 0.170 | 0.020 | 0 | 104 | Covid 19 breakthroughs separating fact from fiction |
|
| 3036140173 | 1.684 | 0.006 | 0.170 | 0.018 | 0 | 134 | Coronavirus disease 2019 covid 19 a short review on hematological manifestations |
|
| 3033243614 | 1.338 | 0.005 | 0.176 | 0.014 | 0 | 51 | Extrapulmonary and atypical clinical presentations of covid 19 |
|
| 3036886562 | 1.194 | 0.004 | 0.170 | 0.012 | 0 | 29 | Covid 19 and advanced practice registered nurses frontline update |
|
| 3039476797 | 1.039 | 0.003 | 0.150 | 0.010 | 0 | 157 | Covid 19 progress in diagnostics therapy and vaccination |
|
| 3035971111 | 1.012 | 0.003 | 0.148 | 0.010 | 0 | 42 | Gastrointestinal manifestations of covid 19 |
|
| 3044931581 | 0.789 | 0.003 | 0.137 | 0.008 | 0 | 45 | Sars cov 2 infection in children |
|
| 3036299138 | 0.579 | 0.002 | 0.128 | 0.006 | 1 | 44 | Targeting cytokine storm to manage patients with covid 19 a mini review |
|
| 3036482760 | 0.579 | 0.002 | 0.147 | 0.006 | 6 | 102 | Severe covid 19 and aging are monocytes the key |
|
| 3037103256 | 0.577 | 0.002 | 0.180 | 0.006 | 0 | 14 | Pediatric case of severe covid 19 with shock and multisystem inflammation |
|
| 3033462763 | 0.357 | 0.001 | 0.132 | 0.004 | 0 | 46 | Covid 19 pandemic and pediatric population with special References to congenital heart disease |
|
| 3036315910 | 0.213 | 0.001 | 0.140 | 0.002 | 0 | 13 | Strategies for successful catheterization laboratory recovery from the covid 19 pandemic |
|
| 3036932243 | 0.200 | 0.001 | 0.147 | 0.002 | 0 | 219 | Sars cov 2 an update on potential antivirals in light of sars cov antiviral drug discoveries |
|