| Literature DB >> 35382104 |
Tao Hu1, Weihe Wendy Guan1, Xinyan Zhu2,3, Yuanzheng Shao2,3, Lingbo Liu4, Jing Du2, Hongqiang Liu5, Huan Zhou6, Jialei Wang6, Bing She7, Luyao Zhang8, Zhibin Li9, Peixiao Wang9,10, Yicheng Tang11, Ruizhi Hou12, Yun Li13, Dexuan Sha13, Yifan Yang14, Ben Lewis1, Devika Kakkar1, Shuming Bao15.
Abstract
The COVID-19 outbreak is a global pandemic declared by the World Health Organization, with rapidly increasing cases in most countries. A wide range of research is urgently needed for understanding the COVID-19 pandemic, such as transmissibility, geographic spreading, risk factors for infections, and economic impacts. Reliable data archive and sharing are essential to jump-start innovative research to combat COVID-19. This research is a collaborative and innovative effort in building such an archive, including the collection of various data resources relevant to COVID-19 research, such as daily cases, social media, population mobility, health facilities, climate, socioeconomic data, research articles, policy and regulation, and global news. Due to the heterogeneity between data sources, our effort also includes processing and integrating different datasets based on GIS (Geographic Information System) base maps to make them relatable and comparable. To keep the data files permanent, we published all open data to the Harvard Dataverse (https://dataverse.harvard.edu/dataverse/2019ncov), an online data management and sharing platform with a permanent Digital Object Identifier number for each dataset. Finally, preliminary studies are conducted based on the shared COVID-19 datasets and revealed different spatial transmission patterns among mainland China, Italy, and the United States.Entities:
Keywords: COVID-19; GIS; data repository; open data; spatial data
Year: 2022 PMID: 35382104 PMCID: PMC8969556 DOI: 10.2478/dim-2020-0012
Source DB: PubMed Journal: Data Inf Manag ISSN: 2543-9251
Open Resources Related to COVID-19 Study
| 1 | Cases | COVID-19 Cases | JHU CSSE | Global | Hours |
| 2 | Geographic Distribution of COVID-19 Cases Worldwide | European Centre for Disease Prevention and Control | Global | Daily | |
| 3 | COVID-19/2019-nCoV Time Series Infection Data Warehouse | Isaac Lin | China | Hours | |
| 4 | nCov2019: An R Package with Real-time Data, Historical Data, and Shiny App | Guangchuang Yu, Xijin Ge, et al. | China, South Korea, USA, Japan, Iran, Italy, Germany, and UK | Hours | |
| 5 | covid-19-data | USA | Daily | ||
| 6 | Dati COVID-19 Italia | Presidency of the Council of Ministers—Department of Civil Protection | Italy | Days | |
| 7 | OpenCOVID19-fr | Antoine Augusti, Colin Maudry, et al. | France | Hours | |
| 8 | Policy | OXFORD COVID-19 Government Response Tracker | University of Oxford | Global | Daily |
| 9 | State Actions to Mitigate the Spread of COVID-19 | Kaiser Family Foundation | USA | Daily | |
| 10 | Mobility | Baidu Mobility Data | Baidu, Inc. | China | Daily |
| 11 | Mobility Changes in Response to COVID-19 | Descartes Labs | USA | Daily | |
| 12 | Social media | #COVID-19: The First Public Coronavirus Twitter Dataset | University of Southern California | Global | Daily |
| 13 | Publications | Database of Publications on Coronavirus Disease (COVID-19) | WHO | Global | Daily |
| 14 | COVID-19 Open Research Dataset (CORD-19) | Allen Institute for AI | Global | Weekly |
Figure 1The flow chart of data collection and deployment.
A Sample of Outflow Matrix between 10 Cities on March 28, 2020 (%)
| Beijing | 8.78 | 2.33 | 7 | 3.5 | 4.82 | 3.04 | 16.07 | 18.41 | |
| Tianjin | 8.74 | 2.4 | 31.21 | 6.97 | 7.04 | 2.15 | 4.91 | 4.79 | |
| Shijiazhuang | 2.05 | 2.19 | 3.02 | 2.76 | 9.67 | 29.94 | 20.35 | 5.57 | |
| Tangshan | 1.8 | 17.03 | 1.96 | 37.06 | 1.04 | 0.98 | 1.95 | 3.71 | |
| Qinhuangdao | 0.47 | 1.35 | 0.64 | 15.19 | 0.16 | 0.2 | 0.46 | 0.66 | |
| Handan | 1.35 | 3.07 | 7.23 | 0.78 | 0.51 | 19.34 | 1.96 | 0.71 | |
| Xingtai | 0.77 | 1.16 | 21.77 | 0.72 | 0.51 | 19.44 | 2.26 | 0.78 | |
| Baoding | 10.97 | 3.83 | 18.67 | 2.49 | 1.83 | 2.62 | 2.6 | 7.97 | |
| Zhangjiakou | 2.77 | 0.85 | 1.38 | 1.33 | 0.58 | 0.34 | 0.24 | 2.17 |
Figure 2Daily count on global geo-tweets related to COVID-19 topics.
Figure 3Daily counts of global news in English related to COVID-19 topics.
Metadata Table for Air Quality Data
| CO | Content of carbon monoxide in the air | mg/m3 |
| SO2 | Content of sulfur dioxide in the air | mg/m3 |
| NO2 | Content of nitrogen dioxide in the air | mg/m3 |
| O3 | Content of ozone in the air | mg/m3 |
| PM2.5 | Suspended particulates <2.5 mm | mg/m3 |
| PM10 | Suspended particulates <10 mm | mg/m3 |
Top 20 Subject Categories and Key Words
| 1 | General and internal medicine | 440 | eCOVID-19 | 221 |
| 2 | Infectious diseases | 100 | SARS-CoV-2 | 95 |
| 3 | Public, environmental, and occupational health | 67 | coronavirus | 69 |
| 4 | Virology | 53 | 2019-nCoV | 60 |
| 5 | Research and experimental medicine | 51 | Coronavirus | 60 |
| 6 | Radiology, nuclear medicine, and medical imaging | 50 | China | 21 |
| 7 | Science and technology—other topics | 50 | Pneumonia | 20 |
| 8 | Biochemistry and molecular biology | 49 | Wuhan | 19 |
| 9 | Microbiology | 49 | pandemic | 17 |
| 10 | Immunology | 43 | pneumonia | 17 |
| 11 | Oncology | 38 | SARS | 16 |
| 12 | Cell biology | 37 | novel coronavirus | 15 |
| 13 | Pediatrics | 28 | epidemiology | 14 |
| 14 | Pharmacology and pharmacy | 28 | Novel coronavirus | 12 |
| 15 | Life sciences and biomedicine—other topics | 26 | Epidemiology | 11 |
| 16 | Respiratory system | 23 | SARS-CoV | 11 |
| 17 | Anesthesiology | 21 | outbreak | 11 |
| 18 | Emergency medicine | 21 | epidemic | 9 |
| 19 | Psychiatry | 19 | public health | 9 |
| 20 | Health care sciences and services | 18 | 2019 novel coronavirus | 8 |
Figure 4The integration of data from different sources with base maps.
Figure 5Place names mapping between data files and base maps.
List of COVID-19 Data Collections
| 1 | China daily cases with base map | China | January 14, 2020 | Weekly | DXY.com | |
| 2 | US daily cases with base map | USA | January 22, 2020 | Weekly | New York Times | |
| 3 | Global daily cases with base map | Global | January 22, 2020 | Weekly | John Hopkins University | |
| 4 | Baidu mobility data | China | January 01, 2020 | Weekly | Baidu.com | |
| 5 | Health facilities | China, USA | AutoNavi/US Department of Homeland Security | |||
| 6 | Social economics | USA | US Census Bureau | |||
| 7 | Climate | China | January 01, 2020 | Weekly | China Meteorological Administration | |
| 8 | Policy and regulation | China, USA, and global | January 01, 2020 | Weekly | BBC, CNN, China Daily, Tencent, etc. | |
| 9 | Scholarly articles | Global | January 01, 2020 | Weekly | Web of Science |
Figure 6Accumulated downloads of “COVID-19 Resources” on Harvard Dataverse.
Figure 7Global access to “COVID-19 Resources” by country by May 26, 2020.
Figure 8The global distributions of visitors to “COVID-19 Resources” on Harvard Dataverse.
Figure 9Accumulated confirmed cases (log scale) of COVID-19 by top 20 countries as of May 23, 2020.
Figure 10Spatiotemporal COVID-19 cases correlations in mainland China.
Figure 11Spatiotemporal COVID-19 cases correlations in Italy.
Figure 12Spatiotemporal COVID-19 cases correlations in the United States.