| Literature DB >> 35382101 |
Daqing He1, Zhendong Wang2, Khushboo Thaker2, Ning Zou2.
Abstract
Academic collections, such as COVID-19 Open Research Dataset (CORD-19), contain a large number of scholarly articles regarding COVID-19 and other related viruses. These articles represent the latest development in combating COVID-19 pandemic in various disciplines. However, it is difficult for laypeople to access these articles due to the term mismatch problem caused by their limited medical knowledge. In this article, we present an effort of helping laypeople to access the CORD-19 collection by translating and expanding laypeople's keywords to their corresponding medical terminology using the National Library of Medicine's Consumer Health Vocabulary. We then developed a retrieval system called Search engine for Laypeople to access the COVID-19 literature (SLAC) using open-source software. Utilizing Centers for Disease Control and Prevention's FAQ questions as the basis for developing common questions that laypeople could be interested in, we performed a set of experiments for testing the SLAC system and the translation and expansion (T&E) process. Our experiment results demonstrate that the T&E process indeed helped to overcome the term mismatch problem and mapped laypeople terms to the medical terms in the academic articles. But we also found that not all laypeople's search topics are meaningful to search on the CORD-19 collection. This indicates the scope and the limitation of enabling laypeople to search on academic article collection for obtaining high-quality information.Entities:
Keywords: COVID-19; consumer health vocabulary; information retrieval; laypeople; translation and expansion process
Year: 2022 PMID: 35382101 PMCID: PMC8969476 DOI: 10.2478/dim-2020-0011
Source DB: PubMed Journal: Data Inf Manag ISSN: 2543-9251
Examples of Laypeople Terms and Corresponding Medical Concepts Identified Via the T&E Process
| SARS | SARS virus | SARS coronavirus, SARS-Cov, Severe Acute Respiratory Syndrome |
| MERS | MERS Virus | middle east respiratory syndrome, MERS |
| hcq | Hcq | Hydroxychloroquine, chloroquine |
| Rat | Rat, brown rat | rattus norvegicus |
| Bat | bat | chiroptera |
| Pneumonia | Pneumonia, lung disease | pulmonary inflammation, pneumonitis, Lung inflammation |
| malaria | malaria | paludism |
Figure 1The Architecture and Data Flow of the SLAC System
Figure 2The Screenshot of the SLAC's Frontend Interface
33 Search Questions Converted from CDC COVID-19 FAQs
| COVID-19 Basics | 0 | What is a novel coronavirus? |
| 2 | Why might someone blame or avoid individuals and groups create stigma because of COVID-19? | |
| 4 | Why do some state's COVID-19 case number sometimes differ from what is posted on CDCs website | |
| How COVID-19 Spreads | 8 | How does the virus spread? |
| 10 | Can someone who has had COVID-19 spread the illness to others? | |
| 15 | What is community spread? | |
| How to Protect Yourself | 18 | Am I at risk for COVID-19 in the United States |
| 20 | What should I do if I have had close contact with someone who has COVID-19? | |
| 23 | Is it okay for me to donate blood? | |
| COVID-19 and Children | 24 | What is the risk of my child becoming sick with COVID-19? |
| 27 | Should children wear masks? | |
| 29 | What steps should parents take to protect children during a community outbreak? | |
| School Dismissals and Children | 30 | While school's out can my child hang out with their friends? |
| 32 | While school's out will kids have access to meals? | |
| 34 | While school's out limit time with older adults including relatives and people with chronic medical conditions? | |
| Preparing Your Home and Family for COVID-19 | 35 | How can my family and I prepare for COVID-19? |
| 37 | What should I do if someone in my house gets sick with COVID-19? | |
| 41 | Should I make my own hand sanitizer if I can’t find it in the stores? | |
| In case of an Outbreak in Your Community | 42 | What should I do if there is an outbreak in my community? |
| 43 | Will schools be dismissed if there is an outbreak in my community? | |
| 44 | Should I go to work if there is an outbreak in my community? | |
| Symptoms and Testing | 46 | What are the symptoms and complications that COVID-19 can cause? |
| 47 | Should I be tested for COVID-19? | |
| 48 | Where can I get tested for COVID-19? | |
| Higher risk | 50 | Who is at higher risk for serious illness from COVID-19? |
| 52 | How were the underlying conditions for people considered higher risk of serious illness with COVID-19 selected? | |
| 56 | Are people with disabilities at higher risk? | |
| COVID-19 and Funerals | 57 | Am I at risk if I go to a funeral or visitation service for someone who died of COVID-19? |
| 60 | What should I do if my family member died from COVID-19 while overseas? | |
| 61 | My family member died from COVID-19 while overseas? What are the requirements for returning the body to the United States? | |
| COVID-19 and Animals | 62 | Can I get COVID-19 from my pets or other animals? |
| 65 | Should I avoid contact with pets or other animals if I am sick with COVID-19? | |
| 69 | What precautions should be taken for animals that have recently been imported from outside the United States for example by shelters rescues or as personal pets? |
Search Question 8, Which is Converted from a Question at CDC COVID-19 FAQs.
| [CDC Topic]: | How COVID-19 Spreads? |
| [Question]: | How does the virus spread? |
| [Answer]: | The virus that causes COVID-19 is thought to spread mainly from person to person, mainly through respiratory droplets produced when an infected person coughs or sneezes. These droplets can land in the mouths or noses of people who are nearby or possibly be inhaled into the lungs. Spread is more likely when people are in close contact with one another (within about 6 feet). COVID-19 seems to be spreading easily and sustainably in the community (“community spread”) in many affected geographic areas. Community spread means people have been infected with the virus in an area, including some who are not sure how or where they became infected. |
Experiment Results
| Avg Num Query Terms per Query | 5.76 | 23.70 | 10.00 | 54.03 |
| Num Queries with > 0 Returned Docs | 19 | 26 | 12 | 29 |
| Avg Relevance Score | 1.97 | 2.09 | 1.46 | 2.28 |
| Avg nDCG@10 | 0.543 | 0.728 | 0.350 | 0.806 |
| Avg nDCG@10 on non-0 queries | 0.944 | 0.924 | 0.964 | 0.917 |
| Precision@5 | 0.382 | 0.436 | 0.297 | 0.418 |
| Precision@10 | 0.348 | 0.373 | 0.267 | 0.373 |
Queries for Different Runs for Search Question 8.
| “spreads” AND “virus” AND “spread” AND “covid-19” | |
| (“smear-instruction imperative” OR “spreads” OR “spread”) AND (“viridae” OR “viruses” OR “virus”) AND (“smear-instruction imperative” OR “spread”) AND (“corona infection virus” OR “corona infections virus” OR “coronavirus” OR “corona virus” OR “genus: coronavirus” OR “covid-19” OR “coronavirus infections” OR “coronaviruses” OR “coronavirus infection”) | |
| “spread” AND “covid-19” AND (person to person OR “close contact” OR “six feet”) | |
| (“smear-instruction imperative” OR “spread”) AND (“corona virus” OR “corona infections virus” OR “coronavirus” OR “coronaviruses” OR “coronavirus infection” OR “covid-19” OR “coronavirus infections” OR “genus: coronavirus” OR “corona infection virus”) AND (person to person OR (“contact” OR “close contact” OR “close” OR “closed” OR “contact with” OR “closing” OR “contacting”) OR (“six feet” OR “feet, unit of measurement” OR “ft” OR “six” OR “feet”)) AND ((“respiratory droplets” OR “respiratory”) OR (“cough, ctcae” OR “coughs” OR “cough”) OR (“sneezing” OR “sneezes” OR “sneeze”) OR (“inhal” OR “inhalation” OR “lung structure” OR “in breathing” OR “inspiration” OR “inspirations” OR “inhaled” OR “pulmonary” OR “inhalations” OR “lung” OR “lung structures” OR “inspired” OR “inhaling” OR “inspir” OR “lungs” OR “breathing inspiration” OR “inspiration function” OR “breathing” OR “inspiratory” OR “breathing in” OR “inhaled into lungs” OR “respiratory aspiration”)) | |
| (“smear - instruction imperative” OR “spreads” OR “spread”) AND (“viridae” OR “viruses” OR “virus”) AND (“smear-instruction imperative” OR “spread”) AND “covid-19” |
Top 10 Returned Documents for the Runs of Search Question 8.
| 1 | qz9tgl83 (5) | b518n9dx (3) | qz9tgl83 (5) | ||
| 2 | 8ozauxlk (5) | djuomhww (5) | yg5posts (5) | ||
| 3 | yg5posts (5) | 0lyxvex0 (4) | oee19duz (5) | ||
| 4 | xfjexm5b (4) | zpaqd5vd (4) | fu8ndhdo (4) | iv753tly (1) | 8ozauxlk (5) |
| 5 | 9em5tjya (4) | pidar1gz (3) | 52zjm9jt (2) | k4lzwfge (3) | 52zjm9jt (5) |
| 6 | oee19duz (5) | nn15iyqd (4) | bpukqctg (5) | c9ts2g7w (3) | xcacty89 (4) |
| 7 | xcacty89 (4) | 8lku99jc (5) | sl6gsjz4 (2) | bbmcenpy (3) | |
| 8 | dxtbp4kd (5) | s155i4e9 (4) | 0hrmk77p (5) | 39tg92sa (2) | xfjexm5b (4) |
| 9 | ztl54g6q (5) | ztcyvsoi (4) | ycrrsr5c (3) | ztl54g6q (5) | |
| 10 | lioj0tkn (3) | smmrl5i6 (1) | k2ixwz9w (4) | hmy8fs3g (5) | 9em5tjya (4) |
Returned Documents are Marked as “DOCID (RelScore).”
Documents Whose ID is Bold are the Ones that Shared with Multiple Runs.
Documents Marked with “*” are Shared between PQ and EQ’.
Average nDCG@10 Values for Different CDC Topics
| COVID-19 Basics | 0.325 | 0.297 | 0.301 | 0.305 |
| COVID-19 Spreads | 0.952 | 0.930 | 0.965 | 0.947 |
| How to Protect Yourself | 0.933 | 0.905 | 0.667 | 0.923 |
| COVID-19 and Children | 0.643 | 0.905 | 0.295 | 0.879 |
| School Dismissals and Children | 0 | 0 | 0 | 0.611 |
| Preparing Your Home and Family for COVID-19 | 0.315 | 0.924 | 0.318 | 0.912 |
| In Case of an Outbreak in Your Community | 0.878 | 0.930 | 0.314 | 0.931 |
| Symptoms and Testing | 0.981 | 0.959 | 0.332 | 0.589 |
| Higher Risk | 0.948 | 0.937 | 0.328 | 0.860 |
| COVID-19 and Funerals | 0 | 0.620 | 0 | 0.982 |
| COVID-19 and Animals | 0 | 0.601 | 0.333 | 0.927 |
nDCG@10 Scores for Questions in CDC Topic “COVID-19 Basics”
| 0. What is a novel coronavirus | 0.975 | 0.892 | 0.902 | 0.914 |
| 2. Why might someone blame or avoid individuals and groups create stigma because of COVID-19? | 0 | 0 | 0 | 0 |
| 4. Why do some state's COVID-19 case number sometimes differ from what is posted on CDC's website? | 0 | 0 | 0 | 0 |