Literature DB >> 33846532

COVID-19 information retrieval with deep-learning based semantic search, question answering, and abstractive summarization.

Andre Esteva1, Anuprit Kale2, Romain Paulus2, Kazuma Hashimoto2, Wenpeng Yin2, Dragomir Radev2,3, Richard Socher2.   

Abstract

The COVID-19 global pandemic has resulted in international efforts to understand, track, and mitigate the disease, yielding a significant corpus of COVID-19 and SARS-CoV-2-related publications across scientific disciplines. Throughout 2020, over 400,000 coronavirus-related publications have been collected through the COVID-19 Open Research Dataset. Here, we present CO-Search, a semantic, multi-stage, search engine designed to handle complex queries over the COVID-19 literature, potentially aiding overburdened health workers in finding scientific answers and avoiding misinformation during a time of crisis. CO-Search is built from two sequential parts: a hybrid semantic-keyword retriever, which takes an input query and returns a sorted list of the 1000 most relevant documents, and a re-ranker, which further orders them by relevance. The retriever is composed of a deep learning model (Siamese-BERT) that encodes query-level meaning, along with two keyword-based models (BM25, TF-IDF) that emphasize the most important words of a query. The re-ranker assigns a relevance score to each document, computed from the outputs of (1) a question-answering module which gauges how much each document answers the query, and (2) an abstractive summarization module which determines how well a query matches a generated summary of the document. To account for the relatively limited dataset, we develop a text augmentation technique which splits the documents into pairs of paragraphs and the citations contained in them, creating millions of (citation title, paragraph) tuples for training the retriever. We evaluate our system ( http://einstein.ai/covid ) on the data of the TREC-COVID information retrieval challenge, obtaining strong performance across multiple key information retrieval metrics.

Entities:  

Year:  2021        PMID: 33846532     DOI: 10.1038/s41746-021-00437-0

Source DB:  PubMed          Journal:  NPJ Digit Med        ISSN: 2398-6352


  11 in total

1.  Multi-probe attention neural network for COVID-19 semantic indexing.

Authors:  Jinghang Gu; Rong Xiang; Xing Wang; Jing Li; Wenjie Li; Longhua Qian; Guodong Zhou; Chu-Ren Huang
Journal:  BMC Bioinformatics       Date:  2022-06-29       Impact factor: 3.307

2.  CoQUAD: a COVID-19 question answering dataset system, facilitating research, benchmarking, and practice.

Authors:  Shaina Raza; Brian Schwartz; Laura C Rosella
Journal:  BMC Bioinformatics       Date:  2022-06-02       Impact factor: 3.307

3.  Revealing Opinions for COVID-19 Questions Using a Context Retriever, Opinion Aggregator, and Question-Answering Model: Model Development Study.

Authors:  Zhao-Hua Lu; Jade Xiaoqing Wang; Xintong Li
Journal:  J Med Internet Res       Date:  2021-03-19       Impact factor: 5.428

4.  A term-based and citation network-based search system for COVID-19.

Authors:  Chrysoula Zerva; Samuel Taylor; Axel J Soto; Nhung T H Nguyen; Sophia Ananiadou
Journal:  JAMIA Open       Date:  2021-12-14

Review 5.  Data science approaches to confronting the COVID-19 pandemic: a narrative review.

Authors:  Qingpeng Zhang; Jianxi Gao; Joseph T Wu; Zhidong Cao; Daniel Dajun Zeng
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2021-11-22       Impact factor: 4.226

6.  Heterogeneous deep graph convolutional network with citation relational BERT for COVID-19 inline citation recommendation.

Authors:  Tao Dai; Jie Zhao; Dehong Li; Shun Tian; Xiangmo Zhao; Shirui Pan
Journal:  Expert Syst Appl       Date:  2022-09-17       Impact factor: 8.665

7.  Artificial intelligence for topic modelling in Hindu philosophy: Mapping themes between the Upanishads and the Bhagavad Gita.

Authors:  Rohitash Chandra; Mukul Ranjan
Journal:  PLoS One       Date:  2022-09-01       Impact factor: 3.752

Review 8.  COVID-19-Related Scientific Literature Exploration: Short Survey and Comparative Study.

Authors:  Bahaj Adil; Safae Lhazmir; Mounir Ghogho; Houda Benbrahim
Journal:  Biology (Basel)       Date:  2022-08-16

Review 9.  Searching for scientific evidence in a pandemic: An overview of TREC-COVID.

Authors:  Kirk Roberts; Tasmeer Alam; Steven Bedrick; Dina Demner-Fushman; Kyle Lo; Ian Soboroff; Ellen Voorhees; Lucy Lu Wang; William R Hersh
Journal:  J Biomed Inform       Date:  2021-07-08       Impact factor: 8.000

10.  COBERT: COVID-19 Question Answering System Using BERT.

Authors:  Jafar A Alzubi; Rachna Jain; Anubhav Singh; Pritee Parwekar; Meenu Gupta
Journal:  Arab J Sci Eng       Date:  2021-06-23       Impact factor: 2.334

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.