Literature DB >> 32189081

AI-Driven Tools for Coronavirus Outbreak: Need of Active Learning and Cross-Population Train/Test Models on Multitudinal/Multimodal Data.

K C Santosh1.   

Abstract

The novel coronavirus (COVID-19) outbreak, which was identified in late 2019, requires special attention because of its future epidemics and possible global threats. Beside clinical procedures and treatments, since Artificial Intelligence (AI) promises a new paradigm for healthcare, several different AI tools that are built upon Machine Learning (ML) algorithms are employed for analyzing data and decision-making processes. This means that AI-driven tools help identify COVID-19 outbreaks as well as forecast their nature of spread across the globe. However, unlike other healthcare issues, for COVID-19, to detect COVID-19, AI-driven tools are expected to have active learning-based cross-population train/test models that employs multitudinal and multimodal data, which is the primary purpose of the paper.

Entities:  

Keywords:  Active learning; Artificial intelligence; COVID-19; Cross-population train/test models; Machine learning; Multitudinal and multimodal data

Mesh:

Year:  2020        PMID: 32189081      PMCID: PMC7087612          DOI: 10.1007/s10916-020-01562-1

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


Introduction

The novel coronavirus (COVID-19) is a global threat since it was identified in late 2019 [1]. About COVID-19, the Centers for Disease Control and Prevention report [2] has clearly mentioned the following (U.S. Feb. 24, 2020): Person-to-person spread of COVID-19 appears to occur mainly by respiratory transmission. How easily the virus is transmitted between persons is currently unclear. Signs and symptoms of COVID-19 include fever, cough, and shortness of breath [3]. Based on the incubation period of illness for Middle East Respiratory Syndrome (MERS) and Severe Acute Respiratory Syndrome (SARS) coronaviruses, as well as observational data from reports of travel-related COVID-19, CDC estimates that symptoms of COVID-19 occur within 2–14 days after exposure. According to the World Health Organization (WHO) report [4], as of today (March 09, 2020), China confirmed 80,904 cases and 3123 of them were died. Outside China, 28,673 were confirmed and 686 of them were died from 104 countries, where Italy is found to be the most influenced country after china: 7375 confirmed cases (366 deaths). Similarly, there are 7382 confirmed cases (248 deaths) in Republic of Korea, and U.S. is no exception (see Fig. 1). Based on the confirmed cases, fatality rate, as of today, is still less than other respiratory diseases: study of 72,000 COVID-19 patients finds 2.3% death rate [5]. Figure 2 provides confirmed cases of COVID-19 across the world. Considering its global coverage, the WHO has already been declared public health emergency and its possible global threats, including consequences [6]. The devastating case in Wuhan China and future epidemics require special attention [3, 7, 8].
Fig. 1

Known locations of coronavirus cases by county in the US. Circles are sized by the number of people there who have tested positive, which may differ from where they contracted the illness. More than 100 cases have been identified in New York. (source: https://www.nytimes.com/interactive/2020/us/coronavirus-us-cases.html, March 09, 2020)

Fig. 2

Countries, territories or areas with reported confirmed cases of COVID- 19 (source: https://www.who.int/docs/default-source/coronaviruse/situation-reports/ 20,200,309-sitrep-49-covid-19.pdf?sfvrsn = 70dabe61_4, March 09, 2020)

Known locations of coronavirus cases by county in the US. Circles are sized by the number of people there who have tested positive, which may differ from where they contracted the illness. More than 100 cases have been identified in New York. (source: https://www.nytimes.com/interactive/2020/us/coronavirus-us-cases.html, March 09, 2020) Countries, territories or areas with reported confirmed cases of COVID- 19 (source: https://www.who.int/docs/default-source/coronaviruse/situation-reports/ 20,200,309-sitrep-49-covid-19.pdf?sfvrsn = 70dabe61_4, March 09, 2020) At this point, it is important to note that coronavirus was not a surprise case, since several years ago, in 2003, a novel coronavirus was identified in patients with SARS, and it is thought to be caused by an unknown infectious agent [9-12]. Since then, apart from clinical procedures and treatments, Artificial Intelligence (AI) promises a new paradigm for healthcare. Several different AI tools that are built upon Machine Learning (ML) algorithms are employed to analyzing data and decision-making processes [13]. AI-driven tools can be used to identify novel coronavirus outbreaks as well as forecast their nature of spread across the globe. However, the fundamental theory behind AI-driven tools is that they require sufficient training data (of all possible cases). Often, traditional machine learning requires a clean set of annotated data so that classifiers can possibly be well trained, which falls under scope of supervised learning. Over the past 5 decades or more, tremendous progress has been made in resolving many issues of several different projects. However, we failed to reach the point: “to model an accurate classifier, how big the size of training samples should be?” Do we still wait for collecting fairly large amount of data? Deep Learning (DL), as an example, requires a large amount of data to be trained [14, 15]. The primary idea behind the use of DL is not only to avoid feature engineering but also to extract tiny features in radiology data (pixel-level nodule, for example) [16]. Collecting large amount of data is not trivial, and one has to wait for a long. Most of the reported AI-driven tools are limited to proof-of-concept models for coronavirus case. AI experts state the fact that limited data may skew results away from the severity of coronavirus outbreak. The Wall Street Journal [17] reported that coronavirus reveals limits of AI health tools: some diagnostic-app makers are holding off updating their tools, highlighting the shortage of data on the new coronavirus and the limitations of health services billed as AI when faced with novel, fast-spreading illnesses (Parmy Olson, February 29, 2020). In a nutshell, social medias, newspapers, and health reports, we note that conventional AI-driven tools for real-world cases (with less data) may not provide optimal performance. To detect COVID-19, AI-driven tools are expected to have -based that employs . In the following, within the framework of COVID-19, the concept of active learning will be discussed (Section 2). Cross-population train/test models and its usefulness for COVID-19 are discussed in Section 3. The necessity and the use of multitudinal and multimodal data are explained in Section 4. The paper concludes in Section 5.

Active learning (AL)

As compared to passive learning (traditional machine learning classifiers), active learning is used to a learning problem, where the learner has some role in determining on what data it will be trained [13]. When it is an emergency (COVID-19) [6], it requires a special attention so that data analysis and decision-making can be made consistently without waiting several days, months, and years for data collection. Again, exploiting real-time data (on-the-fly) is the must since one cannot wait for years to train machine and learn from them nor manual annotation/analysis is possible. This means that instead of having a conventional set of train, validation, and test set, we need AI-driven tools that can learn over time without having complete knowledge about the data, which we call Active Learning (AL). In other words, AL mechanism helps self-learn i.e., Incremental Learning (IL) over time in the presence of experts (if required) [18]. The ILs aim is to iteratively help learn model to adapt to new data without forgetting its existing limited knowledge. Figure 3 provides a schematic diagram of an AL mechanism, where different data types are used. While learning, the changes in data over time can be assessed with the help of Anomaly Detection (AD) techniques. In AL-based tool, AD helps find/identify rare items, events or observations that bring suspicions by differing significantly from the majority of the data or with respect to a set of normal data for that particular event.
Fig. 3

For time-series data, a schema of Active Learning (AL) model is provided. For better understanding, AL (in dotted red circle) is used with Deep Learning (DL) for all possible data types. In AL, expert’s feedback is used in parallel with the decisions from each data type. Since DL are data dependent, separate DLs are used for different data type. The final decision is made based on multitudinal and multimodal data

For time-series data, a schema of Active Learning (AL) model is provided. For better understanding, AL (in dotted red circle) is used with Deep Learning (DL) for all possible data types. In AL, expert’s feedback is used in parallel with the decisions from each data type. Since DL are data dependent, separate DLs are used for different data type. The final decision is made based on multitudinal and multimodal data

Cross-population train/test AI-driven models

Beside the use of AL in machine learning, cross-population train/test models are the must in such scenarios, since we do not have enough data from the particular regions. This means that there is a need to automatically detect nVirus in Italy from the model trained in Wuhan, China. In other words, for such a respiratory disease, it is essential to have cross-population train/test-based AI-driven models so that automated detection can be possible. In parallel, the collected data can be used for training models over time, which are based on the decisions. Conventionally, in the literature, such a concept does not exist.

Multitudinal and multimodal data

More often, AI-driven tools are limited to one data type. Decisions that are solely based on one data type (regardless of the data size) may be skewed away from the severity of coronavirus influence. In such a case, use of mutitudinal and multimodal data can help support decision-making process with higher confidence. Since coronaviruses are enveloped viruses with a positive-sense single-stranded RNA genome and a nucleocapsid of helical symmetry, the most popularly used data for AI-driven tools mostly employ RNA sequences. Besides, Electronic Health Record (EHRs), Computerized Tomography (CT) scans [19-21], Chest X-rays (CRRs, see Fig. 4), and other data are considered and tested. Alibaba launched a new AI-based system to detect coronavirus infection via CT scans with an accuracy of up to 96% [19]. As mentioned before, AD in image consists in finding portion of the images (a set of pixels) with anomalous and unusual patterns. With small changes in image pixels, spatio-temporal signatures can be deviated from standard threshold(s). Since AD is not just limited to image data; it can be applied for all types of data that are ranging from a vector (1D array/signal/pattern), 2D matrix (image, for instance) to multi-dimensional data. In the recently reported work [20], Chest CT (see Fig. 5, as an example) – which is used to diagnose COVID-19 – can be complemented to the reverse-transcription polymerase chain reaction (RT-PCR) tests. Therefore, instead of using different machine learning models for one data type (and even small size data [22]) and looking for ensemble techniques to combine results, for AI researchers, it is wise to use mutitudinal and multimodal data to check whether different data types can help yield consistent decisions about the coronavirus outbreak over time.
Fig. 4

Chest X-ray: Bilateral focal consolidation, lobar consolidation, and patchy consolidation are clearly observed (check lower lung [1])

Fig. 5

Chest CT: An axial CT image shows ground-glass opacities with a rounded morphology (arrows) in the right middle and lower lobes [21]

Chest X-ray: Bilateral focal consolidation, lobar consolidation, and patchy consolidation are clearly observed (check lower lung [1]) Chest CT: An axial CT image shows ground-glass opacities with a rounded morphology (arrows) in the right middle and lower lobes [21]

Conclusions

Considering the possible future epidemics of the COVID-19, in this paper, the importance of the AI-driven tools and their appropriate train and test models have been introduced and discussed. The primary purpose of the paper is that AI scientists do not always wait for the complete datasets to train, validate, and test the models. Instead, AI-driven tools are required to be implemented from the beginning of data collection, in parallel with the experts in the field, where active learning needs to be employed. To achieve higher confidence during decision-making process, rather than relying on one data type, several data types are expected to be employed. For this, under the scope of active learning, the use of multitudinal and multimodal data have been discussed. Besides, considering the spread rate of COVID-19 (across the globe), AI-driven tools are expected to work as cross-population train/test models.
  12 in total

1.  Identification of a novel coronavirus in patients with severe acute respiratory syndrome.

Authors:  Christian Drosten; Stephan Günther; Wolfgang Preiser; Sylvie van der Werf; Hans-Reinhard Brodt; Stephan Becker; Holger Rabenau; Marcus Panning; Larissa Kolesnikova; Ron A M Fouchier; Annemarie Berger; Ana-Maria Burguière; Jindrich Cinatl; Markus Eickmann; Nicolas Escriou; Klaus Grywna; Stefanie Kramme; Jean-Claude Manuguerra; Stefanie Müller; Volker Rickerts; Martin Stürmer; Simon Vieth; Hans-Dieter Klenk; Albert D M E Osterhaus; Herbert Schmitz; Hans Wilhelm Doerr
Journal:  N Engl J Med       Date:  2003-04-10       Impact factor: 91.245

2.  Deep learning and medical diagnosis.

Authors:  Marc Dewey; Peter Schlattmann
Journal:  Lancet       Date:  2019-11-09       Impact factor: 79.321

3.  Update: Public Health Response to the Coronavirus Disease 2019 Outbreak - United States, February 24, 2020.

Authors:  Daniel B Jernigan
Journal:  MMWR Morb Mortal Wkly Rep       Date:  2020-02-28       Impact factor: 17.586

4.  Chest CT Findings in Coronavirus Disease-19 (COVID-19): Relationship to Duration of Infection.

Authors:  Adam Bernheim; Xueyan Mei; Mingqian Huang; Yang Yang; Zahi A Fayad; Ning Zhang; Kaiyue Diao; Bin Lin; Xiqi Zhu; Kunwei Li; Shaolin Li; Hong Shan; Adam Jacobi; Michael Chung
Journal:  Radiology       Date:  2020-02-20       Impact factor: 11.105

5.  Correlation of Chest CT and RT-PCR Testing for Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases.

Authors:  Tao Ai; Zhenlu Yang; Hongyan Hou; Chenao Zhan; Chong Chen; Wenzhi Lv; Qian Tao; Ziyong Sun; Liming Xia
Journal:  Radiology       Date:  2020-02-26       Impact factor: 11.105

6.  The Role of Augmented Intelligence (AI) in Detecting and Preventing the Spread of Novel Coronavirus.

Authors:  Justin B Long; Jesse M Ehrenfeld
Journal:  J Med Syst       Date:  2020-02-04       Impact factor: 4.460

7.  Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study.

Authors:  Nanshan Chen; Min Zhou; Xuan Dong; Jieming Qu; Fengyun Gong; Yang Han; Yang Qiu; Jingli Wang; Ying Liu; Yuan Wei; Jia'an Xia; Ting Yu; Xinxin Zhang; Li Zhang
Journal:  Lancet       Date:  2020-01-30       Impact factor: 79.321

Review 8.  From SARS to MERS, Thrusting Coronaviruses into the Spotlight.

Authors:  Zhiqi Song; Yanfeng Xu; Linlin Bao; Ling Zhang; Pin Yu; Yajin Qu; Hua Zhu; Wenjie Zhao; Yunlin Han; Chuan Qin
Journal:  Viruses       Date:  2019-01-14       Impact factor: 5.048

Review 9.  SARS and MERS: recent insights into emerging coronaviruses.

Authors:  Emmie de Wit; Neeltje van Doremalen; Darryl Falzarano; Vincent J Munster
Journal:  Nat Rev Microbiol       Date:  2016-06-27       Impact factor: 60.633

10.  Characteristics of and Important Lessons From the Coronavirus Disease 2019 (COVID-19) Outbreak in China: Summary of a Report of 72 314 Cases From the Chinese Center for Disease Control and Prevention.

Authors:  Zunyou Wu; Jennifer M McGoogan
Journal:  JAMA       Date:  2020-04-07       Impact factor: 56.272

View more
  68 in total

1.  Detecting SARS-CoV-2 From Chest X-Ray Using Artificial Intelligence.

Authors:  Md Manjurul Ahsan; Md Tanvir Ahad; Farzana Akter Soma; Shuva Paul; Ananna Chowdhury; Shahana Akter Luna; Munshi Md Shafwat Yazdan; Akhlaqur Rahman; Zahed Siddique; Pedro Huebner
Journal:  IEEE Access       Date:  2021-02-23       Impact factor: 3.367

2.  A Systematic Review on the Use of AI and ML for Fighting the COVID-19 Pandemic.

Authors:  Muhammad Nazrul Islam; Toki Tahmid Inan; Suzzana Rafi; Syeda Sabrina Akter; Iqbal H Sarker; A K M Najmul Islam
Journal:  IEEE Trans Artif Intell       Date:  2021-03-01

3.  CAD systems for COVID-19 diagnosis and disease stage classification by segmentation of infected regions from CT images.

Authors:  Mohammad H Alshayeji; Silpa ChandraBhasi Sindhu; Sa'ed Abed
Journal:  BMC Bioinformatics       Date:  2022-07-06       Impact factor: 3.307

4.  COVID-19 pulmonary consolidations detection in chest X-ray using progressive resizing and transfer learning techniques.

Authors:  Anant Bhatt; Amit Ganatra; Ketan Kotecha
Journal:  Heliyon       Date:  2021-06-05

5.  Covid-19 Imaging Tools: How Big Data is Big?

Authors:  K C Santosh; Sourodip Ghosh
Journal:  J Med Syst       Date:  2021-06-03       Impact factor: 4.460

6.  A Bi-fold Approach to Detect and Classify COVID-19 X-Ray Images and Symptom Auditor.

Authors:  Ahan Chatterjee; Swagatam Roy; Sunanda Das
Journal:  SN Comput Sci       Date:  2021-05-28

7.  Data-driven test strategy for COVID-19 using machine learning: A study in Lahore, Pakistan.

Authors:  Chuanli Huang; Min Wang; Warda Rafaqat; Salman Shabbir; Liping Lian; Jun Zhang; Siuming Lo; Weiguo Song
Journal:  Socioecon Plann Sci       Date:  2021-06-08       Impact factor: 4.923

8.  Weather indicators and improving air quality in association with COVID-19 pandemic in India.

Authors:  Rabin Chakrabortty; Subodh Chandra Pal; Manoranjan Ghosh; Alireza Arabameri; Asish Saha; Paramita Roy; Biswajeet Pradhan; Ayan Mondal; Phuong Thao Thi Ngo; Indrajit Chowdhuri; Ali P Yunus; Mehebub Sahana; Sadhan Malik; Biswajit Das
Journal:  Soft comput       Date:  2021-07-13       Impact factor: 3.732

Review 9.  Drug repurposing for COVID-19: Approaches, challenges and promising candidates.

Authors:  Yan Ling Ng; Cyrill Kafi Salim; Justin Jang Hann Chu
Journal:  Pharmacol Ther       Date:  2021-06-23       Impact factor: 12.310

Review 10.  Diagnosing COVID-19 in the Emergency Department: A Scoping Review of Clinical Examinations, Laboratory Tests, Imaging Accuracy, and Biases.

Authors:  Christopher R Carpenter; Philip A Mudd; Colin P West; Erin Wilber; Scott T Wilber
Journal:  Acad Emerg Med       Date:  2020-07-26       Impact factor: 5.221

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.