Literature DB >> 34773807

Comprehensive analyses of bioinformatics applications in the fight against COVID-19 pandemic.

Lifei Ma¹, Huiyang Li², Jinping Lan³, Xiuqing Hao⁴, Huiying Liu⁴, Xiaoman Wang⁵, Yong Huang⁶.

Abstract

Novel coronavirus disease 2019 (COVID-19) is a global pandemic caused by severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2), which can be transmitted from person to person. As of September 21, 2021, over 228 million cases were diagnosed as COVID-19 infection in more than 200 countries and regions worldwide. The death toll is more than 4.69 million and the mortality rate has reached about 2.05% as it has gradually become a global plague, and the numbers are growing. Therefore, it is important to gain a deeper understanding of the genome and protein characteristics, clinical diagnostics, pathogenic mechanisms, and the development of antiviral drugs and vaccines against the novel coronavirus to deal with the COVID-19 pandemic. The traditional biology technologies are limited for COVID-19-related studies to understand the pandemic happening. Bioinformatics is the application of computational methods and analytical tools in the field of biological research which has obvious advantages in predicting the structure, product, function, and evolution of unknown genes and proteins, and in screening drugs and vaccines from a large amount of sequence information. Here, we comprehensively summarized several of the most important methods and applications relating to COVID-19 based on currently available reports of bioinformatics technologies, focusing on future research for overcoming the virus pandemic. Based on the next-generation sequencing (NGS) and third-generation sequencing (TGS) technology, not only virus can be detected, but also high quality SARS-CoV-2 genome could be obtained quickly. The emergence of data of genome sequences, variants, haplotypes of SARS-CoV-2 help us to understand genome and protein structure, variant calling, mutation, and other biological characteristics. After sequencing alignment and phylogenetic analysis, the bat may be the natural host of the novel coronavirus. Single-cell RNA sequencing provide abundant resource for discovering the mechanism of immune response induced by COVID-19. As an entry receptor, angiotensin-converting enzyme 2 (ACE2) can be used as a potential drug target to treat COVID-19. Molecular dynamics simulation, molecular docking and artificial intelligence (AI) technology of bioinformatics methods based on drug databases for SARS-CoV-2 can accelerate the development of drugs. Meanwhile, computational approaches are helpful to identify suitable vaccines to prevent COVID-19 infection through reverse vaccinology, Immunoinformatics and structural vaccinology.

Entities: Chemical

Keywords: Application; Bioinformatics technology; COVID-19; SARS-CoV-2

Mesh：

Substances：
Antiviral Agents

Year: 2021 PMID： 34773807 PMCID： PMC8560182 DOI： 10.1016/j.compbiolchem.2021.107599

Source DB: PubMed Journal: Comput Biol Chem ISSN： 1476-9271 Impact factor: 2.877

Introduction

Since December 2019, several cases of acute respiratory tract infection have been confirmed in Wuhan, Hubei province, China (C. Huang et al., 2020; Zhu et al., 2020). The large-scale pandemic of confirmed cases jumps unpredictably, spreads rapidly, and poses a severe threat to public health (Mahase, 2020; J.T. Wu et al., 2020). The World Health Organization (WHO) has declared the outbreak an "International Public Health Emergency" on March 11, 2020. The newly identified β-coronavirus (β-CoVs) was named severe acute respiratory syndrome coronavirus type 2 (SARS-CoV-2) by the Coronavirus Study Group (CSG) of the International Committee. The disease caused by SARS-CoV-2 was suspected to be an infectious disease and officially designated coronavirus disease 2019 (COVID-19) by WHO (Zhu et al., 2020). According to the current data, SARS-CoV-2 is easily transmissible among people through respiratory droplets and survives in the air for about 2 h, remaining highly infectiousness (Y. Liu et al., 2020; Munster et al., 2020; wei Lu et al., 2020; H. Zhang et al., 2020). The incubation period after infection is usually 4–8 days (Chen et al., 2020a; C. Huang et al., 2020; D. Wang et al., 2020). All ages are susceptible to SARS-CoV-2, however, older patients and those who smoke experience severe illness with comorbidities (Chen et al., 2020b; J. Zhang et al., 2020). Importantly, the likelihood of local transmission tends to increase due to cases, including human-to-human transmission in the asymptomatic infection period (Rothe et al., 2020, Yuan et al., 2006). The two highly pathogenic β-CoVs, which are known to cause severe acute respiratory syndrome (SARS) and Middle East respiratory syndrome (MERS), are zoonotic pathogens that mainly infect the lower respiratory tract and result in fatal pneumonia in humans, and cause a serious threat to human health (Channappanavar and Perlman, 2017, Luk et al., 2019, Ramadan and Shaib, 2019; F. Wu et al., 2020). The overall mortality rate reported by WHO is 9.6% for SARS and 34.4% for MERS (Munster et al., 2020, World Health Organization, 2020). In comparison, SARS-CoV-2 has a mortality rate of 2.05% in the world, and elder patients with chronic diseases and intensive care unit (ICU) patients have reached mortality rates of 17–38% (Chen et al., 2020a; D. Wang et al., 2020). Global Real Time COVID-19 outbreak map from WHO (https://covid19.who.int/) showing the confirmed cases, deaths and vaccine doses have been administered in different countries and regions worldwide. So far, there is no effective treatment for human coronavirus (CoVs) infection and no drugs are recommended. However, a large number of studies have made progress in the characterization of CoVs, and the therapies and vaccines targeting the virus are being studied extensively. Traditional biology technologies of virus research can not obtain such timely and large-scale data, and has other shortcomings. Bioinformatics technology has made great contributions to the rapid and in-depth study of SARS and MERS and will also bring a new dawn to the study of SARS-CoV-2 (Woo et al., 2010). Sequencing diagnosis is still the main method in the diagnosis of the novel coronavirus and plays an important role in virus functional research. In this scientific and technological battle, the analysis of sequencing the virus data by bioinformatics tools provides an important basis for the discovery of SARS-CoV-2. Databases of SARS-CoV-2 provide genome and protein structure quality evaluation, variant calling, and interactive visualization and other biological characteristics. Sequence alignment among different species help us to understand virus origin and evolutionary characteristics. The rapid, feasible, accurate, and sensitive point-of-care testing (POCT) approaches for detecting SARS-CoV-2 in different situation are urgently developed. By bioinformatics analysis of single-cell RNA sequencing of SARS-CoV-2 RNA, the features of epithelial and immune cell types were revealed. In addition, it can provide predictive and screening high-throughput computational support to accelerate pandemic treatment and prevention, such as specific drugs, and the development of vaccines. Small molecule drugs, screened from drug databases for SARS-CoV-2 through high-performance computing approaches, such as molecular dynamics simulation, molecular docking and artificial intelligence technology, can competitively bind to the functional sites of viral proteins, which prevents the viral protein from binding to its true substrate, thereby inhibiting viral activity or the receptors induce the host to generate immune responses and produce neutralizing antibodies against novel coronavirus (Yin et al., 2020). Although we know comparatively little about SARS-CoV-2, it is a highly pathogenic human pathogen, and possibly a zoonotic agent. Challenges will continue to be confronted in several key areas, therefore, in this review, evidence of the virus RNA genome sequence by bioinformatics methods is instrumental in the characteristic, origin, evolution, detection, pathogenesis and function of SARS-CoV-2, and the development of antiviral drugs and vaccines, to provide data for public health decision-making.

The bioinformatics technologies relating to COVID-19

The following sections provide several of the most important methods and applications of bioinformatics technologies in combating the COVID-19 based on currently available reports and the descriptive workflow is shown in Fig. 1.

Fig. 1

The descriptive workflow of the most important methods and applications of bioinformatics technologies on COVID-19.

Genome and protein structure in databases

According to the next-generation sequencing (NGS) and third-generation sequencing (TGS) technology, significantly updated and publicly available databases of SARS-CoV-2 offer genome and protein structure quality evaluation, variant calling, and interactive visualization. According to the Global Initiative on Sharing Avian Influenza Data (GISAID, https://www.gisaid.org) (Shu and McCauley, 2017), National Center for Biotechnology Information (NCBI, https://www.ncbi.nlm.nih.gov) (O’Leary et al., 2016), National Microbiology Data Center (NMDC, https://www.nmdc.cn) (Shi et al., 2019), China National GeneBank (CNGB, https://www.cngb.org) (Xiao et al., 2019), China National Center for Bioinformation (CNCB, https://bigd.big.ac.cn/ncov) and several other sources, SARS-CoV-2 genome sequences resources worldwide and their related metadata have been published ( Table 1) (Song et al., 2020; W.M. Zhao et al., 2020). After obtaining nucleic acid sequence, comprehensive bioinformatics analysis demonstrated that COVID-19 was caused by the novel coronavirus SARS-CoV-2 which belongs to lineage B β-CoVs and is a positive single-stranded RNA virus with a genome length of about 29.9 kb, with non-coding regions at both ends, and non-structural protein-coding regions and structural protein-coding regions in the middle (Chan et al., 2020; Roujian Lu et al., 2020; Zhu et al., 2020). SARS-CoV and MERS-CoV have positive-sense RNA genomes of 27.9 kb and 30.1 kb, respectively (De Wit et al., 2016). The complete RNA genome of SARS-CoV-2 includes 29,870 nucleotides and also contains a variable number of open reading frames (ORFs) (Chen et al., 2020a; Y. Liu et al., 2020; Munster et al., 2020; D. Wang et al., 2020; wei Lu et al., 2020; H. Zhang et al., 2020). Excluding the poly (A) tail (GenBank no. MN908947), the ORFs encode 9860 amino acids with five typical ORFs on a common coding strand and in the order of 5′ UTR-replicase polyprotein (orf1a/b, 7096-aa) – Spike glycoprotein (S, 1273-aa) – Envelope protein (E,75-aa) – Membrane protein (M, 222-aa) – Nucleocapsid protein (N, 419-aa)- 3′ UTR, in which the four essential protein-coding regions S, E, M, and N encode structural proteins ( Fig. 2A, B) (Chan et al., 2020). In addition, there are several accessory genes that influence the host innate immune response, such as ORF3b and ORF8. ORF3b encodes a completely novel short protein without an exact function. The new ORF8 probably encodes a secreted protein formed by an alpha-helix, followed by beta sheets that contain six strands that have no known functional domain or motif similarity (Chan et al., 2020). Two-thirds of viral RNA, mainly located in the coding region of non-structural proteins, including the ORF1a and ORF1b genes, translates two replicase polyproteins (pp1a and pp1ab) and are proteolytically cleaved into 16 non-structural proteins (NSP), namely NSP1–16, which are indispensable for viral transcription and replication (Chan et al., 2020; F. Wu et al., 2020; Zhou et al., 2020).

Table 1

Databases of genome and protein structure.

Type	Data center
Genome sequence	Global Initiative on Sharing Avian Influenza Data (GISAID, https://www.gisaid.org)
Genome sequence	National Center for Biotechnology Information (NCBI, https://www.ncbi.nlm.nih.gov)
Genome sequence	National Microbiology Data Center (NMDC, https://www.nmdc.cn)
Genome sequence	China National GeneBank (CNGB, https://www.cngb.org)
Genome sequence	China National Center for Bioinformation (CNCB, https://bigd.big.ac.cn/ncov)
Protein sequence/function	Universal Protein (UniProt, https://www.uniprot.org)
Protein sequence/structure	Research Collaboratory for Structural Bioinformatics Protein Data Bank Database (RCSB PDB, https://www.rcsb.org)

Fig. 2

Structure of SARS-CoV-2. (A) Schematic representation of the structure of SARS-CoV-2. It has four structural proteins, S (spike), E (envelope), M (membrane), and N (nucleocapsid) proteins; the N protein holds the single-strand, positive-sense RNA genome, and the S, E, and M proteins together create the viral envelope. (B) SARS-CoV-2 genome comprises a 5′ untranslated region (5′ UTR) including 5′ leader sequence, open reading frame (ORF) 1a/b, envelope, membrane and nucleoprotein, accessory proteins such as orf 3, 6,7a, 7b, 8, and 9b, and 3′ untranslated region (3′ UTR) in sequence. (C) SARS-CoV-2 structure is based upon amplification targets of the NTS method. NTS detected 12 fragments including ORF1ab and virulence factor-encoding regions (M. Wang et al., 2020).

Databases of genome and protein structure. Structure of SARS-CoV-2. (A) Schematic representation of the structure of SARS-CoV-2. It has four structural proteins, S (spike), E (envelope), M (membrane), and N (nucleocapsid) proteins; the N protein holds the single-strand, positive-sense RNA genome, and the S, E, and M proteins together create the viral envelope. (B) SARS-CoV-2 genome comprises a 5′ untranslated region (5′ UTR) including 5′ leader sequence, open reading frame (ORF) 1a/b, envelope, membrane and nucleoprotein, accessory proteins such as orf 3, 6,7a, 7b, 8, and 9b, and 3′ untranslated region (3′ UTR) in sequence. (C) SARS-CoV-2 structure is based upon amplification targets of the NTS method. NTS detected 12 fragments including ORF1ab and virulence factor-encoding regions (M. Wang et al., 2020). There are two main methods for predicting the virus structure through high-performance computing and bioinformatics. The first method is to clarify the virus structure through gene sequencing results. To ensure the accuracy of the method, the results require statistical verification. Among them, AlphaFold (Senior et al., 2020) de novo simulation is a deep two-dimensional dilated convolutional residual network, which predicts the protein structure or directs homologous modeling based on similar proteins in the sequence. The higher the sequence similarity, the higher is the reliability of structural simulation, common software include Swiss-Model (https://swissmodel.expasy.org/) (Waterhouse et al., 2018), Robetta (http://robetta.bakerlab.org/) (Kim et al., 2004) I-TASSER suite (Yang et al., 2014), and so on. Novel coronavirus structure can be predicted from the SARS virus structure, so de novo simulations using artificial intelligence programs are seldom needed. The second method to discover the structure of the virus is to reconstruct its three-dimensional structure through imaging techniques. In this process, high-performance computing, mainly using RELION software (Scheres, 2012), can accelerate the reconstruction of the three-dimensional structure. AlphaFold and RELION is recommended to calculate the virus protein structure, but sometimes computing power of AlphaFold may lead to over fitting. The coronavirus molecular weight is large and, therefore, cryo-electron microscopy (Cryo-EM) combined with bioinformatics technologies can accelerate the reconstruction of coronavirus. Currently, SARS-CoV-2 protein sequences and functional information were described in Universal Protein (UniProt, https://www.uniprot.org) and the world’s first three-dimensional structure of the new coronavirus protein is available in the Research Collaboratory for Structural Bioinformatics Protein Data Bank Database (RCSB PDB, https://www.rcsb.org, PDB ID: 6LU7) (Jin et al., 2020). The functional polypeptides are released from the polyproteins (pp1a and pp1ab) by extensive proteolytic processing, predominantly by a 33.8-kDa main protease (Mpro), which is also referred to as the 3 C-like protease. Mpro digests the polyproteins at no less than 11 conserved sites, starting with the autolytic cleavage of this enzyme itself from pp1a and pp1ab (Hegyi and Ziebuhr, 2002). The important role of Mpro in the viral life cycle, together with few closely related homologs in humans, identifies it as a key drug target against novel coronavirus (Pillaiyar et al., 2016). Through a high-resolution complex structure, academician Chen's team found that the antibody 4A8 has strong virus neutralization ability and can significantly inhibit the activity of the virus (Chi et al., 2020). Bioinformatics technology has become a powerful tool for identifying, analyzing, and monitoring the novel coronavirus, and has laid an important foundation for controlling the spread of the virus. The rapid acquirement of the genome sequence and protein structure of SARS-CoV-2 by bioinformatics technology has opened a fast channel for targeted drug screening and vaccine development.

Origin, similarity, and evolution

The origin and evolution of SARS-CoV-2 remain largely unclear. Several studies have suggested that bats are known to be hosts for more than 30 CoVs (Wong et al., 2019), and could be the potential natural reservoir of SARS-CoV-2 (Giovanetti et al., 2020, Paraskevis et al., 2020, Wong et al., 2019). On May 1, 2020, WHO stated that many scientists had studied the gene sequence of COVID-19 and were convinced that it originated from nature. However, there is no definitive evidence of the origin or intermediate host of the novel virus (Li et al., 2020, Phan et al., 2020). The virus genome sequencing results, sequencing alignment and phylogenetic analysis with bioinformatics tool MUSCLE (3.8.31) (Edgar, 2004) demonstrated that SARS-CoV-2 had a 96.3% sequence identity to BatCoV RaTG13, which originated from Yunnan, China in 2013. This was the closest relative to two bat-derived SARS-like CoVs, bat-SL-CoVZC45 and bat-SL-CoVZXC21, sharing 88% nucleotide similarity, originating from Zhoushan, China, in 2018, whereas it shared 79.5% identity to SARS-CoV and 50% identity to MERS-CoV ( Fig. 3) (Roujian Lu et al., 2020; Paraskevis et al., 2020; Zhou et al., 2020). The results suggest that bat CoV and human SARS-CoV-2 might come from a common ancestor, although bats are not sold in the seafood market, in Wuhan, China (F. Wu et al., 2020). Therefore, the natural host of novel coronavirus SARS-CoV-2 may be bats, but the origin and intermediate host is not yet conclusive. Protein sequences alignment and phylogenetic studies (Y. Zhao et al., 2020) observed that the residues of the receptor were similar in many species, which provides more possibilities of alternative intermediate hosts, such as turtles, pangolins, minks and snakes ( Table 2) (Ghorbani et al., 2021, Ji et al., 2020, Li et al., 2020; Z. Liu et al., 2020; Xiao et al., 2020). A recent study suggests that pangolin has the probability to be an intermediate host of SARS-CoV-2 but may not be the only one. Through the genomic sequence similarity analysis of SARS-CoV-2 obtained from different patients, its homology is more than 99.9%. Therefore, it is reasonable to consider that the SARS-CoV-2 strain originates from one source and can be detected relatively quickly (Roujian Lu et al., 2020; Zhou et al., 2020).

Fig. 3

Table 2

Comprehensive analysis of the host of SARS-CoV-2.

Host	Description
Bat	Primary host maybe bat. SARS-CoV-2 had a 96.3% sequence identity to BatCoV RaTG13, which originated from Yunnan, China in 2013. SARS-CoV-2 had a 88% identity to bat-SL-CoVZC45 and bat-SL-CoVZXC21 originating from Zhoushan, China, in 2018, whereas it shared 79.5% identity to SARS-CoV and 50% identity to MERS-CoV.
Turtle	Turtle maybe Intermediate host. The interaction between the key amino acids of S protein RBD and ACE2 indicated that, turtles (Chrysemys picta bellii, Chelonia mydas, and Pelodiscus sinensis) may act as the potential intermediate hosts.
Pangolin	Pangolin maybe Intermediate host. The isolation of a coronavirus from pangolins that is closely related to SARS-CoV-2.
Mink	Mink maybe Intermediate host. A high rate of variation within SARS-CoV-2 mink isolates implies that mink populations were infected before human populations.
Snake	Snake maybe Intermediate host. SARS-CoV-2 has most similar codon usage bias with snake.

Phylogenetic analysis of full-length genomes of SARS-CoV-2 and representative viruses of the genus Betacoronavirus. 2019-nCoV=SARS-CoV-2 = severe acute respiratory syndrome coronavirus type 2. MERS-CoV = Middle East respiratory syndrome coronavirus. SARS-CoV = severe acute respiratory syndrome coronavirus (Roujian Lu et al., 2020). Comprehensive analysis of the host of SARS-CoV-2. The S gene of the SARS-CoV-2 has less than 75% sequence identity to those of the bat SARS-like CoVs (SL-CoVZXC21 and ZC45) and human SARS-CoV (Zhou et al., 2020). Similarly, the spike glycoprotein encoded by the S genes of the SARS-CoV-2 was longer than that of the SARS-CoV (Zhou et al., 2020). The spike protein, composed of the S1 and S2 domain, is crucial in determining host tropism and transmission capacity through the mediation of receptor-binding and membrane fusion (Roujian Lu et al., 2020). Among these, the S2 subunit of the SARS-CoV-2 is highly conserved and has a 99% identity with that of SARS-CoV (Roujian Lu et al., 2020). The receptor-binding domain is usually located in the C-terminal domain of S1 to directly associate with the human receptor. Although the S1 domain of the SARS-CoV-2 only has approximately 70% identity with SARS-CoV, homology modeling revealed that the SARS-CoV-2 had a similar receptor-binding domain structure to that of SARS-CoV (Roujian Lu et al., 2020). Notably, Zhou et al. found that SARS-CoV-2, just like SARS-CoV, may use angiotensin-Converting Enzyme 2 (ACE2) as an entry receptor in ACE2-expressing cells, the majority of which are type II alveolar cells (AT2) in human lung (Edgar, 2004; Y. Zhao et al., 2020; Ziegler et al., 2020). The similarities and differences of the S protein of β coronavirus are shown in Table 3. Therefore, high-throughput bioinformatics technology is needed for large-scale screening to monitor whether ACE2 targeted drugs are effective in the treatment of SARS-CoV-2, and individual mutations need to be constantly observed to determine whether they have an effect on drug activity.

Table 3

Similarities and differences in the S protein of β coronavirus.

Type	Different sequence structures	Receptors
SARS-CoV-2	There are 12 short insertion sequences in the N-terminal domain of S protein, including (PRRA)CCT CGG CGG GCA	ACE2
SARS CoV	S1 domain contains 14 unique amino acids	ACE2
BAT-CoV- RaTG13	S protein contains short insertion sequences in the N-terminal domain	ACE2
MERS-CoV	The S1 domain RBD is different from SARS-CoV, which contains a core structure and a subsidiary subregion that functions as RBM	DPP4(CD26)
HCoV-HKU	Three identical S1 domains form an interwoven cap at the top of the S2 stem	unknown

Similarities and differences in the S protein of β coronavirus.

SARS-CoV-2 detection

At this time, diagnostic testing for SARS-CoV-2 can be conducted only at Centers for Disease Control and Prevention (CDC, https://www.cdc.gov/coronavirus/2019-nCoV/guidelines-clinical-specimens.html). Due to the unpredictable emergence of the SARS-CoV-2 virus, the range of diagnostic tools that can be performed in clinical practice is quite limited. To date, SARS-CoV-2 has been detected in human clinical specimens by real-time reverse transcription-polymerase chain reaction (qRT-PCR), next-generation sequencing, third-generation sequencing, RT-loop-mediated isothermal amplification (RT-LAMP), LAMP-Seq, antibody test, pulmonary CT, etc. (W.E. Huang et al., 2020; Lan et al., 2020; Renfei Lu et al., 2020; She et al., 2020). More rapid, feasible, accurate, and sensitive point-of-care testing (POCT) approaches for the diagnosis of COVID-19 in patients infected by SARS-CoV-2 are urgently needed. The comparison of detection methods of SARS-CoV-2 are shown below and in Table 4.

Table 4

Comparison of detection methods of SARS-CoV-2.

Methods	Advantages	Disadvantages
qRT-PCR	Gold standard, high specific, high rapid, economic, the most robust, and widely applied technology	Low sensitivity, speed is limited, require sophisticated equipment and exhibit false-negative rates
NGS	High accurate, high throughout, sequencing for high-quality genome	Equipment is expensive, operator expertise, and a turnaround time of > 24 h
NTS	Target amplification and long-read, High rapid, high accurate, higher sensitivity than standard qRT-PCR and sequencing for high-quality genome	Equipment is expensive, and more expensive than qRT-PCR
RT-LAMP	High sensitivity, simple operation, fast reaction speed (within 50 min), less dependent on complex equipment, and can detect of viruses, various pathogens, and even viral variants	The limitation of the RT-LAMP assay was calculated to be 118.6 copies per reaction.
LAMP-Seq	Low cost, high rapid (within 40 min) and highly sensitive protocol, population-scale testing	Without clinical certification
Antibody	Traditional mature technology	Lower sensitivity, and antibody production takes a long time
CT	Easy to operate and helpful in diagnosis	Asymptomatic infection cannot be diagnosed

Comparison of detection methods of SARS-CoV-2. Nucleic acid detection is the golden standard for COVID-19 diagnosis. From the testing methods conducted to detect the causative virus, qRT-PCR is specific, rapid, economic, the most robust, and widely applied technology for qualitative and quantitative diagnosis (Corman et al., 2020, Li et al., 2019a, Wan et al., 2016). However, the qRT-PCR requires sophisticated equipment and exhibits high false-negative rates, low sensitivity, with only 30–50% positive detection ratio, and cannot identify suspected infection, or close contact with confirmed cases, thereby resulting in patients mistakenly considered uninfected or cured and impeding pandemic containment (Chu et al., 2020, Lan et al., 2020, Lo and Chiu, 2020). LAMP is a high sensitivity technology with a simple operation, fast reaction speed (within 50 min), is less dependent on complex equipment, and has been widely used for the development of POCT techniques for the detection of viruses, various pathogens, and even viral variants (Renfei Lu et al., 2020; Notomi et al., 2000; Thai et al., 2004). Later, the mismatch-tolerant version of the LAMP method has higher sensitivity and faster reaction speed than the traditional method (Li et al., 2019b, Zhou et al., 2019). The limitation of the RT-LAMP assay was calculated to be 118.6 copies per reaction. Next-generation and third-generation sequencing are widely applied for mutation identification, pathogen identification and monitoring virus evolution (Metsky et al., 2017, Wilson et al., 2014) including SARS-CoV, but require expensive equipment, operator expertise, and a turnaround time of > 24 h, making them unsuitable for the current crisis. To effectively interrupt the transmission of SARS-CoV-2, promising POCT detection methods are needed. And to break the cycle of isolation and spread, LAMP-Seq and nanopore target sequencing (NTS) are capable of population-scale processing. Because traditional PCR detection needs a professional laboratory, its speed is greatly limited. LAMP-Seq, a low cost, rapid (within 40 min) and highly sensitive protocol, allows for population-scale testing using massively parallel RT-LAMP by using sequence-specific barcodes. It can pool processing, parallelize deep sequencing, and perform standard computing analysis to identify positive individuals (Schmid-Burgk et al., 2020; Y. Zhang et al., 2020). Most importantly, this method, in which the false negative and false positive rates are lower than 0.2%, is expected to scale hundreds of thousands of samples per day per sequencing facility (Schmid-Burgk et al., 2020). Although LAMP, DNA barcode and high-throughput sequencing technology are not new, their combination provides a new large-scale diagnostic idea for potentially larger outbreaks. On January 31, 2020, The New England Journal of Medicine reported the first patient diagnosed with COVID-19 in the United States and indicated nanopore sequencing as part of the sequencing toolkit. Subsequently, Belgium, China, Canada, and more than 16 countries used the ARTIC protocol and nanopore sequencing to complete the sequencing of the novel coronavirus in about 7 h (Fauver et al., 2020; J. Lu et al., 2020). Combining the advantage of target amplification and long-read real-time nanopore sequencing, the NTS to detect SARS-CoV-2 and other respiratory viruses was established simultaneously within 6–10 h, with higher sensitivity than standard qPCR (Taiaroa et al., 2020). For the virulence regions, NTS designed 11 fragments of 600–950 bp as targets, completely covering the 9115 bp region, and 26 primers were used to develop the SARS-CoV-2 primer panel (Fig. 2C) (M. Wang et al., 2020). For more testing samples, NTS could rapidly detect many positive samples within 10 min (for quick detection) and 4 h (for final evaluation), showing good concordance with qPCR results. Parallel testing with approved qPCR of SARS-CoV-2 and NTS using a variety of nucleic acid samples from suspected COVID-19 cases confirmed that NTS identified more infected patients as positive, and could also monitor mutations in the nucleic acid sequence or in other respiratory viruses. NTS is a rapid, accurate, and comprehensive detection method, which makes it suitable for the diagnosis of contemporary COVID-19 to reduce mortality. In addition, this approach has the potential to be further extended to diagnose more viruses or pathogens (Fauver et al., 2020; M. Wang et al., 2020). Oxford Nanopore MinION and GridION were used for NTS. MinION, the smallest Oxford Nanopore sequencer, is smaller than a cell phone and can be used in various environments with low equipment cost. Base-calling and the quality assessment of MinION sequencing data use Guppy (v. 3.1.5) software, and GridION uses MinKNOW (v. 3.6.5). The reads of each sample are mapped against the virus genome reference database using BLASTn (v. 2.9.0+) (Altschul et al., 1990). Sequence correction of nanopore sequencing data was performed using medaka (v. 0.10.5) (Oxford Nanopore Technologies, 2018). The ARTIC bioinformatics workflow is used to obtain consistent sequence, SNPs, and mutation data through published SARS-CoV-2 sequence information. Variants were identified in infected patients by single nucleotide mutations at seven sites by NTS and qPCR. Silent variants exist that have not been characterized by existing whole-genome sequencing methods (M. Wang et al., 2020). Finally, for data analysis, bioinformatics analysis may also be introduced for high-quality genome, variants, and high-throughput detection (Charalampous et al., 2019, Greninger et al., 2015). At the time of writing, qRT-PCR has become the gold standard method for diagnosis and clinical decision-making of antiviral therapy. While, next-generation and third-generation sequencing based on bioinformatics technology were established to conduct virus detection and genome assembly and large-scale sequencing analysis of COVID-19 in patient samples. Combining the detection of pathogenic organisms in respiratory specimens in clinical virology laboratories with bioinformatics analysis of the downstream data of viruses will help to gain insight into the mutation, spread, evolution, pathogenesis of the virus, and drug and vaccine development.

A single-cell atlas revealed COVID-19 immune features

Single-cell RNA sequencing (scRNA-seq) is another important technology which provides a comprehensive cell atlas comprehensive to identify that different peripheral immune subtype changes are associated with distinct clinical features of COVID-19. By bioinformatics analysis, SARS-CoV-2 RNA was found in a variety of epithelial and immune cell types, accompanied by significant changes in the transcriptome of virus positive cells. Monocytes and lymphocytes do not express large amounts of pro-inflammatory cytokines in peripheral blood (Wilk et al., 2020). While, upregulation of S100A8/A9 of peripheral megakaryocytes and monocytes mainly causes cytokine storm in severe patients (Ren et al., 2021). Collectively, single-cell RNA sequencing results represent a rich resource for understanding the complex multicellular immune response induced by COVID-19 and developing effective therapeutic directions for COVID-19.

Medicine screening for COVID-19

At present, bioinformatics based on high-performance computing is the standard of scientific research tools and will certainly be used to accelerate the development of specific drugs. In general, it is necessary for modern pharmaceuticals to understand the structure of the virus, and then evaluate the small protein molecules that can bind to and inactivate the virus. ACE2 as an entry receptor, potential drug target, plays an important role in the discovery and optimization of inhibitors blocking the entry of cells to treat disease. Small molecule drugs can competitively bind to the functional sites of viral proteins, which prevents the viral protein from binding to its true substrate, thereby inhibiting viral activity (Abd El-Aziz and Stockand, 2020). Drug databases for SARS-CoV-2 (D3Targets-2019-nCoV, CoViLigands, CORDITE, DockCov2, DrugBank, TCMD), accelerating molecular dynamics simulation (vsREMD), molecular docking (AutoDock, D3Docking, D3Similarity, CDOCKER), and artificial intelligence (AI) technology (DNN, MolAICal) will assist in drug design and screening through high-throughput computing with bioinformatics analysis. Drug databases for SARS-CoV-2 ( Table 5). D3Targets-2019-nCoV, a molecular docking-based webserver, is accessible freely to public (https://www.d3pharma.com/D3Targets-2019-nCoV/index.php) (Shi et al., 2020), which currently contains 42 viral proteins with 69 conformations for predicting potential target protein for active compounds and for drugs screening against targets that may have potential inhibitory effects on SARS-CoV-2 viral RNA polymerase. CoViLigands is an anti-coronavirus active compounds database which collected from previous publications at https://www.d3pharma.com/D3Targets-2019-nCoV/CoViLigands/2019-nCoV.php, which is updated regularly and available free of charge (Yang et al., 2021). CORDITE (CORona Drug InTEractions database, https://cordite.mathematik.uni-marburg.de) is accessible web server and integrates all available publications about potential drugs for SARS-CoV-2 (Martin et al., 2020). DockCoV2 a predicting the binding affinity of drugs with the SARS-CoV-2 virus encodes proteins database which contains 3109 drugs. DockCoV2 is available at https://covirus.cc/drugs/. Seven target structural proteins involved in viral entry and replication process, including 3 CLpro (PDB ID: 6LU7), PLpro (PDB ID: 6WX4), RdRp (PDB ID: 7BV2), spike receptor-binding domain (RBD) (PDB ID: 6M0J), N protein (PDB ID: 6M3M) and ACE2 (PDB ID: 1R42) were retrieved from Protein Data Bank (Chen et al., 2021). Drugbank database (https://www.drugbank.ca) contains comprehensive bioinformatics and chemoinformatics resources, and provides detailed drug, drug target information and its mechanism, including pharmacochemistry, pharmacology, pharmacokinetics, and drug-drug interaction information. Traditional Chinese Medicines Database (TCMD, 2009) provides traditional knowledge and modern science to understand traditional Chinese medicines (Traditional Chinese Medicine Database (TCMD), 2009).

Table 5

Drug databases for SARS-CoV-2.

Databases	Websites/Resources
D3Targets-2019-nCoV	https://www.d3pharma.com/D3Targets-2019-nCoV/index.php
CoViLigands	https://www.d3pharma.com/D3Targets-2019-nCoV/CoViLigands/2019-nCoV.php
CORDITE	https://cordite.mathematik.uni-marburg.de
DockCov2	https://covirus.cc/drugs/
DrugBank	https://www.drugbank.ca
TCMD	Software

Drug databases for SARS-CoV-2. Molecular dynamics (MD) simulation is a method that simulates experimental conditions and can display the microscopic evolution of the system from the atomic level (J. Wang et al., 2020). After the vsREMD simulation, the homology model of the novel coronavirus Mpro was constructed and N3 could fit inside the substrate-binding pocket, which could target the current novel coronavirus Mpro. Kinetic studies by balancing the binding constant Ki (designated as K2 / K1) and the inactivation rate constant K3 formed by covalent bonds also showed that N3 has a strong inhibitory effect on the novel coronavirus Mpro (Jin et al., 2020). Moreover, Joshi et al. conducted phylogenetic and sequence similarity network analysis on the main drug target, Mpro, and identified molecules that were strongly active against SARS-CoV-2 Mpro, such as δ-viniferin, myricitrin, Taiwanhomoflavone A, lactucopicrin 15-oxalate, nympholide A, afzelin, biorobin, hesperidin, and phyllaemblicin B, which also proved that these molecules have a powerful combination to other potential targets of SARS-CoV-2 infection, such as the viral receptor human angiotensin-converting enzyme 2 (HACE-2) and RNA-dependent RNA polymerase (RdRp) (Joshi et al., 2020). In addition, Chen et al. prepared a three-dimensional model of COVID-19 virus 3 C-like protease (3 CLpro, Mpro) and used the three-dimensional model computer to virtually screen 16 candidate drugs. Among them, the antiviral drugs ledipasvir and velpatasvir are effective as therapeutic drugs against novel coronaviruses with minimal side effects, such as fatigue and headache (Jo et al., 2020, Kang et al., 2020; Z. Wang et al., 2020). Molecular docking calculations are based on receptor active site regions to search for whether ligands interact with receptors and the optimal binding mode between them. The targets of SARS-CoV-2 were chosen Mpro, 3 CLpro, and RdRp and ligands were obtained from the DrugBank and D3Targets-2019-nCoV database. AutoDock (MGLTools, version 1.5.6) simulates and calculates each small molecule in a small molecule library, and observes the energy released through binding with proteins from various angles. The more stable the binding, the more likely this small molecule will be developed into a drug (Morris et al., 2009). D3Docking is a structure-based module embedded in the D3Targets-2019-nCoV web server, which utilized molecular docking to explore drug targets for drugs or active compounds observed from previous clinical studies and identify compounds against potential drug targets via protein structure based virtual screening (Shi et al., 2020). D3Similarity is a complementary approach to docking-based methods for the SARS-CoV-2 target prediction, virtual screening and target identification of potential coronavirus antivirals according to two-dimensional (2D)/three-dimensional (3D) molecular similarity, which lays the foundation for the screening of new COVID-19 drugs (Jin et al., 2020). For example, Remdisvir is an RdRp inhibitor, which is reported to effectively inhibit SARS-CoV and SARS-CoV-2 in D3Docking and clinical practice. Recent research reported that a batch of old and traditional Chinese medicines that could be effective in the therapy of pneumonia were discovered by using high-performance computing and bioinformatics methods (Ma et al., 2020; Z. Wang et al., 2020). A strategy of combining virtual screening and enzymatic testing can be used. It focuses on drug screening for marketed drugs, as well as self-establishing a database of highly mature medicinal compounds and ingredients of medicinal plant-derived compounds, which are rapidly distributed. Ma et al. (2020) established an optimized docking model and made a high-throughput virtual screening based on molecular docking technology. They obtained the crystal structures of Mpro and papain-like protein (PLP) from the PDB database and homology model and used CDOCKER program to conduct virtual screening of the TCMD, 11294 potential active ingredients were obtained by Mpro and PLP inhibitor screening. Rhubarb screened by Mpro and pollen screened by PLP both contain Maxing Shigan Decoction and Xuanbai Chengqi Decoction, which can be used for pandemic virus obstruction of the lung. The old drugs that are active natural products in traditional Chinese medicines and may have therapeutic effects on SARS-CoV-2 have been discovered, which greatly shortens the development time (Beck et al., 2020, Jin et al., 2020).

Artificial intelligence (AI)

Deep Neural Network (DNN) algorithm of AI technology is an efficient way to screen the potential drugs for the treatment of COVID-19 by using two different learning databases in which one is composed of compounds which have been reported or confirmed to have activity against SARS-CoV, SARS-CoV-2, human immunodeficiency virus, influenza virus, and the other contains the known 3C-like protease inhibitors (Ke et al., 2020). After selection, 13 drugs (bedaquiline, brequinar, celecoxib, clofazimine, conivaptan, gemcitabine, tolcapone, vismodegib, boceprevir, chloroquine, homoharringtonine, tilorone, and salinomycin) may prevent virus replication and indicate the potential to develop antiviral drugs (Ke et al., 2020). Deep learning is another approach of AI that applied into medicine and 2D/3D ligand design. MolAICal software supplies for 3D drug design of protein targets by artificial intelligence and classical algorithm (Bai et al., 2020). At present, the SARS-CoV-2 infectious disease outbreak is tending to expand. Screening and developing of therapeutic drugs for viral diseases have become the focus and difficulty of world scientific research. Old drugs and traditional Chinese medicines of anti-SARS-CoV-2 are screened by many kinds of molecular dynamics simulation, molecular docking and artificial intelligence technology with bioinformatics which are considered to be more effective and to short the development time. After screening, ledipasvir, velpatasvir and remdisvir are effective as therapeutic drugs against SARS-CoV-2. As well, these molecules, δ-viniferin, myricitrin, Taiwanhomoflavone A, lactucopicrin 15-oxalate, nympholide A, afzelin, biorobin, hesperidin, and phyllaemblicin B, have a powerful combination to other potential targets of SARS-CoV-2 infection. In addition, the bedaquiline, brequinar, celecoxib, clofazimine, conivaptan, gemcitabine, tolcapone, vismodegib, boceprevir, chloroquine, homoharringtonine, tilorone, and salinomycin are the potential antiviral drugs. And, one of the traditional Chinese medicines named Rhubarb, can be used for pandemic virus obstruction of the lung. However, new drugs also need to be designed and still need a large number of clinical trials.

Vaccine for COVID-19

Vaccination is the most effective means of preventing and controlling infectious diseases. Since the COVID-19 pandemic, researchers around the world have been accelerating the development of an effective candidate vaccine to combat this viral disease. At present, several applications of bioinformatics to vaccine research can develop the effectiveness and safety of the vaccines through reverse vaccinology, Immunoinformatics and structural vaccinology (Chukwudozie et al., 2021a). Reverse vaccinology (RV) is a promising approach that aims to find novel vaccine candidates through genome or proteome of pathogens. Systematic sequencing revealed that many isolates have deletions in the core genomes and may have a significant impact on viral structure or function. RV relies on bioinformatics methods to identify target antigens according to genomic information. The open reading frame (ORF) of organism genome was identified by VaxiJen server to determine various antigens and physicochemical properties related to antigen epitopes, reducing the time on the antigen selection process. Using the existing SARS-COV-2 genome sequence information, few researchers have used RV in the design of novel COVID-19 vaccine. In addition, it is also applied to develop a multi-epitope chimeric vaccine against SARS-COV-2 (Enayatkhani et al., 2021, Ong et al., 2020). Immunoinformatics approach involves the application of computational methods to epitope-based peptide vaccine design. After getting the SARS-COV-2 genome and 3D protein structure, the most antigenic polyprotein and inhibitor-binding sites were predicted using various bioinformatics tools, which can generate a specific immune response. In order to predict the immunogenic targets from the viral genome sequences, computational tools such as TEpredict, CTLPred, NetMHC, and Epitopemap were applied. Immunoinformatics tools can help researchers develop potentially suitable vaccine candidates by satisfactorily understanding the human immune response to organisms in a short time (Delany et al., 2013, María et al., 2017). Structural vaccinology is a reasonable method to develop effective vaccines by joining epitopes and immunogenic domains of the SARS-COV-2 proteins. The conformational characteristics of viral epitopes can make them become good candidate antigens (María et al., 2017). Structural properties (structural stability of peptides, solvent exposure, hydrophobicity and codon optimization) are used to map antigen epitopes to detect conformational features that may affect immunogenicity. In a word, structural vaccine technologies have been used to predict candidate vaccines for SARS-COV-2 via molecular docking, dynamics simulations, and homology modeling. These techniques are helpful to identify effective vaccination for the prevention of COVID-19 infection (Chukwudozie et al., 2021b). Although many bioinformatics tools and algorithms have been used to develop safe candidate vaccines for COVID-19, there are still some limitations in fully integrating this prediction method in the design and development of traditional vaccines. Compared with most other DNA viruses with stable genomes such as smallpox virus, novel coronavirus has a distinctive feature and is more prone to gene mutations because it is an RNA virus, which is likely to change the nature of the virus itself and also causes a great reduction in the effectiveness of the vaccine. One of the main deficiencies of bioinformatics in finding effective vaccines against COVID-19 is that it can not solve the long-standing problem of selecting a suitable animal model to test vaccine candidates. In addition, reverse vaccinology can only target proteins and identify linear and discontinuous epitopes, which may lead to some predicted epitopes hidden in viral proteins and not easy to detect in vivo (Silva-Arrieta et al., 2020). Also, for the immunoinformatics screening, the pleomorphism of MHC class I molecules may prevent the precision of vaccines designing (Greenbaum et al., 2011). These may be the reasons for the failure of candidate vaccine screening. Ultimately, it is required to solve these limitations to improve the safety and effectiveness of vaccine candidates against SARS-COV-2 or other novel viruses. Modern vaccines research is inseparable from bioinformatics technology. As of September 22, 2021, there are 13 different vaccines have been administered by researchers around the world and more than 5.87 million people have been vaccinated. For instance, the Pfizer/BioNtech Comirnaty vaccine was listed for WHO Emergency Use Listing (EUL) on December 31, 2020. The COVID-19 vaccine of Sinopharm was successfully launched in EUL on May 7, 2021 which is produced by Beijing Bio-Institute of Biological Products Co Ltd, subsidiary of China National Biotec Group (CNBG). Although the efficacy of several COVID-19 vaccines is amazingly high, it has not been reached 100% yet. The SARS-COV-2 continues to mutate, but the current viral mutations have not caused a great impact on vaccine development and disease diagnosis and treatment. The protective effectiveness of the two doses BNT162b2 and ChAdOx1 vaccines against B.1.617.2 (delta) variant of SARS-COV-2 were 88% and 67%, respectively (Lopez Bernal et al., 2021). No matter how the virus mutates, the more we know about its variants, the better the vaccines that will be developed by combining bioinformatics methods and biological technologies. The pandemic is not over yet, and whether it will return in the future remains unknown. Even if the pandemic ends suddenly, we should continue to develop the most promising vaccine candidates.

Conclusion

We reviewed and systematically analyzed the current studies on the characteristics of SARS-CoV-2 based on bioinformatics methods as follows. Firstly, a large number of computer resources have been used to sequence the genome and variants of the novel coronavirus, and the sequencing results have been compared to find the source of the virus and the maximum probability of host transmission combining bioinformatics methods and other technologies. At present, the sequence resources on SARS-CoV-2 are evolving very rapidly. The simple sequence alignment of viruses consumes little resources and can be done using online databases. However, to accelerate complex and detailed research, high-performance computing environments are necessary. High quality SARS-CoV-2 genome was assembled by the NGS and TGS technology. After sequencing alignment and phylogenetic analysis, evolutionary characteristics indicated that the bat may be the natural host of the novel coronavirus. However, the origin and intermediate host of the virus has not been fully determined, so the sequence alignment and mutations of individuals need to be constantly continued. By comparing different virus detection methods, they have different advantages in different aspects. qRT-PCR is the gold standard approach for virus detection and auxiliary clinical diagnosis. Other SARS-CoV2 detection methods have different advantages and provide comprehensive support. Secondly, so far, the protein three-dimensional structure and RNA sequence characteristics of SARS-CoV-2 will help us to understand the structural basis and functional characteristics of coronavirus entering the target cells. ACE2, as an entry receptor, provides the entry point for SARS-CoV2 to infect human cells, acts as a drug target to treat COVID-19 patients. Bioinformatics analysis of single-cell RNA sequencing provide abundant resource for discovering the mechanism of immune response induced by COVID-19. The up-regulation of S100A8/A9 on megakaryocytes and monocytes in severe patients mainly causes cytokine storm. A large number of drug databases provide the basis and possibility for drug research of SARS-CoV-2. Molecular dynamics simulation, molecular docking and artificial intelligence technology of high-throughput computing and bioinformatics methods are applied to accelerate and support the design and screening of auxiliary drugs. Several effective drugs for the treatment of SARS-CoV-2 were screened, such as ledipasvir, velpatasvir and remdisvir and so on. In addition, computational approaches can screen effective vaccines against SARS-COV-2 in a short time through reverse vaccinology, Immunoinformatics and structural vaccinology. However, whether these antiviral drugs have a definite effect on the treatment of patients with COVID-19 and whether vaccines can protect against disease, still requires solid data from clinical trials. Bioinformatics tools will continue to play a key and complementary role in all stages of the COVID-19 pandemic.

Ethics approval and consent to participate

Not applicable.

CRediT authorship contribution statement

Lifei Ma: Conceptualization, Investigation, Methodology, Data curation, Resources, Software, Writing – review & editing, Funding acquisition, Supervision, Visualization. Huiyang Li: Conceptualization, Investigation, Methodology, Data curation, Resources, Software, Writing – review & editing, Visualization. Jinping Lan: Resources, Writing – review & editing, Visualization. Xiuqing Hao: Resources, Writing – review & editing. Huiying Liu: Writing – review & editing. Xiaoman Wang: Conceptualization, Resources, Methodology, Writing -– review & editing, Supervision. Yong Huang: Conceptualization, Methodology, Resources, Writing – review & editing, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

10 in total

Review 1. Nanoscale Technologies in the Fight against COVID-19: From Innovative Nanomaterials to Computer-Aided Discovery of Potential Antiviral Plant-Derived Drugs.

Authors: Nunzio Iraci; Carmelo Corsaro; Salvatore V Giofrè; Giulia Neri; Angela Maria Mezzasalma; Martina Vacalebre; Antonio Speciale; Antonina Saija; Francesco Cimino; Enza Fazio
Journal: Biomolecules Date: 2022-07-30

2. Bioinformaticians: the hidden heroes of the COVID-19 pandemic.

Authors: Ebony Torrington
Journal: Biotechniques Date: 2022-04-05 Impact factor: 2.746

3. An Implementation Strategy to Develop Sustainable Surveillance Activities Through Adoption of a Target Operating Model.

Authors: Natalie K Lee; Miles A Stewart; Jessica S Dymond; Sheri L Lewis
Journal: Front Public Health Date: 2022-04-06

4. Insights on the evolution of Coronavirinae in general, and SARS-CoV-2 in particular, through innovative biocomputational resources.

Authors: Daniel Andrés Dos Santos; María Celina Reynaga; Juan Cruz González; Gabriela Fontanarrosa; María de Lourdes Gultemirian; Agustina Novillo; Virginia Abdala
Journal: PeerJ Date: 2022-07-25 Impact factor: 3.061

Review 5. Modern drug discovery applications for the identification of novel candidates for COVID-19 infections.

Authors: Isha Rani; Avjit Kalsi; Gagandeep Kaur; Pankaj Sharma; Sumeet Gupta; Rupesh K Gautam; Hitesh Chopra; Shabana Bibi; Syed Umair Ahmad; Inderbir Singh; Manish Dhawan; Talha Bin Emran
Journal: Ann Med Surg (Lond) Date: 2022-07-12

6. Discovering common pathogenetic processes between COVID-19 and sepsis by bioinformatics and system biology approach.

Authors: Lu Lu; Le-Ping Liu; Rong Gui; Hang Dong; Yan-Rong Su; Xiong-Hui Zhou; Feng-Xia Liu
Journal: Front Immunol Date: 2022-08-31 Impact factor: 8.786

7. Single-cell RNA sequencing to decipher the immunogenicity of ChAdOx1 nCoV-19/AZD1222 and mRNA-1273 vaccines in patients with autoimmune rheumatic diseases.

Authors: Yen-Ju Chen; Po-Liang Cheng; Wen-Nan Huang; Hsin-Hua Chen; Hong-Wei Chen; Jun-Peng Chen; Ching-Tsai Lin; Kuo-Tung Tang; Wei-Ting Hung; Tsu-Yi Hsieh; Yi-Hsing Chen; Yi-Ming Chen; Tzu-Hung Hsiao
Journal: Front Immunol Date: 2022-08-01 Impact factor: 8.786

8. Deciphering the Potential of Pre and Pro-Vitamin D of Mushrooms against Mpro and PLpro Proteases of COVID-19: An In Silico Approach.

Authors: Abhay Tiwari; Garima Singh; Gourav Choudhir; Mohit Motiwale; Nidhi Joshi; Vasudha Sharma; Rupesh K Srivastava; Satyawati Sharma; Marco Tutone; Pradeep Kumar Singour
Journal: Molecules Date: 2022-08-31 Impact factor: 4.927

9. High throughput SARS-CoV-2 variant analysis using molecular barcodes coupled with next generation sequencing.

Authors: Lyora A Cohen-Aharonov; Annie Rebibo-Sabbah; Adar Yaacov; Roy Z Granit; Merav Strauss; Raul Colodner; Ori Cheshin; Shai Rosenberg; Ronen Eavri
Journal: PLoS One Date: 2022-06-21 Impact factor: 3.752

10. Network-Based Data Analysis Reveals Ion Channel-Related Gene Features in COVID-19: A Bioinformatic Approach.

Authors: Hao Zhang; Ting Feng
Journal: Biochem Genet Date: 2022-09-14 Impact factor: 2.220

10 in total