Literature DB >> 35873408

Nothing in SARS-CoV-2 makes sense except in the light of RNA modification?

Mingmei Zhao1, Chunxiao Li2, Yu Dong3, Xuekun Wang2, Wenqing Jiang4,5, Yaogang Chen1.   

Abstract

The expression pattern of RNA deaminases determines the mutation and evolution of SARS-CoV-2.
© 2022 Future Medicine Ltd.

Entities:  

Keywords:  ADAR; APOBEC; RNA deamination; SARS-CoV-2; expression pattern

Year:  2022        PMID: 35873408      PMCID: PMC9302237          DOI: 10.2217/fvl-2022-0043

Source DB:  PubMed          Journal:  Future Virol        ISSN: 1746-0794            Impact factor:   3.015


COVID-19 mediated by SARS-CoV-2 is still causing damage to the world. One of the most challenging issues is the continuous mutation and evolution of the SARS-CoV-2 sequence. As we know, viral mutations may alter the pathogenesis, virulence and vaccine resistance of the virus, complicating the prevention and diagnosis of the disease. Therefore, understanding the molecular mechanism underlying the virus evolution is necessary. In this article, we will propose our perspective that when studying the mutation and evolution of SARS-CoV-2, one would inevitably come to the topic of RNA deamination systems in hosts. Without considering RNA deamination, a study on SARS-CoV-2 would hardly make sense [1]. In memory of the evolutionary biologist Theodosius Dobzhansky [2], we claim that virtually nothing in SARS-CoV-2 makes sense except in the light RNA deamination.

Evidence of virus evolution driven by deamination systems of hosts

Despite the fact that mutation of SARS-CoV-2 is one of the most worrying news during the pandemic, we should always be aware that the mutation of this ssRNA-positive virus must take place in cellular systems. One of the mutation mechanisms is the replication errors, where the mutations should: occur under a rate of 1E–8; and exhibit a symmetric distribution of all types of nucleotide substitutions [3,4]. However, in reality, the mutation profile in SARS-CoV-2 (both fixed and polymorphic sites) are obviously skewed to the C-to-U(T) substitution [5-7], and the occurrence of mutations is apparently higher than 1E–8 [8]. These facts indicate that: there are other mutation sources other than replication errors; and that traditional evolutionary algorithm/formula should not be directly applied to the SARS-CoV-2 evolution due to the different nature of mutation sources [9,10]. Fortunately, researchers have found that the C-to-U deamination by APOBECs [5,11] and the A-to-I deamination by adenosine deaminases acting on RNA (ADARs) [3] could nicely explain the most prevalent C-to-U substitution and the second-most prevalent A-to-G substitution in the SARS-CoV-2 sequence. The occurrence of deamination events is much higher than 1E–8 and it would produce an asymmetric distribution in the mutation profile [8]. Therefore, any studies dealing with the mutation and evolution of SARS-CoV-2 (no matter what the original purpose was) are actually dealing with the evolution patterns of RNA deamination sites. This is why nothing in SARS-CoV-2 makes sense except in the light RNA deamination.

Expression pattern of RNA deaminases explains the mutation profile of SARS-CoV-2

To comprehend the evolution process behind the mutation profile of SARS-CoV-2, one should first understand how the RNA deaminases (ADARs and APOBECs) work. (1) Tissue specificity: ADAR family has three members in humans (ADAR1–3) and all of them have considerable expressions in lungs and nerve systems. APOBECs are encoded by multiple subunits and the expressions of which are also enriched in lungs and nerves [12]. This suggests that when SARS-CoV-2 infects human lungs, it will be highly ‘mutable’ in lungs due to the tissue-specific expression of RNA deaminases. (2) Subcellular localization: ADAR2 and ADAR3 are located in nucleus while ADAR1 only has partial expression in cytosol (the p150 isoform) [13]. In contrast, APOBECs have a much wider expression spectrum ranging from nucleus to cytosol as well as other cellular compartments. Since SARS-CoV-2 mainly invades the cytosol, the localization of deaminases successfully explains why C-to-U sites are more prevalent than A-to-I sites in SARS-CoV-2 (although these two deamination types are already much more prevalent than other substitutions). (3) Sequence preference: ADAR3 is inactive in mammals, ADAR2 preferentially deaminates coding sequences and ADAR1 mainly deaminates noncoding regions [14]. Since most of the SARS-CoV-2 ‘genome’ is coding region, it is intuitive to think that ADAR2 should be the ‘chief editor’ of A-to-I events in SARS-CoV-2. Unfortunately, the nucleus-located ADAR2 is generally inaccessible to SARS-CoV-2. In contrast, the widely expressed APOBECs do not have the preference on coding/noncoding regions so that the C-to-U sites are catalyzed in an unbiased manner. This dilemma for ADARs again explains why A-to-I substitutions are much fewer than C-to-U substitutions in SARS-CoV-2. Altogether, the expression patterns of RNA deaminases explain the mutation profile of SARS-CoV-2, and there is no reason to omit this information when one studies the sequence evolution of SARS-CoV-2.

SARS-CoV-2 has the potential to infect human nerve systems, the tissues with abundant RNA deaminases

We have introduced that the expressions of ADARs and APOBECs are highest in lungs and nerve systems. Interestingly, apart from the commonly known infection in lungs, many cases suggest that SARS-CoV-2 could also infect nerve systems, leading to neurological symptoms like encephalopathy and delirium [15,16]. This scenario implies that SARS-CoV-2 seems to selectively (or ‘deliberately’) infect the tissues with high expression of RNA deaminases, accelerating its own mutation and evolution rate. However, we emphasize that tissue tropism is a separate problem that must not be related to the presence of host deaminases. The observation of SARS-CoV-2 infecting nerve systems might simply be the result of natural selection. One possible evolutionary trajectory is, among all the tissues or cell types, only lungs and nerve systems could force the virus sequence to change rapidly, providing more options for the virus to increase its fitness in hosts. Consequently, only the SARS-CoV-2 strains with adaptive mutations (which are randomly obtained) could survive. Therefore, the selectively maintained viral strains inherited the ability to infect nerve systems. Without considering the RNA deaminases, one could hardly understand why SARS-CoV-2 infects lungs and nerve systems. Even with the structural evidence that only particular receptors on cell membrane could interact with SARS-CoV-2, this antigen–receptor relationship is highly susceptible to mutations. Natural selection is the only force to create a new host–virus relationship (by positive selection) or to maintain an existing host–virus relationship (by purifying selection).

Parameters like linkage disequilibrium are only meaningful at single molecule level instead of individual host level

Among the numerous literatures on SARS-CoV-2 evolution, one could usually see some traditional analyses on linkage disequilibrium (LD), recombination, Theta-Pi or Tajima’s D [17]. Here, in the light of RNA deaminases, we would point out the flaws and paradoxes behind these previous studies. For humans, each individual is a diploid so that the LD analysis could be performed among a human population. But for SARS-CoV-2, if one intends to perform LD, then how to define ‘a virus individual’ or a ‘haplotype’? The millions of viral sequences isolated from a single host (namely a sequencing library) would be highly polymorphic due to promiscuous deamination by ADARs and APOBECs (a phenomenon termed ‘intra-host polymorphisms’). In theory, each RNA molecule should be regarded as ‘a virus individual’ and the unit of LD should be each ‘single sequencing read’. Unfortunately, in most SARS-CoV-2 literature dealing with ‘strains’ [18,19], one sequencing library from one host is regarded as ‘one strain’ to perform LD [17] regardless of the intra-host polymorphisms caused by deamination. Moreover, misdefinition of ‘a virus individual’ also affects the definition of population size N. It remains debatable whether the N of SARS-CoV-2 refers to the number of infects humans, infected cells, or the number of SARS-CoV-2 RNA molecules. When RNA deamination and intra-host polymorphisms are considered, it becomes evident that each SARS-CoV-2 molecule should be treated as a virus individual. Likewise, evolutionary analyses on recombination, nucleotide diversity (Theta-Pi), selection strength (Tajima’s D) and allele frequency should also take the deamination events into account [20].

Future perspective

In this article, we have introduced that the expression patterns of human RNA deaminases determine the: tissue-specificity of SARS-CoV-2 infection; the abundance of mutations; and the fast evolution of virus. We are also concerned that the previous definition of ‘a SARS-CoV-2 strain/haplotype’ should be updated due to the high level of intra-host polymorphism. We appeal that the next-generation sequencing of SARS-CoV-2 isolates needs longer reads that cover the whole RNA in order to determine the linkage of distantly located mutation sites. The allele frequency and diversity analyses should be performed at single molecule level. Moreover, different from the human population genetics, the prevalence of mutations in SARS-CoV-2 is dictated by an additional factor, that is the activity of deaminases in host. Therefore, researchers should seriously consider what the mutation spectrum tells us before casually making a conclusion based on the incomplete observation.
  16 in total

Review 1.  APOBECs and virus restriction.

Authors:  Reuben S Harris; Jaquelin P Dudley
Journal:  Virology       Date:  2015-03-26       Impact factor: 3.616

2.  Mutation profile of over 4500 SARS-CoV-2 isolations reveals prevalent cytosine-to-uridine deamination on viral RNAs.

Authors:  Yue Li; Xinai Yang; Na Wang; Haiyan Wang; Bin Yin; Xiaoping Yang; Wenqing Jiang
Journal:  Future Microbiol       Date:  2020-09       Impact factor: 3.165

3.  Evidence for selection on SARS-CoV-2 RNA translation revealed by the evolutionary dynamics of mutations in UTRs and CDSs.

Authors:  Lin Zhu; Qi Wang; Weiyu Zhang; Hao Hu; Kexin Xu
Journal:  RNA Biol       Date:  2022-01       Impact factor: 4.766

4.  Fast evolution of SARS-CoV-2 driven by deamination systems in hosts.

Authors:  Yanping Zhang; Wen Jiang; Yan Li; Xiaojie Jin; Xiaoping Yang; Pirun Zhang; Wenqing Jiang; Bin Yin
Journal:  Future Virol       Date:  2021-09-10       Impact factor: 3.015

5.  Rampant C-to-U deamination accounts for the intrinsically high mutation rate in SARS-CoV-2 spike gene.

Authors:  Xueyan Liu; Xuan Liu; Jie Zhou; Yu Dong; Wen Jiang; Wenqing Jiang
Journal:  RNA       Date:  2022-05-04       Impact factor: 5.636

6.  SARS-CoV-2: Possible recombination and emergence of potentially more virulent strains.

Authors:  Dania Haddad; Sumi Elsa John; Anwar Mohammad; Maha M Hammad; Prashantha Hebbar; Arshad Channanath; Rasheeba Nizam; Sarah Al-Qabandi; Ashraf Al Madhoun; Abdullah Alshukry; Hamad Ali; Thangavel Alphonse Thanaraj; Fahd Al-Mulla
Journal:  PLoS One       Date:  2021-05-25       Impact factor: 3.240

7.  Reconciling the debate on deamination on viral RNA.

Authors:  Lai Wei
Journal:  J Appl Genet       Date:  2022-05-04       Impact factor: 2.653

8.  Evidence for host-dependent RNA editing in the transcriptome of SARS-CoV-2.

Authors:  Salvatore Di Giorgio; Filippo Martignano; Maria Gabriella Torcia; Giorgio Mattiuz; Silvestro G Conticello
Journal:  Sci Adv       Date:  2020-06-17       Impact factor: 14.957

9.  Frequent neurologic manifestations and encephalopathy-associated morbidity in Covid-19 patients.

Authors:  Eric M Liotta; Ayush Batra; Jeffrey R Clark; Nathan A Shlobin; Steven C Hoffman; Zachary S Orban; Igor J Koralnik
Journal:  Ann Clin Transl Neurol       Date:  2020-10-05       Impact factor: 4.511

10.  GC usage of SARS-CoV-2 genes might adapt to the environment of human lung expressed genes.

Authors:  Yue Li; Xinai Yang; Na Wang; Haiyan Wang; Bin Yin; Xiaoping Yang; Wenqing Jiang
Journal:  Mol Genet Genomics       Date:  2020-09-04       Impact factor: 3.291

View more
  2 in total

1.  The Sponge Interaction Between Circular RNA and microRNA Serves as a Fast-Evolving Mechanism That Suppresses Non-small Cell Lung Cancer (NSCLC) in Humans.

Authors:  Liqun Wu; Meiling Cao; Xiaoxin Pu; Baoyi Liu; Jinxiang Wang
Journal:  J Mol Evol       Date:  2022-08-29       Impact factor: 3.973

2.  uORF-Mediated Translational Regulation of ATF4 Serves as an Evolutionarily Conserved Mechanism Contributing to Non-Small-Cell Lung Cancer (NSCLC) and Stress Response.

Authors:  Wenjing Xiao; Yang Sun; Jinpeng Xu; Na Zhang; Lina Dong
Journal:  J Mol Evol       Date:  2022-08-13       Impact factor: 3.973

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.