Aim: The inference of coronavirus evolution is largely based on mutations in SARS-CoV-2 genome. Misinterpretation of these mutations would mislead people about the evolution of SARS-CoV-2. Materials & methods: With 4521 lines of SARS-CoV-2, we obtained 3169 unique point mutation sites. We counted the numbers and calculated the minor allele frequency (MAF) of each mutation type. Results: Nearly half of the point mutations are C-T mismatches and 20% are A-G mismatches. The MAF of C-T and A-G mismatches is significantly higher than MAF of other mutation types. Conclusion: The excessive C-T mismatches do not resemble the random mutation profile. They are likely to be caused by the cytosine-to-uridine deamination system in hosts.
Aim: The inference of coronavirus evolution is largely based on mutations in SARS-CoV-2 genome. Misinterpretation of these mutations would mislead people about the evolution of SARS-CoV-2. Materials & methods: With 4521 lines of SARS-CoV-2, we obtained 3169 unique point mutation sites. We counted the numbers and calculated the minor allele frequency (MAF) of each mutation type. Results: Nearly half of the point mutations are C-T mismatches and 20% are A-G mismatches. The MAF of C-T and A-G mismatches is significantly higher than MAF of other mutation types. Conclusion: The excessive C-T mismatches do not resemble the random mutation profile. They are likely to be caused by the cytosine-to-uridine deamination system in hosts.
Authors: Francisco Barona-Gómez; Luis Delaye; Erik Díaz-Valenzuela; Fabien Plisson; Arely Cruz-Pérez; Mauricio Díaz-Sánchez; Christian A García-Sepúlveda; Alejandro Sanchez-Flores; Rafael Pérez-Abreu; Francisco J Valencia-Valdespino; Natali Vega-Magaña; José Francisco Muñoz-Valle; Octavio Patricio García-González; Sofía Bernal-Silva; Andreu Comas-García; Angélica Cibrián-Jaramillo Journal: Microb Genom Date: 2021-11