Vladimir Makarenkov1, Bogdan Mazoure2, Guillaume Rabusseau2,3, Pierre Legendre4. 1. Département d'informatique, Université du Québec à Montréal, Montreal, QC, Canada. makarenkov.vladimir@uqam.ca. 2. Montreal Institute for Learning Algorithms (Mila), Montreal, QC, Canada. 3. Département d'informatique et de Recherche Opérationnelle, Université de Montréal and Canada CIFAR AI Chair, Montreal, QC, Canada. 4. Département de Sciences Biologiques, Université de Montréal, C. P. 6128, Succursale Centre-Ville, Montreal, QC, H3C 3J7, Canada.
Abstract
BACKGROUND: The SARS-CoV-2 pandemic is one of the greatest global medical and social challenges that have emerged in recent history. Human coronavirus strains discovered during previous SARS outbreaks have been hypothesized to pass from bats to humans using intermediate hosts, e.g. civets for SARS-CoV and camels for MERS-CoV. The discovery of an intermediate host of SARS-CoV-2 and the identification of specific mechanism of its emergence in humans are topics of primary evolutionary importance. In this study we investigate the evolutionary patterns of 11 main genes of SARS-CoV-2. Previous studies suggested that the genome of SARS-CoV-2 is highly similar to the horseshoe bat coronavirus RaTG13 for most of the genes and to some Malayan pangolin coronavirus (CoV) strains for the receptor binding (RB) domain of the spike protein. RESULTS: We provide a detailed list of statistically significant horizontal gene transfer and recombination events (both intergenic and intragenic) inferred for each of 11 main genes of the SARS-CoV-2 genome. Our analysis reveals that two continuous regions of genes S and N of SARS-CoV-2 may result from intragenic recombination between RaTG13 and Guangdong (GD) Pangolin CoVs. Statistically significant gene transfer-recombination events between RaTG13 and GD Pangolin CoV have been identified in region [1215-1425] of gene S and region [534-727] of gene N. Moreover, some statistically significant recombination events between the ancestors of SARS-CoV-2, RaTG13, GD Pangolin CoV and bat CoV ZC45-ZXC21 coronaviruses have been identified in genes ORF1ab, S, ORF3a, ORF7a, ORF8 and N. Furthermore, topology-based clustering of gene trees inferred for 25 CoV organisms revealed a three-way evolution of coronavirus genes, with gene phylogenies of ORF1ab, S and N forming the first cluster, gene phylogenies of ORF3a, E, M, ORF6, ORF7a, ORF7b and ORF8 forming the second cluster, and phylogeny of gene ORF10 forming the third cluster. CONCLUSIONS: The results of our horizontal gene transfer and recombination analysis suggest that SARS-CoV-2 could not only be a chimera virus resulting from recombination of the bat RaTG13 and Guangdong pangolin coronaviruses but also a close relative of the bat CoV ZC45 and ZXC21 strains. They also indicate that a GD pangolin may be an intermediate host of this dangerous virus.
BACKGROUND: The SARS-CoV-2 pandemic is one of the greatest global medical and social challenges that have emerged in recent history. Human coronavirus strains discovered during previous SARS outbreaks have been hypothesized to pass from bats to humans using intermediate hosts, e.g. civets for SARS-CoV and camels for MERS-CoV. The discovery of an intermediate host of SARS-CoV-2 and the identification of specific mechanism of its emergence in humans are topics of primary evolutionary importance. In this study we investigate the evolutionary patterns of 11 main genes of SARS-CoV-2. Previous studies suggested that the genome of SARS-CoV-2 is highly similar to the horseshoebat coronavirus RaTG13 for most of the genes and to some Malayan pangolincoronavirus (CoV) strains for the receptor binding (RB) domain of the spike protein. RESULTS: We provide a detailed list of statistically significant horizontal gene transfer and recombination events (both intergenic and intragenic) inferred for each of 11 main genes of the SARS-CoV-2 genome. Our analysis reveals that two continuous regions of genes S and N of SARS-CoV-2 may result from intragenic recombination between RaTG13 and Guangdong (GD) Pangolin CoVs. Statistically significant gene transfer-recombination events between RaTG13 and GD PangolinCoV have been identified in region [1215-1425] of gene S and region [534-727] of gene N. Moreover, some statistically significant recombination events between the ancestors of SARS-CoV-2, RaTG13, GD PangolinCoV and batCoV ZC45-ZXC21 coronaviruses have been identified in genes ORF1ab, S, ORF3a, ORF7a, ORF8 and N. Furthermore, topology-based clustering of gene trees inferred for 25 CoV organisms revealed a three-way evolution of coronavirus genes, with gene phylogenies of ORF1ab, S and N forming the first cluster, gene phylogenies of ORF3a, E, M, ORF6, ORF7a, ORF7b and ORF8 forming the second cluster, and phylogeny of gene ORF10 forming the third cluster. CONCLUSIONS: The results of our horizontal gene transfer and recombination analysis suggest that SARS-CoV-2 could not only be a chimera virus resulting from recombination of the bat RaTG13 and Guangdong pangolin coronaviruses but also a close relative of the batCoV ZC45 and ZXC21 strains. They also indicate that a GD pangolin may be an intermediate host of this dangerous virus.
Authors: Eric Bapteste; Leo van Iersel; Axel Janke; Scot Kelchner; Steven Kelk; James O McInerney; David A Morrison; Luay Nakhleh; Mike Steel; Leen Stougie; James Whitfield Journal: Trends Genet Date: 2013-06-11 Impact factor: 11.639
Authors: Dennis A Benson; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; David L Wheeler Journal: Nucleic Acids Res Date: 2007-12-11 Impact factor: 16.971
Authors: Stacey B Whitbourne; Xuan-Mai T Nguyen; Rebecca J Song; Emily Lord; Michelle Lyden; Kelly M Harrington; Rachel Ward; Yanping Li; Jessica V V Brewer; Kelly M Cho; Luc Djousse; Sumitra Muralidhar; Philip S Tsao; J Michael Gaziano; Juan P Casas Journal: PLoS One Date: 2022-04-25 Impact factor: 3.752
Authors: M Alejandra Tortorici; Nadine Czudnochowski; Tyler N Starr; Roberta Marzi; Alexandra C Walls; Fabrizia Zatta; John E Bowen; Stefano Jaconi; Julia di Iulio; Zhaoqian Wang; Anna De Marco; Samantha K Zepeda; Dora Pinto; Zhuoming Liu; Martina Beltramello; Istvan Bartha; Michael P Housley; Florian A Lempp; Laura E Rosen; Exequiel Dellota; Hannah Kaiser; Martin Montiel-Ruiz; Jiayi Zhou; Amin Addetia; Barbara Guarino; Katja Culap; Nicole Sprugasci; Christian Saliba; Eneida Vetti; Isabella Giacchetto-Sasselli; Chiara Silacci Fregni; Rana Abdelnabi; Shi-Yan Caroline Foo; Colin Havenar-Daughton; Michael A Schmid; Fabio Benigni; Elisabetta Cameroni; Johan Neyts; Amalio Telenti; Gyorgy Snell; Herbert W Virgin; Sean P J Whelan; Jesse D Bloom; Davide Corti; David Veesler; Matteo Samuele Pizzuto Journal: bioRxiv Date: 2021-04-08
Authors: M Alejandra Tortorici; Nadine Czudnochowski; Tyler N Starr; Roberta Marzi; Alexandra C Walls; Fabrizia Zatta; John E Bowen; Stefano Jaconi; Julia Di Iulio; Zhaoqian Wang; Anna De Marco; Samantha K Zepeda; Dora Pinto; Zhuoming Liu; Martina Beltramello; Istvan Bartha; Michael P Housley; Florian A Lempp; Laura E Rosen; Exequiel Dellota; Hannah Kaiser; Martin Montiel-Ruiz; Jiayi Zhou; Amin Addetia; Barbara Guarino; Katja Culap; Nicole Sprugasci; Christian Saliba; Eneida Vetti; Isabella Giacchetto-Sasselli; Chiara Silacci Fregni; Rana Abdelnabi; Shi-Yan Caroline Foo; Colin Havenar-Daughton; Michael A Schmid; Fabio Benigni; Elisabetta Cameroni; Johan Neyts; Amalio Telenti; Herbert W Virgin; Sean P J Whelan; Gyorgy Snell; Jesse D Bloom; Davide Corti; David Veesler; Matteo Samuele Pizzuto Journal: Nature Date: 2021-07-19 Impact factor: 69.504
Authors: Kyent-Yon Yie; Tsair-Wei Chien; Yu-Tsen Yeh; Willy Chou; Shih-Bin Su Journal: Int J Environ Res Public Health Date: 2021-03-03 Impact factor: 3.390