Literature DB >> 32296544

An emergent clade of SARS-CoV-2 linked to returned travellers from Iran.

John-Sebastian Eden1,2, Rebecca Rockett1,3,4, Ian Carter3, Hossinur Rahman3, Joep de Ligt5, James Hadfield6, Matthew Storey5, Xiaoyun Ren5, Rachel Tulloch1,2, Kerri Basile3, Jessica Wells3, Roy Byun7, Nicky Gilroy3, Matthew V O'Sullivan3,4, Vitali Sintchenko1,3,4, Sharon C Chen1,3,4, Susan Maddocks3, Tania C Sorrell1,2,3, Edward C Holmes1, Dominic E Dwyer1,3,4, Jen Kok3,4.   

Abstract

The SARS-CoV-2 epidemic has rapidly spread outside China with major outbreaks occurring in Italy, South Korea, and Iran. Phylogenetic analyses of whole-genome sequencing data identified a distinct SARS-CoV-2 clade linked to travellers returning from Iran to Australia and New Zealand. This study highlights potential viral diversity driving the epidemic in Iran, and underscores the power of rapid genome sequencing and public data sharing to improve the detection and management of emerging infectious diseases.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Keywords:  COVID-19; SARS-CoV-2; genome sequencing; phylogenetics

Year:  2020        PMID: 32296544      PMCID: PMC7147362          DOI: 10.1093/ve/veaa027

Source DB:  PubMed          Journal:  Virus Evol        ISSN: 2057-1577


1. Introduction

From a public health perspective, the real-time whole-genome sequencing (WGS) of emerging viruses enables the informed development and design of molecular diagnostic assays, and tracing patterns of spread across multiple epidemiological scales (i.e. genomic epidemiology). However, WGS capacities and data sharing policies vary in different countries and jurisdictions, leading to potential sampling bias due to delayed or underrepresented sequencing data from some areas with substantial SARS-CoV-2 activity. Herein, we show that the genomic analyses of SARS-CoV-2 strains from Australian returned travellers with COVID-19 disease may provide important insights into viral diversity present in regions currently lacking genomic data.

2. SARS-CoV-2 emergence and dissemination

In late December 2019, a cluster of cases of pneumonia of unknown aetiology in Wuhan city, Hubei province, China was reported by health authorities (Wuhan Municipal Health Commission 2019). A novel betacoronavirus, designated SARS-CoV-2, was identified as the causative agent (Wu et al. 2020) of the disease now known as COVID-19, with substantial human-to-human transmission (Lu et al. 2020). To contain a growing epidemic, Chinese authorities implemented strict quarantine measures in Wuhan and surrounding areas in Hubei province. Significant delays in the global spread of the virus were achieved, but despite these measures, cases were exported to other countries. As of 9 March 2020, these numbered more than 100 countries, on all continents except Antarctica; the total number of confirmed infections exceeded 110,000 and there were nearly 4,000 deaths (Dong, Du, and Gardner 2020). Although the majority of cases have occurred in China, major outbreaks have also been reported in Italy, South Korea, and Iran (World Health Organisation 2020a). Importantly, there is widespread local transmission in multiple countries outside China following independent importations of infection from visitors and returned travellers.

3. WGS of SARS-CoV-2 cases in Australia and New Zealand

Viral extracts were prepared from respiratory tract samples where SARS-CoV-2 was detected by reverse-transcription polymerase chain reaction (RT-PCR) using World Health Organisation (2020b) recommended primers and probes targeting the E and RdRp genes. In New South Wales (NSW), Australia, WGS for SARS-CoV-2 was developed based on an existing amplicon-based Illumina sequencing approach (Di Giallonardo et al. 2018). Viral extracts were reverse transcribed with SSIV VILO cDNA master mix and then used as input for multiple overlapping PCR reactions (∼2.5 kb each) spanning the viral genome using Platinum SuperFi master mix (primers provided in Supplementary Table S1). Amplicons were pooled equally, purified, and quantified. Nextera XT libraries were prepared and sequencing was performed with multiplexing on an Illumina iSeq (300 cycle flow cell). In New Zealand, the ARTIC network protocol was used for WGS (Quick 2020). In short, 400-bp tiling amplicons designed with Primal Scheme (Grubaugh et al. 2019a) were used to amplify viral cDNA prepared with SuperScript III. A sequence library was then constructed using the Oxford NanoPore ligation sequencing kit and sequenced on a R9.4.1 MinION flow cell. Near-complete viral genomes were then assembled de novo in Geneious Prime 2020.0.5 or through reference mapping with RAMPART V1.0.6 (Hadfield 2019) using the ARTIC network nCoV-2019 novel coronavirus bioinformatics protocol (Loman and Rambaut 2020). In total, 13 SARS-CoV-2 genomes were sequenced from cases in NSW diagnosed between 24 January and 3 March 2020, as well as a single genome from the first patient in Auckland, New Zealand sampled on 27 February 2020 (Table 1). Australian and New Zealand sequences were aligned to global reference strains sourced from GISAID with MAFFT (Katoh 2002) and then compared phylogenetically using a maximum-likelihood approach—PhyML v2.2.4 (Guindon and Gascuel 2003).
Table 1.

SARS-CoV-2 genomes sequenced in this study

GISAID IDVirus nameLocationCollection dateTravel history
EPI_ISL_408976408976/Australia/Sydney-2/2020-01-22Sydney, Australia22 January 2020China
EPI_ISL_407893407893/Australia/NSW01/2020-01-24Sydney, Australia24 January 2020China
EPI_ISL_408977408977/Australia/Sydney-3/2020-01-25Sydney, Australia25 January 2020China
EPI_ISL_413490413490/New_Zealand/01/2020-02-27Auckland, New Zealand27 February 2020Iran
EPI_ISL_412975412975/Australia/NSW05/2020-02-28Sydney, Australia28February 2020Iran
EPI_ISL_413594413594/Australia/NSW08/2020-02-28Sydney, Australia28 February 2020SE Asia
EPI_ISL_413595413595/Australia/NSW09/2020-02-28Sydney, Australia28 February 2020SE Asia
EPI_ISL_413213413213/Australia/NSW06/2020-02-29Sydney, Australia29 February 2020Iran
EPI_ISL_413214413214/Australia/NSW07/2020-02-29Sydney, Australia29 February 2020None
EPI_ISL_413596413596/Australia/NSW10/2020-02-28Sydney, Australia1 March 2020SE Asia
EPI_ISL_413597413597/AUS/NSW11/2020-03-02Sydney, Australia2 March 2020Iran
EPI_ISL_413600413600/AUS/NSW14/2020-03-03Sydney, Australia3 March 2020None
EPI_ISL_413598413598/AUS/NSW12/2020-03-04Sydney, Australia4 March 2020Iran
EPI_ISL_413599413599/AUS/NSW13/2020-03-04Sydney, Australia4 March 2020Iran
SARS-CoV-2 genomes sequenced in this study

4. A distinct clade of SARS-CoV-2 identified in travellers returned from Iran

The Australian strains of SARS-CoV-2 were dispersed across the global SARS-CoV-2 phylogeny (Fig. 1A). The first four cases of COVID-19 disease in NSW occurred between 24 and 26 January 2020, and these were closely related with 1–2 single nucleotide polymorphisms (SNPs) difference to the dominant variant circulating in Wuhan at the time (prototype strain MN908947/SARS-CoV-2/Wuhan-Hu-1). As the four patients identified in this period had recently returned from China, this region was the likely source of infection. From 1 February 2020, travel to Australia from mainland China was restricted to returning Australian residents and their children, who were placed in home quarantine for 14 days. Despite the intensive testing of such returning travellers, no further cases of COVID-19 were detected in NSW until 28 February 2020, when SARS-COV-2 was detected in an individual returning from Iran (NSW05). A close contact of this individual also tested positive (NSW14) providing the first evidence of local transmission within NSW. This was followed by further Iran travel-linked cases in NSW (NSW06, NSW11, NSW12, and NSW13) and New Zealand (NZ01).
Figure 1.

Phylogenetic analysis of SARS-CoV-2 genome sequences highlighting a clade of imported cases from Iran. (A) Global diversity of circulating SARS-CoV-2 strains including Australian sequences (blue circles, n = 19). The prototype strain Wuhan-Hu-1 is shown as a red circle. An emergent clade containing cases imported from Iran is highlighted with grey shading. (B) Sub-tree showing the informative branch containing imported Iranian cases (highlighted with yellow squares) and defined by substitutions at positions G1397A, T28688C, and G29742T. Node support is provided as bootstrap values of 100 replicates. For both (A) and (B), the scales are proportional to the number of substitutions per site.

Phylogenetic analysis of SARS-CoV-2 genome sequences highlighting a clade of imported cases from Iran. (A) Global diversity of circulating SARS-CoV-2 strains including Australian sequences (blue circles, n = 19). The prototype strain Wuhan-Hu-1 is shown as a red circle. An emergent clade containing cases imported from Iran is highlighted with grey shading. (B) Sub-tree showing the informative branch containing imported Iranian cases (highlighted with yellow squares) and defined by substitutions at positions G1397A, T28688C, and G29742T. Node support is provided as bootstrap values of 100 replicates. For both (A) and (B), the scales are proportional to the number of substitutions per site. Of note, the genomes of all patients with a history of travel to Iran were part of a monophyletic group defined by three nucleotide substitutions (G1397A, T28688C, and G29742T) in the SARS-CoV-2 genome relative to the Wuhan prototype strain (Fig. 1B). G1397A and T28688C both occur in coding regions with G1397A producing a non-synonymous change (V378I) in the ORF1ab-encoded non-structural protein 2 region. G29742T occurs in the 3ʹ-UTR. In addition to the Australian and New Zealand strains, this clade also included a traveller who had returned to Canada from Iran (BC_37_0-2), providing further evidence of its likely link to the Iranian epidemic. Indeed, a search of all currently available GISAID sequences and metadata revealed no other complete genome sequences from patients with documented history of travel to or residence in Iran (as of 9 March 2020). A search of partial sequences identified two SARS-CoV-2 sequences which originated in Iran (413553/IRN/Tehran15AW/2020-02-28 and 413554/IRN/Tehran9BE/2020-02-23) spanning a 363 nt region of the viral nucleoprotein (N). Although short in length, these two sequences covered one of the informative SNPs defining this clade—T28688C, and both Iranian strains matched the sequences from patients with travel histories to Iran and grouped by phylogenetic analysis (Supplementary Figs. S1 and S2).

5. Discussion

Technological advancements and the widespread adoption of WGS in pathogen genomics have transformed public health and infectious disease outbreak responses (Popovich and Snitkin 2017). Previously, disease investigations often relied on the targeted sequencing of a small locus to identify genotypes and infer patterns of spread along with epidemiological data (Dudas and Bedford 2019). As seen with the recent West African Ebola (Dudas et al. 2017) and Zika virus epidemics (Grubaugh et al. 2018), rapid WGS significantly increases resolution of diagnosis and surveillance thereby strengthening links between genomic, clinical, and epidemiological data (Grenfell 2004), and potentially uncovering outbreaks in unsampled locations (Grubaugh et al. 2019b). This advance improves our understanding of pathogen origins and spread that ultimately lead to stronger and more timely intervention and control measures (Grubaugh et al. 2019c). Following the first release of the SARS-CoV-2 genome (Wu et al. 2020), public health and research laboratories worldwide have rapidly shared sequences on public data repositories such as GISAID (Shu and McCauley 2017) (n = 236 genomes as of 9 March 2020) that have been used to provide near real-time snapshots of global diversity through public analytic and visualization tools (Hadfield et al. 2018). Although all known cases linked to Iran are contained in this clade, it is important to note the presence of two Chinese strains sampled during mid-January 2020 from Hubei and Shandong provinces. It is expected that further Chinese strains will be identified that fall in this clade, and across the entire phylogenetic diversity of SARS-CoV-2 as this is where the outbreak started, including likely for the outbreak in Iran itself. However, while we cannot completely discount that the cases in Australia and New Zealand came from other sources including China, our phylogenetic analyses, as well as epidemiological (recent travel to Iran) and clinical data (date of symptom onset), provide evidence that this clade of SARS-CoV-2 is directly linked to the Iranian epidemic, from where genomic data are currently lacking. Importantly, the seemingly multiple importations of very closely related viruses from Iran into Australia suggest that this diversity reflects the early stages of SARS-CoV-2 transmission within Iran. Click here for additional data file.
  16 in total

1.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

2.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.

Authors:  Stéphane Guindon; Olivier Gascuel
Journal:  Syst Biol       Date:  2003-10       Impact factor: 15.683

Review 3.  Whole Genome Sequencing-Implications for Infection Prevention and Outbreak Investigations.

Authors:  Kyle J Popovich; Evan S Snitkin
Journal:  Curr Infect Dis Rep       Date:  2017-04       Impact factor: 3.725

4.  Genomic Insights into Zika Virus Emergence and Spread.

Authors:  Nathan D Grubaugh; Nuno R Faria; Kristian G Andersen; Oliver G Pybus
Journal:  Cell       Date:  2018-03-08       Impact factor: 41.582

5.  Travel Surveillance and Genomics Uncover a Hidden Zika Outbreak during the Waning Epidemic.

Authors:  Nathan D Grubaugh; Sharada Saraf; Karthik Gangavarapu; Alexander Watts; Amanda L Tan; Rachel J Oidtman; Jason T Ladner; Glenn Oliveira; Nathaniel L Matteson; Moritz U G Kraemer; Chantal B F Vogels; Aaron Hentoff; Deepit Bhatia; Danielle Stanek; Blake Scott; Vanessa Landis; Ian Stryker; Marshall R Cone; Edgar W Kopp; Andrew C Cannons; Lea Heberlein-Larson; Stephen White; Leah D Gillis; Michael J Ricciardi; Jaclyn Kwal; Paola K Lichtenberger; Diogo M Magnani; David I Watkins; Gustavo Palacios; Davidson H Hamer; Lauren M Gardner; T Alex Perkins; Guy Baele; Kamran Khan; Andrea Morrison; Sharon Isern; Scott F Michael; Kristian G Andersen
Journal:  Cell       Date:  2019-08-22       Impact factor: 41.582

6.  An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar.

Authors:  Nathan D Grubaugh; Karthik Gangavarapu; Joshua Quick; Nathaniel L Matteson; Jaqueline Goes De Jesus; Bradley J Main; Amanda L Tan; Lauren M Paul; Doug E Brackney; Saran Grewal; Nikos Gurfield; Koen K A Van Rompay; Sharon Isern; Scott F Michael; Lark L Coffey; Nicholas J Loman; Kristian G Andersen
Journal:  Genome Biol       Date:  2019-01-08       Impact factor: 13.583

7.  Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding.

Authors:  Roujian Lu; Xiang Zhao; Juan Li; Peihua Niu; Bo Yang; Honglong Wu; Wenling Wang; Hao Song; Baoying Huang; Na Zhu; Yuhai Bi; Xuejun Ma; Faxian Zhan; Liang Wang; Tao Hu; Hong Zhou; Zhenhong Hu; Weimin Zhou; Li Zhao; Jing Chen; Yao Meng; Ji Wang; Yang Lin; Jianying Yuan; Zhihao Xie; Jinmin Ma; William J Liu; Dayan Wang; Wenbo Xu; Edward C Holmes; George F Gao; Guizhen Wu; Weijun Chen; Weifeng Shi; Wenjie Tan
Journal:  Lancet       Date:  2020-01-30       Impact factor: 79.321

8.  A new coronavirus associated with human respiratory disease in China.

Authors:  Fan Wu; Su Zhao; Bin Yu; Yan-Mei Chen; Wen Wang; Zhi-Gang Song; Yi Hu; Zhao-Wu Tao; Jun-Hua Tian; Yuan-Yuan Pei; Ming-Li Yuan; Yu-Ling Zhang; Fa-Hui Dai; Yi Liu; Qi-Min Wang; Jiao-Jiao Zheng; Lin Xu; Edward C Holmes; Yong-Zhen Zhang
Journal:  Nature       Date:  2020-02-03       Impact factor: 49.962

9.  The ability of single genes vs full genomes to resolve time and space in outbreak analysis.

Authors:  Gytis Dudas; Trevor Bedford
Journal:  BMC Evol Biol       Date:  2019-12-26       Impact factor: 3.260

10.  An interactive web-based dashboard to track COVID-19 in real time.

Authors:  Ensheng Dong; Hongru Du; Lauren Gardner
Journal:  Lancet Infect Dis       Date:  2020-02-19       Impact factor: 25.071

View more
  45 in total

1.  Revealing COVID-19 transmission in Australia by SARS-CoV-2 genome sequencing and agent-based modeling.

Authors:  Rebecca J Rockett; Alicia Arnott; Connie Lam; Rosemarie Sadsad; Verlaine Timms; Karen-Ann Gray; John-Sebastian Eden; Sheryl Chang; Mailie Gall; Jenny Draper; Eby M Sim; Nathan L Bachmann; Ian Carter; Kerri Basile; Roy Byun; Matthew V O'Sullivan; Sharon C-A Chen; Susan Maddocks; Tania C Sorrell; Dominic E Dwyer; Edward C Holmes; Jen Kok; Mikhail Prokopenko; Vitali Sintchenko
Journal:  Nat Med       Date:  2020-07-09       Impact factor: 53.440

2.  Evolution and Epidemiology of SARS-CoV-2 Virus.

Authors:  Yu-Nong Gong; Kuo-Ming Lee; Shin-Ru Shih
Journal:  Methods Mol Biol       Date:  2022

3.  InterARTIC: an interactive web application for whole-genome nanopore sequencing analysis of SARS-CoV-2 and other viruses.

Authors:  James M Ferguson; Hasindu Gamaarachchi; Thanh Nguyen; Alyne Gollon; Stephanie Tong; Chiara Aquilina-Reid; Rachel Bowen-James; Ira W Deveson
Journal:  Bioinformatics       Date:  2021-12-15       Impact factor: 6.937

4.  Artificial Intelligence (AI) and Big Data for Coronavirus (COVID-19) Pandemic: A Survey on the State-of-the-Arts.

Authors:  Quoc-Viet Pham; Dinh C Nguyen; Thien Huynh-The; Won-Joo Hwang; Pubudu N Pathirana
Journal:  IEEE Access       Date:  2020-07-15       Impact factor: 3.367

5.  Ecology and Evolution of Betacoronaviruses.

Authors:  Eduardo Rodríguez-Román; Adrian J Gibbs
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

6.  SARS-CoV-2 outbreak in Iran: The dynamics of the epidemic and evidence on two independent introductions.

Authors:  Zohreh Fattahi; Marzieh Mohseni; Khadijeh Jalalvand; Fatemeh Aghakhani Moghadam; Azam Ghaziasadi; Fatemeh Keshavarzi; Jila Yavarian; Ali Jafarpour; Seyedeh Elham Mortazavi; Fatemeh Ghodratpour; Hanieh Behravan; Mohammad Khazeni; Seyed Amir Momeni; Issa Jahanzad; Abdolvahab Moradi; Alijan Tabarraei; Sadegh Ali Azimi; Ebrahim Kord; Seyed Mohammad Hashemi-Shahri; Azarakhsh Azaran; Farid Yousefi; Zakiye Mokhames; Alireza Soleimani; Shokouh Ghafari; Masood Ziaee; Shahram Habibzadeh; Farhad Jeddi; Azar Hadadi; Alireza Abdollahi; Gholam Abbas Kaydani; Saber Soltani; Talat Mokhtari-Azad; Reza Najafipour; Reza Malekzadeh; Kimia Kahrizi; Seyed Mohammad Jazayeri; Hossein Najmabadi
Journal:  Transbound Emerg Dis       Date:  2021-05-22       Impact factor: 4.521

7.  Accommodating individual travel history, global mobility, and unsampled diversity in phylogeography: a SARS-CoV-2 case study.

Authors:  Philippe Lemey; Samuel Hong; Verity Hill; Guy Baele; Chiara Poletto; Vittoria Colizza; Áine O'Toole; John T McCrone; Kristian G Andersen; Michael Worobey; Martha I Nelson; Andrew Rambaut; Marc A Suchard
Journal:  bioRxiv       Date:  2020-06-23

Review 8.  Culture counts: the diverse effects of culture and society on mental health amidst COVID-19 outbreak in Australia.

Authors:  Y Furlong; T Finnie
Journal:  Ir J Psychol Med       Date:  2020-05-14

9.  Genomic analysis of SARS-CoV-2 strains among Indians returning from Italy, Iran & China, & Italian tourists in India.

Authors:  Varsha Potdar; Sarah S Cherian; Gururaj Rao Deshpande; Padinjaremattathil Thankappan Ullas; Pragya D Yadav; Manohar Lal Choudhary; Rohan Gughe; Veena Vipat; Sheetal Jadhav; Savita Patil; Dimpal Nyayanit; Triparna Majumdar; Atul Walimbe; Shivshankar Gaikwad; Hitesh Dighe; Anita Shete-Aich; Sreelekshmy Mohandas; Deepika Chowdhury; Gajanan Sapkal; Atanu Basu; Nivedita Gupta; Raman R Gangakhedkar; Sidhartha Giri; Lalit Dar; Amita Jain; Bharati Malhotra; Priya Abraham
Journal:  Indian J Med Res       Date:  2020 Feb & Mar       Impact factor: 2.375

10.  Identification of the nucleotide substitutions in 62 SARS-CoV-2 sequences from Turkey.

Authors:  Ayşe Banu Demİr; Domenico Benvenuto; Hakan AbacioĞlu; Silvia Angeletti; Massimo Ciccozzi
Journal:  Turk J Biol       Date:  2020-06-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.