| Literature DB >> 33835709 |
Zohreh Fattahi1,2, Marzieh Mohseni1,2,3, Khadijeh Jalalvand1, Fatemeh Aghakhani Moghadam1, Azam Ghaziasadi4,5, Fatemeh Keshavarzi1, Jila Yavarian4, Ali Jafarpour4,5, Seyedeh Elham Mortazavi6, Fatemeh Ghodratpour1, Hanieh Behravan1, Mohammad Khazeni4,7, Seyed Amir Momeni7, Issa Jahanzad8, Abdolvahab Moradi9, Alijan Tabarraei9, Sadegh Ali Azimi9, Ebrahim Kord10, Seyed Mohammad Hashemi-Shahri10, Azarakhsh Azaran11, Farid Yousefi11, Zakiye Mokhames12, Alireza Soleimani12, Shokouh Ghafari13, Masood Ziaee13, Shahram Habibzadeh14, Farhad Jeddi14, Azar Hadadi15, Alireza Abdollahi16, Gholam Abbas Kaydani17, Saber Soltani4,5, Talat Mokhtari-Azad4, Reza Najafipour18, Reza Malekzadeh19, Kimia Kahrizi1, Seyed Mohammad Jazayeri4,5, Hossein Najmabadi1,2.
Abstract
The SARS-CoV-2 virus has been rapidly spreading globally since December 2019, triggering a pandemic, soon after its emergence. While Iran was among the first countries confronted with rapid spread of virus in February 2020, no real-time SARS-CoV-2 whole-genome tracking in early phase of outbreak was performed in the country. To address this issue, we provided 50 whole-genome sequences of viral isolates ascertained from different geographical locations in Iran during March-July 2020. The corresponding analysis on origins, transmission dynamics and genetic diversity of SARS-CoV-2 virus, represented at least two introductions of the virus into the country, constructing two major clusters defined as B.4 and B.1*. The first entry of the virus might have occurred around very late 2019/early 2020, as suggested by the time to the most recent common ancestor, followed by a rapid community transmission that led to dominancy of B.4 lineage in early epidemic till the end of June. Gradually, reduction in dominancy of B.4 occurred possibly as a result of other entries of the virus, followed by surge of B.1* lineages, as of mid-May. Remarkably, variation tracking of the virus indicated the increase in frequency of D614G mutation, along with B.1* lineages, which showed continuity till October 2020. The increase in frequency of D614G mutation and B.1* lineages from mid-May onwards predicts a rapid viral transmission that may push the country into a critical health situation followed by a considerable change in composition of viral lineages circulating in the country.Entities:
Keywords: COVID-19; Iran; SARS-CoV-2; phylogenetic study; whole genome sequencing
Mesh:
Year: 2021 PMID: 33835709 PMCID: PMC8251331 DOI: 10.1111/tbed.14104
Source DB: PubMed Journal: Transbound Emerg Dis ISSN: 1865-1674 Impact factor: 4.521
FIGURE 1(a) Lineages assignment of 58 SARS‐CoV‐2 sequences from the Iranian outbreak. Abundance of SARS‐CoV‐2 lineages over time from March to the end of June 2020 indicates a reduction in dominancy of the B.4 lineage. Trend of circulating lineages assigned by (b) Pangolin v2.0.7, C. CoV‐GLUE and D. NextClade v0.6.0
FIGURE 2Tempo‐spatial phylogenetic tree of SARS‐CoV‐2 emergence in Iran
FIGURE 3Radial phylogenetic tree of SARS‐CoV‐2 genomes from the Iranian outbreak in the context of 216 set of genomes from around the world. The major B.4 cluster in the Iranian SARS‐CoV‐2 outbreak is highlighted in light green. The [B.1.*/20A] cluster is highlighted in light red. The red circles denote the two B.4 samples from China located near the major Iranian cluster. The green circles denote the five B.4 samples from Australia, Canada and New Zealand located within the major Iranian cluster
FIGURE 4The zoomed and collapsed phylogenetic tree of SARS‐CoV‐2 genomes from the Iranian outbreak in the context of 216 set of genomes from around the world. The two main clusters circulating in Iran are zoomed. The major B.4 cluster in SARS‐CoV‐2 Iranian outbreak is highlighted in light green and the lines corresponding to Iranian samples within this cluster are also shown in green while other global samples are shown in red lines. The [B.1.*/20A] cluster is highlighted in light red, and the lines corresponding to Iranian samples within this cluster are also shown in red, while other global samples are shown in blue lines
Common and novel variants observed in SARS‐CoV‐2 genomes of the Iranian outbreak
| No | Genomic change | Type of mutation | Gene/protein | Amino acid change | No. of samples | Sample lineages | Description |
|---|---|---|---|---|---|---|---|
| Common SARS‐CoV−2 variants observed in early phase of Iranian outbreak | |||||||
| 1 | G1397A | Non‐synonymous | nsp2 | V198I | 43 (81%) | B.4, B |
Known coexistence of variants, constituting B.4 lineage, as suggested by Eden et al., |
| 2 | T28688C | Synonymous | N | L139L | 41 (77%) | B.4 | |
| 3 | G29742T | Non‐coding | 3ʹUTR | NA | 43 (81%) | B.4, B | |
| 4 | G11083T | Non‐synonymous | nsp6 | L37F | 39 (74%) | B.4, B | The most common mutation in Asia, during December 2019‐March 2020. |
| 5 | C241T | Non‐coding | 5ʹ‐UTR | NA | 10 (19%) | B.1* |
Known coexistence of variants, constituting B.1 (G) and B.1.* (GH) clades, according to Mercatelli & Giorgi, |
| 6 | C3037T | Synonymous | nsp3 | F106F | 10 (19%) | B.1* | |
| 7 | C14408T | Non‐synonymous | nsp12 (RdRp) | P323L | 10 (19%) | B.1* | |
| 8 | A23403G | Non‐synonymous | S | D614G | 13 (24.5%) | B.1* ( | |
| 9 | G25563T | Non‐synonymous | ORF3a | Q57H | 10 (19%) | B.1*, B.4, B | |
| Unique SARS‐CoV−2 haplotypes observed in early phase of Iranian outbreak | |||||||
| 10 | G20887A | Non‐synonymous | nsp16 | G77R | 9 (17%) | B.4 |
New coexistence of variants observed in same nine samples also carrying B.4 common variants. |
| 11 | C28830T | Non‐synonymous |
| S186F | 9 (17%) | B.4 | |
| 12 | C21627T | Non‐synonymous | S | T22I | 9 (17%) | B.4 | |
| 13 | G8653T | Non‐synonymous | nsp4 | M33I | 6 (11%) | B.4 |
New coexistence of variants observed in same six samples also carrying B.4 common variants. |
| 14 | C884T | Non‐synonymous | nsp2 | R27C | 6 (11%) | B.4 | |
| Unique SARS‐CoV−2 variants observed in early phase of Iranian outbreak | |||||||
| 15 | C28388G | Non‐synonymous | N | Q39E | 1 | B.1.1/20B | Known variants located at the same position: Q39L/ Q39H/ Q39R/ Q39* |
| 16 | G18712A | Non‐synonymous | nsp14 | A225T | 1 | B.4/19A | Known variants located at the same position: A225D / A225S |
| 17 | T3926C | Non‐synonymous | nsp3 | S403P | 1 | B.4/19A | Known variants located at the same position: S403L / S403A |
| 18 | G6461A | Non‐synonymous | nsp3 | V1248M | 1 | B.4/19A | Known variants located at the same position: V1248G / V1248L |
Abbreviations: N, nucleocapsid phosphoprotein; nsp14, 3'‐to‐5' exonuclease; nsp16, 2'‐O‐ribose methyltransferase; nsp2, Non‐Structural protein 2; nsp3, Predicted phosphoesterase, papain‐like proteinase; nsp4, Transmembrane protein; nsp6, Transmembrane protein; ORF, open reading frame; RdRp, RNA‐dependent RNA polymerase; S, Spike glycoprotein.
Variants located in the spike, observed in SARS‐CoV‐2 genomes of the Iranian outbreak
| Genomic change | Type of mutation | Gene/protein | Amino acid change | No. of samples | Sample lineages |
|---|---|---|---|---|---|
| A23403G | Non‐synonymous | S (S1) | D614G | 13 (24.5%) | B.1*/20A, B.1*/20B, B.4/19A |
| C21627T | Non‐synonymous | S (S1) | T22I | 9 (17%) | B.4/19A |
| G22100A | Non‐synonymous | S (S1) | E180K | 2 (4%) | B.4/19A |
| G22592T | Non‐synonymous | S (S1‐RBD Domain) | A344S | 1 (2%) | B.4/19A |
| C23679T | Non‐synonymous | S (S2) | A706V | 1 (2%) | B.4/19A |
| G24348T | Non‐synonymous | S (S2) | S929I | 1 (2%) | B.4/19A |
| G25249T | Non‐synonymous | S (S2) | M1229I | 1 (2%) | B.4/19A |
Abbreviation: S, spike glycoprotein.
FIGURE 5The frequency of D614G mutation during March–October interval in Iranian SARS‐CoV‐2 outbreak