Literature DB >> 35171928

Worldwide SARS-CoV-2 haplotype distribution in early pandemic.

Andrea Cairo1, Marilena V Iorio2, Silvia Spena1, Elda Tagliabue2, Flora Peyvandi1,3.   

Abstract

The world is experiencing one of the most severe viral outbreaks in the last few years, the pandemic infection by SARS-CoV-2, the causative agent of COVID-19 disease. As of December 10th 2021, the virus has spread worldwide, with a total number of more than 267 million of confirmed cases (four times more in the last year), and more than 5 million deaths. A great effort has been undertaken to molecularly characterize the virus, track the spreading of different variants across the globe with the aim to understand the potential effects in terms of transmission capability and different fatality rates. Here we focus on the genomic diversity and distribution of the virus in the early stages of the pandemic, to better characterize the origin of COVID-19 and to define the geographical and temporal evolution of genetic clades. By performing a comparative analysis of 75401 SARS-CoV-2 reported sequences (as of December 2020), using as reference the first viral sequence reported in Wuhan in December 2019, we described the existence of 26538 genetic variants, the most frequent clustering into four major clades characterized by a specific geographical distribution. Notably, we found the most frequent variant, the previously reported missense p.Asp614Gly in the S protein, as a single mutation in only three patients, whereas in the large majority of cases it occurs in concomitance with three other variants, suggesting a high linkage and that this variant alone might not provide a significant selective advantage to the virus. Moreover, we evaluated the presence and the distribution in our dataset of the mutations characterizing the so called "british variant", identified at the beginning of 2021, and observed that 9 out of 17 are present only in few sequences, but never in linkage with each other, suggesting a synergistic effect in this new viral strain. In summary, this is a large-scale analysis of SARS-CoV-2 deposited sequences, with a particular focus on the geographical and temporal evolution of genetic clades in the early phase of COVID-19 pandemic.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35171928      PMCID: PMC8849502          DOI: 10.1371/journal.pone.0263705

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

In nineteen months after the declaration of the SARS-CoV-2 pandemic [1], the scientific community has been struggling to understand the complexity of this novel coronavirus of still debated zoonotic origin [2], the clinical symptoms [3, 4], the risk factors, the potential treatments, in the urgent effort to contain the infection, to predict potentially serious disease outcomes, to find a cure. In 2021, several SARS-CoV-2 vaccines have been generated and approved, and several other are still in clinical trials [5]. The circulation of SARS-CoV-2 before the pandemic declaration has been ascertained, and several efforts have been made to track the worldwide spreading and the genetic changes that originated different viral strains. The first case of COVID-19 was reported in Wuhan, China, in December 2019 and, despite the relatively low mortality (approximately 2% on average worldwide, as of December 10th 2021, ) and the high percentage of asymptomatic or pauci-symptomatic subjects (over 80%), the viral outbreak has literally caused a dramatic collapse of the health care system in the most hit countries, as in Italy, where the mortality rate reached over 14% between May and August 2020. In the urgent race to find efficient drugs and decrease the complications, it has been immediately clear that a deeper understanding of the genomic diversity of this virus was crucial. Indeed, the existence of different strains and their temporal and geographical distribution could provide relevant information on: how the virus spread all over the world, the possible acquisition of selective advantages, the most conserved sequences suitable for a vaccine design. Since the first complete genomic sequence of SARS-CoV-2 release on January 5th 2020 [1, 2], thousands of additional sequences have been deposited. Different virus strains have been reported [6], and the accumulation of recurrent variants [7, 8]. Here, we evaluated the occurrence of different SARS-COV-2 clades and their geographical distribution starting from 75401 sequences deposited in two major available databases, GISAID and NCBI, from the beginning of COVID-19 pandemic to September 2020.

Material and methods

Data acquisition

The VCF file containing all identified variants in 77648 SARS-CoV-2 genomes and relative information about geographical area, date of sampling and genome length were downloaded on September 29th 2020 from http://covseq.baidu.com/. Viral sequences were collected from major available databases: GISAID (74303) and NCBI (3345). Genomes without detailed date of sampling, geographical localization and with a sequence shorter than 29000 nucleotides were excluded. After filtering, 75401 viral genomes were included in the analysis. The list of all analyzed sequences with the collection dates is available in

Sequence and mutational analysis

The reference sequence used for our analysis is NC_045512.2 (NCBI RefSeq). Multiallelic variants were splitted using Bcftools [9]. Variants were annotated using SnpEff v.4.5covid19 [10]. Data manipulation and descriptive analysis were performed using python package PANDAS [11]. Viral genomes and relative variants were organized in a binary matrix. The identification of clades and haplotypes was performed including in the analysis only the variants identified in at least 1000 viral sequences. We created a sub-dataset containing the 53 variants identified as present in more than 1000 viral sequences ( and different haplotypes were identified and counted using groupby PANDA function (). The sequences were classified into clades and deriving haplotypes based on the mutational distribution. Frequency of each variant in all countries was calculated (). and represented by clustering analysis and relative heatmap, which were performed with R 4.0.0s. Only countries with at least 50 viral sequences were included in the clustering analysis.

Results and discussion

The geographical distribution of available sequences is reported in Almost 58% of all available sequences was obtained from European centers, principally United Kingdom, followed by North America (~20%), Oceania (~11%) and Asia (~8%). The samples were collected from December 2019 to August 17th 2020 at different times across different countries, and analyzing their temporal distribution it is worth noting that few sequences were early obtained (January 2020) in United States (January 19th 2020), Australia (January 22nd 2020) and North Europe (January 23rd 2020) (). China represents the country with the highest percentage of unmutated viral genomes (3%) (). Observing the distribution of viral unmutated sequences, it could be noticed how USA, Northern Europe, Australia, South Africa, Brazil, Canada and India have been directly involved in the spread of the pandemic after China (). Analyzing the minimum number of variants per sequence in each country we could indeed track the mutational evolution and the relative geographical distribution, and the circulation timing of the virus. Importantly, this approach allowed us to hypothesize that some countries severely hit by COVID-19, as Italy and France, were characterized by the spread of already mutated forms of the virus (). A total of 26539 variants were identified (): 57% missense, 28% synonymous, 7% insertions/deletions, 2% stop variants, 7% of variants being localized in the untranslated regions, 4% in the 3’UTR and 3% in the 5’UTR ( The number of variants per sequence spans mainly from 0 to 20; the majority of sequences carried from 7 to 9 variants and only few cases (431) had more than 20 variants (). A plausible hypothesis could be that, whereas few variants may have provided such favorable features as an improved infectivity, a higher number of variants did not result in a selective advantage. Variants present in at least 1000 sequences are reported in (N = 53). As expected, nonsense mutations were not found among the most frequent variants.

Most frequent SARS-CoV-2 variants.

Only the variants identified in at least 1000 viral sequences are reported. * Numbering refers to the reference sequence: NC_045512.2. Tracing the mutational evolution of the viral genome has been crucial to evaluate potential functional consequences and to obtain an efficient vaccine, which should preferentially target more evolutionary conserved regions. Our analysis underlines that, despite the high number of variants, the regions coding for the polyprotein ORF1ab, the spike (S) and the membrane (M) proteins are the most conserved (). This is not surprising, since most evolutionary conserved regions usually encode essential proteins. However, we confirmed the missense variant p.Asp614Gly in S protein as the most frequent (found in 57820 sequences, 77% of the total) (, as described in previous studies [7, 8, 12, 13]. This variant has been shown to improve the binding affinity of the S protein to the human ACE2 receptor, reported as main entry site of the virus into human cells [14], through the cleavage of the S1/S2 domain [15]. Indeed, it was previously shown that SARS-CoV infection can be enhanced by exogenous proteases. It has been demonstrated that the p.Asp614Gly variant confers higher infectivity [16], competitive fitness and improved transmission in human primary cells and animal models [17]. However, the variant does not seem to be associated with an increased disease severity [16], despite what was initially inferred from the correlation of mortality rate and prevalence of p.Asp614Gly in different countries [8, 18]. Given its pitoval role in receptor binding, membrane fusion and cell infectivity, the capability to induce neutralizing antibodies and T cell-immune responses and the relative low variability so far reported, the S protein has been considered a valid target for the development of SARS-CoV-2 vaccines [19-21]. It is relevant to notice that, focusing on the 53 most frequent variants, we found the p.Asp614Gly variant as a single mutation in three patients only, suggesting that this variant alone does not provide a significant selective advantage to the virus. Indeed, in the large majority of cases p.Asp614Gly occurs concomitantly with other variants, in particular with three variants, all characterized by similar frequency and located in the ORF1ab gene: the missense p.Pro4715Leu, the synonymous p.Phe924Phe and the c.-25C>T in the 5’ untranslated region. These four variants are indeed in strong linkage and constitute the most common clade (clade 1). We identified 4 different clades and evaluated their temporal occurrence, geographical distribution and potential connections ( The variants characterizing clade 1 have been identified in 55582 sequences, 74% of the total. This clade, first reported in UK in January 2020, has literally spread worldwide and particularly to European countries, where it represents 78% of sequences in (1059 cases), to North and South America (73 and 77%), Africa (77%) and Oceania (73%) ( Interestingly, this core is present in approximately 93% (201 of 215) of Italian cases. Clade 1 has likely generated two subclades, subclade_1A and subclade_ 1B, characterized by a different geographical frequency (), with the presence of subclade_1A mainly in Europe (41%), Oceania (52%) and South America (50%), whereas subclade_1B is present mainly in North America (51%) (). A visual representation of the spreading of different variants and relative clades is clearly inferable from the hierarchical cluster analysis reported in . The plot highlights the co-presence of the four most common variants constituting clade 1 and their worldwide spreading. Additionally, it is possible to observe how the distribution of the three variants characterizing subclade_1A (28881, 28882, 28883) and the two variants peculiar of subclade_1B (25563, 1059) present an almost mutually exclusive geographical localization.

Hierarchical cluster analysis of most frequent SARS-CoV-2 variants and geographical distribution.

Only countries with at least 50 viral sequences were included in the clustering analysis. Clade 2, characterized by two different variants (8782 and 28144), is present in North America, some European countries such as Spain, some regions of Asia (China, Kazakistan and Thailand), Africa (Ghana and Nigeria) and Australia ( and ). Clade 3 (11083, 14805, 26144) and clade 4 (1440) appear to be more common in Europe. Despite the worldwide distribution of clade 1 variants, it is remarkable the peculiar case of South Korea, characterized by an almost exclusive presence of clade 3, and at lower frequency of clade 2 (). In addition, some regions are characterized by a high frequency of different variants not related to the four major clades. 58% of sequences of Singapore presents a specific pattern of 5 variants (11083, 28311, 13730, 23929, 6312); Spain and Kazakistan curiously present the diffusion of similar virus strains characterized by the presence of 4 variants (28863, 9477, 25979, 28657) observed in 28% and 35% of the sequences, respectively. Likewise, 47% of the Australian sequences are characterized by the presence of 6 variants (7540, 23401, 22992, 16647, 18555, 1163) (). The 22992 missense variant, leading to a S477A aminoacidic change, has been associated to a stronger transmission capacity [22] and higher infectivity, perhaps due to an improved binding of the S protein RBD (receptor-binding domain) to the ACE receptor [23]. Other sporadic mutations are related to a specific geographical area, as the variant 313 identified in 37% of Japanese sequences, the variants 18877 and 29829 observed in the 69% of sequences from Saudi Arabia and 90% of the sequences from Netherlands, respectively. These four identified clades constitute the core of 1213 different haplotypes, of which 548 are unique, whereas the remaining occurred in more than one patient. We report the 20 most frequent haplotypes according to the number of cases ().

Most frequent SARS-CoV-2 haplotypes: Frequency and distribution.

In bold the variants charactering Clade 1, in italics Clade 2, underlined Clade 3 and double underlined Clade 4. Geographical distribution of haplotypes 1,2 and 3 are described in the . Haplotype 1 (241, 3037, 14408, 28881, 28882, 28883) is more common in South America and Europe, haplotype 2 (23403, 14408, 3037, 241, 25563, 1059) has mainly spread to North America and haplotype 3 (241, 3037, 14408, 23403) to Africa and Europe. Haplotype 3 is present in 48% of Italian sequences, and haplotype 1 in 36%. At the beginning of 2021, a new viral strain characterized by 23 variants (6 synonymous and 17 non-synonymous) and associated with a high diffusion capacity was reported in the United Kingdom [24]. One of these mutations, the p.Asn501Tyr mutation located at the RBD of the S protein, has been previously reported to be an adaptive mutation in a mouse model subjected to serial passages of a human SARS-CoV-2 by means of intranasal inoculations [25]. This mutation seems to favor virus binding to the human ACE2 receptor, thus leading to a phenotype of increased virulence in mice [23]. However, p.Asn501Tyr variant and other 8 of the 17 variants (3267C>T p.Thr1001Ile, 6954T>C p.Ile2230Thr in the ORF1ab gene; 23063A>T, 23271 C>A p.Ala570Asp, 23604C>A p.Pro681His, 23709C>T p.Thr716Ile in the S gene; 27972C>T p.Gln27*, 28048G>T p.Arg52Ile in the Orf8 gene; 28977C>T p.Ser235Phe in the N gene) were observed only in a few sequences in our dataset, but never in linkage with each other, suggesting a synergistic effect of the variants in this new viral strain (). In particular p.Asn501Tyr was observed in 2 sequences at the beginning of April, one in Brazil and the other in USA. Later, starting from June 2020, the same mutation was identified in 34 Australian sequences. In conclusion, our analysis emphasizes that the most frequent SARS-CoV-2 variants in early phases of COVID-19 pandemic clustered into four major clades, characterized by different geographical localization, likely reflecting a temporal and spatial spread of virus strains. Moreover, we determined that the most frequent variant, the missense variant p.Asp614Gly in the S protein, is in strong linkage with three additional variants, suggesting a potential functional cooperation in providing a selective advantage to the virus. Despite the limitations of this study, as the fact that most sequences in the first year of COVID-19 pandemic were reported by UK, and the unavailability of clinical data to correlate the different variants to disease outcome, our analysis provides a portrait of SARS-CoV-2 temporal and spatial spread during the first phases of the pandemic. During last year several novel virus variants have been sequenced, in particular the World Health Organization (WHO) identified variants of concern (VOCs), which cause an improved virulence and transmissibility or affect the efficacy of diagnosis, treatment or vaccines [26]. The most recently described VOC, the so called Omicron, has been first documented in South Africa in November 2021, where it has been responsible for the forth waive of COVID-19. This variant has raised concerns about its rapid spread and potential escape to vaccines. According to still preliminary data however, Omicron seems to have improved transmissibility but decreased severity [27]. Further studies are needed to verify whether vaccines retain their efficacy also against this variant. Considering the continuous circulation of the virus and the emerging of new SARS-CoV-2 variants, it is extremely relevant to associate the different haplotypes to a specific outcome of the disease and to understand whether or not the acquired mutations have functional consequences in terms of infectiveness, clinical severity and potential responsiveness to specific treatments and to vaccines.

Conclusions

By performing a large scale SARS-CoV-2 analysis of reported sequences in the first year of Covid-19 pandemic, we described the existence of 26538 genetic variants, the most frequent clustering into four major clades characterized by a specific geographical distribution. We also found that the most frequent variant, the missense p.Asp614Gly in the S protein, occurs almost exclusively in concomitance with three other variants, suggesting a high linkage and a potential cooperative effect on the virus fitness.

Worldwide distribution of available genomic sequences.

The number and the percentage of sequences reported in each continent and country are indicated in panel (a) and (b), respectively. Only countries with more than 200 sequences are shown. (PDF) Click here for additional data file.

Geographical distribution of the collection dates of the first sequenced sample in each country.

(PDF) Click here for additional data file.

Mutational spectrum of SARS-CoV-2 genome.

(a) Type of variants are listed with the corresponding number and frequency. (b) Variant number distribution. The bar graph shows the amount of the reported sequences with the same number of variants. (c) Frequency of variants with a direct effect on the virus’ proteins (missense, nonsense and insertion/deletion). Variants are listed with the corresponding number and frequency of reported mutations. (PDF) Click here for additional data file.

Frequency and geographical distribution of clade 1.

(PDF) Click here for additional data file.

Frequency and geographical distribution of clade1-derived haplotypes 1–3.

(PDF) Click here for additional data file.

Frequency and geographical distribution of clade2.

(PDF) Click here for additional data file.

List of all analyzed sequences with collection dates and presence of most frequents variants.

(XLSX) Click here for additional data file.

Haplotype identified starting from the 53 most frequent variants.

(XLSX) Click here for additional data file.

Frequency of each variant in all countries.

(XLSX) Click here for additional data file.

List of all identified variants.

(XLSX) Click here for additional data file.
Table 1

SARS-CoV-2 sequence distribution.

Continent/CountryN° SequencesN° Seq. with 0 variant (%)
EUROPE 4350881(0.1)
Austria4251 (-)
Denimark7421 (-)
Germany3921 (-)
Iceland5815 (0.8)
Italy2150 (-)
Sweden5261 (-)
Switzerland7082 (-)
UK3286769 (0.2)
NORTH AMERICA 1527227 (0.2)
Usa1458026 (0.2)
Canada5531 (-)
SUD AMERICA 13271 (-)
Brazil8381 (-)
AFRICA 9842 (-)
South Africa2311 (-)
ASIA 636378 (1.2)
China109332 (3.0)
India24132 (-)
OCEANIA 79471 (-)
Australia76881 (-)
Table 2

Most frequent SARS-CoV-2 variants.

Only the variants identified in at least 1000 viral sequences are reported.

Genomic Position*Genec.DNAProteinType of variantDate of first observationCountryN°of sequences% of sequencesFunctional data
23403Sc.1841A>Gp.Asp614Glymissense24/01/2020China5782077%Increased infectivity (13;15;16)
14408ORF1abc.14144C>Tp.Pro4715Leumissense24/01/2020China5760976%
3037ORF1abc.2772C>Tp.Phe924Phesynonymous24/01/2020China5756276%
241ORF1abc.-25C>T/5’UTR24/01/2020China5648775%
28881Nc.608G>Ap.Arg203Lysmissense27/01/2020Australia2597434%
28883Nc.610G>Cp.Gly204Argmissense27/01/2020Australia2592734%
28882Nc.609G>Ap.Arg203Argsynonymous27/01/2020Australia2592334%
25563ORF3ac.171G>Tp.Gln57Hismissense16/02/2020Singapore1621422%
29836/c.4453_4519del/3’UTR20/01/2020China1418119%
1059ORF1abc.794C>Tp.Thr265Ilemissense16/02/2020Singapore1258917%
11083ORF1abc.10818G>Tp.Leu3606Phemissense17/01/2020China782110%
14805ORF1abc.14541C>Tp.Tyr4847Tyrsynonymous09/02/2020United Kingdom64018%
28144ORF8c.251T>Cp.Leu84Sermissense26/12/2019China54817%
8782ORF1abc.8517C>Tp.Ser2839Sersynonymous26/12/2019China53477%
26144ORF3ac.752G>Tp.Gly251Valmissense21/01/2020China52287%
1163ORF1abc.898A>Tp.Ile300Phemissense27/01/2020Australia41806%
18555ORF1abc.18291C>Tp.Asp6097Aspsynonymous27/01/2020Australia39405%
16647ORF1abc.16383G>Tp.Thr5461Thrsynonymous27/01/2020Australia38275%
23401Sc.1839G>Ap.Gln613Glnsynonymous27/01/2020Australia38215%
7540ORF1abc.7275T>Cp.Thr2425Thrsynonymous27/01/2020Australia38005%
22992Sc.1430G>Ap.Ser477Asnmissense27/01/2020Australia37705%Increased infectivity (21, 22)
20268ORF1abc.20004A>Gp.Leu6668Leusynonymous25/01/2020Australia34335%
23731Sc.2169C>Tp.Thr723Thrsynonymous02/03/2020Denmark33194%
10097ORF1abc.9832G>Ap.Gly3278Sermissense02/03/2020Denmark33084%
18060ORF1abc.17796C>Tp.Leu5932Leusynonymous19/01/2020USA25963%
17858ORF1abc.17594A>Gp.Tyr5865Cysmissense21/02/2020USA25423%
17747ORF1abc.17483C>Tp.Pro5828Leumissense21/02/2020USA24773%
2558ORF1abc.2293C>Tp.Pro765Sermissense09/02/2020United Kingdom22503%
2480ORF1abc.2215A>Gp.Ile739Valmissense09/02/2020United Kingdom21073%
18877ORF1abc.18613C>Tp.Leu6205Leusynonymous21/02/2020Lebanon20143%
17247ORF1abc.16983T>Cp.Arg5661Argsynonymous25/02/2020United Kingdom16722%
27964ORF8c.71C>Tp.Ser24Leumissense03/03/2020USA16662%
15324ORF1abc.15060C>Tp.Asn5020Asnsynonymous22/01/2020China16562%
1604ORF1abc.1341_1343delTGAp.Asp448delins/del05/02/2020United Kingdom16532%
25429ORF3ac.37G>Tp.Val13Leumissense02/03/2020USA16292%
28854Nc.581C>Tp.Ser194Leumissense16/01/2020China16182%
28580Nc.307G>Tp.Asp103Tyrmissense02/03/2020Chile16042%
28311Nc.38C>Tp.Pro13Leumissense26/02/2020South Korea15482%
29829/c.4446_4519del/3’UTR27/02/2020Netherlands13802%
313ORF1abc.48C>Tp.Leu16Leusynonymous27/02/2020Canada13692%
28657Nc.384C>Tp.Asp128Aspsynonymous25/02/2020France13552%
29864/c.4481_4517del/3’UTR18/01/2020China13292%
1440ORF1abc.1175G>Ap.Gly392Aspmissense23/02/2020United Kingdom13172%
28863Nc.590C>Tp.Ser197Leumissense25/02/2020France13092%
9477ORF1abc.9212T>Ap.Phe3071Tyrmissense01/02/2020China13072%
13730ORF1abc.13466C>Tp.Ala4489Valmissense14/02/2020Singapore13032%
25979ORF3ac.587G>Tp.Gly196Valmissense25/02/2020France12922%
19839ORF1abc.19575T>Cp.Asn6525Asnsynonymous26/02/2020Switzerland12182%
2891ORF1abc.2626G>Ap.Ala876Thrmissense23/02/2020United Kingdom12022%
2416ORF1abc.2151C>Tp.Tyr717Tyrsynonymous29/02/2020France11602%
29851/c.4468_4519del/3’UTR25/01/2020Australia11542%
6312ORF1abc.6047C>Ap.Thr2016Lysmissense04/03/2020Taiwan11261%
23929Sc.2367C>Tp.Tyr789Tyrsynonymous04/03/2020Taiwan10631%

* Numbering refers to the reference sequence: NC_045512.2.

Table 3

Most frequent SARS-CoV-2 clades: Frequency and distribution.

CladeGenomic position of variantsDate of first sample collectionCountry of first detectionAfrica (%)Asia (%)Europe (%)North America (%)Oceania (%)South America (%)All (%)
Clade_1 23403/14408/3037/24125-Jan-2020Australia77.0049.0078.0073.0073.0077.0074.00
Sub_Clade_1_A 23403/14408/3037/241/28881/28882/2888327-Jan-2020Australia19.0023.0041.005.0052.0050.0033.00
Sub_Clade_1_B 23403/14408/3037/241/25563/105916-Feb-2020Singapore4.006.007.0051.008.005.0016.00
Clade_2 28144/878226-Dec-2019China5.0010.003.0017.0010.005.007.00
Clade_3 11083/14805/261449-Feb-2020UK0.901.709.301.603.601.506.20
Clade_4 144023-Feb-2020UK0.100.202.800.100.600.101.70
Table 4

Most frequent SARS-CoV-2 haplotypes: Frequency and distribution.

In bold the variants charactering Clade 1, in italics Clade 2, underlined Clade 3 and double underlined Clade 4.

Haplotype IDGenomic position of variantsDate of first sample collectionCountryN° of sequences% of sequences
123403/14408/3037/241/28881/28882/2888316-Feb-2020UK1033714%
223403/14408/3037/241/25563/105916-Feb-2020Singapore851111%
3 23403/14408/3037/241 3-Feb-2020UK59448%
423403/14408/3037/241/28881/28882/28883/2983627-Feb-2020Spain39445%
523403/14408/3037/241/28881/28882/28883/1163/18555/16647/23401/7540/2299219-Mar-2020Australia29004%
628144/8782/18060/17858/1774721-Feb-2020USA22423%
723403/14408/3037/241/2026825-Jan-2020Australia19523%
823403/14408/3037/241/2983626-Feb-2020UK17812%
911083/14805/26144/2558/24809-Feb-2020UK14192%
10/dic-20China13802%
11 28144/8782 26-Dec-2019China11522%
1223403/14408/3037/241/25563/1887721-Feb-2020Lebanon11151%
1323403/14408/3037/241/25563/29836/105921-Feb-2020France11031%
1411083/14805/26144/1724725-Feb-2020UK10731%
1523403/14408/3037/241/153243-Mar-2020Switzerland10451%
1623403/14408/3037/241/254297-Mar-2020UK10351%
1723403/14408/3037/241/25563/1059/279648-Mar-2020USA10221%
1823403/14408/3037/241/28881/28882/28883/31327-Feb-2020Switzerland10211%
1923403/14408/3037/241/28881/28882/28883/23731/100972-Mar-2020Denmark8931%
201440/289123-Feb-2020UK8911%
  23 in total

1.  A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.

Authors:  Pablo Cingolani; Adrian Platts; Le Lily Wang; Melissa Coon; Tung Nguyen; Luan Wang; Susan J Land; Xiangyi Lu; Douglas M Ruden
Journal:  Fly (Austin)       Date:  2012 Apr-Jun       Impact factor: 2.160

2.  Characteristics and Outcomes of Hospitalized Patients in South Africa During the COVID-19 Omicron Wave Compared With Previous Waves.

Authors:  Caroline Maslo; Richard Friedland; Mande Toubkin; Anchen Laubscher; Teshlin Akaloo; Boniswa Kama
Journal:  JAMA       Date:  2022-02-08       Impact factor: 157.335

Review 3.  SARS-CoV-2 vaccines in development.

Authors:  Florian Krammer
Journal:  Nature       Date:  2020-09-23       Impact factor: 49.962

4.  Mutations Strengthened SARS-CoV-2 Infectivity.

Authors:  Jiahui Chen; Rui Wang; Menglun Wang; Guo-Wei Wei
Journal:  J Mol Biol       Date:  2020-07-23       Impact factor: 5.469

5.  Emergence of genomic diversity and recurrent mutations in SARS-CoV-2.

Authors:  Lucy van Dorp; Mislav Acman; Damien Richard; Liam P Shaw; Charlotte E Ford; Louise Ormond; Christopher J Owen; Juanita Pang; Cedric C S Tan; Florencia A T Boshier; Arturo Torres Ortiz; François Balloux
Journal:  Infect Genet Evol       Date:  2020-05-05       Impact factor: 3.342

Review 6.  The variants of SARS-CoV-2 and the challenges of vaccines.

Authors:  Xiucui Han; Qing Ye
Journal:  J Med Virol       Date:  2021-12-15       Impact factor: 20.693

7.  Variant analysis of SARS-CoV-2 genomes.

Authors:  Takahiko Koyama; Daniel Platt; Laxmi Parida
Journal:  Bull World Health Organ       Date:  2020-06-02       Impact factor: 9.408

8.  Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus.

Authors:  Bette Korber; Will M Fischer; Sandrasegaram Gnanakaran; Hyejin Yoon; James Theiler; Werner Abfalterer; Nick Hengartner; Elena E Giorgi; Tanmoy Bhattacharya; Brian Foley; Kathryn M Hastie; Matthew D Parker; David G Partridge; Cariad M Evans; Timothy M Freeman; Thushan I de Silva; Charlene McDanal; Lautaro G Perez; Haili Tang; Alex Moon-Walker; Sean P Whelan; Celia C LaBranche; Erica O Saphire; David C Montefiori
Journal:  Cell       Date:  2020-07-03       Impact factor: 66.850

9.  SARS-CoV-2 viral spike G614 mutation exhibits higher case fatality rate.

Authors:  Manuel Becerra-Flores; Timothy Cardozo
Journal:  Int J Clin Pract       Date:  2020-06-03       Impact factor: 3.149

10.  Adaptation of SARS-CoV-2 in BALB/c mice for testing vaccine efficacy.

Authors:  Hongjing Gu; Qi Chen; Guan Yang; Lei He; Hang Fan; Yong-Qiang Deng; Shibo Jiang; Shihui Sun; Cheng-Feng Qin; Yusen Zhou; Yanxiao Wang; Yue Teng; Zhongpeng Zhao; Yujun Cui; Yuchang Li; Xiao-Feng Li; Jiangfan Li; Na-Na Zhang; Xiaolan Yang; Shaolong Chen; Yan Guo; Guangyu Zhao; Xiliang Wang; De-Yan Luo; Hui Wang; Xiao Yang; Yan Li; Gencheng Han; Yuxian He; Xiaojun Zhou; Shusheng Geng; Xiaoli Sheng
Journal:  Science       Date:  2020-07-30       Impact factor: 47.728

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.