Literature DB >> 31997390

Potential of large "first generation" human-to-human transmission of 2019-nCoV.

Xingguang Li1, Junjie Zai2, Xiaomei Wang1, Yi Li1.   

Abstract

To investigate the genetic diversity, time origin, and evolutionary history of the 2019-nCoV outbreak in China and Thailand, a total of 12 genome sequences of the virus with known sampling date (24 December 2019 and 13 January 2020) and geographic location (primarily Wuhan city, Hubei Province, China, but also Bangkok, Thailand) were analyzed. Phylogenetic and likelihood-mapping analyses of these genome sequences were performed. On the basis of our results, the star-like signal and topology of 2019-nCoV may be indicative of potentially large "first generation" human-to-human virus transmission. We estimated that 2019-nCoV likely originated in Wuhan on 9 November 2019 (95% credible interval: 25 September 2019 and 19 December 2019), and that Wuhan is the major hub for the spread of the 2019-nCoV outbreak in China and elsewhere. Our results could be useful for designing effective prevention strategies for 2019-nCoV in China and beyond.
© 2020 Wiley Periodicals, Inc.

Entities:  

Keywords:  2019-nCoV; Bangkok; TMRCA; Wuhan; outbreak

Mesh:

Year:  2020        PMID: 31997390      PMCID: PMC7166825          DOI: 10.1002/jmv.25693

Source DB:  PubMed          Journal:  J Med Virol        ISSN: 0146-6615            Impact factor:   2.327


INTRODUCTION

The current outbreak of the novel coronavirus 2019‐nCoV, which was first reported in the Chinese city of Wuhan on 31 December 2019, can cause serious pneumonia and is now known to spread from person to person. From 24 January 2020, hundreds of millions of people will travel to their hometowns or overseas for the Chinese New Year holidays, which is China's most significant annual holiday. Authorities in China and around the world have mounted an enormous operation to track and screen travelers from Wuhan in central China, where the first cases were documented. As of 26 January 2020, there have been 2744 cases of 2019‐nCoV confirmed in mainland China, including 80 deaths, 461 serious, and 51 discharged, as well as 8 in Hong Kong, 5 in Macao, and 4 in Taiwan. Thirty‐three exported cases to other countries have been confirmed, including seven in Thailand, three in Japan, three in South Korea, three in the United States, two in Vietnam, four in Singapore, three in Malaysia, one in Nepal, three in France, and four in Australia. Of note, the 2019‐nCoV disease has not been found in humans previously; therefore, determining how 2019‐nCoV spreads is the most urgent question surrounding the outbreak. It is also imperative that we discover whether 2019‐nCoV has the potential to cause an outbreak similar to the 2002‐2003 outbreak caused by severe acute respiratory syndrome coronavirus (SARS‐CoV), , , which emerged in southern China and eventually killed 774 people in 37 countries. 2019‐nCoV and SARS‐CoV are members of the large family of coronaviruses, which also includes the viruses responsible for the Middle East respiratory syndrome (MERS). , Field studies have revealed the original source of SARS‐CoV and MERS‐CoV to be the bat, , , , with masked palm civets (a mammal native to Asia and Africa) , , and camels, , respectively, serving as intermediate hosts between bats and humans. To date, 2019‐nCoV is most closely associated with SARS and related viruses that circulate in bats. The hypothesis that 2019‐nCoV jumped from an animal at the Wuhan Huanan Seafood Wholesale Market, which sold processed meats and live consumable animals, is strongly supported by previous study suggesting that this virus came directly or indirectly from bats. Therefore, the 2019‐nCoV outbreak implies that the human consumption of wild animals should be limited to prevent zoonotic disease infection. Professor Zhong Nanshan, a SARS intervention specialist, is the respiratory researcher leading the Chinese government's expert panel on the 2019‐nCoV outbreak. After a visit to Wuhan city on 20 January 2020, Professor Zhong confirmed that 2019‐nCoV is spreading between people, and further confirmed that 14 medical workers had been infected by 1 person, raising concerns that certain people may be “super‐spreaders” of the virus. Determining the genome sequences of 2019‐nCoV strains can offer key information about viral origin and dissemination. Thus far, researchers have played a crucial role in the rapid sequencing, publishing, and sharing of genome sequences obtained from 2019‐nCoV strains found in infected people from China and Thailand. In the present study, we employed state‐of‐the‐art methods to investigate the time origin and potential rapid expansion of this virus in humans based on 12 genome sequences of 2019‐nCoV with known sampling date (24 December 2019 and 13 January 2020) and geographic location (Wuhan, Hubei Province, China and Bangkok, Thailand). Our study should provide insight into the time of origin and evolutionary history of the 2019‐nCoV outbreak in China and Thailand. This study may be useful for designing effective 2019‐nCoV prevention strategies in China and beyond.

MATERIALS AND METHODS

Collation of 2019‐nCoV genome sequence dataset

As of 19 January 2020, 13 genome sequences of 2019‐nCoV have been released on GISAID (http://gisaid.org/) and one strain (virus name: BetaCoV/Wuhan‐Hu‐1/2019; accession ID: EPI_ISL_402125) has been released on GenBank (https://www.ncbi.nlm.nih.gov/nuccore/MN908947) by the Shanghai Public Health Clinical Center & School of Public Health, Fudan University, Shanghai, China. This strain was the first released 2019‐nCoV genome sequence but has since been updated three times. In this study, we used the latest version (MN908947.3), and therefore the genome sequence of strain BetaCoV/Wuhan‐Hu‐1/2019 from GISAID (EPI_ISL_402125) was excluded. Isolate BetaCoV/Wuhan/IVDC‐HB‐04/2020 (EPI_ISL_402120) shows evidence of sequencing artifacts and thus was also excluded in this study. The final dataset included 12 genome sequences of 2019‐nCoV with known sampling date and city (10 from individuals in Wuhan, Hubei Province, China, with samples collected between 24 and 30 December 2019, and two from Chinese individuals in Bangkok, Thailand, who had recently traveled from Wuhan, with samples collected between 8 and 13 January 2020) (Table S1). For this dataset, the 2019‐nCoV genome sequences were aligned using MAFFT v7.222 and then manually curated using BioEdit v7.2.5.

Phylogenetic analyses

To study the amount of evolutionary information contained in the dataset, likelihood‐mapping analysis was performed using TREE‐PUZZLE v5.3, with 30 000 randomly chosen quartets for the entire tree. For each sequence quartet, three unrooted tree topologies are possible. For a random sample of quartets, the likelihoods for the three possible topologies are reported as dots in an equilateral triangle. The distribution of points in different sections of this triangle indicates the tree‐likeness of the data: the three corners represent fully resolved tree topologies, indicating the presence of a tree‐like phylogenetic signal; the center represents the sets of points where all three trees are equally supported, indicating a star‐like phylogenetic signal; and the three areas on the sides indicate support for conflicting tree topologies. To infer phylogeny, we used the maximum‐likelihood approach with the Hasegawa‐Kishino‐Yano nucleotide substitution model with gamma‐distributed rate variation among sites (HKY+G) in PhyML v3.1. Support for the inferred relationships was determined by bootstrap analysis with 1000 replicates. To reconstruct the evolutionary history of 2019‐nCoV, we used Bayesian inference through a Markov chain Monte Carlo (MCMC) framework implemented in BEAST v1.8.4, with the BEAGLE library v2.1.2 used to increase computational performance. As no temporal signal was present in the dataset, we employed HKY+G, as well as a constant size coalescent tree prior and strict molecular clock model assuming an evolutionary rate of 4.59 × 10−4 substitutions per site per year to estimate the time to most recent common ancestor (TMRCA) for this dataset based on previous genetic analysis. We also placed a log‐normal prior (mean = 4.59 × 10−4 substitutions per site per year, 95% Bayesian credible interval: 1.178 × 10−4 to 3 × 10−3 substitutions per site per year) on the evolutionary rate with a strict molecular clock model based on previous studies. , , To ensure adequate mixing of model parameters, MCMC chains were run for 100 million steps with sampling every 10 000 steps from the posterior distribution. Convergence was evaluated by calculating the effective sample sizes of the parameters using Tracer v1.7.1. Trees were summarized as maximum clade credibility trees using TreeAnnotator after discarding the first 10% as burn‐in, and then visualized in FigTree v1.4.4 (http://tree.bio.ed.ac.uk/software/figtree).

Ancestral reconstructions of discrete traits

To perform ancestral reconstruction of the unobserved sampling locations (k = 2), discrete phylogeographic analyses were performed using the empirical tree distributions generated for our 2019‐nCoV dataset. The location exchange process was modeled using asymmetric continuous‐time Markov chains (CTMC) with an approximate CTMC conditional reference prior to the overall rate scaler and a uniform prior distribution. Bayesian analysis was run using BEAST v1.8.4 and BEAGLE library v2.1.2 for an MCMC chain of 100 million iterations, with 10 000 samples of all parameters and 10 000 trees for our dataset.

Identifying pathways of virus spread using graph hierarchies

To identify a subset of well‐supported migration events among 2019‐nCoV strains, we used a Bayesian stochastic search variable selection procedure with a hierarchical prior on location indicators (0‐1), which allow CTMC rates to shrink to zero with some probability, using BEAST v1.8.4. Strongly supported rates of virus movement (Bayes factor > 10) were identified using SpreaD3 v0.9.6.

RESULTS

Demographic characteristics of the dataset

The dataset included 12 genome sequences of 2019‐nCoV strains from Wuhan, China (n = 10), with sampling dates between 24 and 30 December 2019, and from Bangkok, Thailand (n = 2), with sampling date between 8 and 13 January 2020 (Table S1). The samples were primarily from Wuhan (83.33%), which is also the city of the original 2019‐nCoV outbreak.

Likelihood‐mapping and phylogenetic analyses

For this dataset, our likelihood‐mapping analysis revealed a strong star‐like topology signal (Figure 1), indicating very limited genetic diversity for 2019‐nCoV as of now. Furthermore, phylogenetic analysis of the 12 genome sequences of 2019‐nCoV also showed a lack of genetic diversity, in accordance with the likelihood‐mapping analysis (Figure 2). The correlation between sampling date and divergence to the root of midpoint rooted maximum‐likelihood phylogeny indicated no temporal signal (Figure 3). Therefore, we used an assumed evolutionary rate of 4.59 × 10−4 substitutions per site per year from the previous study. The estimated TMRCA for 2019‐nCoV was 9 November 2019 (95% Bayesian credible interval: 25 September 2019 and 19 December 2019). We also used an assumed evolutionary rate range (95% Bayesian credible interval: 1.178 × 10−4 to 3 × 10−3 substitutions per site per year) for this dataset based on previous studies, , , and estimated that the TMRCA for 2019‐nCoV was 3 September 2019 (95% Bayesian credible interval: 25 March 2019 and 20 December 2019).
Figure 1

Likelihood‐mapping analyses of 12 genome sequences of 2019‐nCOV. Likelihoods of three tree topologies for each possible quartet (or for a random sample of quartets) are denoted by a data point in an equilateral triangle. The distribution of points in the seven areas of the triangle reflects tree‐likeness of the data. Specifically, three corners represent fully resolved tree topologies; center represents an unresolved (star) phylogeny, and sides represent support for conflicting tree topologies

Figure 2

Maximum‐likelihood phylogeny of 12 genome sequences of 2019‐nCOV. Points are color‐coded by city of origin. Tree is midpoint rooted

Figure 3

Regression of root‐to‐tip genetic distance against the year of sampling for 2019‐nCOV. Points are color‐coded by the city of origin. Gray indicates the linear regression line

Likelihood‐mapping analyses of 12 genome sequences of 2019‐nCOV. Likelihoods of three tree topologies for each possible quartet (or for a random sample of quartets) are denoted by a data point in an equilateral triangle. The distribution of points in the seven areas of the triangle reflects tree‐likeness of the data. Specifically, three corners represent fully resolved tree topologies; center represents an unresolved (star) phylogeny, and sides represent support for conflicting tree topologies Maximum‐likelihood phylogeny of 12 genome sequences of 2019‐nCOV. Points are color‐coded by city of origin. Tree is midpoint rooted Regression of root‐to‐tip genetic distance against the year of sampling for 2019‐nCOV. Points are color‐coded by the city of origin. Gray indicates the linear regression line

Dynamics analysis of ancestral discrete traits

Our phylogeographic analysis revealed that the most probable root location of the 2019‐nCoV ancestor was in Wuhan, China (posterior probability = 0.98) (Figure 4). Our results also revealed that Wuhan acted as a diffusion epicenter compared to Bangkok. Most viral transmissions between epidemiologically linked cities were from Wuhan to Bangkok (mean estimate 1.96; 95% credible interval: 0.99‐2.42; Bayes factor = 1631).
Figure 4

Maximum clade credibility tree estimated from 12 genome sequences of 2019‐nCOV. The number near node represents the most probable geographic location of descendent branches. Nodes are color‐coded by the city of origin

Maximum clade credibility tree estimated from 12 genome sequences of 2019‐nCOV. The number near node represents the most probable geographic location of descendent branches. Nodes are color‐coded by the city of origin

DISCUSSION

From the 12 genome sequences of 2019‐nCoV, which included the city of origin and date of sampling, our likelihood‐mapping analysis confirmed that this novel virus has spread rapidly, indicating that the scale of transmission may be large given the virus spillover from an animal reservoir to humans and high secondary human‐to‐human transmission. On the basis of the confirmation from Professor Zhong, who is leading the government's expert panel on the outbreak, 14 healthcare workers appear to have been infected by one person who carried the virus. This suggests that epidemic super‐spreaders may have been generated at the very beginning of the outbreak and occurred once this novel coronavirus jumped from animals to humans. In addition, as mass travel for the Chinese New Year holiday could spread this virus farther and faster, it is critical to understand the Wuhan 2019‐nCOV outbreak and predict how easily the virus can transmit between humans and whether the outbreak has the potential to persist. On the basis of our time scale phylogenetic analysis of 12 genome sequences, the observed lack or limited diversity of the 2019‐nCOV strains suggests that the common ancestor of this virus in humans possibly occurred on 9 November 2019. However, we acknowledge that the estimates presented here relate only to a limited number of 2019‐nCOV genome sequences and our results are estimated with some uncertainty. Therefore, our conclusions should be considered preliminary and explained with caution. As the outbreak continues, it is possible that the addition of new sequences to our analyses could change the above results significantly. The generation of additional genomic datasets from Wuhan, as well as other regions of China and beyond, and from humans and animals that have transmitted 2019‐nCOV, will help clarify the extent of human‐to‐human transmission and identify the genetic changes that allowed this virus to jump species. Future studies should include comparative analyses of between‐host and within‐host transmission of 2019‐nCOV, and finer‐scale epidemiology to elucidate the transmission dynamics of different hosts through time and space. Our results show that the 2019‐nCOV outbreak has been shaped by migration and travel and therefore it is important to consider both local, national, and international strategies when designing interventions to end 2019‐nCOV transmission in China and beyond. Taken together, our results emphasize the importance of using likelihood‐mapping and phylogenetic analyses to provide insights into the origin, evolutionary history, and spread of 2019‐nCOV over a short time scale. These efforts, combined with epidemiological investigations, are needed to track changes in the 2019‐nCOV epidemic. Understanding these epidemic dynamics in real‐time is increasingly important for public health in terms of guiding prevention efforts.

CONFLICT OF INTERESTS

The authors declare that there are no conflict of interests.

AUTHOR CONTRIBUTIONS

XL conceived and designed the study, analyzed the data, and drafted the manuscript. XL, JZ, XW, and YL interpreted the data and provided critical comments. All authors reviewed and approved the final manuscript. Supplementary information Click here for additional data file.
  29 in total

1.  Many-core algorithms for statistical phylogenetics.

Authors:  Marc A Suchard; Andrew Rambaut
Journal:  Bioinformatics       Date:  2009-04-15       Impact factor: 6.937

2.  Maximum-likelihood analysis using TREE-PUZZLE.

Authors:  Heiko A Schmidt; Arndt von Haeseler
Journal:  Curr Protoc Bioinformatics       Date:  2007-03

3.  Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats.

Authors:  Susanna K P Lau; Patrick C Y Woo; Kenneth S M Li; Yi Huang; Hoi-Wah Tsoi; Beatrice H L Wong; Samson S Y Wong; Suet-Yi Leung; Kwok-Hung Chan; Kwok-Yung Yuen
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-16       Impact factor: 11.205

4.  A previously undescribed coronavirus associated with respiratory disease in humans.

Authors:  Ron A M Fouchier; Nico G Hartwig; Theo M Bestebroer; Berend Niemeyer; Jan C de Jong; James H Simon; Albert D M E Osterhaus
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-08       Impact factor: 11.205

5.  Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia.

Authors:  Ali M Zaki; Sander van Boheemen; Theo M Bestebroer; Albert D M E Osterhaus; Ron A M Fouchier
Journal:  N Engl J Med       Date:  2012-10-17       Impact factor: 91.245

6.  Ecoepidemiology and complete genome comparison of different strains of severe acute respiratory syndrome-related Rhinolophus bat coronavirus in China reveal bats as a reservoir for acute, self-limiting infection that allows recombination events.

Authors:  Susanna K P Lau; Kenneth S M Li; Yi Huang; Chung-Tong Shek; Herman Tse; Ming Wang; Garnet K Y Choi; Huifang Xu; Carol S F Lam; Rongtong Guo; Kwok-Hung Chan; Bo-Jian Zheng; Patrick C Y Woo; Kwok-Yung Yuen
Journal:  J Virol       Date:  2010-01-13       Impact factor: 5.103

7.  SpreaD3: Interactive Visualization of Spatiotemporal History and Trait Evolutionary Processes.

Authors:  Filip Bielejec; Guy Baele; Bram Vrancken; Marc A Suchard; Andrew Rambaut; Philippe Lemey
Journal:  Mol Biol Evol       Date:  2016-04-23       Impact factor: 16.240

8.  Bayesian phylogenetics with BEAUti and the BEAST 1.7.

Authors:  Alexei J Drummond; Marc A Suchard; Dong Xie; Andrew Rambaut
Journal:  Mol Biol Evol       Date:  2012-02-25       Impact factor: 16.240

9.  MERS coronaviruses in dromedary camels, Egypt.

Authors:  Daniel K W Chu; Leo L M Poon; Mokhtar M Gomaa; Mahmoud M Shehata; Ranawaka A P M Perera; Dina Abu Zeid; Amira S El Rifay; Lewis Y Siu; Yi Guan; Richard J Webby; Mohamed A Ali; Malik Peiris; Ghazi Kayali
Journal:  Emerg Infect Dis       Date:  2014-06       Impact factor: 6.883

10.  Potential of large "first generation" human-to-human transmission of 2019-nCoV.

Authors:  Xingguang Li; Junjie Zai; Xiaomei Wang; Yi Li
Journal:  J Med Virol       Date:  2020-02-14       Impact factor: 2.327

View more
  42 in total

1.  [Interval between entry of mainland China and diagnosis in imported COVID-19 cases and factors contributing to delayed diagnosis in Guangdong Province].

Authors:  Lin Cheng; Huilan Xu
Journal:  Nan Fang Yi Ke Da Xue Xue Bao       Date:  2020-05-30

2.  Diagnosis of COVID-19, vitality of emerging technologies and preventive measures.

Authors:  Muhammad Asif; Yun Xu; Fei Xiao; Yimin Sun
Journal:  Chem Eng J       Date:  2021-05-07       Impact factor: 13.273

3.  Sleep Disturbance and Psychological Profiles of Medical Staff and Non-Medical Staff During the Early Outbreak of COVID-19 in Hubei Province, China.

Authors:  Wei Wang; Wenqin Song; Zhongyuan Xia; Yuhong He; Linghua Tang; Jiabao Hou; Shaoqing Lei
Journal:  Front Psychiatry       Date:  2020-07-22       Impact factor: 4.157

4.  Evolutionary history, potential intermediate animal host, and cross-species analyses of SARS-CoV-2.

Authors:  Xingguang Li; Junjie Zai; Qiang Zhao; Qing Nie; Yi Li; Brian T Foley; Antoine Chaillon
Journal:  J Med Virol       Date:  2020-03-11       Impact factor: 2.327

Review 5.  The Proteins of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS CoV-2 or n-COV19), the Cause of COVID-19.

Authors:  Francis K Yoshimoto
Journal:  Protein J       Date:  2020-06       Impact factor: 2.371

6.  COVID-19: The Conjunction of Events Leading to the Coronavirus Pandemic and Lessons to Learn for Future Threats.

Authors:  Roger Frutos; Marc Lopez Roig; Jordi Serra-Cobo; Christian A Devaux
Journal:  Front Med (Lausanne)       Date:  2020-05-12

7.  Genomic Epidemiology of SARS-CoV-2 From Mainland China With Newly Obtained Genomes From Henan Province.

Authors:  Ning Song; Guang-Lin Cui; Qing-Lei Zeng
Journal:  Front Microbiol       Date:  2021-05-20       Impact factor: 5.640

8.  Research on the Influence Mechanism of Epidemic Information Disclosure on Screening Authenticity Information.

Authors:  Xiaohui Huang; Gang Li; Yu Wang; Xiaohui Li
Journal:  Procedia Comput Sci       Date:  2021-06-12

9.  Recombinant SARS-CoV-2 spike S1-Fc fusion protein induced high levels of neutralizing responses in nonhuman primates.

Authors:  Wenlin Ren; Hunter Sun; George F Gao; Jianxin Chen; Sean Sun; Rongqing Zhao; Guang Gao; Yalin Hu; Gan Zhao; Yuxin Chen; Xia Jin; Feng Fang; Jinggong Chen; Qi Wang; Sitao Gong; Wen Gao; Yufei Sun; Junchi Su; Ailiang He; Xin Cheng; Min Li; Chenxi Xia; Maohua Li; Le Sun
Journal:  Vaccine       Date:  2020-06-24       Impact factor: 3.641

10.  Evidence of increasing diversification of emerging Severe Acute Respiratory Syndrome Coronavirus 2 strains.

Authors:  Matías Castells; Fernando Lopez-Tort; Rodney Colina; Juan Cristina
Journal:  J Med Virol       Date:  2020-08-02       Impact factor: 20.693

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.