Literature DB >> 17931731

Positive natural selection in the evolution of human metapneumovirus attachment glycoprotein.

Abinash Padhi1, Bindhu Verghese.   

Abstract

Human metapneumovirus (hMPV), a newly discovered virus of the family Paramyxoviridae, has been associated with upper and lower respiratory tract infections in different age groups in many countries. The putative attachment (G) glycoprotein of this virus was previously reported to have shown more extensive nucleotide and deduced amino acid sequence polymorphism than any other genomic regions of this virus, leading to four sub-lineages. Using a maximum likelihood-based codon substitution model of sequence evolution, here we report that sequences of extracellular domain of 8 amino acid sites in lineage 1a, and 3 amino acid sites each in lineage 1b, 2a, and 2b have a higher rate of nonsynonymous substitutions (d(N)) than the synonymous substitutions (d(S)) with a posterior probability above 0.95, thus suggesting the evidence of adaptive evolution driven by Darwinian selection. Although it is unclear whether these amino acid adaptations are driven by differential immune pressure or some other factors, identification of these positively selected amino acid sites would help in better screening using epitope mapping technology to identify and localize the sites that can be recognized by the immune system. We also observed surprisingly higher nucleotide substitution rates per site, per year for each lineage of hMPV than the rates that were previously reported for the human respiratory syncytial virus, suggesting rapid evolutionary dynamics of hMPV.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17931731      PMCID: PMC7114232          DOI: 10.1016/j.virusres.2007.08.014

Source DB:  PubMed          Journal:  Virus Res        ISSN: 0168-1702            Impact factor:   3.303


Human metapneumovirus (hMPV) of the family Paramyxoviridae and subfamily Pneumoviridae was first discovered in The Netherlands from infants and children suffering from acute respiratory tract disease (van den Hoogen et al., 2001). Since then considerable progress has been made in identification and characterization (Cote et al., 2003, Mackay et al., 2003, Ebihara et al., 2004, Maertzdorf et al., 2004, Skiadopoulos et al., 2004, Hamelin and Boivin, 2005, Leung et al., 2005, Gray et al., 2006a, Gray et al., 2006b, Ulbrand et al., 2006, van den Hoogen, 2007) as well as in understanding its genetic diversity (Bastien et al., 2003, Bastien et al., 2004, Biacchesi et al., 2003, Ishiguro et al., 2004, Peret et al., 2004, Carr et al., 2005, Ludewick et al., 2005, Galiano et al., 2006, Boivin et al., 2007). To date this virus has been identified in many countries from different age groups and reported to cause upper respiratory tract infections, flu-like infections, and has also been associated with lower respiratory tract infections (van den Hoogen et al., 2001, Stockton et al., 2002, Biacchesi et al., 2003, Bastien et al., 2003, Bastien et al., 2004, Ishiguro et al., 2004, Peret et al., 2004, Carr et al., 2005, Ludewick et al., 2005, Fouchier et al., 2005, Regev et al., 2006, Galiano et al., 2006, Kahn, 2006, Gray et al., 2006a, Gray et al., 2006b), a pattern similar to that reported for human respiratory syncytial virus (HRSV). Although comparative genome mapping analyses suggested that this virus has structural and functional similarities with HRSV (Kahn, 2006), recent studies reported that the attachment (G) glycoprotein of these paramimyxoviruses exhibit extensive nucleotide and amino acid variation, with most differences located in the extracellular domain (Peret et al., 2004, Kahn, 2006). Therefore, G-protein has been widely used to infer evolutionary relationships among the isolates from different geographic regions (e.g., Peret et al., 2004, Ishiguro et al., 2004). Although phylogenetic analyses of hMPVs from the complete nucleotide coding sequences revealed the existence of two major lineages of hMPVs (Ishiguro et al., 2004), recent analyses based on G-protein phylogeny revealed the existence of two minor sub-groups within each major lineage (Peret et al., 2004). Despite the knowledge of identification and characterization of hMPVs, the possible mechanism by which hMPV G-proteins have evolved is poorly understood. Earlier studies on the molecular evolution of HRSV G-protein reported that certain amino acid sites that correspond to sites of O-glycosylation, or amino acid sites that were previously described as monoclonal antibody-induced in vitro escape mutants, are under positive selection and thus showed strong association between these positively selected sites and the mapped neutralizing epitopes (Zlateva et al., 2004). Recently, Zhang et al. (2006) also reported that certain amino acid sites in severe acute respiratory syndrome (SARS) coronavirus (CoV) are evolved by positive Darwinian selection. These lines of evidence suggest an interesting evolutionary pattern of the respiratory viruses. At the genomic level, whether a gene, or a particular amino acid within a gene, is under relaxed selection or remains functionally constrained throughout evolution can be detected by comparing the rate of nonsynonymous nucleotide substitutions per nonsynonymous site (dN) with that of synonymous substitutions per synonymous site (dS) (Hughes and Nei, 1989). If dN/dS (hereafter referred as ω) is greater than one, then positive selection is said to be operating. Alternatively, if ω  < 1, the gene is under strong purifying selection and presumed to be functionally constrained. Identifying genes that have evolved by adaptation is central to understanding molecular evolution. However, not all amino acid differences observed among the closely related sequences from ecologically/geographically isolated strains are adaptive (e.g., Zlateva et al., 2004). Therefore, analyzing patterns of amino acid substitutions would provide insight into understanding protein adaptation by identifying candidate codon sites on which positive selection has been operating. Identifying the positively selected amino acid sites would also help in further immunization studies. Maximum likelihood (ML)-based codon substitution models, which account for variable ω ratios among codon sites and detect codon sites that are subjected to positive selection (Yang et al., 2000), have been widely used in detecting positive selection in a number of respiratory viral groups (e.g., Zlateva et al., 2004, Zhang et al., 2006). Here we used Yang et al's (2000) ML codon substitution models to test whether there was evidence at the nucleotide sequence level that a subset of amino acid sites in G-protein of hMPV sequences that represent each subgroup has been under positive selection. In addition, we used a Bayesian MCMC approach implemented in BEAST version 1.4.4 (Drummond and Rambaut, 2006) that utilize the number and temporal distribution of genetic differences among viruses sampled at different times (Drummond et al., 2002, Drummond et al., 2006) to estimate the evolutionary change for each lineage. A total of 144 published unique nucleotide coding sequences of G-protein representing four sub-lineages (1a = 46, 1b = 40, 2a = 38, 2b = 20) were retrieved from GenBank (Table 1 ). Sequences were aligned using Mesquite version 1.2 (Maddison and Maddison, 2006), DAMBE version 4.5.2 (Xia, 2000, Xia and Xie, 2001), and BioEdit version 7.0.5.3 (Hall, 1999) software packages. To infer phylogenetic relationship among these strains of hMPVs, we reconstructed a neighbor joining tree from their predicted amino acid sequence data with p-distance implemented in MEGA version 3.1 (Kumar et al., 2004). Using the same program, nodal supports were estimated with 10,000 nonparametric bootstrap replicates. For selection analyses, we reconstructed unrooted ML trees for each lineage from their respective nucleotide sequence data using the appropriate nucleotide substitution model identified by the hierarchical likelihood ratio test implemented in Modeltest version 3.5 (Posada and Crandall, 1998). PHYML version 2.4.4 (Guindon and Gascuel, 2003) was used to conduct ML analyses.
Table 1

GenBank accession number, strain name, country of origin, and the year of isolation of 144 unique hMPV G-protein sequences used in the study

GenBank No.Strain nameCountry of originYear of IsolationSourceGroup
AF37133700-1The Netherlands van den Hoogen et al. (2001)1a
AY296015FL/4/01The Netherlands2001van den Hoogen et al. (2001)1a
AY296016FL/3/01The Netherlands2001van den Hoogen et al. (2001)1a
AY296017FL/8/01The Netherlands2001van den Hoogen et al. (2001)1a
AY296018FL/10/01The Netherlands2001van den Hoogen et al. (2001)1a
AY296019NL/10/01The Netherlands2001van den Hoogen et al. (2001)1a
AY296020NL/2/02The Netherlands2002van den Hoogen et al. (2001)1a
AY327802201-7182AustraliaGenBank1a
AY327803201-4199AustraliaGenBank1a
AY327804Q01-6410AustraliaGenBank1a
AY327805Q01-7262AustraliaGenBank1a
AY327806Q01-6346AustraliaGenBank1a
AY327807Q01-7292AustraliaGenBank1a
AY327808Q01-7252AAustraliaGenBank1a
AY327809Q01-7292AustraliaGenBank1a
AY327810Q016297AustraliaGenBank1a
AY485232hMPV13-2000Canada2000Peret et al. (2004)1a
AY485235hMP V193-2002Canada2002Peret et al. (2004)1a
AY485236hMPV22-2001Canada2001Peret et al. (2004)1a
AY485238hMPV23-2001Canada2001Peret et al. (2004)1a
AY485251hMPV81-1999Canada1999Peret et al. (2004)1a
AY485254hMPV86316-2002Canada2002Peret et al. (2004)1a
AY485255hMPV88448-2002Canada2002Peret et al. (2004)1a
AY485256hMPV88470-2002Canada2002Peret et al. (2004)1a
AY530092JPS03-180Japan2003Ishiguro et al. (2004)1a
AY574225CAN34-02Canada2002Ishiguro et al. (2004)1a
AY574226CAN40-02Canada2002Ishiguro et al. (2004)1a
AY574228CAN97-02Canada2002Ishiguro et al. (2004)1a
AY574231CAN187-02Canada2002Ishiguro et al. (2004)1a
AY574237CAN216-02Canada2002Ishiguro et al. (2004)1a
AY574243CAN464-02Canada2002Ishiguro et al. (2004)1a
AY574244CAN532-02Canada2002Ishiguro et al. (2004)1a
AY848881RSA/39/01South Africa2001Ludewick et al. (2005)1a
AY848882RSA/1/02South Africa2002Ludewick et al. (2005)1a
AY848885RSA/4/02South Africa2002Ludewick et al. (2005)1a
AY848887RSA/17/02South Africa2002Ludewick et al. (2005)1a
AY848889RSA/31/01South Africa2001Ludewick et al. (2005)1a
AY848890RSA/33/01South Africa2001Ludewick et al. (2005)1a
AY848893RSA/8/02South Africa2002Ludewick et al. (2005)1a
AY848896RSA/3/02South Africa2002Ludewick et al. (2005)1a
AY848897RSA/10/02South Africa2002Ludewick et al. (2005)1a
AY848901RSA/14/02South Africa2002Ludewick et al. (2005)1a
AY848903RSA/34/01South Africa2001Ludewick et al. (2005)1a
DQ312444IA3-2002USA2002Gray et al., 2006a, Gray et al., 2006b1a
DQ362949Arg/1/03Argentina2003Galiano et al. (2006)1a
DQ362950Arg/2/02Argentina2002Galiano et al. (2006)1a
AY296021NL/17/00The Netherlands2000van den Hoogen et al. (2004)1b
AY296022NL/1/81The Netherlands1981van den Hoogen et al. (2004)1b
AY296023NL/1/93The Netherlands1993van den Hoogen et al. (2004)1b
AY296025NL/3/93The Netherlands1993van den Hoogen et al. (2004)1b
AY296026NL/1/95The Netherlands1995van den Hoogen et al. (2004)1b
AY296028NL/13/96The Netherlands1996van den Hoogen et al. (2004)1b
AY296029NL/22/01The Netherlands2001van den Hoogen et al. (2004)1b
AY296030NL/24/01The Netherlands2001van den Hoogen et al. (2004)1b
AY296032NL/29/01The Netherlands2001van den Hoogen et al. (2004)1b
AY296033NL/302The Netherlands2002van den Hoogen et al. (2004)1b
AY485234hMPV17-2000Canada2000Peret et al. (2004)1b
AY485250hMPV80-1999Canada1999Peret et al. (2004)1b
AY530090JPS03-176Japan2003Ishiguro et al. (2004)1b
AY530091JPS03-178Japan2003Ishiguro et al. (2004)1b
AY530093JPS03-187Japan2003Ishiguro et al. (2004)1b
AY530095JPS03-240Japan2003Ishiguro et al. (2004)1b
AY574227CAN58-02Canada2002Bastien et al. (2004)1b
AY574229CAN164-02Canada2002Bastien et al. (2004)1b
AY574230CAN182-02Canada2002Bastien et al. (2004)1b
AY574234CAN197-02Canada2002Bastien et al. (2004)1b
AY574235CAN208-02Canada2002Bastien et al. (2004)1b
AY574236CAN215-02Canada2002Bastien et al. (2004)1b
AY574241CAN348-02Canada2002Bastien et al. (2004)1b
AY848910RSA/27/00South Africa2000Ludewick et al. (2005)1b
AY848911RSA/7/00South Africa2000Ludewick et al. (2005)1b
AY848912RSA/26/00South Africa2000Ludewick et al. (2005)1b
AY848914RSA/7/01South Africa2000Ludewick et al. (2005)1b
AY848915RSA/20/00South Africa2000Ludewick et al. (2005)1b
AY848916RS A/20/01South Africa2001Ludewick et al. (2005)1b
AY848917RSA/49/00South Africa2000Ludewick et al. (2005)1b
AY848919RSA/44/00South Africa2000Ludewick et al. (2005)1b
DQ270215BJ1819China2000GenBank1b
DQ312449IA-8-2003USA2003Gray et al. (2006a)1b
DQ270217BJ1824ChinaGenBank1b
DQ312458IA-17-2003USA2003Gray et al. (2006a)1b
DQ312462IA21-2004USA2004Gray et al. (2006a)1b
DQ312463IA22-2004USA2004Gray et al. (2006a)1b
DQ312464IA23-2004USA2004Gray et al. (2006a)1b
DQ362952Arg/3/00Argentina2000Galiano et al. (2006)1b
NC_004148CAN97-83Canada1997Biacchesi et al. (2003)1b
AY296040NL/1/94The Netherlands1994van den Hoogen et al. (2004)2a
AY296041NL/1/82The Netherlands1982van den Hoogen et al. (2004)2a
AY296042NL/1/96The Netherlands1996van den Hoogen et al. (2004)2a
AY296044NL/9/00The Netherlands2000van den Hoogen et al. (2004)2a
AY296045NL/3/01The Netherlands2001van den Hoogen et al. (2004)2a
AY296046NL/4/01The Netherlands2001van den Hoogen et al. (2004)2a
AY296047UK/5/01UK2001van den Hoogen et al. (2004)2a
AY297748CAN98-75Canada1998Biacchesi et al. (2003)2a
AY485243hMPV73-1998Canada1998Peret et al. (2004)2a
AY485244hMPV74-1998Canada1998Peret et al. (2004)2a
AY485245hMPV75-1998Canada1998Peret et al. (2004)2a
AY485246hMPV76-1998Canada1998Peret et al. (2004)2a
AY485247hMPV77-1998Canada1998Peret et al. (2004)2a
AY485248hMPV78-1998Canada1998Peret et al. (2004)2a
AY485249hMPV79-1998Canada1998Peret et al. (2004)2a
DQ270219BJ1921ChinaGenBank2a
DQ270220BJ2034ChinaGenBank2a
DQ270221BJ4879ChinaGenBank2a
DQ270222BJ4944ChinaGenBank2a
DQ270223BJ5128ChinaGenBank2a
DQ270224BJ5129ChinaGenBank2a
DQ312443IA2-2002USA2002Gray et al. (2006a)2a
DQ312457IA16-2003USA2003Gray et al. (2006a)2a
DQ312460IA19-2003USA2003Gray et al. (2006a)2a
DQ393715Peru1-2002USA2002Gray et al. (2006b)2a
DQ843658BJ1816ChinaGenBank2a
AY848861RSA/4/00South Africa2000Ludewick et al. (2005)2a
AY848862RSA/71/00South Africa2000Ludewick et al. (2005)2a
AY848864RSA/37/00South Africa2000Ludewick et al. (2005)2a
AY848865RSA/16/00South Africa2000Ludewick et al. (2005)2a
AY848866RSA/12/00South Africa2000Ludewick et al. (2005)2a
AY848868RSA/29/00South Africa2000Ludewick et al. (2005)2a
AY848869RSA/58/00South Africa2000Ludewick et al. (2005)2a
AY848875RSA/54/00South Africa2000Ludewick et al. (2005)2a
AY848878RSA/23/00South Africa2000Ludewick et al. (2005)2a
AY848879RSA/90/00South Africa2000Ludewick et al. (2005)2a
AY848880RSA/93/00South Africa2000Ludewick et al. (2005)2a
DQ312453IA12-2003USA2003Gray et al. (2006a)2a
AY296034NL/1/99The Netherlands1999van den Hoogen et al. (2004)2b
AY296035NL/11/00The Netherlands2000van den Hoogen et al. (2004)2b
AY296036NL/12/00The Netherlands2000van den Hoogen et al. (2004)2b
AY296037NL/5/01The Netherlands2001van den Hoogen et al. (2004)2b
AY296038NL/9/01The Netherlands2001van den Hoogen et al. (2004)2b
AY296039NL/21/01The Netherlands2001van den Hoogen et al. (2004)2b
AY485242hMPV33-2001Canada2001Peret et al. (2004)2b
AY485252hMPV82-1997Canada1997Peret et al. (2004)2b
AY530089JPS02-76Japan2002Ishiguro et al. (2004)2b
DQ312445IA4-2002USA2002Gray et al. (2006a)2b
DQ312446IA5-2002USA2002Gray et al. (2006a)2b
DQ312448IA7-2003USA2003Gray et al. (2006a)2b
DQ312454IA13-2003USA2003Gray et al. (2006a)2b
DQ312455IA14-2003USA2003Gray et al. (2006a)2b
DQ312461IA20-2003USA2003Gray et al. (2006a)2b
DQ393716Peru2-2002Peru2002Gray et al. (2006b)2b
DQ393717Peru3-2003Peru2003Gray et al. (2006b)2b
DQ393718Peru4-2003Peru2003Gray et al. (2006b)2b
DQ393719Peru5-2003Peru2003Gray et al. (2006b)2b
AY530094JPS03-194Japan 2003Ishiguro et al. (2004) 2b
GenBank accession number, strain name, country of origin, and the year of isolation of 144 unique hMPV G-protein sequences used in the study Overall substitution rate (nucleotide substitutions per site per year) of each lineage was estimated using the Bayesian skyline model, with both relaxed (variable) molecular clock (with uncorrelated lognormal model) and strict clock implemented in the BEAST version 1.4.4 (Drummond and Rambaut, 2006). This model employs a Bayesian MCMC approach and utilize the number and temporal distribution of genetic differences among viruses sampled at different times (Drummond et al., 2002, Drummond et al., 2006). Bayesian skyline plots with 10 grouped intervals were reconstructed to infer demographic history (Drummond et al., 2005). Phylogenies were evaluated using a chain length of 30 million states under the HKY85 + Γ4 substitution model and with uncertainty in the data reflected in the 95% high-probability density (HPD) intervals. Convergence of trees was checked using Tracer version 1.3 (Rambaut and Drummond, 2006). To determine the synonymous and nonsynonymous sequence divergence distribution pattern across the entire coding region of each lineage (Fig. 1 ), we used a sliding window approach (window size = 6, step = 1) implemented in DNAsp version 4.0 (Rozas et al., 2003).
Fig. 1

NJ tree inferred from 144 amino acid sequences of human metapneumovirus G glycoprotein representing four lineages. Nodal support is mentioned at the base of the node. The sliding window analyses of respective lineages show the synonymous and nonsynonymous divergence.

NJ tree inferred from 144 amino acid sequences of human metapneumovirus G glycoprotein representing four lineages. Nodal support is mentioned at the base of the node. The sliding window analyses of respective lineages show the synonymous and nonsynonymous divergence. To assess whether positive selection is operating in any codon sites, we used the alignment and ML trees of respective lineages as input for the CODEML program of PAML version 3.15 (Yang, 1997). The PAML program incorporates six different codon substitution models that account for variable ω for each codon site. The six codon substitution models are: M0 (one-ratio), M1a (nearly neutral), M2a (positive selection), M7 (β distribution; 0 ≤  ω  ≤ 1), M8 (β  +  ω  > 1: continuous) (Yang et al., 2000), and M8a (β  +  ω  = 1) (Swanson et al., 2003). The M0 model estimates overall ω for the data. The M1a model estimates a single parameter, p 0, with ω 0  = 0, and the remaining sites with frequency p 1 (p 1  = 1 −  p 0) assuming ω 1  = 1. The M2a model adds a class of positively selected sites with frequency p 2 (where p 2  = 1 −  p 1  −  p 0) with ω 2 estimated from the data. In the M7 model, ω follows a beta distribution and is allowed to vary between 0 and 1, and two parameters (p and q) of the beta distribution are estimated from the data. In the M8 model, a proportion, p 0 , of sites have ω drawn from the beta distribution and the remaining sites with proportion p 1 are positively selected (ω 1  > 1). The LRTs between nested models were conducted by comparing twice the difference in log-likelihood values (2lnΔl) against a χ 2-distribution with degrees of freedom equal to the difference in the number of parameters between models (Yang, 1997). Three LRTs were conducted. The first comparison was made between M1a, which allows for two site classes (0 <  ω  < 1, ω  = 1), and M2a, which allows three site classes (0 <  ω  < 1, ω  = 1 or ω  > 1). The second comparison was between M7 and M8, and the last comparison was between M8 and M8a, in which ω for M8a was constrained to 1. In all LRTs good evidence for positive selection is found if the LRT indicates that models that allow for selection (i.e. M2a and M8; alternative models) are significantly better than their respective null models (M1a, M7 and M8a) (Yang, 1997). Posterior probabilities of the inferred positively selected sites were estimated by the Bayes empirical Bayes (BEB) approach that takes sampling errors into account (Yang et al., 2005). Consistent with earlier studies (Peret et al., 2004), G-protein based phylogeny in the present study has also revealed the existence of multiple lineages of this virus (Fig. 1). All four lineages showed some degree of spatial structure; however, few strains in each lineage did not show any spatial structure, indicating extensive viral gene flow across the regions in a given epidemic season. Relatively weak temporal structure across the regions further suggested that either certain strains can remain stable for more than one epidemic season (e.g., HRSV, Zlateva et al., 2004, Zlateva et al., 2005), or mutations might not have occurred in a linear fashion with the preservation of changes in the circulating viral strains. Thus, virus genotypes would frequently appear and disappear along with new mutations in the populations. However, HRSV (Zlateva et al., 2004, Zlateva et al., 2005) showed a strong correlation between the accumulation of genetic divergence and the isolation date of the sequences. Based on the relaxed clock assumption, the evolutionary rate of each major lineage of hMPVs (1 and 2; Table 2 ) are 5.18 × 10−3 and 6.49 × 10−3  substitutions/site/year, respectively. Although these rates are compatible with the substitution rates reported for influenza viruses (Chen and Holmes, 2006), these rates are higher than the estimates of HRSV (HRSV A: 1.83 × 10−3, Zlateva et al., 2004; HRSV B: 1.95 × 10−3, Zlateva et al., 2005; HRSV-BA: 3 × 10−3  substitutions/site/year, Trento et al., 2006; HRSV-A: 2.6 × 10−3, HRSV-B: 3.5 × 10−3, Matheson et al., 2006) and other paramyxoviruses (e.g., measles: Woelk et al., 2002). These discrepancies in the evolutionary rates could be associated with the differential selective pressures targeting different genomic regions. For example, the presence of a greater number of adaptively evolved amino acid sites in the gene can cause an accelerated rate of evolution. As a consequence, the overall evolutionary rate is expected to be higher (Trento et al., 2006). Both major lineages of hMPVs showed interesting population dynamics (Fig. 2 ). The times to the most recent common ancestor for lineage 1 and 2 are 49.452 (29.08–70.8) and 26.091 (21–36.651) years, respectively. While the population size of lineage 1 recently declined, the lineage 2 population size did not show any declining trend. This contrast in the population size could be associated with fitness of the virus.
Table 2

Mean nucleotide substitution rates (95% HPD interval in parenthesis) in hMPV G-gene estimated using Bayesian MCMC approach, with both relaxed and strict clock

LineageRelaxed clock
Strict clock
Substitution rate (×10−3 substitutions/site/year)Likelihood scoreSubstitution rate (×10−3 substitutions/site/year)Likelihood score
1a4.58 (2.400–7.048)−2250.4814.152 (2.235–6.196)−2256.156
1b5.344 (3.995–6.898)−2946.8244.817 (3.809–5.889)−2975.021
2a6.139 (4.318–7.825)−2530.2805.275 (3.733–6.798)−2556.508
2b7.865 (4.060–11.63)−1840.0663.795 (2.687–7.625)−1868.507
1(a + b)5.182 (3.761–6.781)−4689.1614.621 (3.639–5.647)−4702.717
2(a + b)6.494 (4.599–8.438)−3783.3204.770 (3.555–6.012)−3833.563

Estimates with relaxed clock are better fit to the data.

Fig. 2

Skyline plots estimated from Bayesian MCMC analyses of hMPV G-protein sequences belong to lineage 1(a + b) and lineage 2(a + b). Population size (in Y-axis) is expressed in logarithmic scale. The solid line shows the median estimate of population size (Ne × g) throughout the given time period. The grey area gives the 95% HPD interval of these estimates.

Mean nucleotide substitution rates (95% HPD interval in parenthesis) in hMPV G-gene estimated using Bayesian MCMC approach, with both relaxed and strict clock Estimates with relaxed clock are better fit to the data. Skyline plots estimated from Bayesian MCMC analyses of hMPV G-protein sequences belong to lineage 1(a + b) and lineage 2(a + b). Population size (in Y-axis) is expressed in logarithmic scale. The solid line shows the median estimate of population size (Ne × g) throughout the given time period. The grey area gives the 95% HPD interval of these estimates. Despite the weak temporal and spatial structure, viral strains belonging to lineage 1a (Australia, Canada, The Netherlands, South Africa, USA, Argentina, and Japan) and 1b (Canada, The Netherlands, South Africa, USA, Japan, China, and Argentina) have a wider geographic spread than the strains belonging to lineage 2a (Canada, UK, The Netherlands, USA, China, and South Africa) and 2b (The Netherlands, Canada, USA, Peru, and Japan), indicating that fitness of the viral strains might have played a crucial role in the uneven distribution across the wide geographic regions. The extensive polymorphisms of the hMPV G-gene may have resulted from mutations occurring during virus propagation in cell culture; however, Peret et al. (2004) reported identical sequences of the same viral strain after multiple passages, and thus, the observed variation in the G-gene of hMPVs due to multiple passages is more unlikely. However, it is unclear whether the hMPV G-gene experienced differential selection pressures, or all the deduced amino acid sites evolved due to stochastic mutational processes? Sliding window analyses of each lineage revealed that in the majority of regions synonymous divergence exceeds the corresponding nonsynonymous divergence, thus suggesting that the G-gene of hMPV is influenced by purifying selection (Fig. 1). However, a few coding regions in all the lineages showed relatively higher nonsynonymous divergence than synonymous divergence, therefore indicating the pervasive role of positive selection in certain amino acid sites. To identify the codon sites that are positively selected, we performed ML-based codon substitution analyses. Consistent with sliding window results, the M0 model revealed that the average ω for each lineage is less than one (Table 3 ), thus suggesting each lineage experienced purifying selection. However, comparison of the models that assume positive selection (M2a, M8) with the models (M1a, M7, and M8a) that assume no positive selection, detected approximately 6%, 1.3%, 7.3%, and 3% positively selected codons in lineage 1a, 1b, 2a, and 2b, respectively (Table 3). There are eight positively selected sites (site 93, 105, 106, 154, 158, 171, 173, and 188) with posterior probability ≥0.95 within lineage 1a, whereas lineage 1b (site 146, 183, and 196) and lineage 2a (85, 232, and 239) each have three positively selected sites with posterior probability ≥0.95. Lineage 2b has only two positively selected sites (site 100 and 105) with posterior probability ≥0.95. Except site 105, which is positively selected in lineage 1a and 2b, none of the positively selected sites are overlapping among the lineages. It is unclear whether these positively selected sites are associated with the fitness of this virus. Research with monoclonal antibodies has shown that the hMPV F-protein carries neutralizing epitopes (Skiadopoulos et al., 2004, Ulbrand et al., 2006); therefore, antigenic variation due to immune selection in the hMPV F-protein is more likely. Although, the overall excess of synonymous substitutions at the hMPV G-protein indicates that host immune selection might not be the dominant selective force, the findings of several hotspots of nonsynonymous substitutions in this protein suggests that host immune selection might also play a role in maintaining diversity. Recent study has shown that a majority of the neutralizing epitopes in the HRSV G-gene is strongly associated with positively selected sites, and some of the positively selected sites correspond to the sites of O-glycosylation (Zlateva et al., 2004). Like HRSV, although all the positively selected codons of hMPV G-gene are located in the extracellular domain and some of them correspond to sites of O-glycosylation, the putative role is still unclear for these positively selected sites, as is whether some of these positively selected sites are associated with the region of antigenic determinants. We intended to map these positively selected sites with the HRSV G-protein to see whether the same sites were also positively selected in HRSV (Zlateva et al., 2004, Zlateva et al., 2005); however, the predicted G-gene amino acid sequences of the two viruses could not be aligned (van den Hoogen et al., 2002, Kahn, 2006). Although a vast majority of codon sites (>95% in most cases) are shown to have been under purifying selection, significantly higher ω values (>1) at certain codon sites (Table 3) indicate the hMPV G-gene is under positive selection. Identification of these positively selected amino acid sites would help in better screening using epitope mapping technology to identify and localize the sites that can be recognized by the immune system. Knowledge of sites that have adaptively evolved can greatly cut the cost of these screening processes and thereby help in developing better immunization techniques (Mes and van Putten, 2007).
Table 3

Test for variable selection pressures on different codons based on ML-based codon substitution models of Yang et al. (2000)

ModelFree parametersParameter estimatesLikelihood scoresModel comparison (2Δl, d.f., p)Positively selected sitesω ± S.E.
Lineage 1a
 M0: One-ratio1ω = 0.6152−2510.069374None
 M1a: Nearly neutral1ω0 = 0.1, ω1 = 1, (p0 = 0.62, p1 = 0.38)−2473.399503Not allowed



 M2a: Positive selection3ω0 = 0, ω1 = 1, ω2 = 7.31; (p0 = 0.62, p1 = 0.32, p2 = 0.06)−2444.710131(M1a vs. M2a), 57.378744, d.f. = 2, p = 0.000093-H(0.989)7.523 ± 1.614
105-Y (0.987)7.504 ± 1.640
106-F (1.000)7.595 ± 1.464
143-K (0.748)5.832 ± 3.077
154-P (1.000)7.594 ± 1.466
155-R(0.667)5.263 ± 3.239
158-S (0.980)7.456 ± 1.718
171-R(0.958)7.323 ± 1.953
173-T (0.971)7.380 ± 1.805
176-T (0.583)4.674 ± 3.305
188-T (0.973)7.393 ± 1.788



 M7: β2p = 0.1085, q = 0.1183−2474.985643Not allowed



 M8: β + ωs > 14p0 = 0.94, p1 = 0.06, p = 0.36716, q = 0.47347, ω = 6.83−2444.453952(M7 vs. M8), 61.063382, d.f. = 2, p = 0.000093-H (0.993)7.265 ± 1.428
105Y (0.993)7.264 ± 1.426
106-F (1.000)7.307 ± 1.332
143-K (0.814)6.055 ± 2.779
154-P (1.000)7.306 ± 1.332
155-R (0.744)5.575 ± 3.020
156-T (0.649)4.762 ± 2.999
158-S (0.990)7.242 ± 1.468
171-R(0.970)7.119 ± 1.718
173-T (0.989)7.228 ± 1.480
176-T (0.664)5.028 ± 3.191
188-T (0.991)7.239 ± 1.460



 M8a: β + ωs = 13p0 = 0.62, p1 = 0.38, p = 11.37, q = 99, ω = 1−2473.400411(M8 vs. M8a), 57.892918, d.f. = 1, p = 0.0000Not allowed



Lineage 1b
 M0: One-ratio1ω = 0.4649−3088.137934None
 M1a: Nearly neutral1ω0 = 0.166, ω1 = 1, (p0 = 0.72, p1 = 0.28)−3048.588195Not allowed



 M2a: Positive selection3ω0 = 0.179, ω1 = 1, ω2 = 9.729; (p0 = 0.696, p1 = 0.289, p2 = 0.013)−3028.791341(M1a vs. M2a), 39.593708, d.f. = 2, p = 0.0000146-P (1.00)8.326 ± 1.713
183-F (1.00)8.325 ± 1.714
196-L (0.999)8.316 ± 1.732



 M7: β2p = 0.393, q = 0.546−3054.125576Not allowed



 M8: β + ωs > 14p0 = 0.89, p1 = 0.11p = 1.777, q = 4.03, ω = 2.84−3034.377594(M7 vs. M8), 39.495964, d.f. = 2, p = 0.0000146-P (1.000)5.183 ± 1.880
157-F (0.718)3.601 ± 2.137
183-F (1.000)5.183 ± 1.880
196-L (0.999)5.181 ± 1.882
199-S (0.573)2.935 ± 2.110



 M8a: β + ωs = 13p0 = 0.72, p1 = 0.28, p = 19.98, q = 99, ω = 1−3048.6126(M8 vs. M8a), 28.470012, d.f. = 1, p = 0.0000Not allowed



Lineage 2a
 M0: One-ratio1ω = 0.6898−2927.296491None
 M1a: Nearly neutral1ω0 = 0.248, ω1 = 1, (p0 = 0.565, p1 = 0.435)−2913.654666Not allowed



 M2a: Positive selection3ω0 = 0.382, ω2 = 4.487; (p0 = 0.69, p1 = 0.23, p2 = 0.073)−2898.698295(Mla vs. M2a), 29.912742, d.f. = 2, p = 0.000085-L (1.000)5.340 ± 1.570
93-Q (0.888)4.839 ± 2.038
105-L (0.878)4.694 ± 1.966
109-S(0.913)4.898 ± 1.899
113-L (0.732)3.959 ± 2.193
121-P (0.510)2.849 ± 2.078
180-L (0.535)3.024 ± 2.228
202-S (0.508)2.890 ± 2.196
232-Y (0.989)5.295 ± 1.627
239-P (0.975)5.226 ± 1.690



 M7: β2p = 0.606, q = 0.379−2917.092133Not allowed



 M8: β + ωs > 14p0 = 0.89, p1 = 0.11, p = 28.418, q = 31.77, ω = 3.65−2898.947133(M7 vs. M8), 36.29, d.f. = 2, p = 0.000085-L (1.000)5.381 ± 1.324
93-Q (0.900)4.945 ± 1.872
105-L (0.920)4.988 ± 1.751
109-S (0.939)5.093 ± 1.673
113-L (0.777)4.282 ± 2.179
121-P (0.528)3.028 ± 2.275
180-L (0.546)3.173 ± 2.375
202-S (0.519)3.038 ± 2.358
232-Y (0.992)5.351 ± 1.376
239-P (0.983)5.306 ± 1.439



 M8a: β + ωs = 13p0 = 0.57, p1 = 0.43, p = 33.25, q = 99, ω = 1−2913.719868(M8 vs. M8a), 29.54547 d.f. = 1, p = 0.0000Not allowed



Lineage 2b
 M0: One-ratio1ω = 0.7065−1855.459267None
 M1a: Nearly neutral1ω0 = 0, ω1 = 1, (p0 = 0.49, p1 = 0.51)840.547215Not allowed



 M2a: Positive selection3ω0 = 0, ω1 = 1, ω2 = 10.0195; (p0 = 0.45, p1 = 0.52, p2 = 0.03)−1829.903873(M1a vs. M2a), 21.286684, d.f. = 2, p = 0.00002100-E (0.999)7.607 ± 2.041
105-P (0.971)7.432 ± 2.288
109-P (0.911)6.994 ± 2.703
213-R (0.682)5.477 ± 3.516



 M7: β2p = 0.00517, q = 0.005−1840.570751Not allowed



 M8: β + ωs > 14p0 = 0.97, p1 = 0.03, p = 0.005, q = 0.005, ω = 9.6830.002479(M7 vs. M8), 21.136554, d.f. = 2, p = 0.00003100-E (1.000)6.823 ± 2.093
105-P (0.985)6.745 ± 2.196
109-P (0.958)6.561 ± 2.375
114-Y (0.515)3.679 ± 3.226
116-G (0.572)4.071 ± 3.293
162-E (0.606)4.080 ± 3.052
201-T (0.500)3.579 ± 3.206
213-R (0.770)5.424 ± 3.159
220-P (0.629)4.385 ± 3.236



 M8a: β + ωs = 13p0 = 0.49, p1 = 0.51, p = 0.005, q = 2.785, ω = 1840.547213(M8 vs. M8a), 21.089468, d.f. = 1, p = 0.0000Not allowed

Null models (M1a, M7, and M8a) are compared with their respective alternative models (M2a, M8) that allow ω > 1. Proportion of positively selected sites and their corresponding ω-values in M2a and M8 models are in bold. The posterior probability of each positively selected amino acid site is in parenthesis. Posterior probabilities are estimated based on Bayes Empirical bayes analyses (Yang et al., 2005).

Test for variable selection pressures on different codons based on ML-based codon substitution models of Yang et al. (2000) Null models (M1a, M7, and M8a) are compared with their respective alternative models (M2a, M8) that allow ω > 1. Proportion of positively selected sites and their corresponding ω-values in M2a and M8 models are in bold. The posterior probability of each positively selected amino acid site is in parenthesis. Posterior probabilities are estimated based on Bayes Empirical bayes analyses (Yang et al., 2005).
  48 in total

1.  Real-time reverse transcriptase PCR assay for detection of human metapneumoviruses from all known genetic lineages.

Authors:  Jeroen Maertzdorf; Chiaoyin K Wang; Jennifer B Brown; Joseph D Quinto; Marla Chu; Miranda de Graaf; Bernadette G van den Hoogen; Richard Spaete; Albert D M E Osterhaus; Ron A M Fouchier
Journal:  J Clin Microbiol       Date:  2004-03       Impact factor: 5.948

2.  Avian influenza virus exhibits rapid evolutionary dynamics.

Authors:  Rubing Chen; Edward C Holmes
Journal:  Mol Biol Evol       Date:  2006-08-31       Impact factor: 16.240

3.  Bayes empirical bayes inference of amino acid sites under positive selection.

Authors:  Ziheng Yang; Wendy S W Wong; Rasmus Nielsen
Journal:  Mol Biol Evol       Date:  2005-02-02       Impact factor: 16.240

4.  Respiratory tract infection due to human metapneumovirus among elderly patients.

Authors:  Bernadette G van den Hoogen
Journal:  Clin Infect Dis       Date:  2007-03-28       Impact factor: 9.079

5.  Increased positive selection pressure in persistent (SSPE) versus acute measles virus infections.

Authors:  Christopher H Woelk; Oliver G Pybus; Li Jin; David W G Brown; Edward C Holmes
Journal:  J Gen Virol       Date:  2002-06       Impact factor: 3.891

Review 6.  Epidemiology of human metapneumovirus.

Authors:  Jeffrey S Kahn
Journal:  Clin Microbiol Rev       Date:  2006-07       Impact factor: 26.132

7.  Development and validation of an enzyme-linked immunosorbent assay for human metapneumovirus serology based on a recombinant viral protein.

Authors:  Marie-Eve Hamelin; Guy Boivin
Journal:  Clin Diagn Lab Immunol       Date:  2005-02

8.  The two major human metapneumovirus genetic lineages are highly related antigenically, and the fusion (F) protein is a major contributor to this antigenic relatedness.

Authors:  Mario H Skiadopoulos; Stéphane Biacchesi; Ursula J Buchholz; Jeffrey M Riggs; Sonja R Surman; Emerito Amaro-Carambot; Josephine M McAuliffe; William R Elkins; Marisa St Claire; Peter L Collins; Brian R Murphy
Journal:  J Virol       Date:  2004-07       Impact factor: 5.103

9.  Human metapneumovirus infection in the Canadian population.

Authors:  Nathalie Bastien; Diane Ward; Paul Van Caeseele; Ken Brandt; Spencer H S Lee; Gail McNabb; Brian Klisko; Edward Chan; Yan Li
Journal:  J Clin Microbiol       Date:  2003-10       Impact factor: 5.948

10.  Human metapneumovirus genetic variability, South Africa.

Authors:  Herbert P Ludewick; Yacine Abed; Nadia van Niekerk; Guy Boivin; Keith P Klugman; Shabir A Madhi
Journal:  Emerg Infect Dis       Date:  2005-07       Impact factor: 6.883

View more
  10 in total

1.  Population dynamics and rates of molecular evolution of a recently emerged paramyxovirus, avian metapneumovirus subtype C.

Authors:  Abinash Padhi; Mary Poss
Journal:  J Virol       Date:  2008-12-03       Impact factor: 5.103

2.  Human metapneumovirus G protein is highly conserved within but not between genetic lineages.

Authors:  Chin-Fen Yang; Chiaoyin K Wang; Sharon J Tollefson; Linda D Lintao; Alexis Liem; Marla Chu; John V Williams
Journal:  Arch Virol       Date:  2013-02-06       Impact factor: 2.574

3.  Molecular epidemiology and evolution of human respiratory syncytial virus and human metapneumovirus.

Authors:  Eleanor R Gaunt; Rogier R Jansen; Yong Poovorawan; Kate E Templeton; Geoffrey L Toms; Peter Simmonds
Journal:  PLoS One       Date:  2011-03-01       Impact factor: 3.240

4.  Evolutionary dynamics analysis of human metapneumovirus subtype A2: genetic evidence for its dominant epidemic.

Authors:  Jianguo Li; Lili Ren; Li Guo; Zichun Xiang; Gláucia Paranhos-Baccalà; Guy Vernet; Jianwei Wang
Journal:  PLoS One       Date:  2012-03-30       Impact factor: 3.240

5.  Cell tropism predicts long-term nucleotide substitution rates of mammalian RNA viruses.

Authors:  Allison L Hicks; Siobain Duffy
Journal:  PLoS Pathog       Date:  2014-01-09       Impact factor: 6.823

6.  Experiments Investigating the Competitive Growth Advantage of Two Different Genotypes of Human Metapneumovirus: Implications for the Alternation of Genotype Prevalence.

Authors:  Zhen Zhou; Pan Zhang; Yuxia Cui; Yongbo Zhang; Xian Qin; Rongpei Li; Ping Liu; Ying Dou; Lijia Wang; Yao Zhao
Journal:  Sci Rep       Date:  2020-02-18       Impact factor: 4.379

7.  Role of human metapneumovirus glycoprotein G in modulation of immune responses.

Authors:  Thangam Sudha Velayutham; Teodora Ivanciuc; Roberto P Garofalo; Antonella Casola
Journal:  Front Immunol       Date:  2022-07-25       Impact factor: 8.786

8.  Genetic diversity and evolution of human metapneumovirus fusion protein over twenty years.

Authors:  Chin-Fen Yang; Chiaoyin K Wang; Sharon J Tollefson; Rohith Piyaratna; Linda D Lintao; Marla Chu; Alexis Liem; Mary Mark; Richard R Spaete; James E Crowe; John V Williams
Journal:  Virol J       Date:  2009-09-09       Impact factor: 4.099

9.  Evaluating the adaptive potential of the European eel: is the immunogenetic status recovering?

Authors:  Miguel Baltazar-Soares; Seraina E Bracamonte; Till Bayer; Frédéric J J Chain; Reinhold Hanel; Chris Harrod; Christophe Eizaguirre
Journal:  PeerJ       Date:  2016-04-11       Impact factor: 2.984

Review 10.  Constraints on the Genetic and Antigenic Variability of Measles Virus.

Authors:  Shannon M Beaty; Benhur Lee
Journal:  Viruses       Date:  2016-04-21       Impact factor: 5.048

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.