| Literature DB >> 26130388 |
Hirokazu Kimura1,2, Mika Saitoh3, Miho Kobayashi3, Haruyuki Ishii4, Takeshi Saraya4, Daisuke Kurai4, Hiroyuki Tsukagoshi3, Komei Shirabe5, Atsuyoshi Nishina6, Kunihisa Kozawa3, Makoto Kuroda7, Fumihiko Takeuchi7, Tsuyoshi Sekizuka7, Hisanori Minakami8, Akihide Ryo2, Makoto Takeda9.
Abstract
We studied the molecular evolution of the haemagglutinin (H) gene (full length) in all genotypes (24 genotypes, 297 strains) of measles virus (MeV). The gene's evolutionary timescale was estimated by the Bayesian Markov chain Monte Carlo (MCMC) method. We also analysed positive selection sites. The MCMC tree indicated that the MeV H gene diverged from the rinderpest virus (same genus) about 250 years ago and that 24 MeV genotypes formed 3 lineages dating back to a 1915 ancestor (95% highest posterior density [HPD] 1882-1941) with relatively rapid evolution (mean rate: 9.02 × 10(-4) substitutions/site/year). The 3 lineages diverged in 1915 (lineage 1, 95% HPD 1882-1941), 1954 (lineage 2, 95% HPD 1937-1969), and 1940 (lineage 3, 95% HPD 1927-1952). These 24 genotypes may have diverged and emerged between the 1940s and 1990 s. Selective pressure analysis identified many negative selection sites on the H protein but only a few positive selection sites, suggesting strongly operated structural and/or functional constraint of changes on the H protein. Based on the molecular evolution of H gene, an ancestor MeV of the 24 genotypes emerged about 100 years ago and the structure of H protein has been well conserved.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26130388 PMCID: PMC4486977 DOI: 10.1038/srep11648
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Phylogenetic tree of the haemagglutinin (H) gene constructed by the Bayesian Markov chain Monte Carlo method.
The Markov chain Monte Carlo tree was based on the full nucleotide sequence of the H gene (1854 nt) visualised in FigTree. Grey bars indicate 95% highest posterior density for the estimated year. The tree was estimated using an uncorrelated exponential relaxed clock model under an exponential growth model. The scale bar represents the unit of time (year). The reference strains of each genotype are shown in the tree. Common positively selected sites (aa 575) are indicated by a bold line.
Year of divergence of each genotype.
| Genotypes | Year of divergence (95% HPD) | |
|---|---|---|
| 1 | A, B1, B2, B3, C1, C2, E, F | 1915 (1882–1941) |
| 2 | G1, G2, G3, H1, H2 | 1954 (1937–1969) |
| 3 | D1, D2, D3, D4, D5, D6, D7, D8, D9, D10, D11 | 1940 (1927–1952) |
| A | 1948 (1941–1952) | |
| B1 | 1974 (1966–1980) | |
| B2 | 1980 (1976–1982) | |
| B3 | 1979 (1972–1984) | |
| C1 | 1942 (1930–1950) | |
| C2 | 1969 (1959–1975) | |
| D1 | 1951 (1943–1957) | |
| D2 | 1979 (1971–1985) | |
| D3 | 1977 (1972–1982) | |
| D4 | 1974 (1968–1980) | |
| D5 | 1978 (1971–1985) | |
| D6 | 1973 (1957–1984) | |
| D7 | 1966 (1960–1971) | |
| D8 | 1983 (1975–1989) | |
| D9 | 1988 (1979–1994) | |
| D10 | 1992 (1983–1997) | |
| D11 | 1976 (1965–1986) | |
| E | 1955 (1937–1965) | |
| F | 1943 (1932–1950) | |
| G1 | 1965 (1951–1976) | |
| G2 | 1994 (1991–1995) | |
| G3 | 1993 (1990–1996) | |
| H1 | 1978 (1971–1984) | |
| H2 | 1986 (1977–1991) | |
HPD, highest posterior density.
Figure 2Phylogenetic tree of H gene constructed by the maximum likelihood method.
Labels at the branch nodes show at least 70% bootstrap support. The scale bar shows nucleotide substitutions per site.
Positive selection sites in the H gene.
Negative selection sites in the H gene.
| Method | Negative selection sites |
|---|---|
| SLAC | 181 sites |
| FEL | 242 sites |
| IFEL | 157 sites |
Cut-off p-value <0.05
Figure 3Mapping of the positive selection sites on the H protein structure.
An H protein homotetramer determined by Hashiguchi et al.11 (PDB3ALZ) is shown. (Left) Top view; (Right) bottom view. The individual H proteins in the tetramer are shown in grey, light grey, purple, and light purple, and SLAMs are shown in cyan. Amino acid residues known to constitute a portion of an epitope are shown in blue, green, light green, yellow, orange, and red. The glutamine residues at position 575 (Q575) are shown in magenta.
Figure 4Bayesian skyline plot analysis of the measles virus H gene.
The Y-axis shows the effective population size and the X-axis shows generation time (year). Mean effective population size of major genotypes including B3, D4, D8, D9, and H1 are indicated by a solid line. Mean effective population size of minor genotypes such as B1, C1, E, and F are indicated by a dotted line.