| Literature DB >> 26338545 |
Miho Kobayashi1, Shima Yoshizumi2, Sayaka Kogawa3, Tomoko Takahashi4, Yo Ueki5, Michiyo Shinohara6, Fuminori Mizukoshi7, Hiroyuki Tsukagoshi1, Yoshiko Sasaki1, Rieko Suzuki8, Hideaki Shimizu9, Akira Iwakiri10, Nobuhiko Okabe9, Komei Shirabe11, Hiroto Shinomiya12, Kunihisa Kozawa1, Hideki Kusunoki13, Akihide Ryo14, Makoto Kuroda15, Kazuhiko Katayama16,6, Hirokazu Kimura14,17.
Abstract
We studied the molecular evolution of the capsid gene in all genotypes (genotypes 1-9) of human norovirus (NoV) genogroup I. The evolutionary time scale and rate were estimated by the Bayesian Markov chain Monte Carlo (MCMC) method. We also performed selective pressure analysis and B-cell linear epitope prediction in the deduced NoV GI capsid protein. Furthermore, we analysed the effective population size of the virus using Bayesian skyline plot (BSP) analysis. A phylogenetic tree by MCMC showed that NoV GI diverged from the common ancestor of NoV GII, GIII, and GIV approximately 2,800 years ago with rapid evolution (about 10(-3) substitutions/site/year). Some positive selection sites and over 400 negative selection sites were estimated in the deduced capsid protein. Many epitopes were estimated in the deduced virus capsid proteins. An epitope of GI.1 may be associated with histo-blood group antigen binding sites (Ser377, Pro378, and Ser380). Moreover, BSP suggested that the adaptation of NoV GI strains to humans was affected by natural selection. The results suggested that NoV GI strains evolved rapidly and date back to many years ago. Additionally, the virus may have undergone locally affected natural selection in the host resulting in its adaptation to humans.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26338545 PMCID: PMC4559769 DOI: 10.1038/srep13806
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Phylogenetic tree of ORF2 constructed by the Bayesian Markov Chain Monte Carlo method.
The phylogenetic tree was based on the whole nucleotide sequence of ORF2 (1593 nt corresponding to GI.1/Norwalk/1968/US). We analysed 65 strains of GI, 6 strains of GII, 1 strain of GIII, and 3 strains of GIV. Each node represents mean root height. The scale bar represents the unit of time (years). The grey bars indicate the 95% HPDs for the estimated year. The reference strains of each genotype are indicated by solid circles.
Positive selection sites in ORF2 of NoV GI.
| aa change | SLAC | FEL | IFEL | MEME |
|---|---|---|---|---|
| Met2Lys | ○ | |||
| Asp7Gly | ○ | |||
| Pro10Thr, Gln, Thr, SerThr10SerSer10Pro, Thr | ○ | ○ | ○ | |
| Ser16Thr, Asn | ○ | |||
| Ala18Arg | ○ | |||
| Gly19Arg | ○ | |||
| Ile354Val | ○ | |||
| Asn358Asp, LeuAsp358Ser, AsnSer358Asp | ○ | |||
| Val359Ile, MetIle359Val | ○ | |||
| Ala367Val | ○ | |||
| Ile390Thr, Leu, Val, Met | ○ | |||
| Lei393Ile | ○ | |||
| Ser397Cys, Leu | ○ | |||
| Ser400Thr, ArgThr400Ser, Ala, Glu | ○ | |||
| Asp401Asn, Ser, Gly, IleAsn401Ser | ○ | |||
| Ala402Pro, Ser, Val,Gly, AspSer402Asn | ○ | |||
| Val409Asp, Ile, AlaAsp409Asn | ○ | |||
| Ala545Thr, Arg | ○ | |||
| Ser557LeuLeu557IleVal557Ile | ○ | ○ | ○ |
Cut-off value, p < 0.05.
Negative selection sites in ORF2 of NoV GI.
| SLAC | FEL | IFEL | |
|---|---|---|---|
| No. of negative selection sites | 437 | 469 | 428 |
Cut-off value, p < 0.05.
Predicted epitopes of the reference strains for each genotype.
| Genotype | Strain (GenBank accession No.) | Position | Predicted epitopes |
|---|---|---|---|
| GI.1 | Norwalk/1968/US (M87661) | 32–43 | AMDPVAGSSTAV |
| 316–325 | |||
| 377–388 | |||
| 480–496 | FLTCVPNGASSGPQQLP | ||
| GI.2 | Southampton/1991/UK (L07418) | 33–45 | MEPVAGPTTAVAT |
| 413–423 | |||
| 437–448 | PGPNNRSAPNDV | ||
| 501–511 | NGVGAGPQQLP | ||
| GI.3 | Desert Shield395/1990/UK (U04469) | 314–326 | |
| GI.4 | Chiba 407/1987/JP (AB022679) | 29–43 | DPIPIDPVAGSSTAL |
| 157–167 | PVEVPLEDVRN | ||
| 317–327 | |||
| 386–397 | |||
| 435–445 | IPGPNQSGSPN | ||
| 501–510 | SSSTGPQQLP | ||
| 529–539 | PVGTAGPARGR | ||
| GI.5 | Musgrove/1989/UK (AJ277614) | 9–19 | TPSADGANGAG |
| 29–41 | EPLPLDPVAGAST | ||
| 319–329 | |||
| 436–447 | IPGPNTAHKPNL | ||
| GI.6 | BS5/1997/DE (AF093797) | 312–325 | |
| GI.7 | Winchester/1994/UK (AJ277609) | 28–41 | AEPLPLEPVVGAAT |
| 189–200 | LRAGGASSGTDP | ||
| 314–326 | |||
| 392–402 | |||
| 495–506 | PNTGGGPQNLPT | ||
| GI.8 | Boxer/2001/US (AF538679) | 189–200 | LRSGGASSGTDP |
| 342–353 | |||
| 441–453 | TVSNPKVPCTLPQ | ||
| 499–509 | PNAGGGPQTLP | ||
| GI.9 | Vancouver730/2004/CAN (HQ637267) | 20–33 | QLVPENNNTSEPIN |
| 318–334 | |||
| 351–366 | |||
| 374–385 |
The predicted epitopes in the P2 domain are indicated by bold type and underlined.
Common epitopes are indicated by grey shading.
Figure 2Bayesian skyline plot of ORF2 in NoV GI.
The Bayesian skyline plot was estimated under the GTR-Γ4 model. The MCMC chains were run for 65,000,000 steps. The Y-axis represents the effective population size and the X-axis represents generation time (year). The solid black line represents the mean value over time. The 95% HPD intervals are shown in dotted lines.
Figure 3Distributions of the pairwise distance values of ORF2.
The distributions of the p-distance values based on the nucleotide sequences of NoV GI. A total of 65 strains were analysed.