| Literature DB >> 27384324 |
Miho Kobayashi1, Yuki Matsushima2, Takumi Motoya3, Naomi Sakon4, Naoki Shigemoto5, Reiko Okamoto-Nakagawa6, Koichi Nishimura7, Yasutaka Yamashita8, Makoto Kuroda9, Nobuhiro Saruki1, Akihide Ryo10, Takeshi Saraya11, Yukio Morita12, Komei Shirabe6, Mariko Ishikawa2, Tomoko Takahashi13, Hiroto Shinomiya8, Nobuhiko Okabe2, Koo Nagasawa14, Yoshiyuki Suzuki15, Kazuhiko Katayama16, Hirokazu Kimura10,14.
Abstract
Capsid protein of norovirus genogroup II (GII) plays crucial roles in host infection. Although studies on capsid gene evolution have been conducted for a few genotypes of norovirus, the molecular evolution of norovirus GII is not well understood. Here we report the molecular evolution of all GII genotypes, using various bioinformatics techniques. The time-scaled phylogenetic tree showed that the present GII strains diverged from GIV around 1630CE at a high evolutionary rate (around 10(-3) substitutions/site/year), resulting in three lineages. The GII capsid gene had large pairwise distances (maximum > 0.39). The effective population sizes of the present GII strains were large (>10(2)) for about 400 years. Positive (20) and negative (over 450) selection sites were estimated. Moreover, some linear and conformational B-cell epitopes were found in the deduced GII capsid protein. These results suggested that norovirus GII strains rapidly evolved with high divergence and adaptation to humans.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27384324 PMCID: PMC4935990 DOI: 10.1038/srep29400
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Phylogenetic tree of the capsid gene on NoV constructed by the Bayesian MCMC method.
203 strains of human GII, three strains of swine GII, nine strains of GI, one strain of GIII, and three strains of GIV were included in this tree. Grey bars show 95% HPDs. The scale bar represents actual time (year). The time of the most recent common ancestor of this tree was around 854 CE. GII strains were divided from GIV around 1630 CE. NoV GII was formed three lineages.
Figure 2Bayesian skyline plots of all NoV GII (a) GII.2 (b) GII.3 (c) GII.4 (d) and GII.6 (e). The x-axis represents actual time (years) and starts at mean tree model root height. The y-axis represents the effective population size. Mean effective population size is shown as a black line. HPDs of 95% are shown as grey lines.
Positive selection sites on capsid gene in human NoV GII.
| Amino acid change | SLAC | FEL | IFEL | MEME |
|---|---|---|---|---|
| Ser6AsnAsn6Ser, Lys, Ile | ○ | ○ | ○ | ○ |
| Asn9Thr, SerThr9AlaAla9ThrSer9AsnAsn9Thr,Lys,SerAla9Val, Thr | ○ | ○ | ||
| Thr16Ala, SerAla16Ser, Thr | ○ | |||
| Val23Ile, AlaIle23Val, Ser, AlaAla23Gly | ○ | |||
| Asn25Ser, Thr, His, Gln,MixSer25Asn | ○ | |||
| Glu64Mix, His | ○ | |||
| Cys268Ser, Ala, ValVal268Cys, AlaSer268Thr | ○ | |||
| Asp297His, Asn, Ser, Gly, Val, GluHis297Pro, Gln, AspPro297SerGln297HisAsn297Ile, SerGly297Ser, Pro, Arg, AlaSer297Asn, Ala | ○ | |||
| Gly298Asp, Arg, Ala, Ile, Gln, Asn, LysAsp298Gly, Asn, GluArg298Ser | ○ | |||
| Ala303Val, Ile, ThrThr303Val | ○ | |||
| Asp359AlaThr359SerAla359Ser, ValSer359Asn, GlyPro359ThrSer359Asn | ○ | ○ | ||
| Ala360Thr, Ser | ○ | |||
| Gly370Ala, Ser, MixAla370Ser, Gly | ○ | |||
| Ser379Thr, Asp, Ala, Gly, Asn, ProAsp379AsnGly379Ser, AspAla379SerAsn379AspThr379Ser, Ala | ○ | |||
| Asn397Ser, Asp, Glu, Gly, Thr, GlnSer397Arg, AspGly397Ser, Asp,Asp397Glu, AsnHis397ArgThr397ProGln397Asp | ○ | |||
| Gly416Asp, Ala, SerAsp416Gly, Ser, Asn, GluGlu416AspHis416Asn, Gln, ArgAsn416Arg, Thr, AspThr416Pro, AlaSer416Thr | ○ | |||
| Asp416Asn, Gln, Gly, Ser, Glu, AlaAsn416Asp, Ser, GlySer416AlaGly416Ser | ○ | |||
| Ala419ThrThr419Asn, AlaAsn419Asp, AlaAsp419GlyThr419AlaAsp419Pro | ○ | ○ | ○ | ○ |
| Arg435Thr, HisThr435Pro, ValPro435His, SerHis435Ala, Arg, GlnAla435Arg, Ser, His, ValGln435Pro | ○ | |||
| Trp485Phe | ○ |
mean dN/dS = 0.106 (95% CI = 0.103–0.109). Cut off p-value < 0.05.
Figure 3Location of positive selection sites on predicted structure of capsid protein in GII.4/Bristol/1993/UK.
To construct the model, we used five suitable templates of NoV capsid sequences (PDB ID: 1IHM, 3ONU, 4RLZ, 3PUM, and 4X07). Twenty positively selected sites on chains A and B are colored purple and orange, respectively. The HBGA binding sites45 are colored blue and pink. These sites were located within the surface of the protein.
Predicted linear B-cell epitopes of standard strains for each genotype
| Genotype | Strain (Accession No.) | Position | Predicted epitopes |
|---|---|---|---|
| GII.1 | Hawaii virus/1971/US (U07611) | 305–326 | VTNTNGTPF |
| 357–366 | PKFTPKLGSV | ||
| GII.2 | Melksham/1994/UK (X81879) | 4–15 | AS |
| 313–326 | FD | ||
| 359–373 | VPTY | ||
| 531–541 | PMGTGNGRRRV | ||
| GII.3 | Toronto24/1991/CA (U02030) | 59–68 | APGG |
| 294–307 | T | ||
| 325–338 | Y | ||
| 387–400 | FDPNQPTKFTPVGV | ||
| GII.4 | Bristol/1993/UK (X76716) | 64–74 | FTVSPRNAPGE |
| 125–135 | PPNFPTEGLSP | ||
| 251–263 | TGPSSAFVVQPQN | ||
| 309–326 | SNY | ||
| 436–445 | TMPGCSGYPN | ||
| GII.5 | Hillingdon/1990/UK (AJ277607) | 64–73 | FTVSPKNSPG |
| 213–222 | TYLVPPTVES | ||
| 313–327 | F | ||
| 337–351 | SQRNRGESNPANR | ||
| 374–385 | WNTNDVENQPTK | ||
| 439–448 | PLKGGFGNPA | ||
| GII.7 | Leeds/1990/UK (AJ277608) | 306–328 | ITNTDGTPI |
| 338–349 | SQRNKNEQNPAT | ||
| 358–368 | TGGDQYAPKLA | ||
| 390–401 | VGVAGD | ||
| GII.8 | Amsterdam/98-18/1998/NET (AF195848) | 59–72 | APAG |
| 308–327 | NLDGSPV | ||
| 369–383 | FKSPSTDFSDNEPIK | ||
| GII.9 | VA97207/1997/USA (AY038599) | 4–15 | AS |
| 59–72 | APAG | ||
| 308–326 | LDGSPI | ||
| 336–380 | ASQRGPGDATR | ||
| GII.10 | Erfurt/546/2000/DE (AF427118) | 4–15 | AS |
| 204–223 | TRPTPDFDFTYLVPPTVESK | ||
| 295–304 | QDEHRG | ||
| 310–329 | LNGTPF | ||
| 340–353 | QRNTNT | ||
| 384–396 | |||
| GII.13 | Fayetteville/1998/US (AY113106) | 217–230 | PPSVESKTKPFTLP |
| 250–263 | YTAPNETNVVQCQN | ||
| 308–325 | PNGASY | ||
| GII.14 | M7/1999/US (AY130761) | 307–325 | LDGSPI |
| 363-385 | IGQVRFKSSS | ||
| 455–466 | EHFYQEAAPSQS | ||
| GII.15 | J23/1999/US (AY130762) | 20–30 | VPE |
| 316–336 | EPDGEEF | ||
| 349–359 | NTGGAGQNSNR | ||
| 427–440 | AGKLAPPVAPNYPG | ||
| GII.16 | Tiffin/1999/USA (AY502010) | 310–325 | GTPF |
| 338–349 | QRDTGTNPANR | ||
| 359–378 | AKYTPKLGSVQIGTWD | ||
| 380–389 | ERQPVKFTPV | ||
| 434–447 | FRSYIPLKGGHGDP | ||
| GII.17 | CS-E1/2002/USA (AY502009) | 7–17 | DA |
| 314–328 | F | ||
| 341–351 | NVGSNPNTTR | ||
| 365–379 | PKLGSVNFGSTSTDF | ||
| 420–432 | PPIAPNFPGEQLL | ||
| GII.20 | Luckenwalde591/2002/DE (EU373815) | 59-69 | APGG |
| 125–135 | PPNFPPENLSP | ||
| 308–323 | NGSAY | ||
| 337–346 | QRSPNNSTR | ||
| 350–361 | TLNTGSPRYTPK | ||
| GII.21 | IF1998/2003/IR (AY675554) | 2–12 | AS |
| 211–222 | TYLVPPSVESKT | ||
| 248–261 | YTSPNADVVVQPQN | ||
| 310–323 | TY | ||
| 335–348 | TQNPRASGDEAAN | ||
| 374–384 | GHHSQHQQSKF | ||
| 457–468 | HFYQESAPSQSD | ||
| GII.22 | YURI2002/JP (AB083780) | 159–172 | PDVRNQFFHYNQVN |
| 217–226 | PPTVESRTKP | ||
| 315–328 | |||
| 341–369 | NDYN | ||
| 441–450 | LKGGHGNPAI |
Linear epitopes of GII.6 and GII.12 could not be predicted. The positions of the amino acids correspond to each strain. Common epitopes sequences are shown in the bold letters. Positive selection sites are shown in underlined text.
Figure 4Predicted linear B-cell epitopes mapping on the capsid protein of GII. 4.
The predicted structure of capsid protein is the same as in Fig. 3. Linear B-cell epitopes on chain A and B are shown in green and blue, respectively. Common locations among all genotypes are represented by deeper tones. These sites consist of 11 amino acids (DPTXXXPAPXG or similar sequence to this).
Figure 5Predicted conformational B-cell epitopes mapping on the capsid protein of GII. 4.
The predicted structure of capsid protein is the same as in Fig. 3. These sites on chain A and B are shown in green and blue, respectively. Most of conformational epitopes were located in the P1 and P2 domains.