| Literature DB >> 33167568 |
Joshua O Amimo1,2,3, Eunice M Machuka3, Edward O Abworo3, Anastasia N Vlasova2, Roger Pelle3.
Abstract
Astroviruses (AstVs) are widely distributed and are associated with gastroenteritis in human and animals. The knowledge of the genetic diversity and epidemiology of AstVs in Africa is limited. This study aimed to characterize astroviruses in asymptomatic smallholder piglets in Kenya and Uganda. Twenty-four samples were randomly selected from a total of 446 piglets aged below 6 months that were initially collected for rotavirus study and sequenced for whole genome analysis. Thirteen (13/24) samples had contigs with high identity to genus Mamastrovirus. Analysis of seven strains with complete (or near complete) AstV genome revealed variable nucleotide and amino acid sequence identities with known porcine astrovirus (PoAstV) strains. The U083 and K321 strains had nucleotide sequence identities ranging from 66.4 to 75.4% with the known PoAstV2 strains; U460 strain had nucleotide sequence identities of 57.0 to 65.1% regarding the known PoAstV3; and K062, K366, K451, and K456 strains had nucleotide sequence identities of 63.5 to 80% with the known PoAstV4 strains. The low sequence identities (<90%) indicate that novel genotypes of PoAstVs are circulating in the study area. Recombination analysis using whole genomes revealed evidence of multiple recombination events in PoAstV4, suggesting that recombination might have contributed to the observed genetic diversity. Linear antigen epitope prediction and a comparative analysis of capsid protein of our field strains identified potential candidate epitopes that could help in the design of immuno-diagnostic tools and a subunit vaccine. These findings provide new insights into the molecular epidemiology of porcine astroviruses in East Africa.Entities:
Keywords: East Africa; glycosylation; linear antigenic epitopes; porcine astroviruses; recombination; whole genome sequences
Year: 2020 PMID: 33167568 PMCID: PMC7694451 DOI: 10.3390/v12111262
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Comparison of the genomic organization of porcine astroviruses circulating in smallholder swine farms in East Africa.
| Genotype | Strains | Accession No. | 5′UTR | ORF1a | ORF1ab | ORF2 | 3′UTR | Total Length | Source |
|---|---|---|---|---|---|---|---|---|---|
| PoAstV2 | U083 | KY940077 | 18 | 2475 | 4053 | 2325 | 162 | 6434 | Samia, Uganda |
| K321 | KY940076 | 27 | 2475 | 4056 | 2328 | 63 | 6347 | Budalangi, Kenya | |
| PoAstV3 | U460 (Partial ORF1a) | KY933399 | - | 1616 | 3208 | 2148 | - | 5281 | Budama, Uganda |
| PoAstV4 | K456 | KY933398 | 85 | 2550 | 3995 | 2511 | 67 | 6649 | Funyula, Kenya |
| K451 | KY940075 | - | 2602 | 4138 | 2541 | 55 | 6634 | Funyula, Kenya | |
| K366 | MT451917 | 87 | 2550 | 3995 | 2469 | 76 | 6618 | Funyula, Kenya | |
| K062 | MT451918 | 87 | 2550 | 3995 | 2481 | 75 | 6629 | Amukura, Kenya |
(- = region missing).
Figure 1Phylogenetic analysis based on the complete (nearly complete, U460) nucleotide sequences of the full-length genome of East African PoAstVs (bold) and representative AstVs of humans and other species in the GenBank. Multiple sequence alignments were performed using the ClustalO program. The evolutionary history was inferred by using the maximum likelihood method based on the general time reversible model in MEGA X [33]. The scale bar is given in numbers of substitutions per site. Phylogeny was inferred following 1000 bootstrap replications, and the node values show percentage bootstrap support.
Figure 2Phylogenetic tree based on the nucleotide (a) and amino acid (b) sequences of the capsid proteins (ORF2) of the East African astrovirus field strains and the known astroviruses in the GenBank. Multiple sequence alignments were performed using the ClustalO program. The evolutionary history was inferred by using the maximum likelihood method on the GTR model for nucleotide and neighbor-joining method using the p-distance substitution model for amino acid sequences in MEGA X [33]. The scale bar is given in numbers of substitutions per site. Phylogeny was inferred following 1000 bootstrap replications, and the node values show percentage bootstrap support.
Figure 3Recombination analysis of newly discovered full-length astrovirus genomes: (a) Predicted recombination event has likely occurred in the ORF1a–ORF1b junction and covers nearly the entire RdRp region. K062 strain is recombinant. (b) Predicted recombination event has likely occurred in RdRp–ORF2 overlap and covers almost the entire capsid region (ORF2); K366 is the recombinant. (c) Predicted recombination event has occurred in the 3′ end of the capsid region ORF2; K456 strain is recombinant.
Summary of p-values of different recombination methods using the step-down correction for multiple comparisons between our field strains and a p-value cutoff of 0.01 in the Recombination Detection Program (RDP4, Version 4.94).
| Method | Event 1 in K062 | Event 2 in K366 | Event 3 in K456 |
|---|---|---|---|
| RDP | 2.811 × 10−13 | 1.522 × 10−13 | 2.744 × 10−16 |
| GENECONV | 2.747 × 10−17 | 1.099 × 10−2 | 1.816 × 10−3 |
| BootScan | 2.199 × 10−15 | 4.858 × 10−5 | 1.518 × 10−16 |
| MaxChi | 2.377 × 10−21 | 9.780 × 10−13 | 4.161 × 10−5 |
| Chimaera | 2.805 × 10−13 | 2.604 × 10−12 | 5.516 × 10−9 |
| Siscan | 4.626 × 10−2 | 1.434 × 10−7 | 2.183 × 10−13 |
| Phylpro | 4.612 × 10−9 | 1.165 × 10−14 | 2.642 × 10−13 |
| LARD | 1.219 × 10−71 | 1.267 × 10−76 | 2.947 × 10−22 |
| 3Seq | 4.612 × 10−5 | 1.165 × 10−14 | 2.642 × 10−12 |
Predicted antigenic epitopes within capsid proteins (ORF2) of our field strains using three different pieces of software, and antigenicity of the predicted epitopes determined.
| Strains | Amino Acid Position 126–161 | Amino Acid Position 219–241 | Amino Acid Position 332–363 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| U083 | FKMTKCELVLKPLVGDSAVSGTVV | IGKTMSTYQSRAFEGGLFLAELTT | |||||||
| K321 | YKMTRCVVTLKPIVGDSAVAGTVT | CHTFGKTTSTYRNEPFKGGLFLAE | VK | ||||||
| U460 | WRLTNLK | MIEIHGLGKTSSTYKDEPWVGDLF | PFQWLIKG | ||||||
| K456 | WRVQYLD | TLGKTMSTYKSDIFDGPLFLAEVT | QWLIKA | ||||||
| K451 | WRVDNIL | TLGQTMSTYQAKVFTGPLFLCEMT | LFQA | ||||||
| K366 | WRVKNMI | TYGKTVSTYRNDPFTGPLFLAELT | LFKA | ||||||
| K062 | WRARDII | TLGKTLSTYKNEDFTGPLFLAELT | LFKA | ||||||
|
|
|
|
|
| |||||
|
|
|
|
|
|
|
|
| ||
| U083 | PoAstV2 | 127–146 | 1.000 | 129–151 | 1.215 | 136–148 | 0.804 | 126–158 | 0.6357 |
| K321 | 130–149 | 0.777 | 130–153 | 1.200 | 139–151 | 0.810 | 129–161 | 0.7207 | |
| U460 | PoAstV3 | - | - | 135–161 | 1.150 | 142–154 | 0.809 | 132–164 | 1.0511 |
| K456 | PoAstV4 | 127–146 | 1.000 | 125–159 | 1.150 | 135–149 | 0.765 | 127–159 | 1.1882 |
| K451 | 128–147 | 0.994 | 129–154 | 1.150 | 135–149 | 0.765 | 127–159 | 0.7063 | |
| K366 | 128–147 | 0.836 | 134–156 | 1.150 | 135–149 |
| 127–159 | 0.7977 | |
| Ko62 | 128–147 | 0.836 | 130–147 | 1.175 | 135–149 | 0.767 | 127–159 | 0.9112 | |
* Antigenicity of the predicted epitope was analyzed by VaxiJen Software with threshold of 0.4 (values above threshold are probable antigens); a bold (red) amino acid sequence represents a motif conserved in each PoAstV type for positions 126–161 and 332–363, with high antigenicity (>1).
Figure 4Capsid protein structures and conformational B-cell epitopes of PoAstV protein predicted by Ellipro from 3D structure template. Yellow balls are the residues of predicted peptides and white sticks are the non-epitope residues of protein. Each epitope is predicted with residue number and position mentioned.
Glycosylation analysis of the capsid proteins (ORF2) of astrovirus field strains using NetNGlyc software.
| Name | Position | Sequence | Potential † | Jury Agreement | N Glycosylated | Name | Position | Sequence | Potential † | Jury Agreement | N Glycosylated |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 12 | NTTN | 0.734 | (9/9) | ++ |
| 160 | NLTQ | 0.750 | (9/9) | +++ |
| 20 | NGSS | 0.513 | (5/9) | + | 306 | NATT | 0.597 | (6/9) | + | ||
| 55 | NKTV | 0.764 | (9/9) | +++ | 625 | NYTF | 0.646 | (8/9) | + | ||
| 86 | NGSE | 0.691 | (9/9) | ++ |
| 274 | NATP | 0.110 | (9/9) | --- | |
| 154 | N | 0.697 | (9/9) | ++* | 398 | NITQ | 0.681 | (9/9) | ++ | ||
| 297 | NKTI | 0.752 | (9/9) | +++ | 573 | NYTM | 0.731 | (9/9) | ++ | ||
| 439 | NYTT | 0.648 | (9/9) | ++ | 658 | NTTP | 0.110 | (9/9) | --- | ||
| 542 | NGTG | 0.735 | (9/9) | ++ |
| 251 | N | 0.271 | (8/9) | -- | |
| 557 | NRTN | 0.618 | (7/9) | + | 391 | NITG | 0.631 | (9/9) | ++ | ||
| 611 | NNTM | 0.406 | (8/9) | - | 521 | NPTL | 0.643 | (8/9) | +* | ||
|
| 13 | NTTN | 0.746 | (9/9) | ++ | 601 | NGTL | 0.698 | (9/9) | ++ | |
| 21 | NGSS | 0.505 | (6/9) | + | 655 | NLTA | 0.644 | (9/9) | ++ | ||
| 41 | NRTR | 0.749 | (9/9) | ++ |
| 123 | NYSL | 0.738 | (9/9) | ++ | |
| 56 | NQSQ | 0.560 | (6/9) | + | 155 | NLSG | 0.670 | (9/9) | ++ | ||
| 80 | NTTL | 0.625 | (9/9) | ++ | 287 | NGSL | 0.573 | (7/9) | + | ||
| 89 | NESG | 0.557 | (6/9) | + | 345 | NKSQ | 0.677 | (8/9) | + | ||
| 157 | N | 0.714 | (9/9) | ++* | 502 | NYTP | 0.197 | (9/9) | --- | ||
| 412 | NPTR | 0.729 | (9/9) | ++* | 559 | N | 0.580 | (8/9) | +* | ||
| 457 | NGTK | 0.685 | (9/9) | ++ | 565 | NFTQ | 0.584 | (7/9) | + | ||
| 493 | NNTT | 0.547 | (6/9) | + |
| 123 | NYSL | 0.709 | (9/9) | ++ | |
| 494 | NTTA | 0.609 | (7/9) | + | 155 | NLSA | 0.627 | (8/9) | + | ||
| 511 | NESP | 0.129 | (9/9) | --- | 287 | NSSS | 0.497 | (4/9) | - | ||
| 560 | NNSN | 0.377 | (9/9) | -- | 615 | NQTV | 0.607 | (7/9) | + |
† = Any potential crossing the default threshold of 0.5 represents a predicted glycosylated site; + = N glycosylated; - = a negative site; * = proline occurs just after the asparagine residue, so is unlikely to be glycosylated; the jury agreement column indicates how many of the nine networks support the prediction; N = asparagine; S = serine; T = threonine. For picking up N-glycosylation sites with high specificity (asparagine residues very likely to be glycosylated), use only (++) predictions (and better) for asparagines that occur within the Asn–X–Ser/Thr triplet (no Proline at the X position).