| Literature DB >> 31078274 |
Douyue Li1, Hongxi Zhang1, Shan Peng1, Saichao Pan1, Zhongyang Tan2.
Abstract
Microsatellites (SSRs) are ubiquitous in coding and non-coding regions of the Ebolavirus genomes. We synthetically analyzed the microsatellites in whole-genome and terminal regions of 219 Ebolavirus genomes from five species. The Ebolavirus sequences were observed with small intraspecies variations and large interspecific variations, especially in the terminal non-coding regions. Only five conserved microsatellites were detected in the complete genomes, and four of them which well base-paired to help forming conserved stem-loop structures mainly appeared in the terminal non-coding regions. These results suggest that the conserved microsatellites may be evolutionary selected to form conserved secondary structures in 5', 3' terminals of Ebolavirus genomes. It may help to understand the biological significance of microsatellites in Ebolavirus and also other virus genomes.Entities:
Keywords: Conserved microsatellite; Conserved stem-loop structure; Ebolavirus; Evolutionary selection
Mesh:
Substances:
Year: 2019 PMID: 31078274 PMCID: PMC7092875 DOI: 10.1016/j.bbrc.2019.04.192
Source DB: PubMed Journal: Biochem Biophys Res Commun ISSN: 0006-291X Impact factor: 3.575
List of 5 species Ebolavirus in complete genomes and 5′, 3′ terminal sequences.
| Species | Abbreviation | Number of sequences | Genome size (nt) | 5′ terminal | 3′ terminal | ||
|---|---|---|---|---|---|---|---|
| Size (nt) | Length | Size (nt) | Length | ||||
| EBOV | 184 | 18990.04 ± 158.91 | 468.89 ± 0.53 | 47.90 ± 0.53 | 739.66 ± 0.63 | 50.62 ± 0.51 | |
| BDBV | 8 | 18939.63 ± 0.74 | 457.00 ± 0.00 | 48.00 ± 0.00 | 741.25 ± 0.46 | 50.13 ± 0.35 | |
| TAFV | 3 | 18935.00 ± 0.00 | 463.00 ± 0.00 | 48.00 ± 0.00 | 737.00 ± 0.00 | 51.00 ± 0.00 | |
| RESTV | 9 | 18890.44 ± 2.40 | 462.78 ± 0.67 | 48.00 ± 0.00 | 703.22 ± 0.44 | 50.25 ± 0.46 | |
| SUDV | 15 | 18874.73 ± 0.44 | 457.00 ± 0.00 | 48.00 ± 0.00 | 707.73 ± 0.44 | 50.64 ± 0.50 | |
The sizes of 184 sequences are averaged and the standard deviation is calculated to indicate the degree of dispersion.
Length for Prediction: regions including conservative microsatellites is intercepted for secondary structure prediction.
Fig. 1Sequence diversity in 5′, 3′ terminal regions and complete genomes of Ebolavirus. (A)Intraspecies variations of complete genomes. (B) Intraspecies variations of 5′ terminal sequences. (C) Intraspecies variations of 3′ terminal sequences. (D) Interspecific variations of complete genomes. (E) Interspecific variations of 5′ terminal sequences. (F) Interspecific variations of 3′ terminal sequences. Species 1 and Species 2 are the distinctions between different sequences for pairwise alignment, respectively. Species 1 includes the sequences of Z1-Z184, B1—B8, R1-R9, and Species 2 includes the sequences of B1—B8, R1-R9, S1—S15.
Fig. 2High conserved microsatellites loci mainly occurred in 5′, 3′ terminal sequences. (A) Total of microsatellites in the complete genomes and 5′, 3′ terminal sequences. (B) Sequences of five conserved microsatellite loci. (C) Conserved rate of complete genomes and terminal sequences. (D) Relative abundance of complete genomes and terminal sequences. * indicates that they have the same base. Blue is used to represent conserved microsatellites, and different bases on highly conserved microsatellites are represented in red. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
Statistics of conserved microsatellites loci in Ebolavirus genomes.
| Species | Regions | Total of SSRs | Conserved SSRs (%) | Relative abundance | Motif | Rate of sequences (%) |
|---|---|---|---|---|---|---|
| EBOV | Complete genome | 76.66 ± 1.55 | 6.53 ± 0.13 | 0.26 ± 0.00 | -- | – |
| 5′ terminal | 2.00 ± 0.00 | 4.27 ± 0.01 | CACACA | 100.00 | ||
| TGTGTG | 100.00 | |||||
| 3′ terminal | 7.35 ± 0.53 | 27.33 ± 1.84 | 2.70 ± 0.00 | CACACA | 100.00 | |
| TGTGTG | 100.00 | |||||
| BDBV | Complete genome | 74.63 ± 0.52 | 5.36 ± 0.04 | 0.21 ± 0.00 | – | – |
| 5′ terminal | 3.00 ± 0.00 | 66.67 ± 0.00 | 4.38 ± 0.00 | CACACA | 100.00 | |
| TGTGTG | 100.00 | |||||
| 3′ terminal | 5.00 ± 0.00 | 20.00 ± 0.00 | 1.35 ± 0.00 | 100.00 | ||
| TGTGTG | 100.00 | |||||
| TAFV | Complete genome | 79.00 ± 0.00 | 5.06 ± 0.00 | 0.21 ± 0.00 | – | – |
| 5′ terminal | 3.00 ± 0.00 | 66.67 ± 0.00 | 4.32 ± 0.00 | CACACA | 100.00 | |
| TGTGTG | 100.00 | |||||
| 3′ terminal | 4.00 ± 0.00 | 25.00 ± 0.00 | 1.36 ± 0.00 | 100.00 | ||
| TGTGTG | 100.00 | |||||
| RESTV | Complete genome | 81.78 ± 0.83 | 5.98 ± 0.42 | 0.26 ± 0.02 | – | – |
| 5′ terminal | 4.78 ± 0.44 | 39.44 ± 6.35 | 4.08 ± 0.72 | CACACA | 100.00 | |
| TGTGTG | 88.89 g | |||||
| 3′ terminal | 6.22 ± 0.44 | 32.28 ± 2.10 | 2.84 ± 0.00 | CACACA | 100.00 | |
| TGTGTG | 100.00 | |||||
| SUDV | Complete genome | 61.07 ± 2.31 | 8.20 ± 0.30 | 0.26 ± 0.00 | – | – |
| 5′ terminal | 3.00 ± 0.00 | 66.67 ± 0.00 | 4.38 ± 0.00 | CACACA | 100.00 | |
| TGTGTG | 100.00 | |||||
| 3′ terminal | 7.67 ± 0.98 | 26.46 ± 3.10 | 2.83 ± 0.00 | CACACA | 100.00 | |
| TGTGTG | 100.00 |
Conserved of conserved microsatellites, number of conserved microsatellites/total of microsatellites × 100%.
Relative abundance of conserved microsatellites, number of conserved microsatellites/kb.
Conserved microsatellites of Conserved 1–5.
Conserved rate of conserved microsatellites in the sequences, such as 184 sequences all have this motif, the conservative rate is 100%.
Number of microsatellites in all sequences is averaged and the degree of dispersion is expressed by standard deviation.
The motifs of complete genome contains those in the terminal sequences, and Conserved 3 was not analyzed in this study and is therefore not listed.
Fig. 3Conserved microsatellites were located on conserved stem-loop structures. (A) Stem-loop structures of 5 species of Ebolavirus in the 5′ terminal sequences. (B) Stem-loop structures of 5 species of Ebolavirus in the 3′ terminal sequences. Red is used to represent conserved microsatellites, and different bases on highly conserved microsatellites are represented in blue. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)