Literature DB >> 31078274

Conserved microsatellites may contribute to stem-loop structures in 5', 3' terminals of Ebolavirus genomes.

Douyue Li¹, Hongxi Zhang¹, Shan Peng¹, Saichao Pan¹, Zhongyang Tan².

Abstract

Microsatellites (SSRs) are ubiquitous in coding and non-coding regions of the Ebolavirus genomes. We synthetically analyzed the microsatellites in whole-genome and terminal regions of 219 Ebolavirus genomes from five species. The Ebolavirus sequences were observed with small intraspecies variations and large interspecific variations, especially in the terminal non-coding regions. Only five conserved microsatellites were detected in the complete genomes, and four of them which well base-paired to help forming conserved stem-loop structures mainly appeared in the terminal non-coding regions. These results suggest that the conserved microsatellites may be evolutionary selected to form conserved secondary structures in 5', 3' terminals of Ebolavirus genomes. It may help to understand the biological significance of microsatellites in Ebolavirus and also other virus genomes.

Entities: Chemical Disease Gene Species

Keywords: Conserved microsatellite; Conserved stem-loop structure; Ebolavirus; Evolutionary selection

Mesh：

Substances：
RNA, Viral

Year: 2019 PMID： 31078274 PMCID： PMC7092875 DOI： 10.1016/j.bbrc.2019.04.192

Source DB: PubMed Journal: Biochem Biophys Res Commun ISSN： 0006-291X Impact factor: 3.575

Introduction

Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated tracts of 1–6 nucleotides that are repeated multiple times [[1], [2], [3]]. Microsatellites show wide distributions in eukaryotes, prokaryotes and viruses [4], and have length polymorphism, motif diversity and non-random distribution characteristics [5]. Microsatellites are highly mutable and vary widely among individuals possibly as a result of strand slippage during replication [6,7]. These mutations may be correlated with the genomic structures and functions [8,9]. Thus, the emergence and fixation of microsatellites could contribute to the genomic diversity and evolution [10]. In addition, microsatellites expansion has been proved to cause more than 30 neurological and neuromuscular diseases [11], for instance, epilepsy caused by abnormal expansions of TTTCA and TTTTA repeats [12], Huntington disease cansed by CAG repeats [13], etc. Also it is related to lung cancer [14], gastric cancer [15], etc. The occurrence of microsatellites in non-coding regions are also associated with diseases by regulating gene expression, such as Friedreich Ataxia [16]. Whole-genome-scale analysis of 257 viruses suggested that microsatellites were widely distributed in protein-coding and non-coding regions of viral genomes [4]. Microsatellites might potentially contribute to evolution of virus, through the comprehensive analysis of viral genomes in HIV-1 [17], Potyvirus [18] and HCV [19]. The distribution and composition of microsatellites in seven species of Filoviridae family, including Ebolavirus, also have salient features [20]. Ebolavirus comprises of five species: Zaire ebolavirus, Bundibugyo virus, Reston ebolavirus, Sudan ebolavirus, and Taï Forest ebolavirus [21]. The Ebolavirus genome is a negative and single-stranded RNA approximately 19 kb nucleotides. It is organized into a 3′ leader non-coding regions, followed by seven genes and a 5′ trailer non-coding regions [22]. Microsatellites in non-coding regions account for a large proportion of the Ebolavirus genomes [20]. The 5′ and 3′ terminal non-coding regions of virus likely regulate its replication, transcription, and assembly of new virions [23,24]. In Picornaviridae genomes, the terminal sequences can form stem-loop secondary structures to play a role in the viral life cycle [25]. Similarly, there are the conserved secondary structures in potato virus X RNA [26], hepatitis C viruses [27] and coronavirus [28]. Microsatellites mainly exist in non-coding regions [6,29]. The conservation of microsatellites in non-coding regions may be important to gene regulation in plants [30]. In plant viroids, the distribution of microsatellites probably contributes to the secondary structure [31]. The present study is a systematic analysis for the microsatellites in 219 genomic sequences from five Ebolavirus species. To our knowledge, this may be the first study to investigate the related biological significance of microsatellites in virus.

Materials and methods

Collection of genomic sequences

All available 219 complete genomic sequences of Ebolavirus from five species were downloaded from the National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/genome/), and all were cDNA sequences. These genomes were considered as the references, which recorded more complete terminal sequence. In these sequences, 184 EBOV sequences were selected as the main reference sequence due to appearing with the most lethal rate and the most attention. The other BDBV, RESTV, SUDV and TAFV were respectively 8 (B1—B8), 9 (R1-R9), 15 (S1—S15) and 3 (T1-T3) (Table 1 ), and the detailed information about these sequences was also listed (Table in Supplementary Table S1).

Table 1

List of 5 species Ebolavirus in complete genomes and 5′, 3′ terminal sequences.

Species	Abbreviation	Number of sequences	Genome size (nt)	5′ terminal		3′ terminal
Species	Abbreviation	Number of sequences	Genome size (nt)	Size (nt)	Lengthb	Size (nt)	Length
Zaire ebolavirus	EBOV	184	18990.04 ± 158.91a	468.89 ± 0.53	47.90 ± 0.53	739.66 ± 0.63	50.62 ± 0.51
Bundibugyo virus	BDBV	8	18939.63 ± 0.74	457.00 ± 0.00	48.00 ± 0.00	741.25 ± 0.46	50.13 ± 0.35
Taï Forest ebolavirus	TAFV	3	18935.00 ± 0.00	463.00 ± 0.00	48.00 ± 0.00	737.00 ± 0.00	51.00 ± 0.00
Reston ebolavirus	RESTV	9	18890.44 ± 2.40	462.78 ± 0.67	48.00 ± 0.00	703.22 ± 0.44	50.25 ± 0.46
Sudan ebolavirus	SUDV	15	18874.73 ± 0.44	457.00 ± 0.00	48.00 ± 0.00	707.73 ± 0.44	50.64 ± 0.50

The sizes of 184 sequences are averaged and the standard deviation is calculated to indicate the degree of dispersion.

Length for Prediction: regions including conservative microsatellites is intercepted for secondary structure prediction.

List of 5 species Ebolavirus in complete genomes and 5′, 3′ terminal sequences. The sizes of 184 sequences are averaged and the standard deviation is calculated to indicate the degree of dispersion. Length for Prediction: regions including conservative microsatellites is intercepted for secondary structure prediction.

Extraction of microsatellites

IMEx software was selected to extract microsatellites from 219 samples [1,32]. The lowest copy numbers for extracting each microsatellite motif, which was based on empirical criterion and referenced to previous microsatellite studies, were 6, 3, 3, 3, 3, 3 for the perfect mononucleotide to hexanucleotide microsatellites separately. Owing to extracting perfect microsatellites alone, the basic mode of IMEx software was chosen and the imperfect rate was 0 here [34]. Other parameters were used as default. In addition, the compound microsatellite and imperfect microsatellites were excluded in this analysis, thus there was no different result for the microsatellite extraction while we using other extracting tools or algorithms for checking, like MfSAT [35], RepeatMasker [36] and MISA [37]. The results of these methods were not significantly different (data not shown).

Alignment generation

In order to analyze the role of microsatellites in Ebola virus genome, sequence alignment was used to compare the distributional variations of microsatellites more intuitively. The sequence alignment was performed in Clustal W (version 2.1) and then detected the specifically conserved regions [38,39]. To weak the impact on gaps between microsatellites, the number of gaps was reduced and the length of gap was shorten, thus, the parameters were resetting: gap open penalty = 30.0, gap extension penalty = 10.0.

Prediction of RNA secondary structure

Three common programs and a freely available web server platform were chosen to predict the RNA secondary structure of conserved regions in Ebola virus terminal microsatellite, including IPKNOT 1.3.1 (McCaskill), RNAfold, MFOLD v2.3 [40]. In order to obtain succinct images of subsequent processing, the predicted results of IPKNOT 1.3.1 were selected as the final result graphs, and the outputs from the other two prediction platforms were used as reference to adjustment and improvement. In addition, we also used the R-scape to perform pairwise covariation analysis to support our predictions (data not shown) [41].

Statistical analysis

Means, standard deviations, and graphs were generated with Microsoft excel. SPSS 22.0 was utilized to analyze statistical differences, and the independent sample t-test was used for comparison between the two groups. P < 0.0001 indicated that the difference was statistically significant.

Results

High sequence diversity of 5′, 3′ terminal regions and low diversity in complete genomes of ebolavirus

To measure sequence variances of five species, we aligned the complete genomes and 5′, 3′ terminal non-coding regions of 219 sequences. Size of all analyzed regions were ranged from (18874.73 ± 0.44) nt (SUDV) to (18990.04 ± 158.91) nt (EBOV) of complete genomes, (457.00 ± 0.00) nt (BDBV and SUDV) to (468.89 ± 0.53) nt (EBOV) of 5′ terminals, (703.22 ± 0.44) nt (RESTV) to (741.25 ± 0.46) nt (BDBV) of 3′ terminals, respectively (Table 1). Intraspecific variations were compared by using the first sequence of each species (Z1, B1, T1, R1, S1) as reference sequences (Supplementary Table S1). Compared with the aligned scores of complete genomes, there were no statistical significance (P > 0.05) in these scores of 5′ and 3′ terminals, which were above 90 (Fig. 1 A–C, Supplementary Table S2). However, in a comparison of same-region genomes, tremendous variances could be found among five species, although they have similar genome sizes. The aligned scores of the complete genomes were only in the range of 60–65, and those of 5′ terminals ranged from 17 to 25. This trait was even more evident in 3′ terminal sequences, in which the lowest aligned score was 5 (EBOV vs. BDBV). And significance analysis showed that interspecies variations of two terminal regions were significantly higher than those of complete genomes (P < 0.0001) (Fig. 1D–F). Results here indicated that the terminal non-coding regions of Ebolavirus may have greater sequence variances than other parts of the genomes.

Fig. 1

Sequence diversity in 5′, 3′ terminal regions and complete genomes of Ebolavirus. (A)Intraspecies variations of complete genomes. (B) Intraspecies variations of 5′ terminal sequences. (C) Intraspecies variations of 3′ terminal sequences. (D) Interspecific variations of complete genomes. (E) Interspecific variations of 5′ terminal sequences. (F) Interspecific variations of 3′ terminal sequences. Species 1 and Species 2 are the distinctions between different sequences for pairwise alignment, respectively. Species 1 includes the sequences of Z1-Z184, B1—B8, R1-R9, and Species 2 includes the sequences of B1—B8, R1-R9, S1—S15.

High conserved microsatellites loci mainly occurred in 5′, 3′ terminal non-coding regions

Microsatellites on all of the analyzed sequences were extracted to explore the relationship between microsatellites and Ebolavirus genomes. All microsatellites were counted for each of these genomes and the average summated microsatellites were ranged from 61.07 ± 2.31 (SUDV) to 81.78 ± 0.83 (RESTV) (Fig. 2 A, Table 2 , Supplementary Table S2). Then the extracted microsatellites were labeled on the sequences to observe different locations had the identical microsatellite in five species, which was called conserved loci in this research. These included positions of 5–10 nt, 39–44 nt, 6918–6924 nt, 18951–18956 nt (Z1 as reference), and the microsatellites were respectively CACACA, UGUGUG, AAAAAAA, UGUGUG that were named as completely conserved microsatellites. In addition, there was a relatively conserved locus at 18912 nt, similar to CACACA of the 3′ terminals, however, it was imperfect microsatellite UACACA in BDBV and TAFV except for the other species. It was also classified as a conserved locus, considering its particularity and there was no such a phenomenon elsewhere in the genome. And the imperfect microsatellites on these conserved loci were defined as highly conserved microsatellites. These five conserved loci were named for Conserved 1–5 (Fig. 2B, Table 2, Supplementary Fig. S1). Moreover, nearly all (100%) sequences had these conserved microsatellites (Table 2).

Fig. 2

Table 2

Statistics of conserved microsatellites loci in Ebolavirus genomes.

Species	Regions	Total of SSRs	Conserved SSRs (%)a	Relative abundanceb	Motifc	Rate of sequences (%)d
EBOV	Complete genome	76.66 ± 1.55e	6.53 ± 0.13	0.26 ± 0.00	--f	–
	5′ terminal	2.00 ± 0.00	100 ± 0.00	4.27 ± 0.01	CACACA	100.00
	5′ terminal	2.00 ± 0.00	100 ± 0.00	4.27 ± 0.01	TGTGTG	100.00
	3′ terminal	7.35 ± 0.53	27.33 ± 1.84	2.70 ± 0.00	CACACA	100.00
	3′ terminal	7.35 ± 0.53	27.33 ± 1.84	2.70 ± 0.00	TGTGTG	100.00
BDBV	Complete genome	74.63 ± 0.52	5.36 ± 0.04	0.21 ± 0.00	–	–
	5′ terminal	3.00 ± 0.00	66.67 ± 0.00	4.38 ± 0.00	CACACA	100.00
	5′ terminal	3.00 ± 0.00	66.67 ± 0.00	4.38 ± 0.00	TGTGTG	100.00
	3′ terminal	5.00 ± 0.00	20.00 ± 0.00	1.35 ± 0.00	TACACA	100.00
	3′ terminal	5.00 ± 0.00	20.00 ± 0.00	1.35 ± 0.00	TGTGTG	100.00
TAFV	Complete genome	79.00 ± 0.00	5.06 ± 0.00	0.21 ± 0.00	–	–
	5′ terminal	3.00 ± 0.00	66.67 ± 0.00	4.32 ± 0.00	CACACA	100.00
	5′ terminal	3.00 ± 0.00	66.67 ± 0.00	4.32 ± 0.00	TGTGTG	100.00
	3′ terminal	4.00 ± 0.00	25.00 ± 0.00	1.36 ± 0.00	TACACA	100.00
	3′ terminal	4.00 ± 0.00	25.00 ± 0.00	1.36 ± 0.00	TGTGTG	100.00
RESTV	Complete genome	81.78 ± 0.83	5.98 ± 0.42	0.26 ± 0.02	–	–
	5′ terminal	4.78 ± 0.44	39.44 ± 6.35	4.08 ± 0.72	CACACA	100.00
	5′ terminal	4.78 ± 0.44	39.44 ± 6.35	4.08 ± 0.72	TGTGTG	88.89 g
	3′ terminal	6.22 ± 0.44	32.28 ± 2.10	2.84 ± 0.00	CACACA	100.00
	3′ terminal	6.22 ± 0.44	32.28 ± 2.10	2.84 ± 0.00	TGTGTG	100.00
SUDV	Complete genome	61.07 ± 2.31	8.20 ± 0.30	0.26 ± 0.00	–	–
	5′ terminal	3.00 ± 0.00	66.67 ± 0.00	4.38 ± 0.00	CACACA	100.00
	5′ terminal	3.00 ± 0.00	66.67 ± 0.00	4.38 ± 0.00	TGTGTG	100.00
	3′ terminal	7.67 ± 0.98	26.46 ± 3.10	2.83 ± 0.00	CACACA	100.00
	3′ terminal	7.67 ± 0.98	26.46 ± 3.10	2.83 ± 0.00	TGTGTG	100.00

Conserved of conserved microsatellites, number of conserved microsatellites/total of microsatellites × 100%.

Relative abundance of conserved microsatellites, number of conserved microsatellites/kb.

Conserved microsatellites of Conserved 1–5.

Conserved rate of conserved microsatellites in the sequences, such as 184 sequences all have this motif, the conservative rate is 100%.

Number of microsatellites in all sequences is averaged and the degree of dispersion is expressed by standard deviation.

The motifs of complete genome contains those in the terminal sequences, and Conserved 3 was not analyzed in this study and is therefore not listed.

High conserved microsatellites loci mainly occurred in 5′, 3′ terminal sequences. (A) Total of microsatellites in the complete genomes and 5′, 3′ terminal sequences. (B) Sequences of five conserved microsatellite loci. (C) Conserved rate of complete genomes and terminal sequences. (D) Relative abundance of complete genomes and terminal sequences. * indicates that they have the same base. Blue is used to represent conserved microsatellites, and different bases on highly conserved microsatellites are represented in red. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.) Statistics of conserved microsatellites loci in Ebolavirus genomes. Conserved of conserved microsatellites, number of conserved microsatellites/total of microsatellites × 100%. Relative abundance of conserved microsatellites, number of conserved microsatellites/kb. Conserved microsatellites of Conserved 1–5. Conserved rate of conserved microsatellites in the sequences, such as 184 sequences all have this motif, the conservative rate is 100%. Number of microsatellites in all sequences is averaged and the degree of dispersion is expressed by standard deviation. The motifs of complete genome contains those in the terminal sequences, and Conserved 3 was not analyzed in this study and is therefore not listed. Although there is no direct correlation of microsatellites with sequence variances, it is generally assumed that the more similar genomes are expected to contain more conserved microsatellites than do the more different ones [4]. On the contrary, conserved microsatellite loci were mainly located on the terminal non-coding regions with the exception of Conserved 3 that were located on GP gene. In view of this particular phenomenon, we mainly analyzed the terminal sequences further, and continue to study the Conserved 3 in the future. In order to verify the extremely high conservatism of conserved microsatellites at the terminal sequences, the conserved rate was analyzed and the counts subsequently compared across complete genomes. The highest conserved rate of the complete genomes was 8.20% ± 0.30, while the highest one in the terminal regions was 100% ± 0.00 (EBOV), which meant that there were only those two conserved microsatellites in all sequences. Conserved rate of the terminal sequences were considerably higher than those of complete genomes (P < 0.0001) (Fig. 2C, Supplementary Table S2). To observe the proportion of conserved microsatellites in the sequences, the relative abundance of the microsatellites was calculated to eliminate the influence of different lengths. Additionally, the perfect microsatellites were mainly analyzed in this step, so only completely conserved microsatellites were considered for further analysis. Relative abundances of conserved microsatellites in 5′, 3′ terminal sequences were significantly higher than those of complete genomes (P < 0.0001) (Fig. 2D, Supplementary Table S2).

Conserved microsatellites were located on conserved stem-loop structures

To identify potential functional consequences of conserved microsatellites, RNA secondary structure prediction was performed for the further analysis on the above conserved microsatellites. All terminal sequences of five species were identified and they all form different secondary structures. However, the intuitive analysis of conserved microsatellite regions was affected by the multiple possibilities resulted from the algorithms in the process of prediction. While base pairing of the two conserved regions could lead to the formation of the base of the putative stem-loop structures, the intervening sequence variations altered the predictions for the rest of the structures [42]. Therefore, a segment of the conserved microsatellite region was intercepted for prediction, where the 5′ terminal was truncated to about 48 nt, and the 3′ terminal was truncated to about 50 nt (Table 1). Then these sequences of conserved microsatellite regions were put into an online secondary structure website for prediction, and accomplished with the comparison of multiple websites to ensure its accuracy. Thermodynamic modeling of Ebola viral RNA predicted the formation of RNA stem-loop structures at the 3′ and 5′ terminal. All 5′ terminal sequences formed similar stem-loop structures. Unlike the other three species, RESTV and SUDV did not appear the corner, however, they generally exhibited similar structures (Fig. 3 A). Consistent with the 5′ terminal, the secondary structures of 3′ terminal were predicted (Fig. 3B). Sequence analysis showed a highly identity among five species in their 3′ and 5′ terminal.

Fig. 3

Conserved microsatellites were located on conserved stem-loop structures. (A) Stem-loop structures of 5 species of Ebolavirus in the 5′ terminal sequences. (B) Stem-loop structures of 5 species of Ebolavirus in the 3′ terminal sequences. Red is used to represent conserved microsatellites, and different bases on highly conserved microsatellites are represented in blue. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.) Furthermore, two conserved microsatellites in 5′ terminal sequences were complementary pairing and formed stems on the stem-loop structure (Fig. 3A). Similarly, two conserved microsatellites on 3′ terminal sequence also complemented each other to form stems on the stem-loop structures. Although the 3′ terminal of BDBV and TAFV were high conserved microsatellites (UACACA) and complete conserved microsatellite (UGUGUG) pairs, they were also highly homologous to the other three species (Fig. 3B). The prediction of RNA secondary structure of 438 terminal sequences revealed all Ebolavirus sequences follow above rules. Summarily, only one sequence of each species was selected to display, and the others were in the supplementary figure (Supplementary Figs. S2–S19).

Discussion

In this study, five conserved microsatellites were found in complete genomes of Ebolavirus and mainly distributed in the terminal regions. The four conserved microsatellites were revealed to help forming conserved stem-loop structures by the RNA secondary structure prediction method, and it is tempting to speculate that the conserved microsatellites may contribute to the formation of conserved RNA secondary structures. The former comprehensive analysis [4,17,19] is little known about the relationship between microsatellites and functions and structures of viral genomes. And this study may help to understand the biological significance of microsatellites in Ebolavirus and also other virus genomes.

Conserved microsatellites may contribute to conserved stem-loop structures in the terminal regions

The selections to the microsatellite appearances in genomes might have some association with the important biological significance [43]. The 5′, 3′ terminal non-coding regions were thought to regulate viral transcription, replication, and assembly of new virions [23]. The 3′ trailer of Ebolavirus were essential for replication by formed a small stem-loop structure, and participated in RNA-Protein interaction [44]. The related results suggested that conserved microsatellites in the terminal regions of Ebolavirus were well base-paired to help forming conserved stem-loop structures (Fig. 3, Supplementary Figs. S2–S19). Sequence analysis showed that the nucleic acid of Ebolavirus (EBOV, SUDV, RESTV, TAFV) between their 3′ terminal (18 nt) and 5′ terminal (20 nt) were highly identical [42], and conserved stem-loop structures were found [42,44,45], which proved the accuracy of the prediction results in this study. Here, we displayed the relationship between such structures and microsatellites to speculate on the role of microsatellites in Ebolavirus genome. Dinucleotide repeats have been demonstrated that can form secondary structure in eukaryotic genome [8]. And Qin et al. found that microsatellite distribution affects their secondary structure in plant viroids [31]. Therefore, the conserved microsatellites analyzed here indicated that they may contribute to the formation of the stem-loop structures in the terminal regions of Ebolavirus.

Conserved microsatellites under greater evolutionary pressure

The distribution of microsatellites on Ebolavirus genomes had regional preference between and among species [20]. Due to the isolation of many viruses still require the differentiation of thousands of years [46], the viral genome often produces more mutations. Li et al. calculated that the common ancestor for five species of Ebolavirus was about 1257 years old [43]. According to the novel filamentous virus classification process, nucleotide level difference of less than 30% can be divided into the same species, and less than 50% can be divided into the same genera [21]. Therefore, Ebolavirus genomes had high similarities in same species, and there are large sequence variances among five species (Fig. 1, Supplementary Table S2). However, conserved microsatellites loci were mainly distributed in the terminal non-coding regions, and the terminal regions had larger sequence variances that might face with more selective pressures in contrast to complete genomes. Microsatellites were previously reported to be more abundant in non-coding regions [29,47]. According to statistics, except for the longest L gene, the microsatellites in the non-coding regions of Ebolavirus are the most [20]. However, in this research, the variances of the terminal sequences were clearly more than other sequences in Ebolavirus genomes. In addition, the microsatellites with CA and UG as motifs are not high in viral genome [4]. These conserved microsatellites can be preserved through the long evolutionary time, which mean that they perhaps occurred in the importantly locus.

Conserved microsatellites might be preserved depending on functional selection

Selective pressure is widely deemed to the causes for the unequal distributions of microsatellites [48,49]. Some microsatellites might be selected by their functions in the process of mutation [50] and could be retained for countless years. For example, some non-coding microsatellites were conserved in plants between species across hundreds of millions of years [30], and human genome have many highly conserved promoter microsatellites [51]. The original Ebolavirus genomes probably experienced powerful pressures and different fates through the series of selection, consequently, the adaptive ones were preserved [50,52]. It is noteworthy that in Conserved 4, the imperfect microsatellite UACACA of BDBV and TAFV was most likely due to single point mutation. According to the stepwise mutation model, the microsatellite will gradually disappear into the process of evolution by means of point mutation [2]. BDBV, which was discovered and named only in 2007, has the highest similarity with TAFV [53]. This conserved locus may serve as another proof that these two species are more homologous than other Ebolavirus species. So the role of Conserved 4 in the Ebola virus deserves further study. In addition, there was a sequence (R3) of RESTV in Conserved 2, and UGUGUG is also mutated to UGUGUA, which might be the same case (Fig. 2B, Supplementary Fig. S1). However, these mutations did not affect the formation of secondary structures (Supplementary Fig. S8). In summary, conserved microsatellites might be a characteristic product of natural variation and deserves more research.

50 in total

Review 1. Microsatellite mutations in the germline: implications for evolutionary inference.

Authors: H Ellegren
Journal: Trends Genet Date: 2000-12 Impact factor: 11.639

2. Multiple sequence alignment with the Clustal series of programs.

Authors: Ramu Chenna; Hideaki Sugawara; Tadashi Koike; Rodrigo Lopez; Toby J Gibson; Desmond G Higgins; Julie D Thompson
Journal: Nucleic Acids Res Date: 2003-07-01 Impact factor: 16.971

3. Using and understanding RepeatMasker.

Authors: Sébastien Tempel
Journal: Methods Mol Biol Date: 2012

4. Elimination of Toxic Microsatellite Repeat Expansion RNA by RNA-Targeting Cas9.

Authors: Ranjan Batra; David A Nelles; Elaine Pirie; Steven M Blue; Ryan J Marina; Harrison Wang; Isaac A Chaim; James D Thomas; Nigel Zhang; Vu Nguyen; Stefan Aigner; Sebastian Markmiller; Guangbin Xia; Kevin D Corbett; Maurice S Swanson; Gene W Yeo
Journal: Cell Date: 2017-08-10 Impact factor: 41.582

5. Microsatellite is an important component of complete hepatitis C virus genomes.

Authors: Ming Chen; Zhongyang Tan; Guangming Zeng
Journal: Infect Genet Evol Date: 2011-06-23 Impact factor: 3.342

Review 6. Promoter microsatellites as modulators of human gene expression.

Authors: Sterling M Sawaya; Andrew T Bagshaw; Emmanuel Buschiazzo; Neil J Gemmell
Journal: Adv Exp Med Biol Date: 2012 Impact factor: 2.622

7. IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming.

Authors: Kengo Sato; Yuki Kato; Michiaki Hamada; Tatsuya Akutsu; Kiyoshi Asai
Journal: Bioinformatics Date: 2011-07-01 Impact factor: 6.937

Review 8. Microsatellite instability in gastric cancer: molecular bases, clinical perspectives, and new treatment approaches.

Authors: Margherita Ratti; Andrea Lampis; Jens C Hahne; Rodolfo Passalacqua; Nicola Valeri
Journal: Cell Mol Life Sci Date: 2018-09-01 Impact factor: 9.261

Review 9. 5'- and 3'-noncoding regions in flavivirus RNA.

Authors: Lewis Markoff
Journal: Adv Virus Res Date: 2003 Impact factor: 9.937

10. A small stem-loop structure of the Ebola virus trailer is essential for replication and interacts with heat-shock protein A8.

Authors: Joanna Sztuba-Solinska; Larissa Diaz; Mia R Kumar; Gaëlle Kolb; Michael R Wiley; Lucas Jozwick; Jens H Kuhn; Gustavo Palacios; Sheli R Radoshitzky; Stuart F J Le Grice; Reed F Johnson
Journal: Nucleic Acids Res Date: 2016-09-19 Impact factor: 16.971

1 in total

1. Genome-wide identification and characterization of microsatellite markers within the Avipoxviruses.

Authors: Basanta Pravas Sahu; Prativa Majee; Ravi Raj Singh; Niranjan Sahoo; Debasis Nayak
Journal: 3 Biotech Date: 2022-04-13 Impact factor: 2.893

1 in total