| Literature DB >> 27366148 |
Sebastián Metz1, Juan Manuel Cabrera1, Eva Rueda2, Federico Giri3, Patricia Amavet4.
Abstract
Microsatellites are genomic sequences comprised of tandem repeats of short nucleotide motifs widely used as molecular markers in population genetics. FullSSR is a new bioinformatic tool for microsatellite (SSR) loci detection and primer design using genomic data from NGS assay. The software was tested with 2000 sequences of Oryza sativa shotgun sequencing project from the National Center of Biotechnology Information Trace Archive and with partial genome sequencing with ROCHE 454® from Caiman latirostris, Salvator merianae, Aegla platensis, and Zilchiopsis collastinensis. FullSSR performance was compared against other similar SSR search programs. The results of the use of this kind of approach depend on the parameters set by the user. In addition, results can be affected by the analyzed sequences because of differences among the genomes. FullSSR simplifies the detection of SSRs and primer design on a big data set. The command line interface of FullSSR was intended to be used as part of genomic analysis tools pipeline; however, it can be used as a stand-alone program because the results are easily interpreted for a nonexpert user.Entities:
Year: 2016 PMID: 27366148 PMCID: PMC4913048 DOI: 10.1155/2016/6040124
Source DB: PubMed Journal: Adv Bioinformatics ISSN: 1687-8027
Figure 1Flowchart of FullSSR. Multi-FASTA file is used as input; in the gray box are the different processing modules of the core program. There are three kinds of outputs: easy-to-interpret HTML report, Primer3 native outputs, and text files for each SSR found.
Algorithm comparison: results comparison among FullSSR, MISA, and PHOBOS using O. sativa sequences.
|
| FullSSR | MISA | PHOBOS |
|---|---|---|---|
| 1-mer | 615 | 812 | 1518 |
| 2-mer | 172 | 831 | 832 |
| 3-mer | 101 | 275 | 1435 |
| 4-mer | 17 | 31 | 774 |
| 5-mer | 3 | 7 | 828 |
| Total | 908 | 1956 | 5387 |
SSR classification. SSR types obtained with sequences from different partial sequencing projects. The 1-mer SSRs found are greater than 8 bp long.
|
|
|
|
|
|
|---|---|---|---|---|
| 1-mer | 21 | 175 | 33 | 56 |
| 2-mer | 106 | 644 | 77 | 139 |
| 3-mer | 35 | 146 | 14 | 33 |
| 4-mer | 14 | 30 | 4 | 25 |
| 5-mer | 3 | 5 | 4 | 6 |
| Total | 179 | 1000 | 131 | 5387 |
Results for different data set. bp read: base pairs read; % GC: guanine-cytosine percentage; SSRs and primers: number of SSRs found and number of primers designed.
| Species | bp read | % GC | SSRs | Primers |
|---|---|---|---|---|
|
| 143637 | 41,34 | 179 | 98 |
|
| 907553 | 42,12 | 1000 | 605 |
|
| 308323 | 44,65 | 131 | 92 |
|
| 275101 | 50,23 | 259 | 148 |
Figure 2Screenshot of the HTML report of the program. SSR distribution and primers for each SSR are shown (if any). Additional data is displayed (execution time, number of sequences, etc.).