| Literature DB >> 23861572 |
Xuewen Wang1, Peng Lu, Zhaopeng Luo.
Abstract
UNLABELLED: Simple Sequence Repeats (SSR), also called microsatellite, is very useful for genetic marker development and genome application. The increasing whole sequences of more and more large genomes provide sources for SSR mining in silico. However currently existing SSR mining tools can't process large genomes efficiently and generate no or poor statistics. Genome-wide Microsatellite Analyzing Tool (GMATo) is a novel tool for SSR mining and statistics at genome aspects. It is faster and more accurate than existed tools SSR Locator and MISA. If a DNA sequence was too long, it was chunked to short segments at several Mb followed by motifs generation and searching using Perl powerful pattern match function. Matched loci data from each chunk were then merged to produce final SSR loci information. Only one input file is required which contains raw fasta DNA sequences and output files in tabular format list all SSR loci information and statistical distribution at four classifications. GMATo was programmed in Java and Perl with both graphic and command line interface, either executable alone in platform independent manner with full parameters control. Software GMATo is a powerful tool for complete SSR characterization in genomes at any size. AVAILABILITY: The soft GMATo is freely available at http://sourceforge.net/projects/gmato/files/?source=navbar or on contact.Entities:
Keywords: Genome; Marker development; Microsatellite; SSR; Software
Year: 2013 PMID: 23861572 PMCID: PMC3705631 DOI: 10.6026/97320630009541
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1A flowchart of software GMATo.
Figure 2Image showing input and output of soft GMATo; (A) graphic input interface; (B) SSR loci information produced by GMATo; (C) SSR statistical data produced by GMATo.