| Literature DB >> 18670612 |
Luciano Carlos da Maia1, Dario Abel Palmieri, Velci Queiroz de Souza, Mauricio Marini Kopp, Fernando Irajá Félix de Carvalho, Antonio Costa de Oliveira.
Abstract
Microsatellites or SSRs (simple sequence repeats) are ubiquitous short tandem duplications occurring in eukaryotic organisms. These sequences are among the best marker technologies applied in plant genetics and breeding. The abundant genomic, BAC, and EST sequences available in databases allow the survey regarding presence and location of SSR loci. Additional information concerning primer sequences is also the target of plant geneticists and breeders. In this paper, we describe a utility that integrates SSR searches, frequency of occurrence of motifs and arrangements, primer design, and PCR simulation against other databases. This simulation allows the performance of global alignments and identity and homology searches between different amplified sequences, that is, amplicons. In order to validate the tool functions, SSR discovery searches were performed in a database containing 28 469 nonredundant rice cDNA sequences.Entities:
Year: 2008 PMID: 18670612 PMCID: PMC2486402 DOI: 10.1155/2008/412696
Source DB: PubMed Journal: Int J Plant Genomics ISSN: 1687-5389
Figure 1Flow-chart showing the functional structure of SSR Locator. (A) Perl script to search SSRs; (B) text file where information from detected SSRs is stored; (C) module for the statistical calculations for SSR motif occurrence; (D) module that formats text files into standard Primer3 input files; (E) running of Primer3; (F) module for running Virtual-PCR (using a second sequence file as a template); (G) module performing global alignment between homologous amplicons; (H) identity and alignment score calculations between homologous amplicons; and (I) file containing SSR, primer, homologous amplicons, identity, and score information.
Distribution of SSR/minisatellite motifs according to the number of repeats.
| Repeats |
| (%) |
| (%) |
| (%) |
| (%) |
| (%) |
| (%) |
| (%) |
| (%) |
| (%) |
| (%) |
| (%) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3 |
| — |
| — |
| — |
| — |
| — |
| — |
| 95.12 |
| 100 |
| 96 |
| 100 |
| 2.89 |
| 4 |
| — |
| — |
| — |
| — |
| 81.69 |
| 82.82 |
| 4.88 |
| 0 |
| 4 |
| 0 |
| 17.30 |
| 5 |
| — |
| — |
| — |
| 72.11 |
| 16.20 |
| 11.54 |
| 0 |
| 0 |
| 0 |
| 0 |
| 7.55 |
| 6 |
| — |
| — |
| — |
| 16.33 |
| 1.64 |
| 3.33 |
| 0 |
| 0 |
| 0 |
| 0 |
| 1.56 |
| 7 |
| — |
| — |
| 61.31 |
| 3.59 |
| 0 |
| 1.28 |
| 0 |
| 0 |
| 0 |
| 0 |
| 31.58 |
| 8 |
| — |
|
|
| 22.16 |
| 3.59 |
| 0.23 |
| 0.26 |
| 0 |
| 0 |
| 0 |
| 0 |
| 11.57 |
| 9 |
| — |
| — |
| 8.69 |
| 1.59 |
| 0 |
| 0.26 |
| 0 |
| 0 |
| 0 |
| 0 |
| 4.56 |
| 10 |
| — |
| 21.04 |
| 3.42 |
| 0.40 |
| 0 |
| 0.51 |
| 0 |
| 0 |
| 0 |
| 0 |
| 5.02 |
| 11 |
| — |
| 13.80 |
| 1.61 |
| 1.20 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 2.99 |
| 12 |
| — |
| 12.79 |
| 0.90 |
| 0.40 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 2.43 |
| 13 |
| — |
| 11.95 |
| 0.25 |
| 0.40 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 1.97 |
| 14 |
| — |
| 6.57 |
| 0.10 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 1.05 |
| 15 |
| — |
| 7.41 |
| 0.25 |
| 0 |
| 0.23 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 1.28 |
| 16 |
| — |
| 5.05 |
| 0.10 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.82 |
| 17 |
| — |
| 5.56 |
| 0.05 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.87 |
| 18 |
| — |
| 2.53 |
| 0.15 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.46 |
| 19 |
| — |
| 2.86 |
| 0.05 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.49 |
| 20 |
| 15.22 |
| 2.36 |
| 0.10 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.95 |
| 21 |
| 13.77 |
| 1.35 |
| 0.10 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.74 |
| 22 |
| 10.87 |
| 1.01 |
| 0.15 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.61 |
| 23 |
| 5.80 |
| 1.18 |
| 0.15 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.46 |
| 24 |
| 2.17 |
| 0.84 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.20 |
| 25 |
| 6.52 |
| 0.84 |
| 0.05 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.38 |
| 26 |
| 3.62 |
| 0.67 |
| 0.10 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.28 |
| 27 |
| 2.17 |
| 0.17 |
| 0.05 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.13 |
| 28 |
| 0.72 |
| 0.51 |
| 0.15 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.18 |
| 29 |
| 2.90 |
| 0 |
| 0.05 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.13 |
| 30 |
| 1.45 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.05 |
| 31 |
| 6.52 |
| 0.34 |
| 0.05 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.31 |
| 32 |
| 2.17 |
| 0.51 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.15 |
| 33 |
| 2.17 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.08 |
| 34 |
| 0.72 |
| 0.17 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.05 |
| 35 |
| 4.35 |
| 0.17 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.18 |
| 36 |
| 0.72 |
| 0.17 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.05 |
| 37 |
| 0.72 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.03 |
| 38 |
| 2.90 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.10 |
| 39 |
| 0 |
| 0.17 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.03 |
| 40 |
| 0.72 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.03 |
| 41 |
| 0.72 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.03 |
| 42 |
| 1.45 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.05 |
| 43 |
| 1.45 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.05 |
| 44 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.00 |
| ≥ 45 |
| 10.14 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0 |
| 0.36 |
|
| ||||||||||||||||||||||
| Total |
|
|
|
|
|
|
|
|
|
|
| |||||||||||
| (%) |
|
|
|
|
|
|
|
|
|
|
| |||||||||||
Distribution of SSR/minisatellite repeats in the rice cDNA collection.
| Motif | Ocur(1) | (%)(1) | Ocur(2) | (%)(2) | Total | (%) Group | (%) Overall | |
|---|---|---|---|---|---|---|---|---|
| Mono- | A/T |
| 88.80 |
| 11.20 |
| 90.58 | 3.20 |
| C/G |
| 76.92 |
| 23.08 |
| 9.42 | 0.33 | |
|
| ||||||||
| 2-mer | AG/CT |
| 36.06 |
| 63.94 |
| 45.29 | 6.89 |
| GA/TC |
| 61.37 |
| 38.63 |
| 39.23 | 5.96 | |
| CA/TG |
| 35.71 |
| 64.29 |
| 4.71 | 0.72 | |
| AT |
| 100.00 |
| — |
| 4.04 | 0.61 | |
| AC/GT |
| 31.58 |
| 68.42 |
| 3.20 | 0.49 | |
| TA |
| 100.00 |
| — |
| 3.20 | 0.49 | |
| CG |
| 100.00 |
| — |
| 0.34 | 0.05 | |
|
| ||||||||
| 3-mer | CCG/CGG |
| 53.68 |
| 46.32 |
| 18.44 | 9.39 |
| CGC/GCG |
| 61.24 |
| 38.76 |
| 17.89 | 9.11 | |
| GCC/GGC |
| 53.08 |
| 46.92 |
| 10.60 | 5.40 | |
| CTC/GAG |
| 42.69 |
| 57.31 |
| 8.59 | 4.38 | |
| AGG/CCT |
| 30.91 |
| 69.09 |
| 5.53 | 2.82 | |
| GGA/TCC |
| 62.50 |
| 37.50 |
| 4.82 | 2.46 | |
| CAG/CTG |
| 76.32 |
| 23.68 |
| 3.82 | 1.95 | |
| AAG/CTT |
| 50.75 |
| 49.25 |
| 3.37 | 1.71 | |
| CGA/TCG |
| 54.10 |
| 45.90 |
| 3.07 | 1.56 | |
| AGC/GCT |
| 62.07 |
| 37.93 |
| 2.91 | 1.48 | |
| GCA/TGC |
| 83.93 |
| 16.07 |
| 2.81 | 1.43 | |
| AGA/TCT |
| 62.26 |
| 37.74 |
| 2.66 | 1.36 | |
| CCA/TGG |
| 75.00 |
| 25.00 |
| 2.61 | 1.33 | |
| ACC/GGT |
| 48.89 |
| 51.11 |
| 2.26 | 1.15 | |
| GAA/TTC |
| 63.64 |
| 36.36 |
| 2.21 | 1.13 | |
| CAC/GTG |
| 65.12 |
| 34.88 |
| 2.16 | 1.10 | |
| GAC/GTC |
| 54.55 |
| 45.45 |
| 1.66 | 0.84 | |
| ACG/CGT |
| 42.31 |
| 57.69 |
| 1.31 | 0.67 | |
| ATC/GAT |
| 45.45 |
| 54.55 |
| 0.55 | 0.28 | |
| TCA/TGA |
| 50.00 |
| 50.00 |
| 0.50 | 0.26 | |
| CAA/TTG |
| 50.00 |
| 50.00 |
| 0.40 | 0.20 | |
| ACT/AGT |
| 42.86 |
| 57.14 |
| 0.35 | 0.18 | |
| TAA/TTA |
| 14.29 |
| 85.71 |
| 0.35 | 0.18 | |
| CTA/TAG |
| 66.67 |
| 33.33 |
| 0.30 | 0.15 | |
| AAT/ATT |
| 20.00 |
| 80.00 |
| 0.25 | 0.13 | |
| CAT/ATG |
| 100.00 |
| 0 |
| 0.20 | 0.10 | |
| AAC/GTT |
| 75.00 |
| 25.00 |
| 0.20 | 0.10 | |
| ATA/TAT |
| 50.00 |
| 50.00 |
| 0.10 | 0.05 | |
| GTA/TAC |
| 100.00 |
| 0 |
| 0.05 | 0.03 | |
|
| ||||||||
| 4-mer | GATC |
| 100.00 |
| 0 |
| 7.17 | 0.46 |
| ATTA/TAAT |
| 52.94 |
| 47.06 |
| 6.77 | 0.44 | |
| ATCG/CGAT |
| 20.00 |
| 80.00 |
| 5.98 | 0.38 | |
| CATC/GATG |
| 40.00 |
| 60.00 |
| 3.98 | 0.26 | |
| AGAA/TTCT |
| 25.00 |
| 75.00 |
| 3.19 | 0.20 | |
| GCTA/TAGC |
| 75.00 |
| 25.00 |
| 3.19 | 0.20 | |
| GATA/TATC |
| 14.29 |
| 85.71 |
| 2.79 | 0.18 | |
| GCGA/TCGC |
| 42.86 |
| 57.14 |
| 2.79 | 0.18 | |
| GCAC/GTGC |
| 33.33 |
| 66.67 |
| 2.39 | 0.15 | |
| AGGG/CCCT |
| 33.33 |
| 66.67 |
| 2.39 | 0.15 | |
|
| ||||||||
| 5-mer | AGGAG/CTCCT |
| 15.00 |
| 85.00 |
| 4.69 |
|
| CTCTC/GAGAG |
| 89.47 |
| 10.53 |
| 4.46 |
| |
| GAGGA/TCCTC |
| 56.25 |
| 43.75 |
| 3.76 |
| |
| CCTCC/GGAGG |
| 80.00 |
| 20.00 |
| 3.52 |
| |
| AGAGG/CCTCT |
| 26.67 |
| 73.33 |
| 3.52 |
| |
| GGAGA/TCTCC |
| 18.18 |
| 81.82 |
| 2.58 |
| |
| CTCGC/GCGAG |
| 77.78 |
| 22.22 |
| 2.11 |
| |
| AGCTA/TAGCT |
| 44.44 |
| 55.56 |
| 2.11 |
| |
| GAAAA/TTTTC |
| 25.00 |
| 75.00 |
| 1.88 |
| |
| AGGCG/CGCCT |
| 25.00 |
| 75.00 |
| 1.88 |
| |
|
| ||||||||
| 6-mer | CGCCTC/GAGGCG |
| 85.71 |
| 14.29 |
| 3.59 | 0.36 |
| CGGCGA/TCGCCG |
| 28.57 |
| 71.43 |
| 3.59 | 0.36 | |
| CCTCCG/CGGAGG |
| 81.82 |
| 18.18 |
| 2.82 | 0.28 | |
| AGGCGG/CCGCCT |
| 10.00 |
| 90.00 |
| 2.56 | 0.26 | |
| CCGTCG/CGACGG |
| 44.44 |
| 55.56 |
| 2.31 | 0.23 | |
| CGTCGC/GCGACG |
| 77.78 |
| 22.22 |
| 2.31 | 0.23 | |
| ACCGCC/GGCGGT |
| 12.50 |
| 87.50 |
| 2.05 | 0.20 | |
| CCACCG/CGGTGG |
| 85.71 |
| 14.29 |
| 1.79 | 0.18 | |
| GGCGGA/TCCGCC |
| 71.43 |
| 28.57 |
| 1.79 | 0.18 | |
| CTCCAT/ATGGAG |
| 100.00 |
| 0 |
| 1.54 | 0.15 | |
|
| ||||||||
| 7-mer | CCGCCGC/GCGGCGG |
| 66.67 |
| 33.33 |
| 7.32 | 0.15 |
| CTCTCTC/GAGAGAG |
| 80.00 |
| 20.00 |
| 6.10 | 0.13 | |
| CCTCTCT/AGAGAGG |
| 100.00 |
| 0 |
| 4.88 | 0.10 | |
| CTCTCTT/AAGAGAG |
| 100.00 |
| 0 |
| 4.88 | 0.10 | |
| CCCAAAT/ATTTGGG |
| 100.00 |
| 0 |
| 3.66 | 0.08 | |
| GCCGCCG/CGGCGGC |
| 100.00 |
| 0 |
| 3.66 | 0.08 | |
| GCGGCGC/GCGCCGC |
| 100.00 |
| 0 |
| 2.44 | 0.05 | |
| AATAAAA/TTTTATT |
| 100.00 |
| 0 |
| 2.44 | 0.05 | |
| GTGTGCG/CGCACAC |
| 100.00 |
| 0 |
| 2.44 | 0.05 | |
| CGCCGTC/GACGGCG |
| 100.00 |
| 0 |
| 2.44 | 0.05 | |
|
| ||||||||
| 8-mer | TTGGTTTC/GAAACCAA |
| 100.00 |
| 0 |
| 33.33 | 0.05 |
| TGGGCTTG/CAAGCCCA |
| 100.00 |
| 0 |
| 16.67 | 0.03 | |
| GCTTCTTG/CAAGAAGC |
| 100.00 |
| 0 |
| 16.67 | 0.03 | |
| ACGGGCGA/TCGCCCGT |
| 100.00 |
| 0 |
| 16.67 | 0.03 | |
| ATGATGTA/TACATCAT |
| 100.00 |
| 0 |
| 16.67 | 0.03 | |
|
| ||||||||
| 9-mer | TCGGCGGCG/CGCCGCCGA |
| 100.00 |
| 0 |
| 8.00 | 0.05 |
| AGGTGGTGG/CCACCACCT |
| 100.00 |
| 0 |
| 8.00 | 0.05 | |
| CCGGTGCGA/TCGCACCGG |
| 100.00 |
| 0 |
| 4.00 | 0.03 | |
| ACGAGGAGG/CCTCCTCGT |
| 100.00 |
| 0 |
| 4.00 | 0.03 | |
| TCCCTTTTC/GAAAAGGGA |
| 100.00 |
| 0 |
| 4.00 | 0.03 | |
| CGGCATGAA/TTCATGCCG |
| 100.00 |
| 0 |
| 4.00 | 0.03 | |
| CGGCAGCGA/TCGCTGCCG |
| 100.00 |
| 0 |
| 4.00 | 0.03 | |
| ACCATCCCG/CGGGATGGT |
| 100.00 |
| 0 |
| 4.00 | 0.03 | |
| ATGGGCGGC/GCCGCCCAT |
| 100.00 |
| 0 |
| 4.00 | 0.03 | |
| ATGCAGGGT/ACCCTGCAT |
| 100.00 |
| 0 |
| 4.00 | 0.03 | |
|
| ||||||||
| 10-mer | AGCCCCAACG/CGTTGGGGCT |
| 50.00 |
| 50.00 |
| 40.00 | 0.05 |
| TTTTTTTCTT/AAGAAAAAAA |
| 100.00 |
| 0 |
| 20.00 | 0.03 | |
| CCTGCTTTGC/GCAAAGCAGG | 1 | 100 | 0 | 0 | 1 | 20 | 0.03 | |
| ATCTCCGCCG/CGGCGGAGAT | 1 | 100 | 0 | 0 | 1 | 20 | 0.03 | |
Distribution of amplicon alignments for specific and redundant amplicons with varying identity levels.
| Identity | 100 | 99 | 98 | 97 | 96 | 95–90 | 89–80 | 79–70 | 69–60 | ≤59 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Amplicons | 787 | 261 | 151 | 29 | 11 | 8 | 8 | 6 | 5 | 15 | 1281 |
| % | 61.44 | 20.37 | 11.79 | 2.26 | 0.86 | 0.62 | 0.62 | 0.47 | 0.39 | 1.17 | — |