| Literature DB >> 23284085 |
Gaurav Sablok1, Suresh B Mudunuri, Sujan Patnana, Martina Popova, Mario A Fares, Nicola La Porta.
Abstract
Microsatellites or simple sequence repeats (SSRs) are repetitive stretches of nucleotides (A, T, G, C) that are distributed either as single base pair stretches or as a combination of two- to six-nucleotides units that are non-randomly distributed within coding and in non-coding regions of the genome. ChloroMitoSSRDB is a complete curated web-oriented relational database of perfect and imperfect repeats in organelle genomes. The present version of the database contains perfect and imperfect SSRs of 2161 organelle genomes (1982 mitochondrial and 179 chloroplast genomes). We detected a total of 5838 chloroplast perfect SSRs, 37 297 chloroplast imperfect SSRs, 5898 mitochondrial perfect SSRs and 50 355 mitochondrial imperfect SSRs across these genomes. The repeats have been further hyperlinked to the annotated gene regions (coding or non-coding) and a link to the corresponding gene record in National Center for Biotechnology Information(www.ncbi.nlm.nih.gov/) to identify and understand the positional relationship of the repetitive tracts. ChloroMitoSSRDB is connected to a user-friendly web interface that provides useful information associated with the location of the repeats (coding and non-coding), size of repeat, motif and length polymorphism, etc. ChloroMitoSSRDB will serve as a repository for developing functional markers for molecular phylogenetics, estimating molecular variation across species. Database URL: ChloroMitoSSRDB can be accessed as an open source repository at www.mcr.org.in/chloromitossrdb.Entities:
Mesh:
Year: 2013 PMID: 23284085 PMCID: PMC3628443 DOI: 10.1093/dnares/dss038
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1.Schematic illustration showing the flow of the organization of the data in ChloroMitoSSRDB.
Structure of the table ‘chloromitometa’ that stores the meta-information of all the mitochondrial and chloroplast genomes
| Information | Field | Data type | Key | Example |
|---|---|---|---|---|
| Accession number | acc_no | int(11) | 5881414, 110189662 | |
| Sequence ID | seq_id | varchar(11) | PRI | NC_000834, AC_000022 |
| Sequence name | seq_name | varchar(500) | ||
| Sequence type | seq_type | varchar(50) | Complete genome, complete sequence | |
| Sequence length | seq_length | int(11) | 16 613 bp, 7686 bp | |
| Nucleotide composition of A | a_per | Float | 33.06% | |
| Nucleotide composition of T | t_per | Float | 41.87% | |
| Nucleotide composition of G | g_per | Float | 13.58% | |
| Nucleotide composition of C | c_per | Float | 11.49% | |
| Organelle type | organelle | Char(1) | M (for Mitochondrion), C (Chloroplast) | |
| Taxon ID | taxon | Int | 263 995 |
Structure of the tables ‘chloromitoperfectmicrosatellite’ and ‘chloromitoimperfectmicrosatellite’ that store the repeat information of all perfect and imperfect microsatellites of mitochondrial and chloroplast genomes
| Information | Field | Data type | Key | Example |
|---|---|---|---|---|
| Sequence ID | index_no | varchar(11) | PRI | NC_000834, AC_000022 |
| Starting co-ordinate of SSR | start | int(11) | PRI | 172, 12843 |
| Ending co-ordinate of SSR | end | int(11) | PRI | 182, 12885 |
| motif (repeating unit) | motif | varchar(10) | AT, G, CAAC | |
| Number of repetitions | iterations | int(5) | 3, 7 | |
| Length of repeat tract | tract_length | int(11) | 12 bp, 18 bp | |
| Nucleotide composition of A | a_per | Float | 50.00% | |
| Nucleotide composition of T | t_per | Float | 0.00% | |
| Nucleotide composition of G | g_per | Float | 33.33% | |
| Nucleotide composition of C | c_per | Float | 16.67% | |
| Repeat position information | coding_info | varchar(50) | Coding (if repeat is in the coding region) or Null (if outside) | |
| Protein ID (if repeat in coding region) | protein_id | int(11) | 110189664 (if repeat is in the coding region) or 0 (if non-coding) | |
| aImperfection percentage of the tract | imperfection | Float | 9%, 0% | |
| aAlignment line 1 | alignment_line1 | Text | TTAA-TAATTAA | |
| aAlignment line 2 | alignment_line2 | Text | **** ******* | |
| aAlignment line 3 | alignment_line3 | Text | TTAATTAATTAA |
aThe last four columns (imperfection, alignment_line1, alignment_line2 and alignment_line3) are present only in the table that stores imperfect microsatellites (chloromitoimperfectmicrosatellite).
Figure 2.How to browse: schematic browsing of ChloroMitoSSRDB. This figure appears in colour in the online version of DNA Research.