| Literature DB >> 23139588 |
Changsong Zou1, Cairui Lu, Youping Zhang, Guoli Song.
Abstract
Simple sequence repeats (SSRs) can be derived from the complete genome sequence. These markers are important for gene mapping as well as marker-assisted selection (MAS). To develop SSRs for cotton gene mapping, we selected the complete genome sequence of Gossypium raimondii, which consisted of 4447 non-redundant scaffolds. Out of 775.2 Mb sequence examined, a total of 136,345 microsatellites were identified with a density of 5.69 kb per SSR in the G. raimondii genome leading to development of 112,177 primer pairs. The distributions of SSRs in the genome were non-random. Among the different motifs ranging from 1 to 6 bp, penta-nucleotide repeats were most abundant (30.5%), followed by tetra-nucleotide repeats (18.2%) and di-nucleotide repeats (16.9%). Among all identified 457 motif types, the most frequently occurring repeat motifs were poly-AT/TA, which accounted for 79.8% of the total di-nt SSRs, followed by AAAT/TTTA with 51.5% of the total tetra-nucleotede. Further, 18,834 microsatellites were detected from the protein-coding genes, and the frequency of gene containing SSRs was 46.0% in 40,976 genes of G. raimondii. These genome-based SSRs developed in the present study will lay the groundwork for developing large numbers of SSR markers for genetic mapping, gene discovery, genetic diversity analysis, and MAS breeding in cotton.Entities:
Keywords: Gossypium raimondii; Simple Sequence Repeats (SSRs); distribution; molecular marker
Year: 2012 PMID: 23139588 PMCID: PMC3488841 DOI: 10.6026/97320630008801
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Frequency distribution of different repeat types identified in the G. raimondii genome
Figure 2Genome-wide distribution and relative abundance of SSR types by their unit size. Each bar represents the relative abundance of the SSR types in different genome locations. Relative abundance = number of SSR type/region size in mega bases (Mb)