| Literature DB >> 20973953 |
Philippe Castagnone-Sereno1, Etienne G J Danchin, Emeline Deleury, Thomas Guillemaud, Thibaut Malausa, Pierre Abad.
Abstract
BACKGROUND: Microsatellites are the most popular source of molecular markers for studying population genetic variation in eukaryotes. However, few data are currently available about their genomic distribution and abundance across the phylum Nematoda. The recent completion of the genomes of several nematode species, including Meloidogyne incognita, a major agricultural pest worldwide, now opens the way for a comparative survey and analysis of microsatellites in these organisms.Entities:
Mesh:
Year: 2010 PMID: 20973953 PMCID: PMC3091743 DOI: 10.1186/1471-2164-11-598
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Global coverage and density of the microsatellite loci identified in five nematode genomes
| Sequence analyzed (bp) | 86,061,872 | 53,017,507 | 100,269,510 | 169,747,139 | 95,814,443 |
| GC content (in %) | 31.4 | 27.4 | 35.4 | 42.5 | 30.5 |
| Number of microsatellite loci | 4,880 | 2,842 | 11,382 | 20,694 | 61,547 |
| Average density of loci (no./Mbp) | 56.7 | 53.6 | 113.5 | 121.9 | 642.4 |
| Total length of microsatellites (bp) | 80,756 | 45,649 | 206,492 | 390,067 | 1,151,454 |
| Coverage (length in bp/Mbp) | 938.3 | 861.4 | 2,059.4 | 2,297.9 | 12,017.5 |
| Genome content (in %) | 0.09 | 0.09 | 0.21 | 0.23 | 1.20 |
Relative density and coverage of 1-6 bp microsatellites in five nematode genomes
| Motif length (bp) | Nematode species | |||||
|---|---|---|---|---|---|---|
| 1 | No. | 2,635 | 1,385 | 6,479 | 10,933 | 46,466 |
| No./Mbp | 30.6 | 26.1 | 64.6 | 64.4 | 485.0 | |
| %a | 54.0 | 48.7 | 56.9 | 52.8 | 75.6 | |
| Length (bp) | 39,399 | 19,331 | 89,913 | 184,836 | 738,987 | |
| Bp/Mbp | 457.8 | 364.6 | 896.6 | 1088.9 | 7712.7 | |
| 2 | No. | 196 | 99 | 1,982 | 3,971 | 7,058 |
| No./Mbp | 2.3 | 1.9 | 19.8 | 23.4 | 73.7 | |
| % | 4.0 | 3.5 | 17.4 | 19.2 | 11.5 | |
| Length (bp) | 3,906 | 2,092 | 52,382 | 87,108 | 196,902 | |
| Bp/Mbp | 45.4 | 39.5 | 522.3 | 513.2 | 2055.0 | |
| 3 | No. | 1,796 | 1,081 | 2,338 | 4,823 | 6,029 |
| No./Mbp | 20.9 | 20.4 | 23.3 | 28.4 | 62.9 | |
| % | 36.8 | 38.0 | 20.5 | 23.3 | 9.8 | |
| Length (bp) | 28,629 | 16,902 | 39,291 | 88,401 | 137,445 | |
| Bp/Mbp | 328.5 | 318.8 | 391.8 | 520.8 | 1434.5 | |
| 4 | No. | 213 | 265 | 246 | 709 | 1,661 |
| No./Mbp | 2.5 | 5.0 | 2.5 | 4.2 | 17.3 | |
| % | 4.4 | 9.3 | 2.2 | 3.4 | 2.7 | |
| Length (bp) | 7,580 | 6,916 | 6,628 | 18,112 | 63,024 | |
| Bp/Mbp | 88.1 | 130.4 | 66.1 | 106.7 | 657.8 | |
| 5 | No. | 34 | 10 | 38 | 163 | 224 |
| No./Mbp | 0.4 | 0.2 | 0.4 | 1.0 | 2.3 | |
| % | 0.7 | 0.4 | 0.3 | 0.8 | 0.4 | |
| Length (bp) | 870 | 270 | 1,040 | 5,010 | 7,440 | |
| Bp/Mbp | 10.1 | 5.1 | 10.4 | 29.5 | 77.7 | |
| 6 | No. | 6 | 2 | 300 | 95 | 109 |
| No./Mbp | 0.1 | 0.1 | 3.0 | 0.6 | 1.1 | |
| % | 0.1 | 0.1 | 2.6 | 0.5 | 0.2 | |
| Length (bp) | 192 | 138 | 17,256 | 6,600 | 7,656 | |
| Bp/Mbp | 2.2 | 2.6 | 172.1 | 38.9 | 79.9 | |
a% = no./total no. of microsatellites.
Relative frequencya of the most frequent microsatellite motifs found in nematode genomes
| Nematode species | Length of microsatellite motif (bp) | |||||
|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | |
| C (79.5) | AG (49.5) | AAC (52.2) | AAAT (54.0) | AAACG (70.6) | AAAAAT (33.3) | |
| A (20.5) | AT (27.0) | AAT (19.4) | AATT (24.9) | AAATT (8.8) | AACGGT (33.3) | |
| -- | AC (23.5) | AAG (11.7) | AGGT (6.6) | AAGGG (5.9) | AAATTT (16.6) | |
| -- | -- | AGG (7.5) | AGGG (6.1) | AAAAG (2.9) | ACGGGG (16.6) | |
| Total no./Mbb | 30.6 | 2.3 | 20.9 | 2.5 | 0.4 | 0.1 |
| A (75.0) | AT (66.7) | AAT (38.2) | AAAT (66.8) | AAAAT (50.0) | AAAATT (50.0) | |
| C (25.0) | AG (27.2) | AAC (37.5) | AATT (27.5) | CTTTT (20.0) | AAAGGG (50.0) | |
| -- | AC (6.1) | AAG (12.1) | AACT (2.3) | AAATT (2.0) | -- | |
| -- | -- | AGG (4.5) | AAAG (1.1) | AAACG (10.0) | -- | |
| Total no./Mb | 26.1 | 1.9 | 20.4 | 5.0 | 0.2 | 0.1 |
| A (72.8) | AG (36.7) | AAG (29.8) | ACCT (32.5) | AATTT (52.6) | AACCGT (57.0) | |
| C (27.2) | AT (32.6) | AAT (19.1) | AAAT (29.3) | AAACT (23.7) | AACGGT (8.3) | |
| -- | AC (30.6) | ACT (17.0) | ACGG (12.2) | AACGT (7.9) | AAGTTT (6.7) | |
| -- | CG (0.1) | ACG (12.4) | AAGT (9.3) | AAAAT (7.9) | ACCGGT (6.3) | |
| Total no./Mb | 64.6 | 19.8 | 23.3 | 2.5 | 0.4 | 3.0 |
| C (80.6) | AG (77.9) | AAT (26.7) | AAGT (29.1) | AACTT (52.8) | AACCGT (21.0) | |
| A (19.4) | AT (16.9) | AGG (20.5) | AATT (23.1) | AAAGT (11.7) | AAAAGT (18.9) | |
| -- | AC (5.0) | AAC (17.9) | AACT (16.5) | AAACT (9.8) | AAACTT (10.5) | |
| -- | CG (0.2) | AAG (17.4) | AAAT (9.9) | AAGGG (6.1) | AAGGGG (9.5) | |
| Total no./Mb | 64.4 | 23.4 | 28.4 | 4.2 | 1.0 | 0.6 |
| A (99.6) | AT (44.7) | AAT (60.9) | AAGT (35.6) | AAAGT (36.6) | AACCGT (49.5) | |
| C (0.4) | AG (32.4) | ACT (17.2) | AACT (16.4) | AAGGT (13.4) | AACCCT (6.4) | |
| -- | AC (22.8) | AAC (10.8) | AAAT (13.0) | AACTT (13.4) | AAAGGG (6.4) | |
| -- | CG (0.1) | ACG (4.6) | AATT (9.8) | AAAAC (4.0) | ACGTTT (5.5) | |
| Total no./Mb | 485.0 | 73.7 | 62.9 | 17.3 | 2.3 | 1.1 |
aMost frequent motifs (maximum of 4) listed for each nematode species in each nucleotide class (frequency given as a percentage of the total number of microsatellites with a motif of the same length).
bGlobal density of microsatellites for each nucleotide class, including unlisted motifs (if any).
Figure 1Frequency of microsatellite classes in the genome of . Among the 58 classes identified, the 20 most frequent are shown in individual divisions. The remaining 38 microsatellites are considered in one single division defined as others. For the partial standardization method used to define microsatellite classes, see Methods.
The di- to hexanucleotide microsatellites found in the genome of Meloidogyne incognita
| Motif length | Motif | Iteration | Total no. (%) | Longest locus | |||
|---|---|---|---|---|---|---|---|
| ≤10a | 11-20 | 21-30 | >30 | ||||
| 2 | AC | 37 | 7 | 1 | 1 | 46 (2.05) | (AC) |
| AG | 82 | 14 | 1 | - | 97 (4.32) | 2× (AG) | |
| AT | 44 | 7 | 1 | 1 | 53 (2.36) | (AT) | |
| 3 | AAC | 934 | 1 | - | - | 935 (41.65) | (AAC) |
| AAG | 211 | - | - | - | 211 (9.40) | 2× (AAG) | |
| AAT | 346 | 3 | - | - | 349 (15.55) | (AAT) | |
| ACC | 83 | - | - | - | 83 (3.70) | 3x (ACC) | |
| ACT | 67 | - | - | - | 67 (2.98) | 2× (ACT) | |
| AGG | 134 | 1 | - | - | 135 (6.01) | (AGG) | |
| ACG | 14 | - | - | - | 14 (0.62) | 14x (ACG) | |
| CGG | 2 | - | - | - | 2 (0.09) | 2× (CGG) | |
| 4 | AAAC | 6 | - | - | - | 6 (0.27) | (AAAC) |
| AAAT | 110 | 3 | 1 | 1 | 115 (5.12) | (AAAT) | |
| AAAG | 1 | - | - | - | 1 (0.04) | (AAAG) | |
| AATT | 53 | - | - | - | 53 (2.36) | (AATT) | |
| AAGG | 3 | - | - | - | 3 (0.13) | (AAGG) | |
| AGGG | 13 | - | - | - | 13 (0.58) | 2× (AGGG) | |
| AGGT | 4 | 1 | - | 9 | 14 (0.62) | (AGGT) | |
| AGTT | 1 | - | - | - | 1 (0.04) | (AGTT) | |
| ACGT | 1 | - | - | - | 1 (0.04) | (ACGT) | |
| AACT | 1 | - | - | - | 1 (0.04) | (AACT) | |
| AACG | 3 | - | - | - | 3 (0.13) | 3x (AACG) | |
| GGGT | 1 | - | - | - | 1 (0.04) | (GGGT) | |
| ACTT | 1 | - | - | - | 1 (0.04) | (ACTT) | |
| 5 | AAATT | 3 | - | - | - | 3 (0.13) | (AAATT) |
| AAACG | 24 | - | - | - | 24 (1.07) | (AAACG) | |
| AAAAG | 1 | - | - | - | 1 (0.04) | (AAAAG) | |
| CCTTT | 1 | - | - | - | 1 (0.04) | (CCTTT) | |
| AAGGG | 2 | - | - | - | 2 (0.09) | (AAGGG) | |
| AGGGG | 1 | - | - | - | 1 (0.04) | (AGGGG) | |
| AGTTT | 1 | - | - | - | 1 (0.04) | (AGTTT) | |
| CGGTT | 1 | - | - | - | 1 (0.04) | (CGGTT) | |
| 6 | AAAAAT | 2 | - | - | - | 2 (0.09) | 2× (AAAAAT) |
| AAATTT | 1 | - | - | - | 1 (0.04) | (AAATTT) | |
| ACGGGG | 1 | - | - | - | 1 (0.04) | (ACGGGG) | |
| AACGGT | 2 | - | - | - | 2 (0.09) | 2× (AACGGT) | |
| Total | 2192 | 37 | 4 | 12 | 2245 | ||
aFor dinucleotides, n ≥ 8; for tri- to hexanucleotides, n ≥ 5.
Figure 2Microsatellites detected in the coding sequences of . (A) Density of microsatellites (expressed as the number of loci/Mb coding sequence). (B) Frequency of the amino acid repeats encoded by the trinucleotide microsatellites.
Figure 3Gene Ontology (GO) assignment of the . In each of the three GO categories, the percentages of annotated sequences do not add up to 100% because some predicted proteins have more than one GO category assigned to them.