| Literature DB >> 20939885 |
Jean-Francois Martin1, Nicolas Pech, Emese Meglécz, Stéphanie Ferreira, Caroline Costedoat, Vincent Dubut, Thibaut Malausa, André Gilles.
Abstract
BACKGROUND: Microsatellites are markers of choice in population genetics and genomics, as they provide useful insight into patterns and processes as diverse as genome evolutionary dynamics and demographic processes. The acquisition of microsatellites through multiplex-enriched libraries and 454 GS-FLX Titanium pyrosequencing is a promising new tool for the isolation of new markers in unknown genomes. This approach can also be used to evaluate the extent to which microsatellite-enriched libraries are representative of the genome from which they were isolated. In this study, we deciphered potential discrepancies in microsatellite content recovery for two reference genomes (Apis mellifera and Danio rerio), selected on the basis of their extreme heterogeneity in terms of the proportions and distributions of microsatellites on chromosomes.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20939885 PMCID: PMC3091709 DOI: 10.1186/1471-2164-11-560
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summarized data for sequences from multiplex-enriched libraries, for both A. mellifera and D. rerio
| Total no. of sequences | Length > 80 bp | Containing microsatellites | Total number of sequences obtained with primer | Unique sequences | Consensus sequences | |
|---|---|---|---|---|---|---|
| 14789 | 9813 | 6758 | 283 | 238 | 45 | |
| 12789 | 8783 | 7096 | 382 | 346 | 36 | |
| 15833 | 10733 | 9123 | 333 | 301 | 32 | |
| 22940 | 14706 | 11217 | 138 | 103 | 35 | |
| 22189 | 14706 | 10958 | 166 | 139 | 27 | |
| 27392 | 20152 | 16283 | 237 | 176 | 61 | |
| 66.4% | 68.9% | 4.2% | ||||
| 68.7% | 80.8% | 5.4% | ||||
| 67.8% | 85.0% | 3.7% | ||||
| 64.1% | 76.3% | 1.2% | ||||
| 66.3% | 74.5% | 1.5% | ||||
| 73.6% | 80.8% | 1.5% | ||||
Figure 1Comparison of 454 microsatellite sequence data and the reference genome for . For each table, the 95% confidence interval of the expected number of microsatellites based on the published RsaI-digested genome distribution is shown in red whereas the observed number (obtained by 454 pyrosequencing) is shown in black. For each one-way table, the 2- 5- and 8-probe analyses are shown.
Figure 2Comparison of 454 microsatellite sequence data (for the 8 probes) for . "Stat" refers to the overall χ2 statistics for the test of independence between the two variables defining each table. For each cell of the table, a red dot corresponds to an observed number significantly higher than expected under the hypothesis of independence between the two variables defining the table, a blue dot corresponds to an observed number significantly lower than expected under the hypothesis of independence between the two variables defining the table; no dot indicates a non-significant deviation. The size of the dot is inversely proportional to the P-value, with scaling such that comparisons of sizes between tables are meaningful.
Figure 3χ. The size of the dot is inversely proportional to the P-value, with scaling such that comparisons of sizes between tables are meaningful. Boxplots of the χ2 statistics for each of two, five and eight probes are indicated on the X-axis. A logarithmic scale is used on the Y-axis for the χ2 values. The red dotted line represents the significance limit for Apis mellifera (C) and Danio rerio (D). The distribution of the cell probabilities from the chromosome × chromosome region two-way table (figure 3) is plotted for Apis mellifera (E) and Danio rerio (F).
Figure 4χ. The X-axis shows the number of bootstrapped chromosomes (2 to S, S being the chromosome number of the species considered), Y-axis: 95% confidence interval (based on 1,000 replicates) of the χ2 statistic divided by degree of freedom. In green: Apis mellifera, in purple: Danio rerio.