| Literature DB >> 22640415 |
Emese Meglécz1, Nicolas Pech, André Gilles, Jean-François Martin, Michael G Gardner.
Abstract
BACKGROUND: Next generation sequencing (NGS) provides a valuable method to quickly obtain sequence information from non-model organisms at a genomic scale. In principle, if sequencing is not targeted for a genomic region or sequence type (e.g. coding region, microsatellites) NGS reads can be used as a genome snapshot and provide information on the different types of sequences in the genome. However, no study has ascertained if a typical 454 dataset of low coverage (1/4-1/8 of a PicoTiter plate leading to generally less than 0.1x of coverage) represents all parts of genomes equally.Entities:
Mesh:
Year: 2012 PMID: 22640415 PMCID: PMC3444912 DOI: 10.1186/1756-0500-5-259
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Results of correlation and Chi-square tests between 454 and genome variables
| Number of dimers | 0.988 | NA | 2.2e-16 | 9126 | 9 | 2.2e-16 |
| Number of trimers | 0.996 | NA | 2.2e-16 | 19275 | 31 | 2.2e-16 |
| Number of dimers without AA/TT | 0.999 | 7 | 1.8e-10 | 3567 | 8 | 2.2e-16 |
| Number of trimers without AAA/TTT | 0.996 | NA | 2.2e-16 | 10231 | 30 | 2.2e-16 |
| Number of 454 reads per fragment | NA | NA | NA | 635.9 | 225 | 2.2e-16 |
| Number of 454 reads per chromosome | 0.991 | NA | 3.2e-06 | 24.2 | 15 | 0.0624 |
| Number of different microsatellite types | 0.972 | 21 | 9.3e-15 | 187.5 | NA | 0.0005 |
| Number of microsatellites with di-trinucleotide motifs | 0.969 | NA | 2.2e-16 | 77.9 | 13 | 2.713e-11 |
| Number of different microsatellite types | 1 | NA | 0.016 | 8.7 | 4 | 0.0685 |
*NA in df correspond to permutation tests.
Figure 1Correlation between the number different dimers and trimers in 454 and in genome data.
Figure 2Corrected number of 454 reads mapped to each one Mb chromosome fragment. The number of 454 hits mapped to each Mb fragment of the genome (including Ns) is corrected for 1 Mb of informative bases (not Ns). Only the first five chromosomes are coloured.
Figure 3Contribution of each chromosome fragment to the deviation between observed and expected number of mapped 454 reads. Chi-square values of each element with the negative sign for under- and positive for over-representation. Successive chromosomes are marked by different colours.
Figure 4Correlation between the number of hits mapped to each chromosome against the length of the chromosomes. Only informative bases were counted for each chromosome length.
Figure 5Number of different microsatellite types in 454 and genome sequences. Microsatellite count in the genome (continuous lines) and 454 data (dashed lines).