| Literature DB >> 20701770 |
Xiaolei Wu1, Chengwei Ren, Trupti Joshi, Tri Vuong, Dong Xu, Henry T Nguyen.
Abstract
BACKGROUND: With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology.Entities:
Mesh:
Year: 2010 PMID: 20701770 PMCID: PMC3091665 DOI: 10.1186/1471-2164-11-469
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of DNA in silico digestion of soybean genome assembly
| Enzyme | # total fragments | # selected fragments (70-200 bp) | total length | complexity reduction % | coverage for one run | sequence density per Mbp | putative SNP # | % selected portion | # repetitive in selected fragments | % selected fragments containing repetitive elements |
|---|---|---|---|---|---|---|---|---|---|---|
| 454,161 | 72,612 | 9,221,724 | 0.84 | 208.66 | 132.02 | 958.48 | 15.98 | 2332 | 0.51 | |
| 1,342,945 | 265,289 | 33,691,703 | 3.06 | 57.11 | 482.34 | 3,501.81 | 19.75 | 12,089 | 0.90 | |
| 1,731,973 | 334,265 | 42,451,655 | 3.86 | 45.33 | 607.75 | 4,412.30 | 19.29 | 26,105 | 1.51 | |
| 2,475,326 | 594,055 | 75,444,985 | 6.86 | 25.51 | 1,080.10 | 7,841.53 | 23.99 | 62,906 | 2.54 | |
| 2,929,186 | 841,548 | 106,876,596 | 9.72 | 18.00 | 1,530.09 | 11,108.43 | 28.72 | 66,175 | 2.26 | |
| 4,267,589 | 1,367,611 | 173,686,597 | 15.79 | 11.08 | 2,486.57 | 18,052.47 | 32.04 | 140,580 | 3.29 | |
| 8,002,903 | 2,470,067 | 313,698,509 | 28.52 | 6.13 | 4,491.03 | 32,604.88 | 30.86 | 189,866 | 2.37 |
Figure 1The proportion of fragments containing repetitive elements in the targeted fragments.
Figure 2DNA digested by different enzymes. H: Hae III; R: Rsa I; C: CviR I
Figure 3repetitive sequence proportion in the sequences with different read coverage.
Figure 4Distribution of putative SNPs on the 20 chromosomes of the reference genome. The x-axis represents SNP read coverage, and the y-axis represents chromosome coordinates of the reference genome.
Figure 5Visualization example of alignments of short read sequences to the soybean reference genome.