| Literature DB >> 22108025 |
Riaz Ahmad1, Dan E Parfitt, Joseph Fass, Ebenezer Ogundiwin, Amit Dhingra, Thomas M Gradziel, Dawei Lin, Nikhil A Joshi, Pedro J Martinez-Garcia, Carlos H Crisosto.
Abstract
BACKGROUND: The application of next generation sequencing technologies and bioinformatic scripts to identify high frequency SNPs distributed throughout the peach genome is described. Three peach genomes were sequenced using Roche 454 and Illumina/Solexa technologies to obtain long contigs for alignment to the draft 'Lovell' peach sequence as well as sufficient depth of coverage for 'in silico' SNP discovery. DESCRIPTION: The sequences were aligned to the 'Lovell' peach genome released April 01, 2010 by the International Peach Genome Initiative (IPGI). 'Dr. Davis', 'F8, 1-42' and 'Georgia Belle' were sequenced to add SNPs segregating in two breeding populations, Pop DF ('Dr. Davis' × 'F8, 1-42') and Pop DG ('Dr. Davis' × 'Georgia Belle'). Roche 454 sequencing produced 980,000 total reads with 236 Mb sequence for 'Dr. Davis' and 735,000 total reads with 172 Mb sequence for 'F8, 1-42'. 84 bp × 84 bp paired end Illumina/Solexa sequences yielded 25.5, 21.4, 25.5 million sequences for 'Dr. Davis', 'F8, 1-42' and 'Georgia Belle', respectively. BWA/SAMtools were used for alignment of raw reads and SNP detection, with custom PERL scripts for SNP filtering. Velvet's Columbus module was used for sequence assembly. Comparison of aligned and overlapping sequences from both Roche 454 and Illumina/Solexa resulted in the selection of 6654 high quality SNPs for 'Dr. Davis' vs. 'F8, 1-42' and 'Georgia Belle', distributed on eight major peach genome scaffolds as defined from the 'Lovell' assembly.Entities:
Mesh:
Year: 2011 PMID: 22108025 PMCID: PMC3253712 DOI: 10.1186/1471-2164-12-569
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Distributions of fragment sizes from Roche 454 sequencing, for DD and F8 combined data, and separate DD and F8 data.
Peach genome scaffold information with the number of SNP's detected on each scaffold.
| Scaffold ID | #SNPs w/quality > 100 | #SNPs w/quality > 200 | Scaffold length (nt) | Scaffold length (nt)/SNP > 100 | Scaffold length (nt)/SNP > 200 |
|---|---|---|---|---|---|
| scaffold 1 | 1267 | 716 | 46,877,626 | 36,999 | 65,472 |
| scaffold 2 | 1205 | 412 | 26,807,724 | 22,247 | 65,067 |
| scaffold 3 | 499 | 241 | 22,025,550 | 44,140 | 91,392 |
| scaffold 4 | 412 | 227 | 30,528,727 | 74,099 | 134,488 |
| scaffold 5 | 698 | 357 | 18,502,877 | 26,508 | 51,829 |
| scaffold 6 | 392 | 217 | 28,902,582 | 73,731 | 133,192 |
| scaffold 7 | 862 | 414 | 22,790,193 | 26,39 | 55,049 |
| scaffold 8 | 1266 | 459 | 21,829,753 | 17,43 | 47,559 |
| scaffold 9 | 9 | 4 | 2,126,789 | 236,310 | 531,697 |
| scaffold 10 | 0 | 0 | 851,981 | na | na |
| scaffold 11 | 1 | 8 | 736,058 | 736,058 | 92,007 |
| scaffold 12 | 27 | 8 | 675,284 | 25,011 | 84,411 |
| scaffold 13 | 0 | 0 | 670,721 | na | na |
| scaffold 14 | 1 | 0 | 575,512 | 575,512 | na |
| scaffold 15 | 14 | 8 | 516,056 | 3861 | 64,507 |
| scaffold 16 | 1 | 1 | 390,024 | 390,024 | 390,024 |
| scaffold 17 | 0 | 0 | 370,749 | na | na |
| scaffold 18 | 0 | 0 | 333,953 | na | na |
| scaffold 22 | 0 | 0 | 167,479 | na | na |
| scaffold 23 | 0 | 0 | 69,963 | na | na |
| Total | 6654 | 3065 | 225,749,601 | ave. = 40,176 | ave. = 80,506 |