| Literature DB >> 19107202 |
Yufeng Shen1, Sumeet Sarin, Ye Liu, Oliver Hobert, Itsik Pe'er.
Abstract
BACKGROUND: Whole-genome sequencing represents a promising approach to pinpoint chemically induced mutations in genetic model organisms, thereby short-cutting time-consuming genetic mapping efforts. PRINCIPALEntities:
Mesh:
Year: 2008 PMID: 19107202 PMCID: PMC2603312 DOI: 10.1371/journal.pone.0004012
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sequencing and mapping statistics.
| Platform | SOLiD by Maq | GA by Maq | SOLiD by corona-lite |
| Read size | 25 bp | 35 bp | 25 bp |
| Total reads (million) | 256 | 125 | 256 |
| Mapped (million reads/Gb) | 146/3.65 | 84.5/2.96 | 109/2.73 |
| Good pairs (million)/percentage | 41.3/57% | 38.6/91% | 35.5/65% |
| Single end mapped (million)/percentage | 37.3/26% | 2.23/2.6% | 35.2/32.6% |
| Avg. depth-coverage | 33× | 28× | 27× |
| Avg. depth-coverage from good reads | 25× | 25× | NA |
| Mapped to different chromosomes (million)/percentage | 21.9/15% | 0.3/0.4% | NA |
Depth-coverage statistics.
| Depth-coverage | SOLiD | GA |
| > = 0 | 99.98% | 99.96% |
| > = 5 | 99.65% | 99.45% |
| > = 10 | 97.71% | 95.83% |
| > = 15 | 91.64% | 86.22% |
| > = 20 | 78.58% | 70.85% |
| > = 25 | 59.19% | 52.96% |
| > = 30 | 38.79% | 36.18% |
| > = 100 | 1.56% | 0.383% |
Figure 1Distribution of depth-coverage.
The distribution of depth-coverage of the entire genome is shown for both SOLiD and GA. Poisson and gamma distributions with comparable average mean values are imposed on the observed distribution. Only reads that are mapped with no more three mismatches and without inconsistent mate-pairs are counted in the depth-coverage calculation.
Variants.
| SOLiD | GA | Common variants (confirmed true/confirmed false/repeats) | |
| Raw | 5798 | 7385 | 1689 (NA/NA/559) |
| Filtered | 901 | 1094 | 685 (NA/NA/0) |
| ChrV 4 Mb region, raw | 180 | 180 | 42 (32 |
| ChrV 4 Mb region, filtered | 24 | 35 | 23 (23 |
Variants listed in Table 4.
Experimentally validated single nucleotide variants.
| Position on chromosome V | GA Variants called in Ref. | Found by SOLiD | Why not found by SOLiD? | Type of variant |
| 6302463 | YES | NO | Repeats | non-exonic |
| 6889636 | YES | NO | Low coverage | exonic, silent |
| 6889637 | YES | NO | Low coverage | exonic, amino-acid changing |
| 6956711 | YES | NO | Low coverage | exonic, amino-acid changing |
| 6956743 | YES | NO | Low coverage | exonic, amino-acid changing |
| 6956744 | YES | NO | Low coverage | exonic, amino-acid changing |
| 7245105 | YES | YES | non-exonic | |
| 7377580 | YES | YES | non-exonic | |
| 7403427 | YES | YES | exonic, silent | |
| 7430567 | YES | YES | exonic, amino-acid changing | |
| 7524635 | YES | YES | non-exonic | |
| 7546600 | YES | YES | exonic, amino-acid changing | |
| 7860248 | YES | NO | Repeats | non-exonic |
| 7953203 | NO | YES | non-exonic | |
| 8101405 | YES | NO | Mapping | non-exonic |
| 8571627 | YES | YES | exonic, amino-acid changing | |
| 8646873 | YES | YES | non-exonic | |
| 8657771 | YES | YES | non-exonic | |
| 8758179 | YES | YES | exonic, amino-acid changing | |
| 9059200 | YES | YES | non-exonic | |
| 9217870 | YES | YES | non-exonic | |
| 9218397 | YES | YES | non-exonic | |
| 9245971 | YES | YES | exonic, amino-acid changing | |
| 9376379 | YES | YES | exonic, amino-acid changing | |
| 9662867 | YES | NO | Not covered | non-exonic |
| 9663159 | YES | YES | non-exonic | |
| 9707449 | YES | YES | non-exonic | |
| 9846725 | YES | YES | exonic, amino-acid changing | |
| 9927293 | YES | YES | non-exonic | |
| 9928614 | YES | YES | exonic, silent | |
| 9986752 | YES | YES | exonic, silent | |
| 10234234 | YES | YES | non-exonic | |
| 10397711 | YES | YES | non-exonic |
The nucleotide change of the variants are shown in [1]. “Non-exonic” is either intergenic or intronic. The one new variant identified by SOLiD is a C to T substitution.
This is the variant that is responsible for the mutant phenotype of lsy-12 animals [1].
Indels.
| # of indels | SOLiD | GA | common |
| normal filtering | 420 | 1280 | 374 |
| liberal filtering | 782 | 1796 | 663 |
Figure 2The relationship between average depth-coverage and variant detection sensitivity.
The x-axis is the average depth-coverage. The y-axis is the theoretical fraction of genome where potential variants can be detected under the assumptions described above. The red dot marks 95% sensitivity at 13× coverage.
Comparing platforms.
| Feature | Preferred Platform |
| Reducing false positives | SOLiD |
| Reducing false negatives | GA |
| Raw accuracy | SOLiD |
| Mapping | GA |
| Ease of library preparation | GA |