| Literature DB >> 23586484 |
Shreepriya Das1, Haris Vikalo.
Abstract
BACKGROUND: Next-generation DNA sequencing platforms are capable of generating millions of reads in a matter of days at rapidly reducing costs. Despite its proliferation and technological improvements, the performance of next-generation sequencing remains adversely affected by the imperfections in the underlying biochemical and signal acquisition procedures. To this end, various techniques, including statistical methods, are used to improve read lengths and accuracy of these systems. Development of high performing base calling algorithms that are computationally efficient and scalable is an ongoing challenge.Entities:
Mesh:
Year: 2013 PMID: 23586484 PMCID: PMC3776450 DOI: 10.1186/1471-2105-14-129
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The 16 state trellis illustration of the transitions between states in theand ()stage of the trellis. The figure shows 8 out of the possible 16 states along with all valid transitions between them.
Comparison of error rates and speed for GAII
| FB | 0.0128 | 400mins |
| SOVA | 0.0129 | 300mins |
| OnlineCall | 0.0137 | 30mins |
| naiveBayesCall | 0.0139 | 1500mins |
| Ibis | 0.0147 | 480mins |
| Bustard | 0.0154 | 40mins |
| Rolexa | 0.0171 | 720mins |
A comparison of error rates and running times (per lane) for different base callers (note that Bustard’s running time is underestimated since it does not account for the parameter estimation step).
Figure 2(a,b,c) - Comparison of the different basecalling strategies under different performance metrics. The figure shows a) Base calling error rates as a function of tile, b) Base calling error rates as a function of cycle number, c) Discrimination ability.
Figure 3(a,b) - Comparison of the different basecalling strategies for HiSeq2000. The figure shows a) Base calling error rates as a function of cycle for the first pair, b) Base calling error rates as a function of cycle for the second pair.
Comparison of error rates for HiSeq
| FB | 0.0029 | 0.0029 |
| SOVA | 0.0029 | 0.0029 |
| Bustard | 0.0033 | 0.0032 |
A comparison of error rates for different base callers for HiSeq.
Comparison of error rates for supervised and unsupervised schemes
| FB (unsupervised) | 0.0125 |
| SOVA (unsupervised) | 0.0127 |
| FB (supervised) | 0.0124 |
| SOVA (supervised) | 0.0126 |
A comparison of error rates for supervised and unsupervised schemes on a single tile for GAII.