| Literature DB >> 20932319 |
Richard W Hyman1, Hui Jiang, Marilyn Fukushima, Ronald W Davis.
Abstract
BACKGROUND: Relatively recently, the software KB™ Basecaller has replaced phred for identifying the bases from raw sequence data in DNA sequencing employing dideoxy chemistry. We have measured quantitatively the consequences of that change.Entities:
Year: 2010 PMID: 20932319 PMCID: PMC3020662 DOI: 10.1186/1756-0500-3-257
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Figure 1High quality read length: . The red solid curve and blue dashed curve give the empirical distribution function of sequence read length produced by phred and the KB™ Basecaller, respectively. Sequence reads with fewer than 100 contiguous high quality bases have been discarded as failed reads.
Mean high quality read length: phred vs. the KB™ Basecaller.
| KB™ Basecaller | ||||
|---|---|---|---|---|
| > 100 | > 500 | > 100 | > 500 | |
| mean | 763 | 804 | 731 | 756 |
| SD* | 149 | 74 | 116 | 71 |
| No. of reads** | 9586 | 8717 | 9572 | 8911 |
*SD, standard deviation. **No., number.
Project 95: Microbes by genus.
| Closest named bacterium | KB™ Basecaller | |||
|---|---|---|---|---|
| Number of reads | % of total reads | Number of reads | % of total reads | |
| 1781 | 98.2 | 1584 | 87.9 | |
| 4 | 0.2 | 4 | 0.2 | |
| 2 | 0.1 | 2 | 0.1 | |
Project 95: Lactobacillus by species.
| Closest named bacterium | KB™ Basecaller | |||
|---|---|---|---|---|
| Number of reads | % of total reads | Number of reads | % of total reads | |
| 693 | 38.2 | 665 | 36.9 | |
| 606 | 33.4 | 448 | 24.8 | |
| 453 | 25.0 | 455 | 25.2 | |
| 26 | 1.4 | 12 | 0.7 | |
| 3 | 0.2 | 4 | 0.2 | |
Project 95: Novel bacteria.
| Closest named bacterium | KB™ Basecaller | |||
|---|---|---|---|---|
| Number of reads | % match | Number of reads | % match | |
| 19 | 78-97 | 74 | 84-97 | |
| uncultured | 7 | N/A | 139 | N/A |
| uncultured | 0 | N/A | 13 | 89-95 |