| Literature DB >> 24479419 |
Ram Krishna Shrestha, Baruch Lubinsky, Vijay B Bansode, Mónica B J Moinz, Grace P McCormack, Simon A Travers1.
Abstract
BACKGROUND: Many high throughput sequencing (HTS) approaches, such as the Roche/454 platform, produce sequences in which the quality of the sequence (as measured by a Phred-like quality scores) decreases linearly across a sequence read. Undertaking quality trimming of this data is essential to enable confidence in the results of subsequent downstream analysis. Here, we have developed a novel, highly sensitive and accurate approach (QTrim) for the quality trimming of sequence reads generated using the Roche/454 sequencing platform (or any platform with long reads that outputs Phred-like quality scores).Entities:
Mesh:
Year: 2014 PMID: 24479419 PMCID: PMC3912918 DOI: 10.1186/1471-2105-15-33
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Flow diagram of the various steps involved in the QTrim algorithm.
Figure 2Box and whisker plots showing an overview of the high quality (A) and low quality (B) data used for comparisons in this study. The x-axis corresponds to the read base positions (sampled every 10 bp for clarity) while the y-axis corresponds to the quality score. The second y-axis (associated with the green line) corresponds to the read coverage at each of the positions.
Figure 3Comparison of QTrim with other quality trimming approaches on the basis of the total number of reads output versus the mean read length output. These comparisons were performed using good quality (A + C) and poor quality (B + D) sequence data at trimming thresholds of 20 and 30.
Figure 4Comparison of the trimming speeds for all of the methods represented as the number of bases trimmed per second. Both Geneious and Newbler are graphical software packages and, thus, trimming speeds were estimated based on running time of the entire graphical process as opposed to solely the trimming algorithm.