| Literature DB >> 11590101 |
Abstract
UNLABELLED: SAGE data are obtained by sequencing short DNA tags. Due to the mistakes in DNA sequencing, SAGE data contain errors. We propose a new approach to identify tags whose abundance is biased by sequencing errors. This approach is based on a concept of neighbourhood: abundant tags can contaminate tags whose sequence is very close. The application of our approach reveals that moderately abundant tags can be generated by sequencing errors uniquely. It also allows for detecting correct rare tags. AVAILABILITY: Software is available only to non-profit entities and for non-commercial purposes upon request.Entities:
Mesh:
Substances:
Year: 2001 PMID: 11590101 DOI: 10.1093/bioinformatics/17.9.840
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937