| Literature DB >> 21169377 |
Shao-Ke Lou1, Bing Ni, Leung-Yau Lo, Stephen Kwok-Wing Tsui, Ting-Fung Chan, Kwong-Sak Leung.
Abstract
UNLABELLED: Sequencing reads generated by RNA-sequencing (RNA-seq) must first be mapped back to the genome through alignment before they can be further analyzed. Current fast and memory-saving short-read mappers could give us a quick view of the transcriptome. However, they are neither designed for reads that span across splice junctions nor for repetitive reads, which can be mapped to multiple locations in the genome (multi-reads). Here, we describe a new software package: ABMapper, which is specifically designed for exploring all putative locations of reads that are mapped to splice junctions or repetitive in nature.Entities:
Mesh:
Year: 2010 PMID: 21169377 PMCID: PMC3031031 DOI: 10.1093/bioinformatics/btq656
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Workflow of ABMapper. (1) ‘A’ and ‘B’ refer to the two seeds from each end of a read. (2) Hits that are found within the genome (black line) matching the two seeds are represented by bars with the corresponding labels (A: a1, a2, a3; similarly for B). Finally, seeds are extended during (3a) exonic alignment; and (3b) spliced alignment.
Comparison between TopHat (TH), SpliceMap (SM) and ABMapper (ABM)
| Total read = 427 786 | sp.fa | sp_e1.fa | sp_e2.fa | ||||||
|---|---|---|---|---|---|---|---|---|---|
| TH | SM | ABM | TH | SM | ABM | TH | SM | ABM | |
| Found | 366 918 | 354 502 | 367 249 | 320 949 | 372 274 | 277 369 | |||
| Perfect hit | 326 510 | 334 241 | 326 422 | 279 146 | 326 033 | 222 907 | |||
| Accuracy (%) | 88.99 | 94.28 | 88.88 | 86.98 | 87.58 | 80.36 | |||
| Recall (%) | 76.33 | 78.13 | 76.30 | 65.25 | 76.21 | 52.11 | |||
sp.fa is the benchmark dataset, with sp_e1.fa and sp_e2.fa having 1 and 2 random errors, respectively. We defined accuracy as %(Perfect hit/Found) and recall as %(Perfect hit/Total).