| Literature DB >> 27540267 |
Rolf Hilker1, Kai Bernd Stadermann2, Oliver Schwengers1, Evgeny Anisiforov1, Sebastian Jaenicke1, Bernd Weisshaar2, Tobias Zimmermann1, Alexander Goesmann1.
Abstract
MOTIVATION: The vast amount of already available and currently generated read mapping data requires comprehensive visualization, and should benefit from bioinformatics tools offering a wide spectrum of analysis functionality from just one source. Appropriate handling of multiple mapped reads during mapping analyses remains an issue that demands improvement.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27540267 PMCID: PMC5167064 DOI: 10.1093/bioinformatics/btw541
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Read mapping classification in ReadXplorer 2
| Read Mapping Classification | Properties |
|---|---|
| Single Perfect Match | Read has only one mapping without mismatches, may have additional Common Match mappings |
| Single Best Match | Read has one mapping with less deviations than all other mappings, may have additional Common Match mappings |
| Perfect Match | Read has multiple mappings without mismatches |
| Best Match | Read has multiple mappings with the same number of deviations from the reference |
| Common Match | Read has better mappings with less deviations to the reference |
Fig. 1.The five read mapping classifications in the coverage plot. Read counts per base of input sequence are shown separated by strand and colored by mapping classification. Reads aligning at the borders of the figure have a distinctive best mapping (Single Perfect and Single Best Match classes in rich green and yellow), while the light green (Perfect Match), light yellow (Best Match) and red (Common Match) mappings display a repetitive region
Fig. 2.The histogram comfortably visualizes the frequency of TPM values observed within the data set at log scale. The actual number of features belonging to one bar is shown in the tooltip. It can be switched to show RPKM values
Fig. 3.Visualization of genome rearrangement events using GASV (Sindi ). ReadXplorer 2 offers an effortless exploration of the data underlying the genome rearrangements detected by GASV (table at the bottom). The region of the deletion selected in the table is centered simultaneously in the reference, track and read pair viewer. In the example, the hypothetical protein PA0343 is deleted in P. aeruginosa strain B420 from the study by Hilker . No reads map to the corresponding region of P. aeruginosa PAO1 and many read pairs are observed in B420 with an enlarged distance of about 1200 bp instead of the expected 300 bp
Fig. 4.The difference of the read coverage between only uniquely mapped reads (middle row) and using the ‘Single Perfect Match’ and ‘Single Best Match’ mapping classes is shown for the genes AT5G46900 and AT4G12510 (upper row). In total numbers 144 and 78 more reads can be included in the analysis respectively when using the extended read mapping classification (Color version of this figure is available at Bioinformatics online.)