| Literature DB >> 26353839 |
Karel Břinda1, Valentina Boeva2, Gregory Kucherov1.
Abstract
MOTIVATION: Read simulators combined with alignment evaluation tools provide the most straightforward way to evaluate and compare mappers. Simulation of reads is accompanied by information about their positions in the source genome. This information is then used to evaluate alignments produced by the mapper. Finally, reports containing statistics of successful read alignments are created.In default of standards for encoding read origins, every evaluation tool has to be made explicitly compatible with the simulator used to generate reads.Entities:
Mesh:
Year: 2015 PMID: 26353839 PMCID: PMC4681991 DOI: 10.1093/bioinformatics/btv524
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Examples of simulated reads (in our definition read tuples) and their corresponding Rnf names, which can be used as read names in the final Fastq file: a single-end read (r001); a paired-end read (r002); a mate-pair read (r003); a spliced RNA-seq read (r004); a chimeric read (r005); and a random contaminating read with unspecified coordinates (r006)
Fig. 2.Example of two graphs produced by LAVEnder as a part of comparison of mapper capabilities of contamination detection. 200.000 single-end reads were simulated from human and mouse genomes (100.000 from HG38, 100.000 from MM10) by DwgSim using MIShmash and mapped to HG38. All LAVEnder graphs have false discovery rate on x-axis and use mapping quality as the varying parameter for plotted curves. This experiment reveals that YARA copes with contamination better than Bowtie2, BWA-MEM and BWA-SW