Literature DB >> 26745826

Resolving Conflicting Predictions from Multimapping Reads.

Stefan Canzar1, Khaled Elbassioni2, Mitchell Jones3, Julián Mestre3.   

Abstract

The first step in the analysis of data produced by ultra-high-throughput next-generation sequencing technology is to map short sequence "reads" to a reference genome, if available. Sequencing errors, repeat regions, and polymorphisms may lead a read to align to multiple locations in the genome reasonably well. While ignoring such multimapping reads, or some of their alignments, will reduce the sensitivity of almost any type of downstream analysis (e.g., detecting structural variants), erroneous mappings will typically yield false positive predictions. Here we propose a framework that aims to identify true predictions among a large set of candidate predictions by selecting for each read a unique mapping that collectively imply conflict-free predictions. We formulate this problem as the maximum facility location problem, for which we propose LP-rounding heuristics. We provide a theoretic guarantee on the quality of the solution and demonstrate the utility of our algorithm in resolving conflicting deletions implied by simulated reads mapping ambiguously to Craig Venter's genome model and Illumina sequencing reads of the well-studied NA12878 individual.

Entities:  

Keywords:  algorithms; combinatorial optimization

Mesh:

Year:  2016        PMID: 26745826     DOI: 10.1089/cmb.2015.0164

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  2 in total

1.  Jointly aligning a group of DNA reads improves accuracy of identifying large deletions.

Authors:  Anish M S Shrestha; Martin C Frith; Kiyoshi Asai; Hugues Richard
Journal:  Nucleic Acids Res       Date:  2018-02-16       Impact factor: 16.971

2.  Identification and Validation of Reference Genes in Clostridium beijerinckii NRRL B-598 for RT-qPCR Using RNA-Seq Data.

Authors:  Katerina Jureckova; Hana Raschmanova; Jan Kolek; Maryna Vasylkivska; Barbora Branska; Petra Patakova; Ivo Provaznik; Karel Sedlar
Journal:  Front Microbiol       Date:  2021-03-18       Impact factor: 5.640

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.