Literature DB >> 24751874

Indexing a sequence for mapping reads with a single mismatch.

Maxime Crochemore1, Alessio Langiu, M Sohel Rahman.   

Abstract

Mapping reads against a genome sequence is an interesting and useful problem in computational molecular biology and bioinformatics. In this paper, we focus on the problem of indexing a sequence for mapping reads with a single mismatch. We first focus on a simpler problem where the length of the pattern is given beforehand during the data structure construction. This version of the problem is interesting in its own right in the context of the next generation sequencing. In the sequel, we show how to solve the more general problem. In both cases, our algorithm can construct an efficient data structure in O(n log(1+ε) n) time and space and can answer subsequent queries in O(m log log n + K) time. Here, n is the length of the sequence, m is the length of the read, 0<ε<1 and is the optimal output size.

Keywords:  algorithms; genome sequence; indexing; mapping reads; mismatch; pattern matching

Year:  2014        PMID: 24751874      PMCID: PMC3996579          DOI: 10.1098/rsta.2013.0167

Source DB:  PubMed          Journal:  Philos Trans A Math Phys Eng Sci        ISSN: 1364-503X            Impact factor:   4.226


  3 in total

1.  Presenilin-1 mutation L271V results in altered exon 8 splicing and Alzheimer's disease with non-cored plaques and no neuritic dystrophy.

Authors:  John B J Kwok; Glenda M Halliday; William S Brooks; Georgia Dolios; Hanna Laudon; Ohoshi Murayama; Marianne Hallupp; Renee F Badenhop; James Vickers; Rong Wang; Jan Naslund; Akihiko Takashima; Samuel E Gandy; Peter R Schofield
Journal:  J Biol Chem       Date:  2002-12-19       Impact factor: 5.157

2.  Disruption of an SF2/ASF-dependent exonic splicing enhancer in SMN2 causes spinal muscular atrophy in the absence of SMN1.

Authors:  Luca Cartegni; Adrian R Krainer
Journal:  Nat Genet       Date:  2002-03-04       Impact factor: 38.330

3.  Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and genome analyzer systems.

Authors:  André E Minoche; Juliane C Dohm; Heinz Himmelbauer
Journal:  Genome Biol       Date:  2011-11-08       Impact factor: 13.583

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.