Literature DB >> 27058690

CRISPR Detection From Short Reads Using Partial Overlap Graphs.

Ilan Ben-Bassat1, Benny Chor1.   

Abstract

Clustered regularly interspaced short palindromic repeats (CRISPR) are structured regions in bacterial and archaeal genomes, which are part of an adaptive immune system against phages. CRISPRs are important for many microbial studies and are playing an essential role in current gene editing techniques. As such, they attract substantial research interest. The exponential growth in the amount of bacterial sequence data in recent years enables the exploration of CRISPR loci in more and more species. Most of the automated tools that detect CRISPR loci rely on fully assembled genomes. However, many assemblers do not handle repetitive regions successfully. The first tool to work directly on raw sequence data is Crass, which requires reads that are long enough to contain two copies of the same repeat. We present a method to identify CRISPR repeats from raw sequence data of short reads. The algorithm is based on an observation differentiating CRISPR repeats from other types of repeats, and it involves a series of partial constructions of the overlap graph. This enables us to avoid many of the difficulties that assemblers face, as we merely aim to identify the repeats that belong to CRISPR loci. A preliminary implementation of the algorithm shows good results and detects CRISPR repeats in cases where other existing tools fail to do so.

Keywords:  CRISPR detection; filtering; k-mer counting; partial overlap graph; sampling

Mesh:

Year:  2016        PMID: 27058690     DOI: 10.1089/cmb.2015.0226

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  3 in total

1.  A new statistic for efficient detection of repetitive sequences.

Authors:  Sijie Chen; Yixin Chen; Fengzhu Sun; Michael S Waterman; Xuegong Zhang
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

2.  Assembly-based inference of B-cell receptor repertoires from short read RNA sequencing data with V'DJer.

Authors:  Lisle E Mose; Sara R Selitsky; Lisa M Bixby; David L Marron; Michael D Iglesia; Jonathan S Serody; Charles M Perou; Benjamin G Vincent; Joel S Parker
Journal:  Bioinformatics       Date:  2016-08-24       Impact factor: 6.937

3.  CRISPRbuilder-TB: "CRISPR-builder for tuberculosis". Exhaustive reconstruction of the CRISPR locus in mycobacterium tuberculosis complex using SRA.

Authors:  Christophe Guyeux; Christophe Sola; Camille Noûs; Guislaine Refrégier
Journal:  PLoS Comput Biol       Date:  2021-03-05       Impact factor: 4.475

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.