Literature DB >> 17061921

fjoin: simple and efficient computation of feature overlaps.

Joel E Richardson1.   

Abstract

Sets of biological features with genome coordinates (e.g., genes and promoters) are a particularly common form of data in bioinformatics today. Accordingly, an increasingly important processing step involves comparing coordinates from large sets of features to find overlapping feature pairs. This paper presents fjoin, an efficient, robust, and simple algorithm for finding these pairs, and a downloadable implementation. For typical bioinformatics feature sets, fjoin requires O(n log(n)) time (O(n) if the inputs are sorted) and uses O(1) space. The reference implementation is a stand-alone Python program; it implements the basic algorithm and a number of useful extensions, which are also discussed in this paper.

Mesh:

Year:  2006        PMID: 17061921     DOI: 10.1089/cmb.2006.13.1457

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  21 in total

1.  A lncRNA regulates alternative splicing via establishment of a splicing-specific chromatin signature.

Authors:  Inma Gonzalez; Roberto Munita; Eneritz Agirre; Travis A Dittmer; Katia Gysling; Tom Misteli; Reini F Luco
Journal:  Nat Struct Mol Biol       Date:  2015-04-06       Impact factor: 15.369

2.  Binary Interval Search: a scalable algorithm for counting interval intersections.

Authors:  Ryan M Layer; Kevin Skadron; Gabriel Robins; Ira M Hall; Aaron R Quinlan
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

3.  Comparative genomics allows the discovery of cis-regulatory elements in mosquitoes.

Authors:  Douglas H Sieglaff; W Augustine Dunn; Xiaohui S Xie; Karyn Megy; Osvaldo Marinotti; Anthony A James
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-11       Impact factor: 11.205

4.  Role of H3K27 methylation in the regulation of lncRNA expression.

Authors:  Susan C Wu; Eric M Kallin; Yi Zhang
Journal:  Cell Res       Date:  2010-08-03       Impact factor: 25.617

5.  A parallel algorithm for N-way interval set intersection.

Authors:  Ryan M Layer; Aaron R Quinlan
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2017-03       Impact factor: 10.961

6.  High-resolution genotyping and mapping of recombination and gene conversion in the protozoan Theileria parva using whole genome sequencing.

Authors:  Sonal Henson; Richard P Bishop; Subhash Morzaria; Paul R Spooner; Roger Pelle; Lucy Poveda; Martin Ebeling; Erich Küng; Ulrich Certa; Claudia A Daubenberger; Weihong Qi
Journal:  BMC Genomics       Date:  2012-09-23       Impact factor: 3.969

7.  Direct cloning of double-stranded RNAs from RNase protection analysis reveals processing patterns of C/D box snoRNAs and provides evidence for widespread antisense transcript expression.

Authors:  Manli Shen; Eduardo Eyras; Jie Wu; Amit Khanna; Serene Josiah; Mathieu Rederstorff; Michael Q Zhang; Stefan Stamm
Journal:  Nucleic Acids Res       Date:  2011-08-31       Impact factor: 16.971

8.  Large homogeneous genome regions (isochores) in soybean [glycine max (L.) merr].

Authors:  J L Woody; W Beavis; R C Shoemaker
Journal:  Front Genet       Date:  2012-06-01       Impact factor: 4.599

9.  Augmented Interval List: a novel data structure for efficient genomic interval search.

Authors:  Jianglin Feng; Aakrosh Ratan; Nathan C Sheffield
Journal:  Bioinformatics       Date:  2019-12-01       Impact factor: 6.931

10.  The Mouse Genome Database genotypes::phenotypes.

Authors:  Judith A Blake; Carol J Bult; Janan T Eppig; James A Kadin; Joel E Richardson
Journal:  Nucleic Acids Res       Date:  2008-11-03       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.