Literature DB >> 25560631

Benchmarking database performance for genomic data.

Matloob Khushi1.   

Abstract

Genomic regions represent features such as gene annotations, transcription factor binding sites and epigenetic modifications. Performing various genomic operations such as identifying overlapping/non-overlapping regions or nearest gene annotations are common research needs. The data can be saved in a database system for easy management, however, there is no comprehensive database built-in algorithm at present to identify overlapping regions. Therefore I have developed a novel region-mapping (RegMap) SQL-based algorithm to perform genomic operations and have benchmarked the performance of different databases. Benchmarking identified that PostgreSQL extracts overlapping regions much faster than MySQL. Insertion and data uploads in PostgreSQL were also better, although general searching capability of both databases was almost equivalent. In addition, using the algorithm pair-wise, overlaps of >1000 datasets of transcription factor binding sites and histone marks, collected from previous publications, were reported and it was found that HNF4G significantly co-locates with cohesin subunit STAG1 (SA1).Inc.
© 2015 Wiley Periodicals, Inc.

Entities:  

Keywords:  DATABASE BENCHMARKING; EPIGENETIC MODIFICATIONS; MANAGING GENOMIC LOCATIONS DATA; REGMAP; TRANSCRIPTION FACTOR BINDING SITES

Mesh:

Year:  2015        PMID: 25560631     DOI: 10.1002/jcb.25049

Source DB:  PubMed          Journal:  J Cell Biochem        ISSN: 0730-2312            Impact factor:   4.429


  3 in total

1.  Evaluation of Functional Abilities in 0-6 Year Olds: an Analysis with the eEarlyCare Computer Application.

Authors:  María Consuelo Sáiz-Manzanares; Raúl Marticorena-Sánchez; Álvar Arnaiz-González
Journal:  Int J Environ Res Public Health       Date:  2020-05-09       Impact factor: 3.390

2.  Automated classification and characterization of the mitotic spindle following knockdown of a mitosis-related protein.

Authors:  Matloob Khushi; Imraan M Dean; Erdahl T Teber; Megan Chircop; Jonathan W Arthur; Neftali Flores-Rodriguez
Journal:  BMC Bioinformatics       Date:  2017-12-28       Impact factor: 3.169

3.  MatCol: a tool to measure fluorescence signal colocalisation in biological systems.

Authors:  Matloob Khushi; Christine E Napier; Christine M Smyth; Roger R Reddel; Jonathan W Arthur
Journal:  Sci Rep       Date:  2017-08-21       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.