Literature DB >> 27810480

Evaluation of relational and NoSQL database architectures to manage genomic annotations.

Wade L Schulz1, Brent G Nelson2, Donn K Felker3, Thomas J S Durant4, Richard Torres4.   

Abstract

While the adoption of next generation sequencing has rapidly expanded, the informatics infrastructure used to manage the data generated by this technology has not kept pace. Historically, relational databases have provided much of the framework for data storage and retrieval. Newer technologies based on NoSQL architectures may provide significant advantages in storage and query efficiency, thereby reducing the cost of data management. But their relative advantage when applied to biomedical data sets, such as genetic data, has not been characterized. To this end, we compared the storage, indexing, and query efficiency of a common relational database (MySQL), a document-oriented NoSQL database (MongoDB), and a relational database with NoSQL support (PostgreSQL). When used to store genomic annotations from the dbSNP database, we found the NoSQL architectures to outperform traditional, relational models for speed of data storage, indexing, and query retrieval in nearly every operation. These findings strongly support the use of novel database technologies to improve the efficiency of data management within the biological sciences. Copyright Â
© 2016 Elsevier Inc. All rights reserved.

Keywords:  Genomics; MongoDB; MySQL; NoSQL; PostgreSQL; Relational database

Mesh:

Year:  2016        PMID: 27810480     DOI: 10.1016/j.jbi.2016.10.015

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  11 in total

1.  Facilitating Cohort Discovery by Enhancing Ontology Exploration, Query Management and Query Sharing for Large Clinical Data Repositories.

Authors:  Shiqiang Tao; Licong Cui; Xi Wu; Guo-Qiang Zhang
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  Efficient population-scale variant analysis and prioritization with VAPr.

Authors:  Amanda Birmingham; Adam M Mark; Carlo Mazzaferro; Guorong Xu; Kathleen M Fisch
Journal:  Bioinformatics       Date:  2018-08-15       Impact factor: 6.937

3.  The San Diego Nathan Shock Center: tackling the heterogeneity of aging.

Authors:  Gerald S Shadel; Peter D Adams; W Travis Berggren; Jolene K Diedrich; Kenneth E Diffenderfer; Fred H Gage; Nasun Hah; Malene Hansen; Martin W Hetzer; Anthony J A Molina; Uri Manor; Kurt Marek; David D O'Keefe; Antonio F M Pinto; Alessandra Sacco; Tatyana O Sharpee; Maxim N Shokriev; Stefania Zambetti
Journal:  Geroscience       Date:  2021-08-09       Impact factor: 7.713

4.  The High-Throughput Analyses Era: Are We Ready for the Data Struggle?

Authors:  Valeria D'Argenio
Journal:  High Throughput       Date:  2018-03-02

5.  SNPnexus: a web server for functional annotation of human genome sequence variation (2020 update).

Authors:  Jorge Oscanoa; Lavanya Sivapalan; Emanuela Gadaleta; Abu Z Dayem Ullah; Nicholas R Lemoine; Claude Chelala
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

6.  A Personalized Healthcare Monitoring System for Diabetic Patients by Utilizing BLE-Based Sensors and Real-Time Data Processing.

Authors:  Ganjar Alfian; Muhammad Syafrudin; Muhammad Fazal Ijaz; M Alex Syaekhoni; Norma Latif Fitriyani; Jongtae Rhee
Journal:  Sensors (Basel)       Date:  2018-07-06       Impact factor: 3.576

7.  COVID-19 pandemic: Is it the right time to develop interconnected national biomedical registries?

Authors:  Athanasios S Kotoulas
Journal:  Genomics Inform       Date:  2021-12-31

8.  Benchmarking database systems for Genomic Selection implementation.

Authors:  Yaw Nti-Addae; Dave Matthews; Victor Jun Ulat; Raza Syed; Guilhem Sempéré; Adrien Pétel; Jon Renner; Pierre Larmande; Valentin Guignon; Elizabeth Jones; Kelly Robbins
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

9.  A negative storage model for precise but compact storage of genetic variation data.

Authors:  Guillermo Gonzalez-Calderon; Ruizheng Liu; Rodrigo Carvajal; Jamie K Teer
Journal:  Database (Oxford)       Date:  2020-01-01       Impact factor: 3.451

10.  Iron Hack - A symposium/hackathon focused on porphyrias, Friedreich's ataxia, and other rare iron-related diseases.

Authors:  Gloria C Ferreira; Jenna Oberstaller; Renée Fonseca; Thomas E Keller; Swamy Rakesh Adapa; Justin Gibbons; Chengqi Wang; Xiaoming Liu; Chang Li; Minh Pham; Guy W Dayhoff Ii; Ben Busby; Rays H Y Jiang; Linh M Duong; Luis Tañón Reyes; Luciano Enrique Laratelli; Douglas Franz; Segun Fatumo; Atm Golam Bari; Audrey Freischel; Lindsey Fiedler; Omkar Dokur; Krishna Sharma; Deborah Cragun
Journal:  F1000Res       Date:  2019-07-19
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.