| Literature DB >> 24936509 |
Abstract
Entities:
Year: 2014 PMID: 24936509 PMCID: PMC4049360 DOI: 10.1002/mgg3.78
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
Figure 1Our genomic studies at the Inova Translational Medicine Institute (the amount of storage is given in the center of the figure) focus on cloud-based storage and analysis – as well as on-premise capabilities – that integrate DNA-based data with other comprehensive phenotypic and other biologic information. We have developed a robust information technology (IT) infrastructure to enhance data storage, movement, and analysis relevant to a diverse group of end users. These types of IT considerations are increasingly important in current large-scale genomic studies, the requirements of which are shifting data handling techniques away from individually maintained datasets that can be combined on an ad hoc basis. miRNA, microRNA; PB, petabytes; RNAseq, RNA sequencing (whole transcriptome).
Figure 2Complementary to the above figure, this figure demonstrates the combination of multiple data types arising from specific different studies. The conditions shown (as well as the approximate proportional size of each study) refer to Inova Translational Medicine Institute research initiatives, with the numbers beneath the largest studies (preterm birth and the longitudinal study) referring to the approximate number of trios. Increased statistical power and more comprehensive reference genomes can be constructed through multiple datasets, especially when detailed and standardized phenotypic annotations are available. EHR, electronic health record; miRNA, microRNA; RNAseq, RNA sequencing (whole transcriptome); WGS, whole-genome sequencing.