Literature DB >> 15469411

Biomedical informatics: development of a comprehensive data warehouse for clinical and genomic breast cancer research.

Hai Hu1, Henry Brzeski, Joe Hutchins, Mohan Ramaraj, Long Qu, Richard Xiong, Surendran Kalathil, Rand Kato, Santhosh Tenkillaya, Jerry Carney, Rosann Redd, Sheshkumar Arkalgudvenkata, Kashif Shahzad, Richard Scott, Hui Cheng, Stephen Meadow, John McMichael, Shwu-Lin Sheu, David Rosendale, Leonid Kvecher, Stephen Ahern, Song Yang, Yonghong Zhang, Rick Jordan, Stella B Somiari, Jeffrey Hooke, Craig D Shriver, Richard I Somiari, Michael N Liebman.   

Abstract

The Windber Research Institute is an integrated high-throughput research center employing clinical, genomic and proteomic platforms to produce terabyte levels of data. We use biomedical informatics technologies to integrate all of these operations. This report includes information on a multi-year, multi-phase hybrid data warehouse project currently under development in the Institute. The purpose of the warehouse is to host the terabyte-level of internal experimentally generated data as well as data from public sources. We have previously reported on the phase I development, which integrated limited internal data sources and selected public databases. Currently, we are completing phase II development, which integrates our internal automated data sources and develops visualization tools to query across these data types. This paper summarizes our clinical and experimental operations, the data warehouse development, and the challenges we have faced. In phase III we plan to federate additional manual internal and public data sources and then to develop and adapt more data analysis and mining tools. We expect that the final implementation of the data warehouse will greatly facilitate biomedical informatics research.

Entities:  

Mesh:

Year:  2004        PMID: 15469411     DOI: 10.1517/14622416.5.7.933

Source DB:  PubMed          Journal:  Pharmacogenomics        ISSN: 1462-2416            Impact factor:   2.533


  3 in total

1.  The VA Hypertension Primary Care Longitudinal Cohort: Electronic medical records in the post-genomic era.

Authors:  Rany M Salem; Braj Pandey; Erin Richard; Maple M Fung; Erin P Garcia; Victoria H Brophy; Nicholas J Schork; Daniel T O'Connor; Vibha Bhatnagar
Journal:  Health Informatics J       Date:  2010-12       Impact factor: 2.681

2.  TRUNCATULIX--a data warehouse for the legume community.

Authors:  Kolja Henckel; Kai J Runte; Thomas Bekel; Michael Dondrup; Tobias Jakobi; Helge Küster; Alexander Goesmann
Journal:  BMC Plant Biol       Date:  2009-02-11       Impact factor: 4.215

Review 3.  Big data and clinicians: a review on the state of the science.

Authors:  Weiqi Wang; Eswar Krishnan
Journal:  JMIR Med Inform       Date:  2014-01-17
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.