| Literature DB >> 19341506 |
Jing Tang1, Chong Yew Tan, Matej Oresic, Antonio Vidal-Puig.
Abstract
Following the publication of the complete human genomic sequence, the post-genomic era is driven by the need to extract useful information from genomic data. Genomics, transcriptomics, proteomics, metabolomics, epidemiological data and microbial data provide different angles to our understanding of gene-environment interactions and the determinants of disease and health. Our goal and our challenge are to integrate these very different types of data and perspectives of disease into a global model suitable for dissecting the mechanisms of disease and for predicting novel therapeutic strategies. This review aims to highlight the need for and problems with complex data integration, and proposes a framework for data integration. While there are many obstacles to overcome, biological models based upon multiple datasets will probably become the basis that drives future biomedical research.Entities:
Year: 2009 PMID: 19341506 PMCID: PMC2664946 DOI: 10.1186/gm35
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Figure 1A causal model based upon Mendelian randomization. The model demonstrates the core assumptions for making a valid causal inference between a phenotype and disease. The three assumptions are: (1) genotype is independent of the confounder; (2) genotype is associated with phenotype; (3) genotype is independent of disease conditioning on phenotype and confounder. If these assumptions are valid, then an observed association between genotype and disease would imply the causality from phenotype to disease.
Box 1