| Literature DB >> 26690135 |
Martin Hofmann-Apitius1,2, Gordon Ball3,4, Stephan Gebel5, Shweta Bagewadi6, Bernard de Bono7,8, Reinhard Schneider9, Matt Page10, Alpha Tom Kodamullil11, Erfan Younesi12, Christian Ebeling13, Jesper Tegnér14,15, Luc Canard16.
Abstract
Since the decoding of the Human Genome, techniques from bioinformatics, statistics, and machine learning have been instrumental in uncovering patterns in increasing amounts and types of different data produced by technical profiling technologies applied to clinical samples, animal models, and cellular systems. Yet, progress on unravelling biological mechanisms, causally driving diseases, has been limited, in part due to the inherent complexity of biological systems. Whereas we have witnessed progress in the areas of cancer, cardiovascular and metabolic diseases, the area of neurodegenerative diseases has proved to be very challenging. This is in part because the aetiology of neurodegenerative diseases such as Alzheimer´s disease or Parkinson´s disease is unknown, rendering it very difficult to discern early causal events. Here we describe a panel of bioinformatics and modeling approaches that have recently been developed to identify candidate mechanisms of neurodegenerative diseases based on publicly available data and knowledge. We identify two complementary strategies-data mining techniques using genetic data as a starting point to be further enriched using other data-types, or alternatively to encode prior knowledge about disease mechanisms in a model based framework supporting reasoning and enrichment analysis. Our review illustrates the challenges entailed in integrating heterogeneous, multiscale and multimodal information in the area of neurology in general and neurodegeneration in particular. We conclude, that progress would be accelerated by increasing efforts on performing systematic collection of multiple data-types over time from each individual suffering from neurodegenerative disease. The work presented here has been driven by project AETIONOMY; a project funded in the course of the Innovative Medicines Initiative (IMI); which is a public-private partnership of the European Federation of Pharmaceutical Industry Associations (EFPIA) and the European Commission (EC).Entities:
Keywords: bioinformatics; data integration; disease models; genetics; graphical models; knowledge-based modeling; mechanism-identification; multiscale; neurodegeneration
Mesh:
Year: 2015 PMID: 26690135 PMCID: PMC4691095 DOI: 10.3390/ijms161226148
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Head diffusion graph surrounding Apolipoprotein E (APOE). A graph representing Single Nucleotide Polymorphisms (SNPs) (purple circles), epigenetic information (chromatin states of genes), Linkage Disequilibrium (LD) blocks (blue circle), studies (orange circles) and cell types (yellow circles) displays relevant information associated with the APOE gene and its biology.
Figure 2Differential analysis of triggering receptor expressed on myeloid cells 2 (TREM2) activities in neuroinflammation and normal state: In this figure dotted lines indicates regulation, arrows represent increase relation and T shaped lines represent inhibits/decrease relation. Red colored nodes and edges indicate the disease state mechanisms whereas green colored nodes and edges show normal state processes. The highlighted words in the uppermost screenshot represent the entity (term) markup in text by the text mining tool used.
Figure 3Workflow for Resource Description Framework (RDF)-mediated data and knowledge integration and the identification of “signals” in a network context. The workflow essentially identifies subgraphs in the feature network that are highly discriminating between healthy and diseased; the term “long list” describes a list of “interesting network features” that provide contextual, mechanistic insight that can be analyzed by experts in the field.
Figure 4Data upload to Parkinson´s disease (PD) map (June 2015 release). (A) Differential gene expression (false-discovery-rate (FDR) <0.05) from a meta-analysis of eight transcriptome data sets, comparing human post-mortem brain samples from PD vs. healthy controls displayed on the full PD map [95]; (B) Section of the mitochondrial electron transport chain; (C) Section of the cAMP responsive element binding protein (CREB) signaling pathway; (D) Age-dependent differentially expressed genes from healthy aging brain [98]. Section of the CREB signaling pathway. Color-code: green = up regulation, red = down regulation.
Figure 5Workflow for the interpretation (Step B) of results from clustering analysis of NDD patient data (Step A); through the application of physiology route calculations and visualization in ApiNATOMY (Step C). bottom right: Triangle represent radiological measurements; Circle represent clinical indices, Diamond represent “omic” indices. The orange color is meant to describe the generic character of the abovementioned dimensions. In the other parts of the graphic, the various colors in the shapes are representing the values that are taken by those dimensions in a specific context (e.g., individuals). The P is an abbreviation for Publication.
Figure 6Workflow for candidate mechanism identification in the AETIONOMY Knowledge Base—the Parkinson Disease example. represent BEL ans SBML models; represents the gene component of the model. represents the protein component.