| Literature DB >> 26870081 |
Louise B Thingholm1, Lars Andersen2, Enes Makalic3, Melissa C Southey4, Mads Thomassen2, Lise Lotte Hansen5.
Abstract
The development and progression of cancer, a collection of diseases with complex genetic architectures, is facilitated by the interplay of multiple etiological factors. This complexity challenges the traditional single-platform study design and calls for an integrated approach to data analysis. However, integration of heterogeneous measurements of biological variation is a non-trivial exercise due to the diversity of the human genome and the variety of output data formats and genome coverage obtained from the commonly used molecular platforms. This review article will provide an introduction to integration strategies used for analyzing genetic risk factors for cancer. We critically examine the ability of these strategies to handle the complexity of the human genome and also accommodate information about the biological and functional interactions between the elements that have been measured-making the assessment of disease risk against a composite genomic factor possible. The focus of this review is to provide an overview and introduction to the main strategies and to discuss where there is a need for further development.Entities:
Keywords: DNA methylation; array data; gene expression; integrated analysis; massive parallel sequencing (MPS)
Year: 2016 PMID: 26870081 PMCID: PMC4740898 DOI: 10.3389/fgene.2016.00002
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Overview of integration strategies based on co-location of variations.
| CNV | Gene expression | Identification of CNV affected genes, followed by identification of a functional effect in the form of a change in gene expression of the co-located gene. Select genes by: | • Difference in mean expression between variant containing and not-containing tumors. | (Addou-Klouche et al., |
| • Intersecting gene lists (Venn diagram). | Kikuchi et al., | |||
| • Correlation analysis of CNV and expression values. | ||||
| DNA methylation and gene expression | Specification of modality-patterns expected for TSG and oncogenes. Integration on the basis of co-location and identification of genes showing specified pattern. | Wrzeszczynski et al., | ||
| Using a linear model for the effect of methylation and CNV on disease specific expression to identify potential drivers. | AMMERETTO 1. step (Gevaert et al., | |||
| Mutation | Gene expression and CNV | Identify a functional effect in the form of correlation with gene expression or CN of co-located gene. | Ding et al., | |
| CNV and mutation | Integration of variation types by co-location and selecting of drivers by theory of frequency. | Leary et al., | ||
Overview of strategies for integration based on expression modules or interactions of non-co-located variations.
| CNV | Gene expression | Identification of drivers as genes regulating expression modules. | CONEXIC (Akavia et al., |
| Gene expression and DNA methylation | Step 1: identification of drivers as genes with significant relationship between genomic/epigenomic event and expression. | AMARETTO (Gevaert et al., | |
| Step 2: Identification of target modules for drivers. | |||
| Mutation | Gene expression | Identification of genes with mutation status correlating with expression of other genes. Describes a workflow incl. SAM that compares all possible pairs of genes with data available. | Masica and Karchin, |
| Identification of drivers as genes regulating expression modules. Note that potential drivers are selected from a database and not included datasets. | CaMoDi (Manolakos et al., | ||
| CNV and mutation | Gene expression | Identification of drivers as genes regulating expression modules. | DriverNet (Bashashati et al., |
| OncoImpact (Bertrand et al., | |||