| Literature DB >> 26442173 |
Johann de Jong1, Lodewyk F A Wessels2, Maarten van Lohuizen3, Jeroen de Ridder4, Waseem Akhtar3.
Abstract
Retroviruses and DNA transposons are an important part of molecular biologists' toolbox. The applications of these elements range from functional genomics to oncogene discovery and gene therapy. However, these elements do not integrate uniformly across the genome, which is an important limitation to their use. A number of genetic and epigenetic factors have been shown to shape the integration preference of these elements. Insight into integration bias can significantly enhance the analysis and interpretation of results obtained using these elements. For three different applications, we outline how bias can affect results, and can potentially be addressed.Entities:
Keywords: chromatin position effect; epigenomics; gene therapy; insertional mutagenesis; integration bias
Year: 2015 PMID: 26442173 PMCID: PMC4588226 DOI: 10.4161/2159256X.2014.992694
Source DB: PubMed Journal: Mob Genet Elements ISSN: 2159-2543
Figure 1.Schematic overview of integration bias with respect to genes for 4 different DNA integrating elements. Adapted from.
Figure 2.Integration bias can give rise to spurious Common Integration Sites (CISs). Hypothetical integration profile of a tumor screen ("Selected" in blue) and corresponding unselected background integrations ("Unselected" in red). In a typical effort for retrieving cancer genes from an IM screen, a genomic region is called as a CIS if the local integration density exceeds a certain threshold ("CIS threshold;" black dotted line). However, some of these regions may also reflect an a priori integration bias.
Figure 3.Integration bias reduces statistical power in TRIP applications. 104 integrations were generated in silico, and distributed across 2 classes (Class 1 and Class 2) in an increasingly uneven fashion. Depending on the assigned class, reporter gene expression was simulated by drawing from a class-specific distribution. Then, (A) the significance of the difference between the 2 classes was determined by Welch's t-test as a function of the size of Class 2 (dashed gray line: 2-sided 5% significance threshold; red solid line: theoretical expected value of the z-normalized t-statistic), and (B) the effect size was determined as the difference in means between the 2 classes as a function of the size of Class 2 (red solid lines: theoretical expected value and standard deviation of the sample distribution of the difference). The x-axis represents the size of Class 2, which indicates how uneven the distribution across the 2 classes is.
Figure 4.Bias for genic regions of 3 DNA integrating elements. (A) For 3 DNA integration elements , integrations were counted within and outside 5kb from genes, and p-values were determined by 2-sided binomial tests. (B) For comparison, the same analysis was done for 3 sets of matched random controls. Refer to for a detailed description of how these controls were generated.