| Literature DB >> 25526783 |
Jesse D Riordan, Luke J Drury, Ryan P Smith, Benjamin T Brett, Laura M Rogers, Todd E Scheetz, Adam J Dupuy1.
Abstract
BACKGROUND: Animal models of cancer are useful to generate complementary datasets for comparison to human tumor data. Insertional mutagenesis screens, such as those utilizing the Sleeping Beauty (SB) transposon system, provide a model that recapitulates the spontaneous development and progression of human disease. This approach has been widely used to model a variety of cancers in mice. Comprehensive mutation profiles are generated for individual tumors through amplification of transposon insertion sites followed by high-throughput sequencing. Subsequent statistical analyses identify common insertion sites (CISs), which are predicted to be functionally involved in tumorigenesis. Current methods utilized for SB insertion site analysis have some significant limitations. For one, they do not account for transposon footprints - a class of mutation generated following transposon remobilization. Existing methods also discard quantitative sequence data due to uncertainty regarding the extent to which it accurately reflects mutation abundance within a heterogeneous tumor. Additionally, computational analyses generally assume that all potential insertion sites have an equal probability of being detected under non-selective conditions, an assumption without sufficient relevant data. The goal of our study was to address these potential confounding factors in order to enhance functional interpretation of insertion site data from tumors.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25526783 PMCID: PMC4378557 DOI: 10.1186/1471-2164-15-1150
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Validation of candidate SB-induced footprint sites identified by deep-sequencing analysis. (A) Schematic diagram depicting the method used to validate putative footprints. Primers were designed to amplify genomic regions predicted to contain SB-induced footprints in DNA from tumor samples, but not in DNA from normal tissue. Products amplified from tumor and normal DNA were purified and subjected to enzymatic digest with HpyCH4III, a restriction endonuclease whose recognition site is generated following SB transposon excision. (B) Digested products were separated by agarose gel electrophoresis. Results from a high confidence (HC) and low confidence (LC) predicted footprint are shown for each of the three tumors analyzed. Undigested products range in size from 473–530 bp (upper bands), while digested products range in size from 235–268 bp (lower bands).
Candidate footprints identified in SB-induced tumors
| Tumor ID | Predicted high conf. | Validated high conf. | Predicted low conf. | Validated low conf. | Total candidates | Within introns | Within exons |
|---|---|---|---|---|---|---|---|
| 473 | 149 | 5/5 (100%) | 69 | 0/5 (0%) | 218 | 82 (37.6%) | 5 (2.3%) |
| 557 | 2 | 2/2 (100%) | 95 | 1/8 (12.5%) | 97 | 41 (42.3%) | 3 (3.1%) |
| 672 | 133 | 5/5 (100%) | 104 | 0/5 (0%) | 237 | 90 (38.0%) | 5 (2.1%) |
|
|
|
|
|
|
|
|
|
Figure 2The use of standards demonstrates the semi-quantitative range of ligation-mediated PCR to detect transposon insertions in a complex sample. (A) A set of 10 plasmid standards were added to eight DNA samples from SB-induced tumors. These tumors were generated using either a high-copy (HC1-4) [5] or low-copy (LC1-4) transposon donor [8]. The standards were split into three groups and spiked into each tumor DNA sample to mimic insertion events present in 1 copy per cell (1X), 0.5 copies per cell (0.5X), or 0.125 copies per cell (0.125X). The maximum read value obtained for each standard was then expressed as a percentage of the most abundant standard identified in each sample. The indicated values represent the average value for each standard obtained from three independent sequence runs for each sample. (B) Insertion sites in each sample were grouped according to the normalized read value (% of maximum signal). The false discovery rate was estimated for each group of insertions (see Methods).
Figure 3Transposon insertion densities on local chromosomes and non-local windows. (A) The transposon insertion site and TA dinucleotide densities are shown for each chromosome that undergoes local hopping. In each case, the height of the peak indicates the proportion of total events present at that location. The strongest single peak on each chromosome corresponds to the mapped location of the indicated transgene (e.g. TG6113). (B) Two genomic regions identified by PRIM analysis show enrichment for transposon insertion: chr11:3,180,001-3,200,000 (top) and chr6:12,510,001-12,570,001 (bottom). In each case, a histogram plot indicates the number of insertion events present at each location (left axis). The density plot indicated by the solid line shows the proportion of total TA dinucleotides (right axis). The density plot represented by the shaded curve represents an estimate indicating the proportion of TA sites in which more than 50% of read lengths can be accurately mapped (see Methods).
Map-corrected, non-local genomic windows with significant enrichment for detected transposon insertions in unselected tissues (liver, lung, skin, spleen)
| Window coordinates | p-value | Identified by initial analysis? | # of tissues with insertions | # of strains with insertions | Gene within window | Other notes |
|---|---|---|---|---|---|---|
| chr2:81,670,001-81,690,000 | 3.39E-07 | No | 3 | 3 | ||
| chr5:28,160,001-28,180,000 | <1.00E-300 | Yes | 4 | 4 |
| Transposon artifact1 |
| chr5:113,680,001-113,700,000 | 7.91E-18 | Yes | 4 | 3 |
| |
| chr6:12,510,001-12,530,000 | <1.00E-300 | Yes | 4 | 4 |
| |
| chr6:12,530,001-12,550,000 | <1.00E-300 | Yes | 4 | 4 |
| |
| chr6:12,550,001-12,570,000 | <1.00E-300 | Yes | 4 | 3 |
| |
| chr8:65,840,001-65,860,000 | 7.21E-11 | Yes | 4 | 3 |
| |
| chr10:36,800,001-36,820,000 | 3.34E-15 | Yes | 4 | 4 |
| |
| chr11:3,180,001-3,200,000 | 5.64E-176 | Yes | 4 | 4 |
| Genome assembly artifact2 |
| chr11:76,780,001-76,800,000 | 3.30E-45 | Yes | 4 | 4 |
| |
| chr11:79,420,001-79,440,000 | 1.88E-09 | Yes | 4 | 3 |
| |
| chr13:15,380,001-15,400,000 | 1.95E-07 | No | 4 | 4 | ||
| chr13:31,620,001-31,640,000 | 1.01E-200 | Yes | 4 | 4 |
| Transposon artifact1 |
| chr16:28,880,001-28,900,000 | 5.71E-09 | Yes | 4 | 3 |
| |
| chr17:83,160,001-83,180,000 | 2.93E-08 | No | 4 | 3 | ||
| chrX:150,970,001-150,990,000 | 2.26E-07 | No | 3 | 4 |
|
1Detected as significant due to the inclusion of En2 and Foxf2 splicing elements within the transposon structure.
2Detected as significant due to presence at a higher copy number than annotated in the reference genome.
Potential false-positive common insertion sites reported in prior publications
| Gene symbol | Window coordinates | # of studies | References |
|---|---|---|---|
| Sfi1 | chr11:3,180,001-3,200,000 | 12 | [ |
| Nf1 | chr11:79,420,001-79,440,000 | 7 | [ |
| Abi1 | chr2:22,990,001-23,010,000 | 5 | [ |
| Son | chr16:91,640,001-91,660,000 | 4 | [ |
| Thsd7a | chr6:12,510,001-12,530,000; | 4 | [ |
| Chl1 | chr6:103,630,001-103,650,000 | 3 | [ |
| Erc1 | chr6:119,570,001-119,590,000 | 3 | [ |
| Fbxl17 | chr17:63,460,001-63,480,000 | 3 | [ |
| Lphn3 | chr5:81,340,001-81,360,000 | 3 | [ |
| March1 | chr8:65,840,001-65,860,000 | 3 | [ |
| Ahcy | chr2:155,050,001-155,070,000 | 2 | [ |
| Diap2 | chrX:130,130,001-130,150,000 | 2 | [ |
| Faf2 | chr13:54,620,001-54,640,000 | 2 | [ |
| Fhit | chr14:9,840,001-9,860,000 | 2 | [ |
| Gart | chr16:91,640,001-91,660,000 | 2 | [ |
| Nav2 | chr7:49,260,001-49,280,000 | 2 | [ |
| Phf21a | chr2:92,270,001-92,290,000 | 2 | [ |
| 2610307P16Rik | chr13:28,800,001-28,820,000 | 1 | [ |
| a | chr2:155,050,001-155,070,000 | 1 | [ |
| Ccdc73 | chr2:104,930,001-104,950,000 | 1 | [ |
| Cpd | chr11:76,780,001-76,800,000 | 1 | [ |
| Dgki | chr6:37,270,001-37,290,000 | 1 | [ |
| En2 | chr5:28,160,001-28,180,000 | 1 | [ |
| Entpd7 | chr19:43,700,001-43,720,000 | 1 | [ |
| Ide | chr19:37,300,001-37,320,000 | 1 | [ |
| Immp1l | chr2:105,910,001-105,930,000 | 1 | [ |
| Oasl2 | chr5:114,900,001-114,920,000 | 1 | [ |
| Pcdh15 | chr10:73,880,001-73,900,000 | 1 | [ |
| Vps8 | chr16:21,500,001-21,520,000 | 1 | [ |