| Literature DB >> 30450130 |
Alexander Y Komkov1,2, Anastasia A Minervina1, Gaiaz A Nugmanov1, Mariia V Saliutina1, Konstantin V Khodosevich3, Yuri B Lebedev1, Ilgar Z Mamedov1,4.
Abstract
BACKGROUND: There is increasing evidence that the transpositional activity of retroelements (REs) is not limited to germ line cells, but often occurs in tumor and normal somatic cells. Somatic transpositions were found in several human tissues and are especially typical for the brain. Several computational and experimental approaches for detection of somatic retroelement insertions was developed in the past few years. These approaches were successfully applied to detect somatic insertions in clonally expanded tumor cells. At the same time, identification of somatic insertions presented in small proportion of cells, such as neurons, remains a considerable challenge.Entities:
Keywords: Genomic normalization; Kamchatka Crab duplex-specific nuclease; Somatic retroelement insertions
Year: 2018 PMID: 30450130 PMCID: PMC6208084 DOI: 10.1186/s13100-018-0136-1
Source DB: PubMed Journal: Mob DNA
Fig. 1Overview of experimental procedure. Green boxes indicate Alu elements, white boxes – ligated adapter. Red arrows indicate genomic restriction sites for AluI, black horizontal arrows – primers and their annealing sites. Blue boxes (BH) – 8-nt molecular identifiers (UMIs)
Distribution of sequencing reads
| Sequencing reads | Unnormalized | Normalization 1 | Normalization 2 |
|---|---|---|---|
| Total | 13,736,244 | 16,991,713 | 16,533,472 |
| Unambiguously mapped to the human genome (hg38) | 8,376,753 | 7,406,323 | 5,765,382 |
| Corresponding to known AluYa5 and AluYa8 | 6,484,320 | 1,180,602 | 198,024 |
| Number of known AluYa5 and AluYa8 | 2134 | 2392 | 2248 |
| Corresponding to other known Alu | 743,310 | 5,617,692 | 4,496,177 |
| Corresponding to somatic Alu | 56 | 609 | 1525 |
Number of sequencing reads and UMIs corresponding to putative somatic insertions and spike-in controls
| Putative somatic Alu insertions | Unnormalized | Normalization 1 | Normalization 2 |
|---|---|---|---|
| Sequencing reads | 56(47)a | 609(218)a | 1,525(461)a |
| Total number | 47 | 171 | 378 |
| Number with UMI count > 1 | 0 | 24 | 38 |
| Spike-in controls | |||
| DR240 (in 5 cellsb) | 0 | 4(3)a | 7(4)a |
| DR389 (in 5 cellsb) | 1(1)a | 1(1)a | 1(1)a |
| DR259 (in 1 cellc) | 0 | 0 | 1(1)a |
| DR418 (in 1 cellc) | 0 | 0 | 0 |
aNumber of UMIs is given in parentheses
bCorresponds to an insertion present in 5 out of 50,000 cells
cCorresponds to an insertion present in 1 out of 50,000 cells
Fig. 2qPCR analysis of selected insertions and spike-in controls. Each dot indicates Ct values for each Alu flanking sequence in “unnormalized”, “normalization 1” and “normalization 2” libraries. Orange dots – fixed insertions (FI) present in each cell, blue dots – somatic insertions (SI) present in more than one cell, green dots (DR) – spike-in controls containing artificial sequences from Danio rerio. The difference in Ct between abundant fixed insertions and rare spike-in insertions changed from 25 cycles for “unnormalized” to 10 cycles for “normalization 2” libraries
Fig. 3Effect of fragment length and GC content on normalization efficiency. The number of sequencing reads of rare (corresponding to AluYb8) and abundant (corresponding to AluYa5) flanks is plotted against fragment length (a) and GC content (b), respectively, in “unnormalized”, “normalization 1” and “normalization 2” datasets. Y axis – number of reads (logarithmic scale). X axis is length of fragments (a) or their GC content (b). Green circles and green crosses indicate Ya5 and Yb8 insertions in “unnormalized” library; orange circles and orange crosses indicate Ya5 and Yb8 insertions in “normalization 1” library; blue circles and blue crosses indicate Ya5 and Yb8 insertions in “normalization 2” library. Trendlines were fit to data using generalized additive models, shaded area indicate confidence interval (CI=0.95) for trendlines
Fig. 4Results of putative somatic Alu locus-specific PCR. Marker – 100 bp DNA ladder (Evrogen); cortex – results of semi-nested PCR with “normalization 2” library; myocardium - results of semi-nested PCR with control library (myocardium from the same donor); N1-N12 somatic insertions with UMI count more than 1. Blue arrows indicate PCR products of expected size. For N4 insertion amplification product of expected size was observed in both “normalization 2” and control myocardium PCR reactions. For N8 insertion amplification product of expected size was not detected neither in “normalization 2” nor in control myocardium PCR reaction. An example (N9 insertion) of capillary sequencing result is shown at the bottom