| Literature DB >> 30911338 |
Miriam Merenciano1, Camillo Iacometti1,2, Josefa González1.
Abstract
Transposable elements (TEs) are not randomly distributed in the genome. A genome-wide analysis of the D. melanogaster genome found that differences in TE density across 50 kb genomic regions was due both to transposition and duplication. At smaller genomic scales, promoter regions of hsp genes and the promoter region of CG18446 have been shown to accumulate TE insertions. In this work, we have further analyzed the promoter region of CG18446. We screened 218 strains collected in 15 natural populations, and we found that the CG18446 promoter region contains 20 independent roo insertions. Based on phylogenetic analysis, we suggest that the presence of multiple roo insertions in this region is likely to be the result of several bursts of transposition. Moreover, we found that the roo insertional cluster in the CG18446 promoter region is unique: no other promoter region in the genome contains a similar number of roo insertions. We found that, similar to hsp gene promoters, chromatin accessibility could be one of the factors explaining the recurrent insertions of roo elements in CG18446 promoter region.Entities:
Keywords: Fecundity; Natural population; Recurrent insertion; Target site duplication; Transposable element; Viability
Year: 2019 PMID: 30911338 PMCID: PMC6415491 DOI: 10.1186/s13100-019-0152-9
Source DB: PubMed Journal: Mob DNA
PCR results and roo insertions identified in the 218 strains analyzed in this work, and in brackets insertions identified in the 39 strains analyzed in Merenciano et al. (2016) (10)
| Population | Strains analyzed | Strains homozygous for the presence of a | Strains heterozygous for the presence of a | |
|---|---|---|---|---|
| Akka, FI | 13 | 3 | 4 |
|
| Stockholm, SE | 23 | 9 | 6 |
|
| Lund, SE | 6 | 3 | 1 |
|
| Karensminde, DK | 12 | 5 | 2 |
|
| Munich, DE | 14 | 6 | 5 |
|
| Market Harborough, UK | 20 | 5 | 7 |
|
| Gotheron, FR | 13 | 3 | 2 |
|
| Bari, IT | (12) | (3) | (4) |
|
| Gimenells, ES | 14 | 3 | 9 |
|
| Tomelloso, ES | 15 | 3 | 10 |
|
| Cortes de Baza, ES | 13 | 0 | 9 |
|
| Guadix, ES | 14 | 0 | 11 |
|
| San Cristóbal de la Laguna, ES | 12 | 6 | 2 |
|
| Raleigh, US | 22 (27) | 17 (19) | 0 |
|
| Siavonga, ZI | 27 | 2 | 10 |
|
| TOTAL | 257 | 87 | 82 | 20 |
Fig. 1Twenty roo solo-LTR insertions are located in the promoter region of CG18446 gene in different natural strains. Schematic representation of the CG18446 promoter region where the 20 roo solo-LTRs are inserted. The black line represents the CG18446 promoter region. Note that although only one insertion was found in any given strain, we have represented them together for simplicity. The white box represents CG18446 5’UTR. Regions depicted with dotted lines are not drawn to scale. Insertions present in 5′-3′ orientation are shown above the black line and insertions present in 3′-5′ orientation are shown below the black line
Fig. 2The majority of the insertions in the CG18446 promoter region are present at very low allelic frequencies. Allelic frequencies of the 20 roo solo-LTRs insertions in populations from Europe, North America, and Africa. The number of strains analyzed in each population is given in parenthesis and the pie chart size is proportional to this number. Blue, yellow, and red lines represent populations with cold, temperate, and arid climates, respectively
Fig. 3The 20 roo solo-LTR insertions found in CG18446 promoter region are the result of several bursts of transposition. Phylogenetic tree including the 20 roo solo-LTR insertions found in CG18446 promoter region and 115 other roo insertions annotated in the reference genome. roo solo-LTRs found in this work are highlighted in red. When several TEs with the same exact sequence were identified, we removed all of them but one. The TEs included in the tree are marked with *. The TEs that were eliminated are: (*) FBti0019017, FBti0019394, FBti0019438, FBti0020009, FBti0020080, roo, roo, roo, roo, roo, roo, and roo, (**) roo, (***) roo, and (****) FBti0019608
de novo roo insertions found in four gene promoter regions
| Promoter region | Number of strains predicted to have an insertion | Genomic coordinates of the insertion sites validated |
|---|---|---|
|
| 4 | 3R: 25,621,076 and 26,521,553 |
|
| 3 | 3R: 6,027,532 and 6,027,608 |
|
| 3 | 3 L: 2,169,152 and 2,169,162 |
|
| 3 | X: 14,114,700 and 14,115,661 |
Fig. 4Egg-to-adult viability and fecundity are not likely to reduce the probability of finding flies with more than one roo element in the CG18446 promoter region. a. Percentage of emerged flies from the four crosses tested. b. Average number of eggs laid per day during the first 48 h for each of the four crosses analyzed. c. Average number of eggs laid per day during 18 days for each of the four crosses analyzed