| Literature DB >> 34021759 |
Vincent Mérel1, Patricia Gibert1, Inessa Buch1, Valentina Rodriguez Rada1, Arnaud Estoup2, Mathieu Gautier2, Marie Fablet1, Matthieu Boulesteix1, Cristina Vieira1.
Abstract
Transposable elements (TEs) are ubiquitous and mobile repeated sequences. They are major determinants of host fitness. Here, we characterized the TE content of the spotted wing fly Drosophila suzukii. Using a recently improved genome assembly, we reconstructed TE sequences de novo and found that TEs occupy 47% of the genome and are mostly located in gene-poor regions. The majority of TE insertions segregate at low frequencies, indicating a recent and probably ongoing TE activity. To explore TE dynamics in the context of biological invasions, we studied the variation of TE abundance in genomic data from 16 invasive and six native populations of D. suzukii. We found a large increase of the TE load in invasive populations correlated with a reduced Watterson estimate of genetic diversity θw^ a proxy of effective population size. We did not find any correlation between TE contents and bioclimatic variables, indicating a minor effect of environmentally induced TE activity. A genome-wide association study revealed that ca. 2,000 genomic regions are associated with TE abundance. We did not find, however, any evidence in such regions of an enrichment for genes known to interact with TE activity (e.g., transcription factor encoding genes or genes of the piRNA pathway). Finally, the study of TE insertion frequencies revealed 15 putatively adaptive TE insertions, six of them being likely associated with the recent invasion history of the species.Entities:
Keywords: zzm321990 Drosophila suzukiizzm321990 ; PoolSeq; adaptation; biological invasion; populations; transposable elements
Mesh:
Substances:
Year: 2021 PMID: 34021759 PMCID: PMC8476158 DOI: 10.1093/molbev/msab155
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1.Main features of the TE content in the Drosphilla suzukii reference genome. (A) TE genomic occupancy. Piechart illustrating genomic sequence occupancy of each TE order (in percentages of the assembly). (B) TE copy numbers. Bar plot representing TE copy numbers for the 20 TE superfamilies displaying the highest copy numbers. (C) Distribution of TEs and genes. TE density and gene density are shown for windows of 200 kb. Lighter shades correspond to D. melanogaster and darker shades correspond to D. suzukii. The maximum value of gene density is 54 for D. suzukii and 102 for D. melanogaster. The maximum number of TE fragments is 713 for D. suzukii and 442 for D. melanogaster. Syntenic relationships between D. melanogaster and D. suzukii assemblies are shown inside using light links for regions of low gene density (<7 genes per 200 kb in D. suzukii assembly) and dark links for regions of high gene density (≥7 genes/200 kb). Contigs are surrounded by black strokes. Ticks on D. melanogaster assembly are separated by 1 Mb.
Fig. 2.TE activity in the Drosophila suzukii reference population from Watsonville (USA). (A) Frequency distributions of TE insertions. (B) Population frequencies for each TE family. Pseudofamilies are indicated by a star on the x-axis. Only families/pseudofamilies with more than ten insertions in the reference population are shown. DNA and RCs have been grouped for graphical reasons.
Fig. 3.TE dynamics in native and invasive Drosophila suzukii populations. (A) Geographic location and historical status of the 22 D. suzukii population samples genotyped using a pool-sequencing methodology (Fraimout et al. 2017). (B) TE content in D. suzukii populations, as the numbers of insertions per HG. (C) Correlation between TE content and Watterson’s θ in D. suzukii population samples.
Description of the 15 Putatively Adaptive TE Insertions.
| Insertion | Statistics | Gene Vicinity | Outlier SNP Nearby | A/X | TE Order |
|---|---|---|---|---|---|
| 1 |
| F | A | RC | |
| 2 |
|
| F | A | RC |
| 3 |
| — | T | A | DNA |
| 4 |
| NA | F | X | RC |
| 5 |
|
| F | X | RC |
| 6 |
|
| F | X | DNA |
| 7 | XtX |
| F | A | DNA |
| 8 | XtX |
| F | A | RC |
| 9 | XtX | — | F | A | RC |
| 10 | XtX |
| F | A | Unknown |
| 11 | XtX | — | F | A | RC |
| 12 | XtX |
| F | A | Unknown |
| 13 | XtX |
| F | A | RC |
| 14 | XtX |
| F | A | Unknown |
| 15 | XtX |
| F | A | RC |
Note.—Each insertion is an outlier when considering one or a combination of the global differentiation statistics (XtX) and statistics contrasting allelic frequencies between native populations and populations of the invasive American road () or populations of the invasive European road () or all invasive populations (). The fourth column indicates whether an SNP potentially evolving under positive selection had been detected less than 5 kb away in Olazcuaga et al. (2020) (F = false, T = true). The fifth column indicates whether the insertion is located on an autosomal (A) or X-linked contig (X).
Fig. 4.Frequencies of each of the 15 putatively adaptive insertions in the 22 Drosophila suzukii populations. Insertion number is indicated on the left together with the associated BayPass statistics. XtX corresponds to a global differentiation statistic, C2 to a statistic contrasting allelic frequencies between native populations and populations of the invasive American road () or populations of the invasive European road () or all invasive populations ().