| Literature DB >> 23645600 |
Jiangshan J Shen1, Jonathan Dushoff, Adam J Bewick, Frédéric J J Chain, Ben J Evans.
Abstract
Transposable elements (TEs) are repetitive DNA sequences that can make new copies of themselves that are inserted elsewhere in a host genome. The abundance and distributions of TEs vary considerably among phylogenetically diverse hosts. With the aim of exploring the basis of this variation, we evaluated correlations between several genomic variables and the presence of TEs and non-TE repeats in the complete genome sequence of the Western clawed frog (Silurana tropicalis). This analysis reveals patterns of TE insertion consistent with gene disruption but not with the insertional preference model. Analysis of non-TE repeats recovered unique features of their genome-wide distribution when compared with TE repeats, including no strong correlation with exons and a particularly strong negative correlation with GC content. We also collected polymorphism data from 25 TE insertion sites in 19 wild-caught S. tropicalis individuals. DNA transposon insertions were fixed at eight of nine sites and at a high frequency at one of nine, whereas insertions of long terminal repeat (LTR) and non-LTR retrotransposons were fixed at only 4 of 16 sites and at low frequency at 12 of 16. A maximum likelihood model failed to attribute these differences in insertion frequencies to variation in selection pressure on different classes of TE, opening the possibility that other phenomena such as variation in rates of replication or duration of residence in the genome could play a role. Taken together, these results identify factors that sculpt heterogeneity in TE distribution in S. tropicalis and illustrate that genomic dynamics differ markedly among TE classes and between TE and non-TE repeats.Entities:
Keywords: African clawed frogs; GC content; Xenopus tropicalis; gene expression; genome evolution; natural selection
Mesh:
Substances:
Year: 2013 PMID: 23645600 PMCID: PMC3673623 DOI: 10.1093/gbe/evt065
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Coefficients of Logistic Regression
| Predictor | All TEs | TEs by Type | TEs by Size | Non-TE Repeat | ||
|---|---|---|---|---|---|---|
| DNA transposon | Non-LTR Retrotransposons | Long | Short | |||
| Exon | −0.14421 | −0.10878 | −0.23647 | −0.08214 | −0.14448 | 0.00180 |
| Intron | 0.00330 | 0.02476 | −0.06380 | 0.01551 | 0.00236 | −0.01565 |
| Downstream distance | 0.07326 | 0.06659 | 0.01910 | 0.08695 | 0.06266 | 0.02147 |
| Upstream distance | 0.06494 | 0.06270 | 0.00589 | 0.08392 | 0.05458 | −0.00894 |
| Germline expression | 0.00588 | 0.00908 | −0.03113 | −0.02406 | 0.00633 | −0.01477 |
| Somatic expression | −0.01789 | −0.01542 | −0.00950 | −0.03282 | −0.01617 | 0.00008 |
| Conserved | −0.14227 | −0.19235 | 0.09414 | −0.17893 | −0.13198 | 0.10243 |
| GC | −0.22590 | −0.22798 | −0.14974 | −0.21848 | −0.22172 | −0.34670 |
Note.—GC content is calculated without including TE.
*Individually significant departure from zero (P < 0.05).
FLogistic regression coefficients between genomic variables and the three TE classes and non-TE repeats. GC content was calculated excluding the GC content of TEs. Bars indicate two standard deviations of the correlation coefficients and in most cases are small enough to be hidden by the symbols.
TE Insertion Polymorphism Data for Wild-Caught Silurana tropicalis Individuals
| Ghana | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sample ID | UWBM5959 | UWBM5961 | UWBM5958 | UWBM5957 | UWBM5962 | UWBM5960 | UWBM5963 | MVZ 254093 | MVZ 245094 | MVZ 245095 | MVZ 245096 | MVZ 245097 | MVZ 245420 | MVZ 252599 | MVZ 252600 | XEN 231 Nigeria | AMNH 17271 Sierra Leone | XEN 228 Ivory Coast | L 1720 Liberia | Summary |
| DNA transposons | ||||||||||||||||||||
| DNA transposon 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | NA | F/100 |
| DNA transposon 3 | NA | NA | NA | 1 | NA | 1 | 1 | 1 | NA | 1 | NA | NA | 1 | 1 | 1 | 1 | 1 | 1 | NA | F/100 |
| DNA transposon 5 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0/1 | 1 | 1 | 0/1 | 1 | 1 | 1 | 1 | C/95 |
| DNA transposon 6 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | F/100 |
| DNA transposon 10 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | F/100 |
| DNA transposon 11 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | F/100 |
| DNA transposon 12 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | F/100 |
| DNA transposon 14 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | F/100 |
| DNA transposon 15 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | F/100 |
| Non-LTR Retrotransposons | ||||||||||||||||||||
| Non-LTR 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | 0 | 0 | 0 | 0 | 0 | 0 | 0/1 | 0 | NA | NA | R/3 |
| Non-LTR 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | R/3 |
| Non-LTR 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | R /3 |
| Non-LTR 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | R/3 |
| Non-LTR 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | R/8 |
| Non-LTR 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0/1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | R/10 |
| Non-LTR 9 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | F/100 |
| Non-LTR 10 | 1 | 1 | 1 | 1 | NA | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | NA | F/100 |
| Non-LTR 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0/1 | 0 | 0 | 0 | R/5 |
| Non-LTR 12 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | F/100 |
| Non-LTR 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | NA | R/8 |
| Non-LTR 15 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | F/100 |
| LTR retrotransposons | ||||||||||||||||||||
| LTR 3 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | C/95 |
| LTR 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | R/8 |
| LTR 6 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0/1 | 1 | 1 | 1 | 1 | 1 | 1 | 0/1 | 0/1 | 0 | C/92 |
| LTR 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0/1 | 0/1 | 0 | 0 | R/8 |
Note.—For each locus, individuals are either homozygous for a TE insertion (I), homozygous for no insertion (0), or heterozygous (0/1). Some loci for some individuals did not amplify (NA). A summary of insertion genotype frequencies across all individuals, including the genome sequence (not shown) includes rare (R), intermediate (I), common (C), or fixed (F), followed by the percentage of alleles surveyed that have an insertion. For these calculations, the genome sequence was treated as haploid, so a maximum of 39 alleles were surveyed.