| Literature DB >> 28486636 |
K M Laricchia1, S Zdraljevic1,2, D E Cook1,2, E C Andersen1,3,4,5.
Abstract
Transposons are mobile DNA elements that generate both adaptive and deleterious phenotypic variation thereby driving genome evolution. For these reasons, genomes have mechanisms to regulate transposable element (TE) activity. Approximately 12-16% of the Caenorhabditis elegans genome is composed of TEs, of which the majority are likely inactive. However, most studies of TE activity have been conducted in the laboratory strain N2, which limits our knowledge of the effects of these mobile elements across natural populations. We analyzed the distribution and abundance of TEs in 208 wild C. elegans strains to better understand how transposons contribute to variation in natural populations. We identified 3,397 TEs as compared with the reference strain, of which 2,771 are novel insertions and 241 are TEs that have been excised in at least one wild strain. Likely because of their hypothesized deleterious effects, we find that TEs are found at low allele frequencies throughout the population, and we predict functional effects of TE insertions. The abundances of TEs reflect their activities, and these data allowed us to perform both genome-wide association mappings and rare variant correlations to reveal several candidate genes that impact TE regulation, including small regulatory piwi-interacting RNAs and chromatin factors. Because TE variation in natural populations could underlie phenotypic variation for organismal and behavioral traits, the transposons that we identified and their regulatory mechanisms can be used in future studies to explore the genomics of complex traits and evolutionary changes.Entities:
Keywords: C. elegans; QTL; natural variation; transposable elements; transposons
Mesh:
Substances:
Year: 2017 PMID: 28486636 PMCID: PMC5850821 DOI: 10.1093/molbev/msx155
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1.Distribution of transposon sites along the six C. elegans chromosomes. Counts represent the number of non-redundant transposon positions from all strains. DNA transposons are displayed in dark blue, retrotransposons in red, and transposons of unknown classification in yellow. Each bin represents 0.25 Mb. The x-axis ticks mark every 5 Mb.
Transposable elements sites can be classified by whether they are insertions, excisions of TEs as compared to the reference genome, or TEs sites shared with the reference genome. The 3,396 TEs can also be grouped into likely DNA elements, retroelements, or unknown TE types.
| Total | 3,396 |
|---|---|
| Insertion sites | 2,771 |
| Active reference sites | 241 |
| Monomorphic reference sites | 384 |
| DNA elements | 2,893 |
| Retroelements | 387 |
| Unknown transposon elements | 116 |
. 2.Distributions and frequency of transposon insertions and active reference sites. (A) Histogram of transposon insertions per strain with blue (DNA transposons), red (retrotransposons), and yellow (unknown transposons) bars shown. (B) Histogram of transposon excisions as compared with the reference genome with the same colors and classifications as in (A). (C) Minor allele frequencies are depicted for all positions at which an insertion site or active reference site was detected.
. 3.Active reference vs. insertion sites per strain. Each point represents an individual strain. The identity line is shown in dotted grey, and the line of best fit is shown in solid red.
. 4.Numbers of observed and expected transposon insertions into different genomic regions. For each class of genomic region, the number of expected (gray) vs. observed (black) transposon insertions are shown. The expected number of transposon insertions were calculated based upon the size of the respective genomic classes. Arm and center chromosome estimates were summed from previous chromosome calculations (Rockman and Kruglyak 2008). TE insertions were classified as singletons found in one strain (“Sin.”) or non-singleton found in more than one strain (“Non-Sin.”). Chi-squared test statistics for the comparisons are found in supplementary table S3, Supplementary Material online. Stars denote significant deviations (P < 0.05) from expectations. Comparisons of TE insertions found on the centers (A) and arms (B) are shown. TE insertions into different genic classes (CDS, intergenic, intron, promoter, and UTR) are shown for centers (C) and arms (D).
. 5.Manhattan plots for three TE count traits. These traits were filtered to retain QTL located more than 100 markers from positions with a high density of transposons and had good phenotypic splits. Total depicts the total number of insertion sites of any TE class. The mappings for Tc1 and CER4-I represent the transposon copy numbers of active reference sites. MIRAGE1 and PAL5A are the counts of insertion sites. The x-axis is the genomic position in megabase pairs (Mb) with the ticks marking every 5 Mb. The y-axis is statistical significance (−log10(P)). The significance threshold is indicated by the dotted gray line. SNVs that are above this threshold are shown in red, and SNVs that are below the threshold or do not pass our QTL filters (see Methods section) are shown in black. The genomic region containing 100 markers on either side of the peak marker are represented by the pink boxes.