| Literature DB >> 30102348 |
Zi-Wen Li1, Xing-Hui Hou1,2, Jia-Fu Chen1,2, Yong-Chao Xu1,2, Qiong Wu1, Josefa González3, Ya-Long Guo1,2.
Abstract
Transposable elements (TEs) are mobile genetic elements with very high mutation rates that play important roles in shaping genome architecture and regulating phenotypic variation. However, the extent to which TEs influence the adaptation of organisms in their natural habitats is largely unknown. Here, we scanned 201 representative resequenced genomes from the model plant Arabidopsis thaliana and identified 2,311 polymorphic TEs from noncentromeric regions. We found expansion and contraction of different types of TEs in different A. thaliana populations. More importantly, we identified two TE insertions that are likely candidates to play a role in adaptive evolution. Our results highlight the importance of variations in TEs for the adaptation of plants in general in the context of rapid global climate change.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30102348 PMCID: PMC6117151 DOI: 10.1093/gbe/evy171
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Identification of polymorphic reference and nonreference TE sites. (A) Diagram of polymorphic TE sites. (B) Diagram of the polymorphic TE identification method. (C) Flowchart of the polymorphic TE detection method.
Sensitivity and FDR of Various Identification Methods
| Identification Method | Number of Identified TEs | Sensitivity (%) | FDR (%) |
|---|---|---|---|
| Raw data | 10,910 | 90.59 | 11.26 |
| Filtering with direction information | 9,608 | 89.05 | 1.39 |
| Filtering with direction information in noncentromeric region | 8,883 | 90.45 | 1.04 |
| Filtering with direction information in noncentromeric region and at least three reads | 8,853 | 90.32 | 0.93 |
. 2.—PCA based on SNP sites in 201 accessions. Accessions filtered out of the three populations are marked with accession names. popE, popN, and popY represent accessions mainly from Europe, accessions from Northwestern China and accessions from the Yangtze River basin, respectively.
The Numbers of TE Loci in the Three Populations
| popE | popN | popY | Three Populations | |
|---|---|---|---|---|
| Raw TE loci | 3,930 (2,046/1,884) | 2,891 (1,258/1,633) | 2,932 (1,324/1,608) | 4,305 (2,421/1,884) |
| TE loci in noncentromeric region | 3,504 (1,756/1,748) | 2,556 (1,052/1,504) | 2,556 (1,077/1,479) | 3,856 (2,108/1,748) |
| TE loci in noncentromeric region with at least three reads per accession on average | 2,064 (1,092/972) | 1,434 (704/730) | 1,403 (700/703) | 2,311 (1,339/972) |
| Inserted TEs | 101,085 (16,443/84,642) | 19,684 (7,910/11,774) | 47,869 (20,282/27,587) | 168,638 (44,635/124,003) |
| Inserted TEs per accession (95% confidence intervals) | 944.7±7.7 (153.7±6.9/791.0±12.4) | 820.2±5.5 (329.6±5.0/490.6±4.9) | 811.3±2.9 (343.8±2.3/467.6±3.1) | 887.6±10.3 (234.9±13.8/652.6±23.5) |
| Population-specific TE loci | 618 (450/168) | 81 (81/0) | 69 (69/0) | 768 (600/168) |
The first number in parentheses is the number of polymorphic TEs identified by nonreference TE insertion, and the second is the number of polymorphic TEs identified by reference TE insertion.
. 3.—Frequency and distribution of polymorphic TEs in A. thaliana populations. (A) Frequency of polymorphic TEs in A. thaliana. (B) Distribution of polymorphic TEs in the genome. (C) Frequency of polymorphic TEs in different genomic regions. (D) Composition of polymorphic TEs in different populations.
. 4.—Identification of adaptive TEs. (A) Flowchart of the adaptive TE detection method. (B) The absolute values of integrated haplotype scores (|iHS|) at polymorphic TE sites in popY, popN, and popE, respectively. Each point indicates a polymorphic TE site (MAF>0.05). The green line represents the threshold of top 5% highest |iHS| values in each population. Red points are polymorphic TE sites with significantly high |iHS| values in the permutation analysis, whereas polymorphic TE sites with nonsignificant |iHS| values are marked as black points. The arrows indicate the two adaptive TEs. (C, D) |iHS| values of TE_1 and TE_2 (the red points) and SNP sites (the blue points) in 20 kb flanking regions, respectively. Gene models and the TE locus are corresponding to genome positions used in |iHS| plot. (E, F) Extended haplotype homozygosity (EHH) in adaptive TEs and its flanking regions.
List of Two Adaptive TEs
| Adaptive TE | The Predicted Central Position of TE | The Upstream Gene | The Related TE | The Downstream Gene | TE Insertion Frequency in popE/N/Y | |iHS| | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Gene ID | Annotation | TE ID | Annotation | Gene ID | Annotation | |||||
| TE_1 | Chr5: 14270878 | AT5G36220 | CYP81D1 | AT5TE51000 | LINE element | AT5G36228 | Nucleic acid-binding protein | 76/1/5 | 3.29 | 0.036 |
| TE_2 | Chr5: 21851755 | AT5G53800 | Nucleic acid-binding protein | AT5TE78710 in the intron of AT5G53810 | Copia element; AT5G53810 is an | AT5G53820 | Late embryogenesis abundant protein (LEA) family protein | 101/7/0 | 3.60 | 0 |