| Literature DB >> 15461796 |
Brooke D Peterson-Burch1, Dan Nettleton, Daniel F Voytas.
Abstract
BACKGROUND: Retrotransposons are an abundant component of eukaryotic genomes. The high quality of the Arabidopsis thaliana genome sequence makes it possible to comprehensively characterize retroelement populations and explore factors that contribute to their genomic distribution.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15461796 PMCID: PMC545598 DOI: 10.1186/gb-2004-5-10-r78
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Assembling the retroelement dataset. (a) Flow chart for the generation of the dataset. The shaded region denotes steps coordinated by the RetroMap software. (Eprobe refers to a BLAST query sequence) (b) LTR prediction. The innermost direct repeats identified in sequences flanking the original BLAST hit are assigned as LTRs. The repeats delimit the boundaries of the full-length LTR retrotransposons.
Retroelement species used as BLAST probes
| Element | GenBank accession number | Host organism | Family | Genus | Length (nucleotides) | LTR identity (length in nucleotides) |
| AF296831 | MV | 14,016 | 98.2 (1747) | |||
| Cer1 | U15406 | MV | 8,865 | 100.0 (492) | ||
| AJ133521 | MV | 9,045 | 99.9 (1196) | |||
| AF030881 | MV | 5,645 | 91.0 (610) | |||
| Tf1 | M38526 | MV | 4,941 | 100.0 (358) | ||
| Ty3 | M23367 | MV | 5,428 | 100.0 (340) | ||
| Art1 | Y08010 | PV | 4,793 | 99.8 (439) | ||
| M11240 | PV | 5,416 | 100.0 (276) | |||
| AY016208 | PV | 9,089 | 99.8 (548) | |||
| AF053008 | PV | 10,444 | 100.0 (2149) | |||
| Tca2 | AF050215 | PV | 6,428 | 100.0 (280) | ||
| Tca5 | AF065434 | PV | 5,588 | 100.0 (685) | ||
| M22874 | NL | - | 5154 | - | ||
| L1.2 | M80343 | NL | - | 6,050 | - | |
| R1 | X51968 | NL | - | 5356 | - | |
| R2 | X51967 | NL | - | 3,607 | - | |
| Ta11 | L47193 | NL | - | 7,808 | - |
MV, Metaviridae; PV, Pseudoviridae; NL, non-LTR retrotransposon.
A. thaliana LTR retroelements by chromosome
| Chromosome 1 30,080,809 nucleotides | Chromosome 2 19,643,621 nucleotides | Chromosome 3 23,465,812 nucleotides | Chromosome 4 17,549,528 nucleotides | Chromosome 5 26,689,408 nucleotides | Total 117,429,178 nucleotides | |
| RT only | 21 | 19 | 16 | 10 | 16 | 82 |
| Complete elements | 48 | 42 | 47 | 35 | 38 | 210 |
| Nucleotides | 239,675 | 211,083 | 285,207 | 185,127 | 199,386 | 1,120,478 |
| Percentage of total nucleotides | 0.88% | 1.24% | 1.34% | 1.21% | 0.96% | 1.1% |
| Solo LTRs | 84 | 100 | 125 | 89 | 87 | 485 |
| Nucleotides | 16,516 | 19,275 | 23,906 | 15,500 | 15,248 | 90,445 |
| Percentage of total nucleotides | 0.13% | 0.16% | 0.18% | 0.15% | 0.13% | 0.15% |
| RT only | 16 | 30 | 41 | 23 | 32 | 142 |
| Complete elements | 37 | 34 | 45 | 18 | 32 | 166 |
| Nucleotides | 309,690 | 319,802 | 375,703 | 161,352 | 319,535 | 1,486,082 |
| Percentage of total nucleotides | 1.23% | 2.82% | 2.22% | 1.40% | 1.59% | 1.74% |
| Solo LTRs | 435 | 500 | 928 | 360 | 560 | 2,783 |
| Nucleotides | 228,115 | 257,810 | 326,484 | 179,500 | 262,187 | 1,254,096 |
| Percentage of total nucleotides | 1.15% | 1.74% | 1.71% | 1.42% | 1.24% | 1.42% |
| Complete elements | 7 | 8 | 8 | 4 | 11 | 38 |
| Nucleotides | 72,094 | 90,171 | 93,015 | 37,339 | 119,646 | 412,265 |
| Percentage of total nucleotides | 0.38% | 0.87% | 0.67% | 0.41% | 0.69% | 0.60% |
| Complete elements | 14 | 10 | 8 | 6 | 8 | 46 |
| Nucleotides | 131,154 | 102,534 | 83,327 | 68,754 | 103,112 | 591,944 |
| Percentage of total nucleotides | 0.44% | 0.54% | 0.52% | 0.46% | 0.56% | 0.50% |
| Complete elements | 16 | 16 | 29 | 8 | 13 | 82 |
| Nucleotides | 106,442 | 127,097 | 199,361 | 55,259 | 96,777 | 748,231 |
| Percentage of total nucleotides | 0.42% | 1.03% | 1.03% | 0.52% | 0.33% | 0.64% |
| 49 | 90 | 69 | 32 | 71 | 311 | |
| Complete elements | 85 | 76 | 92 | 53 | 70 | 376 |
| Nucleotides | 634,695 | 798,606 | 836,968 | 457,405 | 679,255 | 3,331,357 |
| Percentage of total nucleotides | 2.11% | 4.07% | 3.57% | 2.61% | 2.55% | 2.84% |
| Solo LTRs | 519 | 600 | 1,053 | 449 | 647 | 3,268 |
| Nucleotides | 386,759 | 373,256 | 444,804 | 275,361 | 364,340 | 1,844,520 |
| Percentage of total nucleotides | 1.29% | 1.90% | 1.90% | 1.57% | 1.37% | 1.57% |
| Both | ||||||
| Nucleotides | 1,021,454 | 1,171,862 | 1,281,772 | 732,766 | 1,043,595 | 5,175,877 |
| Percentage of total nucleotides | 3.40% | 5.97% | 5.46% | 4.18% | 3.91% | 4.41% |
Figure 2Arabidopsis thaliana Metaviridae and Pseudoviridae reverse transcriptase diversity. Phylogenetic trees used in this figure are adapted from [14,18]. Each tree is based on ClustalX [56] alignments of reverse transcriptase domains for elements in a given family. Neighbor-joining trees (10,000 bootstrap repetitions) were generated using MEGA2 [57]. The non-LTR retrotransposon Ta11 served as the root for both trees. The three Metaviridae sublineages are boxed.
Figure 3Physical distribution of full-length A. thaliana retroelements. The five A. thaliana chromosomes are designated as Ath1-5. Triangles indicate the location of a particular retroelement on the chromosome. Non-LTR retrotransposons are in black, Pseudoviridae in gray, and Metaviridae in white. Vertical bars on the chromosome show the precise location of the retroelement. Regions of heterochromatin are represented as follows: telomeres and NORs (on Ath2 and Ath4) by rounded chromosome ends; centromeres by hourglass shapes; heterochromatic knobs (on Ath4 and Ath5) by narrowed stretches on chromosome bars. The relatively short chromosome 5 knob is barely visible to the right of the centromere. The inset more clearly depicts heterochromatic regions that are obscured by element insertions. Chromosomes are drawn to scale.
Figure 4Chromosomal distribution of LTRs for the Metaviridae and Pseudoviridae families in A. thaliana. Chromosomes are displayed as in Figure 3. In addition, solo LTRs are drawn as open triangles. The upper chromosome depicts the distribution of Pseudoviridae, the lower the distribution of Metaviridae. In contrast to Figure 3, shading is not used to distinguish between the families.
Comparison of genome localization by retroelement lineage
| Hypotheses | Test | Group(s) tested | Accept? | |||||
| 1 | 2 | 3 | 4 | 5 | ||||
| All families are randomly distributed according to a uniform distribution | Uniform goodness of fit, 10,000 random permutations | MV(F) | No | |||||
| PV(F) | No | |||||||
| MV(S) | No | |||||||
| PV(S) | No | |||||||
| MV(R) | No | |||||||
| PV(R) | No | |||||||
| NL(R) | No | |||||||
| Retroelement family distributions are organized similarly in the genome | MRPP, 10,000 random permutations | MV(FSR), PV(FSR), NL(R) | No | |||||
| MV(FSR), PV(FSR) | No | |||||||
| MV(FSR), NL(R) | No | |||||||
| PV(FSR), NL(R) | 0.3498 | 0.8326 | 0.1468 | 0.1417 | Yes | |||
| All Metaviridae sublineages have similar distributions | MRPP, 10,000 random permutations | MV | 0.2200 | 0.1365 | 0.5676 | 0.4174 | 0.2788 | Yes |
| MV | 0.1057 | 0.3010 | 0.2657 | 0.4526 | 0.4453 | Yes | ||
| MV | 0.1687 | 0.0970 | 0.7116 | 0.3773 | 0.2781 | Yes | ||
| MV | 0.4903 | 0.1268 | 0.7341 | 0.5753 | 0.2361 | Yes | ||
| Metaviridae subtypes have similar distributions | MRPP, 10,000 random permutations | MV(FSR) | 0.7742 | 0.1247 | 0.7425 | 0.0659 | Yes | |
| MV(FS) | 0.4544 | 0.1357 | 0.4435 | 0.7241 | Yes | |||
| MV(FR) | 0.5184 | 0.9461 | 0.5750 | 0.5480 | 0.2135 | Yes | ||
| MV(SR) | 0.9068 | 0.1339 | 0.8194 | Yes | ||||
| Pseudoviridae subtypes have similar distributions | MRPP, 10,000 random permutations | PV(FSR) | 0.0509 | 0.2039 | 0.2199 | 0.0953 | Yes | |
| PV(FS) | 0.2732 | 0.0853 | 0.2665 | 0.6567 | Yes | |||
| PV(FR) | 0.5055 | 0.1185 | 0.0521 | Yes | ||||
| PV(SR) | 0.0743 | 0.5604 | 0.2513 | 0.3476 | Yes | |||
MV, Metaviridae; PV, Pseudoviridae; NL, non-LTR retrotransposon; R, RT-only; S, solo LTR; F, full-length element. p-values < 0.05 are displayed in bold text.
Association of retroelements with heterochromatin
| Hypotheses | Test | Group(s) tested | Accept? | |
| All families share a similar probability of being in or outside heterochromatin | MRPP, 10,000 random permutations | MV(FSR), PV(FSR), NL(R) | No | |
| MV(FSR), PV(FSR) | No | |||
| MV(FSR), NL(R) | No | |||
| PV(FSR), NL(R) | No | |||
| Metaviridae sublineages have similar heterochromatic distributions | MRPP, 10,000 random permutations | MV | No | |
| MV | No | |||
| MV | 0.5211 | Yes | ||
| MV | No | |||
| Element subtypes have similar heterochromatic distributions | MRPP, 1,000 random permutations | MV(SR), PV(SR), NL(R) | No | |
| MV(FR), PV(FR), NL(R) | 0.3960 | Yes | ||
| MV(FS), PV(FS) | No | |||
| Pseudoviridae subtypes have similar heterochromatic distributions | Pearson's chi-square | PV(FSR) | No | |
| PV(FS) | No | |||
| PV(FR) | No | |||
| PV(SR) | 0.9419 | Yes | ||
| Metaviridae subtypes have similar heterochromatic distributions | Pearson's chi-square | MV(FSR) | No | |
| MV(FS) | No | |||
| MV(FR) | 0.5146 | Yes | ||
| MV(SR) | No |
MV, Metaviridae; PV, Pseudoviridae; NL, non-LTR retrotransposon; R, RT-only; S, solo LTR; F, full-length element. p-values < 0.05 are displayed in bold text.
Figure 5Relative ages of A. thaliana LTR retroelement lineages. (a) Box-plot showing the age distribution of Pseudoviridae full-length elements contrasted with those of the Metaviridae. The position of the median is shown as a gray bar in the box that delimits the boundaries of the lower and upper quartiles. Data points more than 1.5 times the inter-quartile range above the upper quartile or below the lower quartile are indicated by individual horizontal lines. Ages were calculated as described in Materials and methods. (b) Relative-age box-plots of Metaviridae sublineages. Permutation test p-values for the significance of the displayed age distributions are shown below each box-plot.
Comparison of LTR retroelement age distributions
| Hypotheses | Test | Group(s) tested | Chromosome | Accept? |
| Metaviridae and Pseudoviridae have similar age distributions in the genome | MRPP, 10,000 random permutations | MV(F), PV(F) | No | |
| Metaviridae sublineages have similar age distributions | MRPP, 10,000 random permutations | MV | No | |
| MV | No | |||
| MV | No | |||
| MV | 0.4618 | Yes | ||
| Metaviridae age distributions are similar whether the elements are in or out of heterochromatin | MRPP, 10,000 random permutations | MV(F) | No | |
| Metaviridae sublineage age distributions are similar whether they are in or outside heterochromatin | MRPP, 10,000 random permutations | MV | No | |
| MV | 0.5747 | Yes | ||
| MV | No | |||
| Pseudoviridae age distributions are similar whether the elements are in or outside heterochromatin | MRPP, 10,000 random permutations | PV(F) | No |
MV, Metaviridae; PV, Pseudoviridae; NL, non-LTR retrotransposon; R, RT-only; S, solo LTR; F, full-length element. p-values < 0.05 are displayed in bold text.