| Literature DB >> 12372141 |
Eugene M McCarthy1, Jingdong Liu, Gao Lizhi, John F McDonald.
Abstract
BACKGROUND: Long terminal repeat (LTR) retrotransposons constitute a major fraction of the genomes of higher plants. For example, retrotransposons comprise more than 50% of the maize genome and more than 90% of the wheat genome. LTR retrotransposons are believed to have contributed significantly to the evolution of genome structure and function. The genome sequencing of selected experimental and agriculturally important species is providing an unprecedented opportunity to view the patterns of variation existing among the entire complement of retrotransposons in complete genomes.Entities:
Mesh:
Substances:
Year: 2002 PMID: 12372141 PMCID: PMC134482 DOI: 10.1186/gb-2002-3-10-research0053
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Summary of rice LTR retrotransposons characterized in this study
| Family | Pre-existing name(s) | Accession number of exemplar | Location | Chromosome number | LTR length (bp) | Inserted element length | TSR | %LNI (mean for family) | Approximate copy number (haploid genome)‡ |
| AC023240 | 100410-106807 | 10 | 965 | 6,398 | AGTCC | 98.1 | 250 | ||
| AL442110 | 95121-100070 | 4 | 267 | 4,950 | ATATT | 98.5 | <50 | ||
| AF458765 | 51-5250 | ? | 146 | 5,200 | CATTC | 99.3 | 50-100 | ||
| AB026295 | 160208-165872 | 6 | 350 | 5,665 | GTTAC | 98.9 | <50 | ||
| AC021891 | 56044-62135 | X | 477 | 6,092 | TACAG | 96.2 | <50 | ||
| AP001366 | 57569-62773 | 1 | 440 | 5,205 | ACCTG | 99.8 | <50 | ||
| AP002538 | 44996-53915 | 1 | 1608 | 8,920 | AGTTT | 98.8 | <50 | ||
| AC021891 | 65191-74406 | X | 1220 | 9,216 | TAAAT | 97.2 | 1100 | ||
| AP000969 | 25869-28634 | 1 | ND | ND | ND | ND | 50-100 | ||
| AC069324 | 137920-139740 | 10 | ND | ND | ND | ND | 400 | ||
| AP003853 | 96975-98088 | 1 | ND | ND | ND | ND | <50 | ||
| AC073166 | 104289-109024 | 10 | 221 | 4,736 | AGAAG | 99.7 | <50 | ||
| AC073405 | 72924-79364 | 5 | 968 | 6,441 | TATGT | 99.6 | 650 | ||
| AC069324 | 8821-17191 | 10 | 319 | 8,371 | CTCCC | 97.6 | 350 | ||
| AP002867 | 127118-132180 | 1 | 262 | 5,062 | GCTTC | 94.5 | 250 | ||
| AP002845 | 42644-49551 | 1 | 300 | 6,908 | TGCTT | 97.9 | <50 | ||
| AC018727 | 102539-96583 | 10 | 501 | 5,957 | TCATC | 99.6 | 50-100 | ||
| AC068654 | 23423-25036 | X | ND | ND | ND | ND | <50 | ||
| AC069300 | 73013-77731 | 10 | 205 | 4,719 | GGGAC | 99.5 | 50-100 | ||
| AC084406 | 8749-14200 | 3 | 286 | 5,452 | TTATA | 97.9 | 50-100 | ||
| AC087545 | 81711-84269 | 10 | ND | ND | ND | ND | 50-100 | ||
| AC074283 | 24546-19810 | 10 | 191 | 4,647 | GAACC | 97.9 | 50-100 | ||
| AP002843 | 144255-139782 | 1 | 209 | 4,774 | AGGAT | 99.5 | 50-100 | ||
| AC016781 | 25997-30858 | ND | 221 | 4,852 | CCGAG | 98.6 | <50 | ||
| AP001278 | 28729 35569 | 1 | 417 | 6,841 | TCGAG | 98.9 | 500§ | ||
| AP001111 | 59274-70587 | 5 | 440 | 11,314 | GATAT | 97.9 | 500 | ||
| AP000399 | 75139-88038 | 6 | 1087 | 12,900 | AATAT | 99.0 | 900 | ||
| AP002539 | 139654-121650 | 1 | 2195 | 18,005 | GTTAT | 99.0 | <50 | ||
| AP002747 | 78609-87615 | 1 | 656 | 9,007 | GGAAC | 96.0 | 550 | ||
| AC078891 | 52683-65684 | 10 | 1507 | 13,002 | ACTTT | 97.2 | 1500 | ||
| AP003054 | 102778-110180 | 1 | 787 | 7,403 | AAACC | 99.9 | <50 | ||
| AP002820 | 111559-12278 | 1 | ND | ND | ND | ND | 50-100 | ||
| AP002864 | 35539-47557 | 6 | 3009 | 12,009 | CACAC | 99.1 | 550 | ||
| AF111709 | 25889-38685 | 5 | 3292 | 12,797 | AGAAA | 99.4 | 450 | ||
| AC068924 | 94924-100611 | 10 | 423 | 5,688 | CTAAT | 98.3 | <50 | ||
| AP001551 | 59722-64876 | 1 | 319 | 5,155 | GGTCA | 98.4 | <50 | ||
| AC068654 | 2534-6969 | X | 794 | 4,436 | CTTGA | 98.9 | 600 | ||
| AF458766 | 31-5535 | ? | 332 | 5,525 | TGAGG | 96.2 | <50 | ||
| AF458767 | 51-5267 | ? | 368 | 5,217 | CAAAG | 97.6 | <50 | ||
| AC020666 | 65731-77151 | 10 | 564 | 11,421 | ACATG | 98.3 | 600 | ||
| AP003631 | 27347-43001 | 1 | 518 | 15,655 | GGTTC | 97.7 | 300 | ||
| AF458768 | 51-5655 | ? | 358 | 5,605 | ATGTC | 99.9 | <50 | ||
| AP000815 | 77117-78910 | 1 | 291 | 1,794 | CTGAT | 98.6 | <50 | ||
| AP000364 | 41541-42747 | 8 | 148 | 1,207 | AACAA | 99.9 | <50 |
*Location given is for an example RT in the GBRD (no full-length element was identified for this family). †As a full-length element is known in the MRD, the TSR and lengths of the LTR and element (columns 5-7) are taken from an element in the MRD while the location (if given) in columns 2-4 refers to an RT in the GBRD. ‡Percentages based on number of hits using a sample LTR from each family as query to search the MRD. §N. Jiang and S.R. Wessler (unpublished work) suggest that if pericentric DNA (which is largely heterochromatic) is taken into account, Osr25 elementsexist at a higher copy number (~1,000 copies in the entire genome) than our survey, based largely on euchromatic sequences, would suggest. ND, not determined.
Figure 1RT-based neighbor-joining tree for copia-like retrotransposons. Distances (uncorrected p) appear next to each branch. RT sequences from plant species other than rice are included for comparison.
Non-rice RTs used in phylogenies
| Name of retrotransposon | Accession number | Host organism |
| T04112 | ||
| T02087 | ||
| AAK73108 | ||
| T17429 | ||
| S27768 | ||
| AAG52949 | ||
| S23315 | ||
| NP_175303 | ||
| T01860 | ||
| NP_178752 | ||
| NP_174802.1 | ||
| AAF13073.1 | ||
| NP_179047 | ||
| AAD19359 | ||
| AAD22153 | ||
| AF326781† | ||
| CAA73042 |
*Previously unnamed RT found by BLAST searches of the GBRD, using rice RTs found in our study as queries. Acr, Ananas comosus retrotransposon; Atr, A. thaliana retrotransposon; Zmr, Z. mays retrotransposon.
Previously named low-copy families for which a full-length exemplar has not been presented in this paper
| Family | Pre-existing family name | Accession number (or source) of sequence |
| T03709 | ||
| T03704 | ||
| T03705 | ||
| T03706 | ||
| T03707 | ||
| Hirochika | ||
| T03711 | ||
| T03712 | ||
| T03716 | ||
| T03721 | ||
| T03723 | ||
| Kumekawa | ||
| T03666 | ||
| T03669 | ||
| T03671 |
Figure 2RT-based neighbor-joining tree for gypsy-like retrotransposons. Distances (uncorrected p) appear next to each branch. RT sequences from plant species other than rice are included for comparison.