| Literature DB >> 25725060 |
Bi Ma1, Tian Li1, Zhonghuai Xiang1, Ningjia He2.
Abstract
Mulberry has been used as an economically important food crop for the domesticated silkworm for thousands of years, resulting in one of the oldest and well-known plant-herbivore interactions. The genome of Morus notabilis has now been sequenced and there is an opportunity to mine the transposable element (TE) data. To better understand the roles of TEs in structural, functional and evolutionary dynamics of the mulberry genome, a specific, comprehensive and user-friendly web-based database, MnTEdb, was constructed. It was built based on a detailed and accurate identification of all TEs in mulberry. A total of 5925 TEs belonging to 13 superfamilies and 1062 families were deposited in this database. MnTEdb enables users to search, browse and download the mulberry TE sequences. Meanwhile, data mining tools, including BLAST, GetORF, HMMER, Sequence Extractor and JBrowse were also integrated into MnTEdb. MnTEdb will assist researchers to efficiently take advantage of our newly annotated TEs, which facilitate their studies in the origin, amplification and evolution of TEs, as well as the comparative analysis among the different species. Database URL: http://morus.swu.edu.cn/mntedb/Entities:
Mesh:
Substances:
Year: 2015 PMID: 25725060 PMCID: PMC4343074 DOI: 10.1093/database/bav004
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Summary of identified TEs in mulberry WGS assembly
| Class | Order | Superfamily | Members | Families |
|---|---|---|---|---|
| Retrotransposons | LTR | 1557 | 226 | |
| 1415 | 145 | |||
| 722 | 312 | |||
| 254 | 119 | |||
| LINE | 19 | 19 | ||
| 30 | 30 | |||
| DNA transposons | TIR | 286 | 31 | |
| 1085 | 44 | |||
| 249 | 38 | |||
| 136 | 39 | |||
| 1 | 1 | |||
| MITE | 136 | 26 | ||
| Helitron | 35 | 32 | ||
| Total | 5925 | 1062 |
Annotation of TE superfamilies in the mulberry WGS assembly
| Class | Order | Superfamily | Masked (bp) | Percentage of Masked (%) | Percentage of genome (%) |
|---|---|---|---|---|---|
| Retrotransposons | LTR | 34 541 580 | 24.13 | 10.44 | |
| 30 419 960 | 21.25 | 9.20 | |||
| 28 414 859 | 19.85 | 8.59 | |||
| 2 005 679 | 1.40 | 0.61 | |||
| unclassified | 46 818 | 0.03 | 0.01 | ||
| LINE | 388 544 | 0.27 | 0.12 | ||
| 974 028 | 0.68 | 0.29 | |||
| SINE | 680 | 0.00 | 0.00 | ||
| DNA transposons | TIR | 6 270 533 | 4.38 | 1.90 | |
| 9 525 810 | 6.65 | 2.88 | |||
| 7 834 412 | 5.47 | 2.37 | |||
| 1 273 395 | 0.89 | 0.38 | |||
| 256 524 | 0.18 | 0.08 | |||
| 17 917 995 | 12.52 | 5.42 | |||
| unclassified | 42 381 | 0.03 | 0.01 | ||
| Helitron | 3 258 215 | 2.28 | 0.98 | ||
| Total | 143 171 413 | 100.00 | 43.28 |
Comparison of the MITE in mulberry with other Rosaceae species
| TEs | Mulberry | Apple | Pear | Strawberry | Peach | |
|---|---|---|---|---|---|---|
| DTM | Element no | 5378 | 158 680 | 33 701 | 13 789 | 16 178 |
| Total length (bp) | 740 060 | 26 867 238 | 5 976 797 | 3 347 380 | 2 988 107 | |
| DTC | Element no | 1874 | 140 | |||
| Total length (bp) | 496 676 | 21 797 | ||||
| DTH | Element no | 85 083 | 42 823 | 35 745 | 5197 | 12 371 |
| Total length (bp) | 12 374 275 | 9 747 607 | 7 525 899 | 1 370 066 | 3 151 797 | |
| DTA | Element no | 8532 | 32 324 | 17 297 | 7223 | 7315 |
| Total length (bp) | 2 432 239 | 7 292 996 | 3 243 583 | 2 134 167 | 1 893 397 | |
| DTT | Element no | 15 638 | 69 677 | |||
| Total length (bp) | 2 371 421 | 12 174 360 | ||||
| DTx | Element no | 1999 | 21563 | 8531 | 3246 | |
| Total length (bp) | 254 302 | 2525527 | 2 094 222 | 804 373 | ||
| Total | Element no | 114 631 | 237 700 | 177983 | 34 880 | 39 110 |
| Total length (bp) | 17 917 995 | 44 658 819 | 31446166 | 8 967 632 | 8 837 674 | |
| Genome size (MB) | 330.79 | 881.28 | 508.55 | 206.89 | 227.25 | |
| Percentage of genome (%) | 5.42% | 5.07% | 6.18% | 4.33% | 3.89% | |
The MITE information of apple, strawberry and peach in this table were retrieved from plant MITE database (P-MITE, http://pmite.hzau.edu.cn/django/mite/) (47). The MITE information of pear was generated by using MITE-Hunter with default parameters. The consensus sequences generated by MITE-Hunter were manually checked using MSA files. Superfamilies are represented using different letters: DTT for Tc1/Mar, DTM for MuLE, DTA for hAT, DTC for CMC, DTH for PIF-Harbinger and DTx for unclassified superfamily.
Figure 1.MnTEdb organization and the description of some functional sections in the database. (A) The top menu of MnTEdb. (B) The side menu of MnTEdb. (C) The user interface of browsing in MnTEdb. User can browse the detailed information of each superfamily and family by clicking the hyperlinks provided in this page. (D) The searching interface of MnTEdb. Two search approaches are provided for user in MnTEdb, including search using ‘ID’ and ‘Family’. All the search results can be shown under the search page. (E) Multiple approaches for TE sequences downloading have been provided. Data can be downloaded by clicking the marked region (green arrow).
Figure 2.Snapshots of analysis tools provided in MnTEdb. (A) The BLAST interface (left) and a sample of BLASTn results (right). (B) The GetORF interface and the snapshots of the output results. (C) HMMER interface of a test protein sequence in MnTEdb. (D) An example of the input and output interface of the Sequence extractor. (E) Genome sequence view in JBrowse of a region in scaffold1. The gene models from the v1.0 genome version of M. notabilis were embedded in the Gene track.