| Literature DB >> 31114637 |
Andrei S Guliaev1, Seraphima K Semyenova1.
Abstract
BACKGROUND: Genomes of eukaryotes are inhabited by myriads of mobile genetic elements (MGEs) - transposons and retrotransposons - which play a great role in genome plasticity and evolution. A lot of computational tools were developed to annotate them either in genomic assemblies or raw reads using de novo or homology-based approaches. But there has been no pipeline enabling users to get coding and flanking sequences of MGEs suitable for a downstream analysis from genome assemblies.Entities:
Keywords: Flatworms; Genome analysis; Mobile genetic elements; PLE; Penelope; Retrotransposons; Schistosoma
Year: 2019 PMID: 31114637 PMCID: PMC6515669 DOI: 10.1186/s13100-019-0163-6
Source DB: PubMed Journal: Mob DNA
Fig. 1MGERT workflow chart. The genome assembly used as input for RepeatModeler for de novo search and classification of MGEs; their families that have been chosen by a user are pipelined to RepeatMasker for similarity search; these regions of similarity then being excised by BEDTools; conserved domain homology search by RPS-Blast followed by ORF extraction by ORFinder; adding flanking regions of specified length. Black outline of the input squares designates mandatory files; a black outline of output squares designates the output is used in the following step of the pipeline
Fig. 2Number, median length and maximum length of PLEs and L1 hits retrieved by MGERT with different merge parameter
Fig. 3Phylogenetic tree based on amino acid sequences of PLEs found in the genome of S.mansoni by MGERT and PLE representatives from RepBase. TERT sequences were used as the outgroup. Blacktip labels represent PLE found by MGERT, other colours - PLE obtained from RepBase. Black circles designate previously known PLEs from genus Schistosoma. Tree branch width corresponds to bootstrap support value (less or greater than 50)