| Literature DB >> 35197094 |
Blair G Paul1, A Murat Eren2,3.
Abstract
Since the first discovery of reverse transcriptase in bacteria, and later in archaea, bacterial and archaeal retroelements have been defined by their common enzyme that coordinates diverse functions. Yet, evolutionary refinement has produced distinct retroelements across the tree of microbial life that are perhaps best described in terms of their programmed RNA-a compact sequence that preserves core information for a sophisticated mechanism. From this perspective, reverse transcriptase has been selected as the modular tool for carrying out nature's instructions in various RNA templates. Beneficial retroelements-those that can provide a fitness advantage to their host-evolved to their extant forms in a wide array of microorganisms and their viruses, spanning nearly all habitats. Within each specialized retroelement class, several universal features seem to be shared across diverse taxa, while specific functional and mechanistic insights are based on only a few model retroelement systems from clinical isolates. Currently, little is known about the diversity of cellular functions and ecological significance of retroelements across different biomes. With increasing availability of isolate, metagenome-assembled, and single-amplified genomes, the taxonomic and functional breadth of prokaryotic retroelements is coming into clearer view. This review explores the recently characterized classes of beneficial, yet accessory retroelements of bacteria and archaea. We describe how these specialized mechanisms exploit a form of fixed mobility, whereby the retroelements do not appear to proliferate selfishly throughout the genome. Moreover, we discuss computational approaches for systematic identification of retroelements from vast sequence repositories and highlight recent discoveries in terms of their apparent distribution and ecological significance in nature. Lastly, we present a new perspective on the eco-evolutionary significance of these genetic elements in marine bacteria and demonstrate approaches that enable the characterization of their environmental diversity through metagenomics.Entities:
Year: 2022 PMID: 35197094 PMCID: PMC8867640 DOI: 10.1186/s13100-022-00262-6
Source DB: PubMed Journal: Mob DNA
Fig. 1Overview of the molecular mechanisms of DGRs and retrons. DGRs and retrons are highlighted among other modern retroelements, with group-II introns shown as the ancestral progenitor. A prototypical DGR is depicted with the stages of adenine mutagenesis and retrohoming that involves template (TR) ncRNA (non-coding RNA) expression, cDNA synthesis by the RT-Avd complex, and site-specific integration into the variable repeat (VR) of a target gene. A prototypical retron is shown with the initiation of cDNA synthesis at a branched guanosine in retron msr RNA. The resulting msr-msd molecule may interact with products of one or more effector genes to make up multipartite retron defense systems
Fig. 2Summary of the functional roles of domesticated retroelements in prokaryotes. The functional significance of DGR hypermutation is depicted in phage that diversify tail tip proteins in order to attach to variable host receptors. Additional cellular DGR targets in Legionella pneumophila and Treponema denticola encode outer membrane proteins whose specific functions remain unknown. Retrons in E. coli have been shown to activate effector proteins that trigger abortive infection and cell arrest. In the case of other bacterial retrons, msDNA may directly inhibit effector proteins, serving as a toxin/antitoxin system. Figure created with BioRender.com
Fig. 3A bioinformatics workflow to survey DGR activity and ecology in metagenomes. A Genome map of Trichodesmium erythraeum strain IMS101, indicating the positions of two DGRs, one retron, and several DGR target genes (purple triangles). B An anvi’o [68] visualization of the coverage of a section of the T. erythraeum genome with short reads recruited from the Tara Oceans Project metagenome N000000184, and the distribution of single-nucleotide variants (SNVs). C A detailed representation of the decrease in coverage and increase in the SNV density in Target Gene 1, which indicates the presence of DGR activity in environmental populations. D Distribution of single-amino acid variants (SAAVs) in the same region of the gene. SAAVs are calculated based on the frequency of codons found in metagenomic short reads that fully map to a given codon position in the gene [61], thus, they are much more effective to quantify the extent of non-synonymous variation environmental populations of Trichodesmium have accumulated. E Oligotyping analysis of all metagenomic short reads that fully match to the primer sequence, location of which is indicated by the horizontal bar in panels C and D. The sequence pattern takes into consideration conserved and variable nucleotide positions as revealed by SNVs in read recruitment results. Metagenomic short reads that match the pattern from each sample were subjected to an oligotyping analysis [69], during which high-entropy nucleotide positions divide sequences into distinct, ecologically relevant groups. While the bar plots visualize the oligotype profiles per sample and their diversity, the dendrogram on the left shows how samples relate to each other based on Jaccard Similarity. Finally, arrows point to the sampling sites near Hawaii and Madagascar that correspond to metagenomes that were used in this analysis on a world map