| Literature DB >> 32977771 |
Alec Vallota-Eastman1, Eleanor C Arrington1, Siobhan Meeken2, Simon Roux3, Krishna Dasari4, Sydney Rosen4, Jeff F Miller5,6, David L Valentine7,8, Blair G Paul9.
Abstract
BACKGROUND: Cyanobacteria maintain extensive repertoires of regulatory genes that are vital for adaptation to environmental stress. Some cyanobacterial genomes have been noted to encode diversity-generating retroelements (DGRs), which promote protein hypervariation through localized retrohoming and codon rewriting in target genes. Past research has shown DGRs to mainly diversify proteins involved in cell-cell attachment or viral-host attachment within viral, bacterial, and archaeal lineages. However, these elements may be critical in driving variation for proteins involved in other core cellular processes.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32977771 PMCID: PMC7517822 DOI: 10.1186/s12864-020-07052-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Schematic overview of a DGR and their prevalence in cyanobacterial genomes. a Three primary steps in the process of mutagenic homing are shown: 1) conserved template region (TR) in the DGR cassette is transcribed into intermediate, non-coding RNA, which is the substrate for DGR reverse transcriptase (DGR-RT). 2) Template-primed reverse transcription of TR-RNA is highly error-prone at adenines, which thus incorporates random nucleotides at specific positions in the resulting cDNA. 3) The new cDNA molecule is integrated into the variable region (VR) at a fixed locus, resulting in the replacement of a portion of the target gene (~ 100 – 200 bp). Genomic surveys suggest that VRs occur almost exclusively near the 3′ terminus of a target gene. Additional “remote” VP genes (i.e. paralogs) may be found in non-DGR loci throughout the genome, which have detectable TR vs VR homology. b Summary of 52 cyanobacteria genomes known to have DGR components (in Fig. 1a) spanning 31 genera. Genera with ≥2 DGR-containing genomes annotated. c DGR feature occurrence normalized to genome number
Fig. 2Broad RT phylogeny compared amongst all known DGR-containing lineages. RT phylogeny across all known DGR-containing lineages with Group-II Introns as the outgroup (highlighted cyanobacterial clade expanded in Fig. 3)
Fig. 3Phylogenetic reconstruction for DGR-containing cyanobacteria and DGR-RT phylogeny. a Phylogeny for concatenated ribosomal protein alignments, including all DGR-containing species. Filled boxes (left) indicate DGR-RT containing species and the corresponding RT clade. RT clades 1 to 3 were defined based on basal branch support. b DGR-RT phylogeny with cyanobacterial physiological subsections highlighted in color. Circles indicate branch support values (hollow > 50%; filled > 70%).
Fig. 4Distribution of CLec variable protein paralogs in cyanobacterial genomes. DGR-internal variable proteins and remote variable proteins are indicated by triangles and asterisks over corresponding representatives. The CLec paralogs of VPs and RVPs are also shown for each genome. The CLec domain found in all paralogs is indicated by either a red square (CLec only) or two grafted domains shown as adjacent rectangles (C-terminal CLec in red; N-terminal domains in various colors). For clarity, paralogs of the same domain architecture in Trichodesmium erythraeum IMS101 are indicated below the representative protein (e.g. × 2). Note: only a representative subset of DGR-containing genomes is shown (15 of 52 genomes)
Fig. 5Representative DGR-internal variable protein (VP) domain architectures. Representative domain architectures for DGR-internal variable proteins. Protein domains are colored according to pHMMR domain assignment. Variable regions are shown in deep purple. Additional features, including predicted signal peptides and transmembrane helices, are also indicated. An example species is shown to left of each VP architecture