| Literature DB >> 36060303 |
Xin Tao1,2, Ziwen Huang1, Fan Chen1, Xinli Wang1, Tingting Zheng3, Shaochun Yuan1,4, Anlong Xu1,5.
Abstract
Entities:
Year: 2022 PMID: 36060303 PMCID: PMC9435367 DOI: 10.1093/nsr/nwac073
Source DB: PubMed Journal: Natl Sci Rev ISSN: 2053-714X Impact factor: 23.178
Figure 1.Evolution of the RAG1 and Transib homologs. (a) Schematic diagram showing the genomic organization of RAG-like (RAGL) homologs in Aureococcus anophagefferens (pelagophyte), Fungia costulata (stony coral), Branchiostoma belcheri (lancelet) and Mus musculus (mouse). The diagram is not scaled to sequence length. Species names are abbreviated using three letters as shown, which was also applied in the following context. 5’-/3’-untranslated region (UTR), transcription start site (TSS), exons, Poly(A) sites, 5’-/3’-terminal inverted repeat (TIR) and target site duplication (TSD) are shown as indicated. The 5’-/3’-UTRs and coding sequences of FcoRAGL were predicted using FGENESH. (b) The consensus sequences of mouse RSSs and RAGL TIRs are shown in a weblogo diagram through an unspaced alignment. The spaced sequence alignment of TIRs from RAGL transposons and mouse RSSs show a common bipartite conserved structure among them. The conserved elements in TIRs are shaded using different colors. The highly conserved nucleotides are colored yellow. Pmi, Patiria miniata; Pfl, Ptychodera flava; Nge, Notospermus geniculatus. (c) Detection of 5’-terminal and 3’-terminal flanking sequences of AanRAGL in the A. anophagefferens using agarose gel electrophoresis. The flanking sequences were cloned using the splinkerette-PCR method (primers are listed in Supplementary Table S2) and the original genomic DNA was extracted from the CCMP 1984 strain of A. anophagefferens (NCMA cultures). (d) Schematic diagram of two AanRAGL loci identified from the splinkerette-PCR assays. The identical regions between the loci and the reference genome are shown by the dash lines. (e) Detection of the AanRAGL insertion in Scaffold 5 from two strains of A. anophagefferens. g1: CCMP1984 strain from National Center for Marine Algae and Microbiota (NCMA); g2: A. anophagefferens from Institute of Oceanology Chinese Academy of Sciences (IOCAS). (f) Phylogenetic analysis of RAG1 and Transib homologs. The maximum-likelihood phylogenetic tree was constructed using IQ-TREE based on the core region of RAG1 and Transib homologs. The optimum LG+F+R6 model was tested and selected, and the ultrafast bootstrap (%) support is shown near the branches. The newly identified Transib proteins are marked using a prefix composed of a three-letter abbreviation of the species name, like those of RAG1 homologs. The primary diversification of protein domains is shown near the branches of the protein clans. The Transib family was subdivided into two subgroups according to phylogenetic relationships of Transib proteins. Osp, Ophiothrix spiculata; Rro, Rhizophlyctis rosea; Mle, Mnemiopsis leidyi; Pba, Pleurobrachia bachei; Jsi, Juglans sigillata; Cch, Capsicum chinense; Cli, Corrigiola litoralis; Hov, Hordeum vulgare; Hvu, Hydra vulgaris; Sla, Silene latifolia; Aps, Austropuccinia psidii; Epu, Erysiphe pulchra; Mpl, Massospora platypediae; Cth, Candidatus thioglobus. The complete phylogenetic tree is shown in Supplementary Fig. S3b. The complete list of species abbreviations is shown in Supplementary Table S3. (g) Sequence alignment of the spaced TIRs from Subgroup I and Subgroup II Transibs. The conserved elements in TIRs are shaded using different colors. The highly conserved nucleotides are colored yellow. Species abbreviation as shown in Fig. 1f. (h) Multiple sequence alignment of representative RAG1 and Transib homologs showing the domain diversification. The Transib subgroups are as defined in Fig. 1g and the conserved amino acids in three types of CTT domains are shaded in color. Mmu, MmuRAG1; Hsa, HsaRAG1; Bbe, BbeRAG1L; Nge, NgeRAG1L_3820; Fco, FcoRAG1L; Rro, RroRAG1L_0490; Aan, AanRAG1L; Mle, MleTransib_3931; Hm9, HvuTransib-9_HMp; Pba, PbaTransib_8175; Cth, CthTransib_0091; Hze, HzeTransib; Hm7, HvuTransib-7_HMp; Cch, CchTransib_3309; Epu, EpuTransib_3173; Eba, EbaTransib_0044. CTT, C-terminal tail; CTT1, type I C-terminal tail; CTT2, type II C-terminal tail. The complete sequence alignment is shown in Supplementary Fig. S5. (i) Model on the origin and evolution of RAG and Transib. The RAG1 and Transib homologs were proposed to have descended from a common bacterial Transib ancestor. After descending from the bacterial Transib ancestor, descendants in Subgroup II experienced complex domain loss and acquisition and spread broadly in different hosts through HGT (horizontal gene transfer) (indicated using red dashed lines). However, descendants in Subgroup I inherited many ancestral characteristics, such as the α11–α12 region, the CTT1 domain and the asymmetric TIRs. At the early times of eukaryotes, one member of the Subgroup I Transibs acquired RAG2L to generate the ancient RAGL transposon. This ancient RAGL then underwent host domestication events in a vertical manner, such as gene duplication [3], transposon fossilization, loss of CTT1, selection of R848 in RAG1 and gain of hinge region in RAG2 [9]. The vertebrate type of RAGLs were generated after duplication and divergence of the primitive RAGLs, which are distinguished using two different blue colors. The fossilized RAGL transposons in Echinoderms are indicated by gray TIRs. The evolution of RAG and Transib after splitting from the common bacterial ancestor are independently shown in the upper and lower parts, respectively, but this does not mean that their host species were separately evolved.