| Literature DB >> 22204608 |
Nicolas J Tourasse1, Fredrik B Stabell, Anne-Brit Kolstø.
Abstract
BACKGROUND: Group II introns are widespread genetic elements endowed with a dual functionality. They are catalytic RNAs (ribozymes) that are able of self-splicing and they are also mobile retroelements that can invade genomic DNA. The group II intron RNA secondary structure is typically made up of six domains. However, a number of unusual group II introns carrying a unique extension of 53-56 nucleotides at the 3' end have been identified previously in bacteria of the Bacillus cereus group.Entities:
Year: 2011 PMID: 22204608 PMCID: PMC3261151 DOI: 10.1186/1756-0500-4-564
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Currently identified group II introns carrying a domain VII
| Intron#,& | Intron copy, Strain, Genbank accession number, Genomic coordinates | Intron's host gene predicted product | Phylogenetic subgroup within the B class | Reference |
|---|---|---|---|---|
| a, | pXO1-70; hypothetical protein with DNA primase domain | α | [ | |
| a, | pXO1-08; hypothetical protein with two helicase domains | β | [ | |
| a, | pXO1-42; annotated as a protein belonging to the TraG/TraD family of plasmid proteins involved in bacterial conjugation, however, shows higher though weak homology to proteins of type IV secretion systems of the VirB/VirD family (see [ | β | [ | |
| a, | same as | β | [ | |
| a, | hypothetical protein ( | α | [ | |
| B.my.I1 | not known due to missing sequence | β | [ | |
| intron inserted in non-coding region | β | [ | ||
| DNA primase | β | This study | ||
| hypothetical protein | α | This study | ||
The entire nucleotide sequences (IEP-encoding ORF included) of B.c.I4a, b, c, and d are identical and inserted in the corresponding host gene. B.th.I5a and b differ at three nucleotide positions only. B.th.I6b, c, and d are identical to each other, while B.th.I6a and e are 98.4% and 99% identical to the former introns, respectively, and B.th.I6f differs from B.th.I6b, c, and d at a single nucleotide position. B.c.I16a and b/c are ~90% identical to the various B.th.I6 copies. All B.th.I6 and B.c.I16 copies are located in the same host gene. Due to missing sequence data, it could not be confirmed whether the B.c.I16 intron is present in one or two copies in the B. cereus F65185 strain. The entire nucleotide sequences of B.th.I7a and c are identical. Part of the IEP-encoding ORF sequence of B.th.I7b is missing from the genomic data, whereas the ribozyme sequence is complete and identical to that of B.th.I7a and c. The entire nucleotide sequences of B.my.I1 and B.ps.I1 are 96.5% identical
B.th.I6-like intron fragments were recently identified in two B. thuringiensis isolates from Mexico (Genbank accession numbers JF800177 and JF800178)
Information in italics is based on sequence data from the Microgen website http://www.micro-gen.ouhsc.edu/b_thuring/b_thuringiensis_home.htm. Genome assembly from May 3, 2011
Figure 1Phylogenetic tree of pXO1-42 sequences from . Specific groups of strains have been colored to emphasize the incongruence between the two trees. In A), the tree was reconstructed from the nucleotide sequences of pXO1-42 using the Neighbor-Joining method applied to a pairwise distance matrix computed following Tamura's 3-parameter model. For strains with known plasmids, the plasmid name is given in parentheses. Strains whose pXO1-42 sequence contains an intron are shown in bold (intron name given in square brackets and yellow background; due to incomplete sequence data, it could not be confirmed whether strain F65185 carries one or two copies of B.c.I16). Numbered curly brackets indicate four pXO1-42 groups (strains labeled in different colors) sharing identical sequences around the intron's homing site, while remaining strains (in black) have variable sequences not belonging to these groups (sequences shown in Figure 2A). In B), the tree was extracted from a supertree of 1403 isolates based on multiple locus sequence typing (MLST) data of chromosomal housekeeping genes available in the SuperCAT database http://mlstoslo.uio.no/. No MLST data are available for strain BGSC 4I5. Strains whose genome has been completely sequenced are marked with asterisks. In A) and B) numbers next to branch nodes indicate statistical support values when > 50%. Scale bars are in average numbers of nucleotide substitutions per site. Origin and information about the strains can be found at the University of Oslo's typing website, http://mlstoslo.uio.no/. pXO1-42 was identified in four additional strains (AH1271, AH1272, AH1273, and AH717), however they were not included here because their genotyping data are conflicting (see [29])
Figure 2A) Multiple alignment of . The sequences shown span positions -25 to +15 around the intron insertion site (indicated by square brackets). Sequences numbered 1-4 correspond to the four groups shown in Figure 1A. Strains with different sequences not belonging to these groups (shown in black in Figure 1A) are individually listed by name (strain F65185 is not included due to incomplete sequence data). Nucleotide positions that differ relative to the top sequence are displayed in a black background. B). Insertion sites of the B.th.I7a and b intron copies in B. thuringiensis kurstaki BGSC 4D1/HD1. Sites that exhibit identical nucleotides in both sequences are indicated by asterisks. B.th.I7a and B.th.I7b have identical sequences and are inserted in plasmidic and chromosomal loci, respectively. Intron boundaries are delimited by brackets. The intron binding sites (IBS1, IBS2, and IBS3) in the exons are boxed and their complementary exon binding sites (EBS1, EBS2, and EBS3) in the intron are indicated underneath. The similarity between the insertion sites is weak overall, and is limited to the IBS2 motif, suggesting retrotransposition of B.th.I7 into ectopic sites
Figure 3Predicted secondary structure of the . Roman numerals (I to VI) indicate the six typical functional RNA domains. The extra 70-nt 3' segment is boxed in gray. ORF, intron-encoded multifunctional open reading frame. Numbering of residues does not include the ORF. The inset shows a comparison of the secondary structure of the 70-nt 3' extension of B.psf.I1 and the consensus structure of the 53/56-nt domain VII from the 20 other group II introns known to carry a 3' extension (see Table 1; consensus drawn as in ref. [18]). Sites in the B.psf.I1 extension that are identical to the consensus are drawn in a black background. Note the extended S2 stem in B.psf.I1
Figure 4In vitro self-splicing of . (A) B.c.I4 deleted of the entire 3' extension (B.c.I4_dS1S2); (B) B.c.I4 WT; (C) B.c.I4 deleted of the entire 3' extension and the branchsite adenosine (B.c.I4_dA_dS1S2); (D) B.c.I4 deleted of the branchsite adenosine only (B.c.I4_dA); (E) B.th.I6a WT; and (F) B.th.I6a deleted of the entire 3' extension (B.th.I6a_dS1S2). Splicing was performed in 40 mM MOPS (pH 7.5), 500 mM KCl, and 100 mM MgCl2 at 47°C. Samples were separated on a 7 M urea 4% polyacrylamide gel. The various splicing products are labeled on the sides. The weak bands corresponding to the linear forms of B.th.I6a (panels E and F) are marked by arrowheads and were identified by size. "dS1S2" and "dA" refer to deletion of the entire 3' extension or the branchsite adenosine, respectively
Figure 5Time-course analysis of in vitro self-splicing of . (A) B.c.I4 WT and B.c.I4 deleted of the entire 3' extension (B.c.I4_dS1S2) spliced in (NH4)2SO4 buffer; (B) same constructs as in (A) spliced in KCl buffer; (C) B.c.I4 deleted of the branchsite adenosine only (B.c.I4_dA) or deleted of the branchsite adenosine and the entire 3' extension (B.c.I4_dA_dS1S2) spliced in KCl buffer; and (D) B.th.I6a WT and B.th.I6a deleted of the entire 3' extension (B.th.I6a_dS1S2) spliced in KCl buffer. Splicing was performed in 40 mM MOPS (pH 7.5), 100 mM MgCl2 and either 500 mM (NH4)2SO4 (panel A) or 500 mM KCl (panels B, C, and D) at 47°C. The relative fractions of released lariat intron were computed from the intensities of the radioactive bands using a phosphorimager. The values shown represent averages with standard deviations of one replicate from two different RNA preparations