| Literature DB >> 29867913 |
Sergio M Morgado1, Ana C P Vicente1.
Abstract
tRNA array unit, a genomic region presenting an intriguing high tRNA gene number and density, was supposed to occur only in few bacteria phyla, particularly Firmicutes. Here, we identified and characterized an abundance and diversity of tRNA array units in Mycobacterium associated genomes. These genomes comprised chromosome, bacteriophages and plasmids from mycobacteria. Firstly, we had identified 32 tRNA genes organized in an array unit within a mycobacteria plasmid genome and therefore, we hypothesized the presence of such structures in Mycobacterium genus. However, at the time, bioinformatics tools only predict tRNA genes, not characterizing their arrangement as arrays. In order to test our hypothesis, we developed and applied an in-house Perl script that identified tRNA genes organization as an array unit. This survey included a total of 7,670 complete and drafts genomes of Mycobacterium genus, 4312 mycobacteriophage genomes and 40 mycobacteria plasmids. We showed that tRNA array units are abundant in genomes associated to the Mycobacterium genus, mainly in Mycobacterium abscessus complex species, being spread in chromosome, prophage, and plasmid genomes. Moreover, other non-coding RNA species (tmRNA and structured RNA) were also identified in these regions. Our results revealed that tRNA array units are not restrict, as previously assumed, to few bacteria phyla and genomes being present in one of the most diverse bacteria genus. We also provide a bioinformatics tool that allows further exploration of this issue in huge genomic databases. The presence of tRNA array units in plasmids and bacteriophages, associated with horizontal gene transfer, and in a bacteria genus that explores diverse niches, are indicatives that tRNA array units have impact in the bacteria biology.Entities:
Keywords: HNH endonuclease; bioinformatics; code:Perl; mycobacteria; mycobacteria plasmids; mycobacteriophages; tRNA array units; tmRNA and structured RNA
Year: 2018 PMID: 29867913 PMCID: PMC5966550 DOI: 10.3389/fmicb.2018.01042
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Maximum likelihood tree based on concatenated tRNA gene nucleotide sequences. In the right side, the tRNA isotype organization (using the single-letter amino acid code) is related with each tree branch. The gaps (– symbol) may not represent the actual distance between two adjacent tRNA genes, but the distance from the reference array. Each array group is depicted by the branch color. The green circles in the branches indicate bootstrap values ≥70.
Figure 2Maximum likelihood tree based on seven concatenated housekeeping genes. The green circles in the branches indicate bootstrap values ≥70. Each colored block at the right indicates the geographic location of the strain isolation. Dotted branches represent the slow-growing species. The array group is indicated by the species name label: G1, red; G2, black; G3, blue; Singleton, lilac; both G1 and G2, green; both G2 and G3, orange. The reference species are in bold.