RNA polymerase III (Pol III) transcription of tRNA genes is essential for generating the tRNA adaptor molecules that link genetic sequence and protein translation. By mapping Pol III occupancy genome-wide in mouse, rat, human, macaque, dog and opossum livers, we found that Pol III binding to individual tRNA genes varies substantially in strength and location. However, when we took into account tRNA redundancies by grouping Pol III occupancy into 46 anticodon isoacceptor families or 21 amino acid-based isotype classes, we discovered strong conservation. Similarly, Pol III occupancy of amino acid isotypes is almost invariant among transcriptionally and evolutionarily diverse tissues in mouse. Thus, synthesis of functional tRNA isotypes has been highly constrained, although the usage of individual tRNA genes has evolved rapidly.
RNA polymerase III (Pol III) transcription of tRNA genes is essential for generating the tRNA adaptor molecules that link genetic sequence and protein translation. By mapping Pol III occupancy genome-wide in mouse, rat, human, macaque, dog and opossum livers, we found that Pol III binding to individual tRNA genes varies substantially in strength and location. However, when we took into account tRNA redundancies by grouping Pol III occupancy into 46 anticodon isoacceptor families or 21 amino acid-based isotype classes, we discovered strong conservation. Similarly, Pol III occupancy of amino acid isotypes is almost invariant among transcriptionally and evolutionarily diverse tissues in mouse. Thus, synthesis of functional tRNA isotypes has been highly constrained, although the usage of individual tRNA genes has evolved rapidly.
Tissue-specific gene expression patterns can be highly conserved among vertebrates, and organismal divergences up to 500 million years can leave a core set of highly transcribed tissue-specific genes intact from chicken to fish to human[1]. Furthermore, non-coding RNAs, such as lincRNAs, may also show elevated purifying selection in location and expression between human and mouse cells[2,3]. Perhaps surprisingly, the proteins that bind DNA to direct these tissue-specific gene expression programs in mammals can diverge rapidly and dramatically in their genome-wide binding, even between evolutionarily close species[4-7]. The ability of divergent protein-DNA contacts to direct highly conserved gene expression is not understood, both because mammalian transcription factors (TF) operate in cross-regulatory and functionally redundant modules, and because of the difficulty of disentangling the specific regulatory contributions of the thousands of TF binding events in a mammalian cell.The synthesis of transfer RNA (tRNA) by RNA polymerase III (pol III) is an ideal system to explore the evolution and function of mammalian transcriptional regulation, because pol III binds to only a few hundred tRNA genes in mammals[8-12], and because translation of mRNA into polypeptides is highly conserved[13]. Each tRNA molecule can be attached to a single amino acid that it couples to a growing polypeptide chain by selectively base-pairing its three-base anticodon to a complementary three-base codon sequence within a messenger RNA (mRNA).tRNA genes that encode the same anticodon are referred to as isoacceptor classes; there are about 48 in mammals, depending on the species[14,15]. Isoacceptor families that translate to the same amino acid are known as amino acid isotypes, of which there are 21. In addition to the redundancies arising from multiple genes encoding the same anticodon, and multiple anticodons translating to the same amino acid, tRNAs can use the third base of the anticodon in a wobble pair with a closely-related codon sequence, thereby allowing one tRNA anticodon to pair with multiple codon triplets. These redundancies have been well characterized biochemically[16]; grouping tRNA genes into isoacceptor families and isotypes can simplify analysis, bypassing redundancy complications[15].Although tRNA biology has been extensively studied, much of the work was based on biochemical assays, and was performed before genome sequences were available for any mammal. More recent work has been mainly computational, using predictive methods to identify possible tRNA genes[14,17]. The experimental analyses that have been carried out so far have focussed only on single species[8-12].In this study, we have identified the location of, and analyzed the evolutionary stability of, the binding of pol III to tRNA genes in six species from four mammalian orders: mouse and rat (rodents), human and macaque (primates), dog (carnivores), and opossum (marsupials) as a non-eutherianmammalian outgroup. Our results demonstrate how the binding of the basal transcriptional machinery to individual tRNA genes can rapidly diverge while still maintaining highly constrained expression of functional amino acid isotypes.
Results
Pol III binds hundreds of expressed tRNA genes in mice
Our analysis focused on actively transcribed nuclear tRNA genes, which we collapsed into anticodon isoacceptors that translate synonymous codons, and further into amino acid isotypes based on their aminoacylation identity (Figure 1A). We first identified the tRNA genes occupied by pol III in mouse liver by performing chromatin immunoprecipitation followed by DNA sequencing (ChIP-seq) and validated their expression by sequencing of cDNA from total RNA (RNA-seq) (Figure 1B) (Methods). We identified candidate loci using tRNAscan-SE, mapped ChIP-seq reads to the appropriate reference genome, and counted reads that align to predicted tRNA loci or within 100 bp upstream or downstream (Methods). In tissues from mouse (as well as five other mammalian species) (Figure 1C), different biological replicates for pol III occupancy were found to be highly similar (Figure 2). As expected from prior studies in cultured cells[8-12], pol III binding was found primarily at known pol III targets[18] such as transposons, other repeat elements, and tRNA genes (Figure S1).
Figure 1
Pol III occupies and transcribes tRNA genes in mouse liver
(A) tRNA genes are present in hundreds of copies in a mammalian genome, but collapse into 62 isoacceptor families (48 are present in mammals), and 21 amino acid isotypes. (B) Separate portions of each species’ tissues were flash frozen to permit direct RNA sequencing, and treated with formaldehyde to cross link protein-DNA contacts to allow chromatin immunoprecipitation reactions. (C) Primary tissues were isolated from six mammals varying in evolutionary distance to mouse, from 12 million (rat) to over 180 million years (opossum). (D) Typical pol III-bound tRNA loci on murine chromosome 11. Pol III binding shown as a beige enrichment track (top), the input DNA as a black track (middle), and total RNA as blue track (bottom), indicating the pol III bound regions are transcriptionally active. The y-axis of each track specifies read density. The mammalian conservation track, obtained from UCSC, displays the degree of placental mammal basewise conservation (20 species) and sequence constraint. Beneath is the genome annotation in this region obtained for repeats, non-coding RNAs, and tRNAs (UCSC genome browser).
Figure 2
tRNA isoacceptors and isotypes are differentially bound by pol III in mouse liver
(A) Pol III binding varies widely among isoacceptor families (red), which collapse in number as well as binding variability in the 21 amino acid isotypes (grey). Areas are shaded according to number of pol III read counts (dendrogram shows values white: low to red/black: high). Hatched boxes represent isoacceptors that are absent in mouse, or that do not encode for an amino acid. (B-D) Pol III binding between replicates of mouse liver compared at (B) tRNA gene loci (blue), (C) anticodon isoacceptors (red), and (D) amino acid isotypes (black). Spearman’s rank correlation coefficients (ρ) are shown.
RNA polymerase II (pol II), which is responsible for protein-coding mRNA synthesis, has been well-characterized to bind in vivo to many regions not actively transcribed due to stalled polymerase activity[19,20], alternate transcription start sites[21], and regulated alternative splicing[22]. However, sequencing of transcripts in mouse liver indicated that every tRNA gene bound by pol III is expressed (Figure 1D). Conversely, very few RNA transcripts aligned uniquely to predicted tRNA loci not bound by pol III (Figure S2), confirming prior reports that pol III binding can be used as a proxy for tRNA transcription[8-12].
Pol III occupies tRNA genes in anticodon- and isotype-specific distributions
Mammals share a set of tRNAs carrying 41-55 anticodons[15]. In mouse liver, we identified 223 distinct tRNA genes bound by pol III. Almost all of the 223 mouse tRNA genes experimentally determined to be bound by pol III were computationally predicted previously by Lowe et al. [14] (98%) or Coughlin et al.[23] (94%) (Table S1). About 70% of the transcribed tRNA genes are located in clusters, indicating that the majority of tRNA gene transcription is concentrated in distinct chromosomal domains (Figure S3)[24].The mouse tRNA genes we identify as pol III bound in primary liver tissue represent 47 of the 62 possible isoacceptor families that encode the 20 standard amino acids, plus selenocysteine (Figure 1A, Figure 2A). One of these 47 isoacceptor families is found only in rodents [tRNAGly(ACC)], and therefore, the remaining 46, which are shared among all interrogated mammals, are predominantly used in our analyses.Redundancies in the translational code are reflected in the distribution of pol III binding among the different isoacceptor families. For instance, methionine (Met) has only a single anticodon tRNAMet(CAT), which is highly occupied by pol III. In contrast, the five different and partially redundant arginine (Arg) anticodons tRNAArg(ACG), (CCG), (CCT), (TCG), and (TCT) each show intermediate pol III occupancy. Collapsed to the amino acid level, however, the collective amount of pol III at all tRNA genes for arginine is substantially higher than the pol III occupancy of methionine tRNA genes (Figure 2A).The binding of pol III among these 223 distinct mouse tRNA genes varied by up to two orders of magnitude (Figure 2B). Analysing pol III binding by isoacceptor family showed only one order of magnitude difference. For instance, the highest occupied isoacceptor families, tRNAMet(CAT) and tRNAAsp(GTC), show over 4000 reads versus just over 300 reads at the lowest, tRNAGly(ACC) (Figure 2C). Although the amount of pol III at each isoacceptor family largely correlated with the number of copies of mouse tRNA genes in the family, there were many specific tRNA genes for the same isoacceptor that showed considerable variation (Figure S4).These observations support prior reports of active regulation of tRNA transcription in a site-specific manner[8-12]. The amino acid isotypes had a similar distribution of total pol III enrichment as their component isoacceptors (Figure 2D). Pol III occupancy at tRNA genes, isoacceptor anticodon families, and amino acid isotypes classes was highly correlated between different biological replicates (Figure 2B-D).
Tissue-independent expression of tRNA isotypes
There are large differences in pol III binding between different human cultured cell types[8,11] and between different human tissues in tRNA expression[25]. We asked whether tRNA expression was similarly divergent in phenotypically diverse primary tissues by determining pol III occupancy in mouse muscle, testes, and liver. Our data confirmed prior reports that different cellular types have differences in pol III occupancy on a gene-by-gene basis (Figure 3, top right, blue circles). The collections of tRNA genes active in different mouse tissues overlap almost entirely (Figure S5); the observed differences between these transcriptionally diverse tissues within one species appear instead to be due to differential pol III binding over the same set of tRNA genes.
Figure 3
Amino acid isotypes are bound by pol III in a tissue-independent manner
Pol III occupancy at tRNA genes was determined in mouse liver, muscle, and testes. The intersection of the row-column for each tissue combination in the upper-right triangle shows the correlation for all pol III bound tRNA gene loci (blue) and isoacceptor (red) and in the lower-left triangle of the binding of pol III to isotypes (black). The Spearman’s rank correlation coefficients (ρ) are reported correspondingly by colour in the lower right of each inter-tissue panel.
When pol III binding is considered by isoacceptor family (Figure 3, top right, red circles) or by amino acid isotype classes (Figure 3, bottom left, black circles), the relative proportion of tRNAs bound by pol III in muscle, testes, and liver is highly correlated (Spearman’s rank correlation coefficient ρ=0.90-0.96 on the isoacceptor and ρ=0.97-0.98 on the isotype level).In bacteria and yeast, relative levels of tRNA isoacceptors correlate with codon usage in genes encoding highly translated proteins, and can themselves influence protein coding gene expression[26]. However, the role of codon bias in mammals remains debated, with different studies providing contradictory results in inter-tissue codon biases[27,28]. Our data show that pol III binding to classes of tRNA genes is surprisingly stable, even among transcriptionally divergent mouse tissues, despite substantial tissue-specific differences in pol III occupancy at individual tRNA genes. This observation provides support for the functional importance of selective codon usage in mammals[16]. We also note more differential use of certain isotypes in testis relative to muscle or liver, perhaps reflecting optimisation in reproductive tissues for tissue-specific gene products.
Pol III binds to hundreds of tRNA genes in six mammalian species
To explore the evolutionary dynamics of tRNA gene transcription, we experimentally determined pol III occupancy in livers of four additional eutherian and one marsupial mammal. We compared our mouse pol III occupancy data to rat (diverged from mouse 12 million years [MY] ago), human (80 MY), macaque (80 MY), dog (85 MY), and the non-eutherian opossum (180 MY) (Figure 4A). We chose liver, which is primarily composed of a single cell type, hepatocytes[29] as a representative vertebrate tissue for interspecies comparisons, because of its conserved tissue structure[30], function, and gene expression[1].
Figure 4
The tRNA genes bound by pol III diverge in genomic location and functional usage among mammals
(A) Two clusters of multiple tRNA genes flanking the Trim41 and Trim7 genes are bound by pol III in multiple species (shown as genome tracks). The numbers on the left of each track specify read density for pol III ChIP experiments. The syntenic position of each tRNA gene is traced between species with a dashed line. A blue dashed line represents conserved binding in all six mammals and a yellow line indicates species-specific innovation. The cluster upstream of the Trim41 gene shows four tRNA gene loci (two valines, a lysine, and an alanine) with conserved pol III binding. The lysine gene locus does not exist in the rodents, and two leucines are only present in opossum. An additional cluster 3′ downstream of Trim7 demonstrates two tRNA loci (threonine and proline) bound by pol III in all six species. The valine tRNA gene only evolved in the eutherians, and the leucine tRNA gene is not bound in dog. (B) A 5-way Venn diagram intersects the tRNA genes bound by pol III for each possible combination of eutherian mammals used in this study. The total number of species-specific tRNA genes (white) is subdivided into tRNA genes present (inner black numbers) and absent (outer black numbers in dashed subdivision) in Ensembl’s 16 amniote alignment. 24 tRNA genes are bound in all 6 species. Area colourings are shaded according to number of tRNA genes (white: low to blue: high). (C) tRNA loci bound by pol III from panel (B) were sorted by the number of species in which they were bound, and compared to the level of pol III occupancy in each of the study species; demonstrating that higher degree of tRNA gene conservation correlates with higher pol III occupancy (Methods).
In addition to the 223 tRNA genes bound by pol III in mouse, we identified 282 in rat, 224 in human, 233 in macaque, 135 in dog, and 216 in opossum (Tables S2-7). The number of genes corresponding with each isotype is roughly similar across the six species (Table S8). In every species, a substantial fraction (50-70%) of the pol III bound tRNA loci were found in clusters (Figure S3). The higher number of tRNA genes in rat and lower number in dog could be explained by: repeat-driven expansion of tRNA genes, segmental duplications, and genome assembly errors[31,32]. Regardless, our data captured a set of 46 isoacceptors found in every mammal. Rodents and opossum each have one additional isoacceptor, tRNAGly(ACC) and tRNALeu(GAG), respectively, totalling 48 for all profiled mammals. A similar number of pol III occupied tRNA loci were found using phenotypically and transcriptionally distinct human cell lines[8,10-12]. Between 61% and 92% of our 224 human tRNA genes overlapped with the tRNA loci identified as bound in other recent studies (Table S1).We noted a good correlation between the number of tRNA genes and overall pol III occupancy among different species, both by isotype and anticodon, similar to that seen in mouse tissues (Table S9). However, there are significant differences between the occupancy levels of specific tRNA genes, indicating an additional level of per-locus regulation beyond the number of genes.Publicly available databases predict the existence of 500 to 1000 possible tRNA genes in the genomes that we experimentally profiled for pol III binding[1]. The hundreds of predicted tRNA genes not occupied by pol III in primary liver tissue must therefore represent a combination of (i) genes actively transcribed in tissues or conditions not profiled in our study and (ii) inactive tRNA pseudogenes. Precedent for the first scenario comes from comparisons between human cell lines. For example, 26% of tRNA loci were occupied by pol III in either HeLa cells or CD4+ T cells, but not in both[8]. Examples of the latter became apparent when we compared pol III occupancy in livers from different mammals.
The location of Pol III bound tRNAs diverges among mammals
We asked whether the genomic location of pol III-associated tRNA genes is conserved throughout mammalian evolution (Figure 4). tRNA gene synteny is broadly maintained in 12 Drosophila genomes spanning about 40 MY of lineage divergence[33]. However, our data showed that the specific set of pol III-bound tRNA genes varied considerably between species.For example, on mouse chromosome 14, a set of four clustered tRNA genes (tRNAThr, tRNAPro, tRNAVal, tRNALeu) is encoded downstream of the murineTrim7 locus, which has conserved synteny in mammals (Figure 4A). Two loci (tRNAThr and tRNAPro) are pol III bound in all interrogated species, and all four loci are pol III-bound in the rodents and primates. Lineage specific loss of pol III occupancy can also be observed in this example, as the tRNALeu gene in dog does not show detectable pol III occupancy. The A- and B-box motifs, which are the type II tRNA gene internal pol III binding motifs[34], have degenerated in dog, precluding transcription. The tRNAVal gene in this example is eutherian-specific and either evolved after the eutherian-marsupial divergence or was lost in opossum. Figure S6 is an example of a primate-specific loss in pol III binding of a tRNAArg(ACG) gene. Our data also show lineage-specific gains: in the same figure, a tRNALeu(TAG) gene is bound by pol III in rodents, yet shows no similar evidence of transcription in the other mammals.To determine the genome-wide evolution of actively transcribed tRNA genes in placental mammals, we compared all six species’ liver pol III binding data using the Ensembl 16-amniota vertebrate PECAN alignment, which contains approximately half of each study species’ genome[35,36]. Only, 35 of all tRNA genes bound by pol III (ca. 16%) can be aligned among eutherians (Figure 4B), consistent with previous reports that a relatively small amount of functional sequence is alignable across eutherian genomes[37]. The 35 tRNA genes bound by pol III in all eutherians include at least one anticodon for 18 of the 21 amino acids. Incorporating opposum, 24 tRNA genes are bound by pol III across 180 MY of evolution, and therefore were likely present in the common mammalian ancestor.Disregarding pol III-occupancy on tRNA genes, 55 predicted tRNA genes (11%) are aligned by PECAN, indicating a slight evolutionary constraint on actively transcribed tRNA genes (Figure S7).Orthologous tRNA loci bound by pol III in all six species tend to have high pol III occupancy (Figure 4C). For example, among the many tRNAArg(TCG) genes in mouse, the single tRNAArg(TCG) gene on chromosome 7 (Table S2) with orthologues in the other five species has the strongest pol III occupancy (Table S3-7). An additional example is tRNALeu, which is the most highly expressed isotype, and which also has two distinct gene loci strongly occupied by pol III in eutherian mammals.The number of loci conserved between two species generally reflects evolutionary distance: humans and macaques (23 MY) share 124 loci, humans and mice (80 MY) share 79, while mice and rats (12 MY) share 99 (Figure 4B). The majority of tRNA loci are not alignable to other species within the 16-way alignment; they would be by this definition occupied species-uniquely (Figure 4B, S5 and S8).We then asked whether pairwise (as opposed to 16-way) alignments of the study species’ genomes would substantially increase the observed overlap in pol III binding. Using synteny nets from UCSC[38], we identified the tRNA genes in each species that do not have an ortholog in the syntenic block in a second species. This calculation provides a minimum estimate on the number of tRNA genes that must occur in non-syntenic locations. Mouse and human have the best-annotated genomes, with 93.3% and 93.4% of the genomes, respectively, in synteny blocks; more generally, a typical pairwise synteny map aligns each genome with more than 90% of the other genome[39]. However, 34% of mouse tRNA genes do not align with human homologs, well over twice as many as would be expected were they randomly distributed in the genome. Similarly, we found 83% of mouse tRNA genes were in the same synteny block in rat, which decreases to 49% in dog, and 43% in opossum (Table S10). It is possible that this reflects a modest preference for tRNA genes to be duplicated in evolutionarily active regions of the genome. In summary, a substantial fraction (ca. 14-55%) of mouse tRNA genes cannot be found in a syntenic location in a second species; this result was unaffected by which species was used to anchor the pairwise comparison.We conclude that even by relaxed measures of homology, a substantial fraction of pol III binding at tRNA genes occurs in species-specific location. Our results indicate that, in contrast to the apparent high conservation in drosophila tRNA gene evolution[33], both tRNA gene location as well as tRNA transcription can evolve rapidly along mammalian lineages.
Pol III occupancy of tRNA isotypes is highly conserved across mammals
We asked whether the observed rapid gain and loss in transcription of particular tRNA genes could be compensated for by simultaneous losses or gains in transcription from other tRNA genes within the same anticodon families. To test this, we began by comparing the total pol III occupancy among the 21 specific amino acid isotypes in all six mammalian species. We reasoned that if there were successful compensation, then the total quantity of pol III at the complete collection of tRNA genes coding for a specific amino acid isotype would not vary between species, despite the divergence we could observe for specific gene loci. We found that the distribution of pol III occupancy among the amino acid isotypes is highly conserved among mammals (Figure 5A), showing correlations uniformly around Spearman’s ρ=0.9 between species.
Figure 5
Isotypes are highly conserved in pol III occupancy and codon usage across mammalian evolution, though the isoacceptors they consist of are only moderately well conserved
We experimentally determined the strength of pol III binding using ChIP-seq (A-C) and transcript abundance using mRNA-seq (D-F) in livers of six mammals. (A) Proportional frequency of pol III binding by isotype is similar among all six mammals. (B) Proportional frequency of pol III binding to all possible arginine tRNA isoacceptors demonstrate differences among mammals. (C) Pol III binding to tRNA anticodons diverges in correspondence to evolutionary distance (red circles), but pol III binding to isotypes remains largely conserved out to 180 MY of evolution (hollow circles). (D) The frequency of each amino acid weighted by transcript expression was highly conserved between species. (E) Proportional frequencies of codon usage for arginine indicate that mammalian species have little divergence in their codon biases. (F) Amino acids and to a large extent their encoding triplet codon are equally abundant in transcripts across 180 MY of mammalian evolution. Coloured lines within the radial plots (A, B, D, and E) present the data values of each species (purple: mouse, pink: rat, green: human, orange: macaque, black: dog, and yellow: opossum). Labels around the plot indicate amino acids (A and D), anticodons (B) and triplet codons (E). Labels within the grid of the radial plots (A, B, D, and E) describe percentages.
Similarly, we determined the correlation of pol III occupancy among the 46 common anticodon isoacceptors in mouse compared with the same isoacceptor family from each of the other mammals; Figure 5B shows an example of the six arginine anticodons. Figure S6 shows a similar analysis for three orthologous tRNALys genes. These examples demonstrate that increased pol III occupancy of orthologous tRNA genes can be balanced by decreased pol III occupancy of other tRNA genes within the same anticodon isoacceptor family. In general, we found that the anticodon isoacceptor correlation with mouse declined steadily with evolutionary distance from approximately ρ=0.88 in rat (12 MY) to ρ=0.69 in opossum (180 MY) (Figure 5C).Taken together, in contrast to the rapid gain and loss of pol III activity at specific tRNA genes, the overall expression of tRNA amino acid isotypes has remained constant during 180 MY of evolution.
Transcriptome usage of codon triplets is highly conserved across mammals
Transcription of amino acid isotypes, and to a lesser extent anticodon isoacceptors, is highly conserved in livers of six mammals. We asked whether the usage of codons in the corresponding mRNA transcriptomes was similarly conserved. Prior studies in prokaryotes and simple eukaryotes have found that codon usage can vary considerably among species, although rarely between closely related species[16]. In addition, gene expression is strongly influenced by tRNA availability in non-mammalian systems[40]. Other findings suggest that codon usage biases are correlated to tRNA abundance[41]. To determine whether mammals have species-specific codon biases, we sequenced mRNA from liver of the six mammals used in this work, then counted the occurrences of triplet codons and amino acids necessary for mRNA decoding in each species’ liver transcriptome, weighted by transcript abundance. We found that the frequencies of different encoded amino acids were almost invariant among mammals as diverse as opossum and mouse, with uniformly high correlation values (ρ >0.95) (Figure 5D-F). Codon usage was well conserved across the profiled species (Figure 5E-F, S9), though not as well as amino acid usage. We observed a similar high correlation in codon usage and weighted amino acid frequency in different mouse tissues (Figures S10, S11 and S12).Finally, we asked whether pol III occupancy of tRNA isoacceptor classes correlated with the weighted codon distribution found in the transcriptome of the same tissue (Figure S13). For liver in 6 species, we found reasonably strong correlation (R≈0.7) between the amount of pol III at the 46 common anticodons, and the weighted occurrence of the corresponding codons in mRNAs. This correlation is particularly surprising, given the number of confounding factors in correlating such disparate data, such as the previously discussed redundancies within the genetic code and post-transcriptional regulation of mRNA causing variations in mRNA longevity.In sum, we found remarkable conservation in pol III occupancy and transcription of tRNA genes at the amino acid level, and a corresponding conservation among the usage of amino acids in transcriptomes from matched tissues in diverse mammals.
Discussion
tRNA gene binding and transcription by pol III is absolutely required for cellular function, yet has only recently been analysed experimentally in single species on a genome-wide scale. Our global binding data in primary mouse tissues confirm reports based on cultured cell lines that pol III occupancy of tRNA genes varies considerably between cell types[8,11]. Nevertheless, despite the variable use of each tRNA locus, conserved pol III occupancy becomes apparent on taking into account the redundancies within the genetic code. This analysis shows that within a single species, pol III binding at the isotype level is very similar in tissues as diverse as muscle, liver, and testis. Similar correlations are found across 180 million years of evolution to those found between replicate experiments within a single species. By conducting RNA-seq in all six mammals, we discovered that a similar type of conservation appears to exist within the amino acid usage across mammals. Our data show that pol III binding at the anticodon isoacceptor level and the component triplet codon within transcriptomes must be under strong constraint at the amino acid isotype level, that is, although individual genes and isoacceptor classes vary in expression, the overall quantity of tRNAs for a given isotype is highly conserved. To our knowledge, this type of conservation has not been reported previously.Genomic binding of pol III showed a remarkably rapid rate of divergence in mammals, largely due to genomic rearrangements, co-option, and gene duplication. We identified 24 genes that are in locations of conserved synteny and therefore likely existed in the early mammalian ancestor. Even between closely related species, such as mouse/rat or human/macaque, 33 to 58% of transcribed tRNA genes appear unique to each species. Proportionally at least, this appears to broadly reflect divergence of the tRNA genes themselves. However, the rapid changes found in functional pol III binding to tRNA genes is similar to site-specific transcriptional regulators like Oct4 and CEBPA, where mouse to human comparisons show that only a minority of binding events are conserved in orthologous locations[6,7].Our data provide evidence for regulatory crosstalk among the tRNA genes corresponding with each isotype, allowing collective output to be coordinated at the transcriptional level. It is possible that distinct regulatory sequence motifs are employed[10] or local chromatin state helps direct pol III binding[8,10,11]. Such highly specific trans-acting communication cannot be explained by the established mechanisms of pol III regulation, which have focused on controls that are mediated through changes to TFs shared by all tRNA genes[17]. One possibility is that tRNA genes cluster in space according to isotype, despite their dispersal across chromosomes, allowing output from the amino acid group to be controlled independently of usage of individual group members. Spatial clustering of tRNA genes has been observed in yeast[42]. However, it is unclear how such clustering might overcome chromatin topological constraints or distinguish among isotypes.Many transcription factors bind the genome tens of thousands of times; Schmidt et al. suggest that many of these binding events are functionally neutral or redundant[6]. Even well characterized, directly regulated TF targets can have clustered TF binding. These factors can contribute in varying degrees towards controlling transcription, which greatly complicates our understanding of the relationship between binding location and regulator function[43-46]. In contrast, tRNA expression originates from only a few hundred distinct locations, in genomes as diverse as drosophila[33] and mammals. The limited number of actively transcribed tRNA genes, determined by pol III occupancy, allowed us to analyse the otherwise complex relationship between pol III binding and its functional role in translation. Despite conservation of the overall tRNA transcriptional program, our analysis showed that pol III occupancy and tRNA genes themselves can evolve rapidly. Thus, purifying selective pressure in tRNA expression must be operating at the amino acid isotype level, rather than at individual genetic loci.
Authors: Maya Kasowski; Fabian Grubert; Christopher Heffelfinger; Manoj Hariharan; Akwasi Asabere; Sebastian M Waszak; Lukas Habegger; Joel Rozowsky; Minyi Shi; Alexander E Urban; Mi-Young Hong; Konrad J Karczewski; Wolfgang Huber; Sherman M Weissman; Mark B Gerstein; Jan O Korbel; Michael Snyder Journal: Science Date: 2010-03-18 Impact factor: 47.728
Authors: Debasish Raha; Zhong Wang; Zarmik Moqtaderi; Linfeng Wu; Guoneng Zhong; Mark Gerstein; Kevin Struhl; Michael Snyder Journal: Proc Natl Acad Sci U S A Date: 2010-02-05 Impact factor: 11.205
Authors: Mitchell Guttman; Ido Amit; Manuel Garber; Courtney French; Michael F Lin; David Feldser; Maite Huarte; Or Zuk; Bryce W Carey; John P Cassady; Moran N Cabili; Rudolf Jaenisch; Tarjei S Mikkelsen; Tyler Jacks; Nir Hacohen; Bradley E Bernstein; Manolis Kellis; Aviv Regev; John L Rinn; Eric S Lander Journal: Nature Date: 2009-02-01 Impact factor: 49.962
Authors: Dominic Schmidt; Michael D Wilson; Christiana Spyrou; Gordon D Brown; James Hadfield; Duncan T Odom Journal: Methods Date: 2009-03-09 Impact factor: 3.608
Authors: Jeremy T Lant; Matthew D Berg; Ilka U Heinemann; Christopher J Brandl; Patrick O'Donoghue Journal: J Biol Chem Date: 2019-01-14 Impact factor: 5.157