Literature DB >> 28108659

YAMAT-seq: an efficient method for high-throughput sequencing of mature transfer RNAs.

Megumi Shigematsu1, Shozo Honda1, Phillipe Loher1, Aristeidis G Telonis1, Isidore Rigoutsos1, Yohei Kirino1.   

Abstract

Besides translation, transfer RNAs (tRNAs) play many non-canonical roles in various biological pathways and exhibit highly variable expression profiles. To unravel the emerging complexities of tRNA biology and molecular mechanisms underlying them, an efficient tRNA sequencing method is required. However, the rigid structure of tRNA has been presenting a challenge to the development of such methods. We report the development of Y-shaped Adapter-ligated MAture TRNA sequencing (YAMAT-seq), an efficient and convenient method for high-throughput sequencing of mature tRNAs. YAMAT-seq circumvents the issue of inefficient adapter ligation, a characteristic of conventional RNA sequencing methods for mature tRNAs, by employing the efficient and specific ligation of Y-shaped adapter to mature tRNAs using T4 RNA Ligase 2. Subsequent cDNA amplification and next-generation sequencing successfully yield numerous mature tRNA sequences. YAMAT-seq has high specificity for mature tRNAs and high sensitivity to detect most isoacceptors from minute amount of total RNA. Moreover, YAMAT-seq shows quantitative capability to estimate expression levels of mature tRNAs, and has high reproducibility and broad applicability for various cell lines. YAMAT-seq thus provides high-throughput technique for identifying tRNA profiles and their regulations in various transcriptomes, which could play important regulatory roles in translation and other biological processes.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28108659      PMCID: PMC5605243          DOI: 10.1093/nar/gkx005

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Transfer RNAs (tRNAs) are universally expressed, non-coding RNAs measuring 70–90-nucleotides (nt) in length. tRNAs play a central role in translation machinery by converting information from mRNA codons to amino acids. The human nuclear genome encodes more than 600 tRNA genes for cytoplasmic tRNAs (cyto tRNAs) that function in the cytoplasmic translation system; in addition, the human mitochondrial genome encodes 22 mitochondrial tRNAs (mt tRNAs) that act in the mitochondrial translation system (1,2). Recently, the nuclear genome has been suggested to encode numerous cyto tRNA- and mt tRNA-lookalikes (3,4). The multitude of tRNA genes in the genome and the high stability of the expressed tRNAs have placed tRNAs among the most abundant RNA molecules in the cellular transcriptome. Despite the abundance of tRNAs and their well-defined role in translation, demand for quantitative and qualitative analyses of the tRNA repertoire has been growing. Interestingly, the tRNA gene copy number varies among individuals (5), and tRNA abundance varies widely among different cells and tissues (6,7). This heterogeneity and variation have been implicated in the translational regulation of mRNA expression (8), animal development (9–11) and disease (12–15). Besides their role in translation, it has become increasingly apparent that tRNAs play a wide variety of non-canonical functions in many biological pathways such as nutrient sensing and apoptosis regulation (16,17). tRNAs further act as substrates for tRNA-derived short non-coding RNAs, many of which have been identified as functional molecules with significant roles in a wide variety of biological processes such as the small regulatory RNA pathway, stress responses and cell proliferation (17–25). To unravel the emerging complexities of tRNA biology and molecular mechanisms underlying them, an efficient tRNA sequencing method is required. Although microarray-based methods have been proven to be useful for the quantification of many tRNA species (6,26), such methods require a custom-made array for each organism and do not completely evaluate all tRNA species including isodecoders that possess the same anticodon sequences but contain sequence variations in other parts. Because isodecoders can exhibit large functional variations (27), an efficient sequencing-based method capable of identifying the entire tRNA repertoire with all sequence variations is needed. However, the characteristic structure of tRNA presents a challenge to the development of such methods. Specifically, the 5΄-end of a mature tRNA binds tightly via base-pairs to the 3΄-end as an acceptor stem. This rigid terminal structure can prevent adapter access to the mature ends of tRNAs, thus hindering the adapter ligation step included in conventional RNA-seq methods and leading to an inefficient and biased representation of tRNAs in RNA-seq data. Instead of using conventional RNA-seq, recent studies have utilized the sequencing methods specialized for tRNAs. For example, Pang et al. employed a two-step ligation strategy to efficiently ligate adapters to tRNAs (28). A method described by Zhong et al. includes the addition of a poly-A tail to the deacylated 3΄-ends of mature tRNAs, thus facilitating the subsequent RT-PCR amplification of tRNAs (29). Furthermore, a DM-tRNA-seq devised by Zheng et al. included the successful reduction of sequence bias from tRNA post-transcriptional methylations by treating tRNAs with AlkB demethylase, followed by a template-switching reaction of thermostable group II intron reverse transcriptase for adapter attachment to tRNAs (30). However, these methods require relatively abundant starting RNA materials and include several purification steps and enzymatic reactions that are not necessary in conventional RNA-seq. In this light, we report herein the development of Y-shaped Adapter-ligated MAture TRNA sequencing (YAMAT-seq), a convenient, efficient and specific mature tRNA sequencing method. The YAMAT-seq procedure includes a nick-ligation step catalyzed by bacteriophage T4 RNA Ligase 2 (Rnl2) (31), which catalyzes ligation at a 3΄-OH/5΄-P nick in double-stranded RNA or an RNA–DNA hybrid (32–34). This peculiar ligation activity toward double-stranded nucleotides makes Rnl2 an attractive tool for adapter ligation during cDNA preparation (35) and for the detection and quantification of SNPs (36), microRNAs (37), small RNA variants (38) and tRNAs (39). In our YAMAT-seq procedure, Rnl2 specifically ligates a Y-shaped adapter to mature tRNAs at a much greater level of efficiency than that achieved with conventional ligation reactions involving linear adapters and T4 RNA ligase 1 (Rnl1). Subsequent cDNA amplification and next-generation sequencing successfully identify the cyto tRNA and mt tRNA repertoire in the cellular transcriptome. YAMAT-seq requires only a small amount of total RNA extracted using a standard organic method, only a few experimental procedures and a time interval of approximately 2 days before submitting the samples for next-generation sequencing. YAMAT-seq is thus an efficient and convenient high-throughput technique for mature tRNA sequencing that should be applicable to the global identification of tRNA repertories in various cell types and conditions.

MATERIALS AND METHODS

Cell culture

Human breast cancer cell lines were used in this study. The BT-474 cell line was cultured in Dulbecco's modified Eagle's medium (Life Technologies) containing 10% fetal bovine serum (FBS). The SK-BR-3 cell line was cultured in RMPI-1640 medium (Life Technologies) containing 10% FBS. The BT-20 cell line was cultured in MEM medium (Life Technologies) containing 10% FBS. The MCF-7 cell line was cultured in MEM medium containing 10% FBS, 1 mM sodium pyruvate, 1× non-essential amino acids solution (Life Technologies) and 10 μg/ml insulin (Sigma).

Total RNA isolation and deacylation treatment

Total RNA was extracted from cultured cells using TRIsure (Bioline) according to the manufacturer's protocol. Total RNAs were incubated at 37°C for 40 min in 20 mM Tris-HCl (pH 9.0) to remove amino acids from mature tRNAs (deacylation treatment), followed by ethanol precipitation.

Y-shaped adapter design

A DNA adapter containing Illumina 3΄-adapter sequences (Y-3΄-AD) and four different DNA/RNA hybrid adapters containing Illumina 5΄-adapter sequences and a 3΄-terminal A (Y-5΄-AD-A), G (Y-5΄-AD-G), C (Y-5΄-AD-C) or U (Y-5΄-AD-U) were synthesized by Integrated DNA Technologies. The adapter sequences were as follows (capital and small letters designate DNA and RNA, respectively): Y-3΄-AD, 5΄-5phos/GTATCCAGTTGGAATTCTCGGGTGCCAAGG/3ddC-3΄; Y-5΄-AD-A, 5΄-GTTCAGAGTTCTACAGTCCGACGATCACTGGATACTGga-3΄; Y-5΄-AD-G, 5΄-GTTCAGAGTTCTACAGTCCGACGATCACTGGATACTGgg-3΄; Y-5΄-AD-C, 5΄-GTTCAGAGTTCTACAGTCCGACGATCACTGGATACTGgc-3΄; and Y-5΄-AD-U, 5΄-GTTCAGAGTTCTACAGTCCGACGATCACTGGATACTGgu-3΄. The Y-shaped adapters comprising Y-3΄-AD/Y-5΄-AD-A, Y-3΄-AD/Y-5΄-AD-G, Y-3΄-AD/Y-5΄-AD-C and Y-3΄-AD/Y-5΄-AD-U are expected to hybridize mature tRNAs with discriminator bases of U, C, G and A, respectively.

Annealing and ligation of Y-shaped adapters to mature tRNAs

Four different Y-5΄-AD adapters (10 pmol each) and 40 pmol of the Y-3΄-AD adapter were incubated with 1 μg of deacylated total RNA in a 9-μl reaction volume at 90°C for 2 min. After adding 1 μl of 10× annealing buffer containing 50 mM Tris-HCl (pH 8.0), 5 mM ethylenediaminetetraacetic acid and 100 mM MgCl2, the 10-μl mixture was subjected to annealing at 37°C for 15 min. To ligate the annealed adapter to mature tRNAs, 10 μl of 1× reaction buffer containing 1 unit of Rnl2 (New England Biolabs) was added to the mixture. The entire mixture (20 μl) was incubated at 37°C for 1 h, followed by overnight incubation at 4°C.

cDNA amplification and Illumina sequencing

cDNA amplification was performed using the TruSeq Small RNA Sample Preparation Kit (Illumina) according to the manufacturer's protocol. Briefly, 6 μl of ligated RNA and 1 μl of RTP primer were incubated at 70°C for 2 min and then placed on ice. Reverse transcription was subsequently performed by adding 5.5 μl of a RT reaction mixture comprising 2 μl of 5× First Strand Buffer, 0.5 μl of 12.5 mM each dNTP, 1 μl of 100 mM dithiothreitol, 1 μl of RNase inhibitor and 1 μl of SuperScript III Reverse Transcriptase (Life Technologies), followed by incubation at 55°C for 60 min. The resultant cDNAs were amplified by polymerase chain reaction (PCR) (11 cycles) using a PCR enzyme mix (PML) and primers included in the Illumina kit. PCR products were developed using 8% native polyacrylamide gel electrophoresis (PAGE), and mature tRNA amplified regions were gel-purified. To confirm the amplified cDNA sequences, purified cDNAs were cloned using StrataClone Blunt PCR Cloning Kit (Agilent Technologies). The cDNAs were further sequenced (100 nt single-read) on an Illumina HiSeq 2500 system after PhiX library clustering by the Next-Generation Sequencing Core at University of Pennsylvania.

Examination of ligation efficiency to synthetic tRNA

To determine ligation efficiency, a synthetic human cyto tRNAAspGUC (5΄-/5Phos/UCCUCGUUAGUAUAGUGGUGAGUAUCCCCGCCUGUCACGCGGGAGACCGGGGUUCGAUUCCCCGACGGGGAGCCA-3΄) was synthesized by Integrated DNA Technologies. Synthetic tRNA was subjected to either Y-adapter ligation or Illumina adapter ligation (conventional method). For Y-adapter ligation, 2 pmol of synthetic tRNA was incubated with 10 pmol each of Y-5΄-AD-C and Y-3΄-AD and subjected to annealing and Rnl2 ligation as described above. For conventional ligation, 2 pmol of synthetic tRNA was subjected to 5΄- and 3΄- adapter ligation using the TruSeq Small RNA Sample Preparation Kit (Illumina). The respective ligation products (1/100 volume, 20 fmol of tRNA) were subjected to reverse transcription using SuperScript III Reverse Transcriptase (Life Technologies) in a total reaction volume of 20 μl; 0.4 μl of each reaction mixture was further subjected to Real-time PCR using SsoFast™ EvaGreen® Supermix (BioRad) and a StepOne PlusTM Real-time PCR machine (Applied Biosystems). To determine the efficiency of the ligation between the 5΄-adapter and tRNA, the amounts of ligated products were quantified by RT-PCR, in which the region from the 5΄-adapter to nucleotide position (np) 35 of the tRNA (np is according to the nucleotide numbering system of tRNAs (40)) was amplified using the following primers: forward, 5΄-GTTCAGAGTTCTACAGTCCGACGATC-3΄; and RT/reverse, 5΄- ACAGGCGGGGATACTCACCACTA-3΄. To determine the efficiency of the ligation between the 3΄-adapter and tRNA, the following primers were used to amplify the region from np 40 of the tRNA to the 3΄-adapter: forward, 5΄-CGGGAGACCGGGGTTCGATT-3΄; and RT/reverse, 5΄-GCCTTGGCACCCGAGAATTCCA-3΄. PCR products were cloned using a StrataClone cloning kit (Agilent Technologies) to generate standard curves; these were subsequently used to calculate and determine the amounts of ligated products from the respective ligation procedures.

Examination of ligation efficiency to native tRNAs in total RNA fraction

Deacylated BT-474 total RNA (1 μg) was subjected to either Y-adapter ligation or Illumina adapter ligation (conventional method), followed by cDNA synthesis as described in the above cDNA amplification section. Resultant reverse transcription reaction mixture (0.4 μl from total volume of 20 μl) was further subjected to Real-time PCR as described above. To determine the ligation efficiency between the 5΄-adapter and tRNA, the following primers were used to amplify the region from the 5΄-adapter to anticodon arm: universal forward, 5΄-AATGATACGGCGACCACCGAGATCTACACGTTCAGAGTTCTACAGTCCGA-3΄; RT/reverse for tRNAAspGUC, 5΄-CGGGGATACTCACCACTATACTAACGAGGA-3΄; RT/reverse for tRNAGluCUC, 5΄-CGAATCCTAACCACTAGACCAC-3΄; RT/reverse for tRNALysCUU, 5΄-ACTGAGCTAGCCGGGC-3΄; and RT/reverse for tRNAPheGAA, 5΄-CTAACGCTCTCCCAACTGAGCTAT-3΄. To determine the efficiency of the ligation between the 3΄-adapter and tRNA, the following primers were used to amplify the region from anticodon arm to the 3΄-adapter: forward for tRNAAspGUC, 5΄-CGGGAGACCGGGGTTCGATT-3΄; forward for tRNAGluCUC, 5΄-GGGTTCGATTCCCGGTCA-3΄; forward for tRNALysCUU, 5΄-GTTCGAGCCCCACGTT-3΄; forward for tRNAPheGAA, 5΄-AAGGTCCCTGGTTCGATCC-3΄ and universal RT/reverse, 5΄- CAAGCAGAAGACGGCATACGAGATCGTGATGTGACTGGAGTTCCTTGGCACCCGAGAATTCCA-3΄.

Bioinformatics analyses

The numbers of YAMAT-seq raw reads obtained from total RNAs of MCF-7, SK-BR-3 and BT-20 with technical triplicates were shown in Figure 3B and Supplementary Figure S1; these are publicly available at NCBI's Sequence Read Archive (accession no. SRP096584). After quality check, SHRiMP2 (41) was used to map the reads to a set of 632 tRNA-reference genes (listed in gtRNAdb (1)) that comprised 610 nuclear-encoded cyto tRNAs from the GRCh37 assembly and 22 known mt tRNAs from tRNAdb (2). The 610 entries from gtRNAdb included 508 true tRNAs and 102 psuedo-tRNAs. We excluded tRNAs that mapped to contigs that are not part of the major chromosome assembly. We allowed non-unique mappings with a 10% mismatch rate, penalizing each mismatch and gap extension equally. We removed any introns from the reference set and CCA was added to the 3΄-ends of the tRNA-reference genes prior to mapping. To be conservative, we also mapped the reads (minus the CCA) to the full GRCh37 assembly using the same parameters and excluded the read if it mapped equally or better to non-tRNA space when compared with the tRNA-reference gene mapping. We confirmed that almost all reads that mapped to the tRNA reference also mapped to tRNAs during the full genome mapping. Lastly, we only kept reads that were 60–87 nt in length inclusive and ended in CCA. Because 100 nt reads, yielded by Illumina sequencing, contain 3΄-terminal sequences (13 nt) of Y-5΄-AD adapter and thereby 87 nt is the maximum read length for tRNAs, CCA sequences were not found in the reads of some long tRNAs (e.g. cyto tRNASeCUCA with 90 nt length). These long tRNA reads were retained as tRNA reads regardless of a lack of CCA sequences. Statistical analysis was performed using R (42) (https://www.R-project.org/). Heatmaps were built with the heatmap.2 function of the gplots package of R. Dendrograms were constructed and visualized with the amap and dendextend packages (43), respectively, with Euclidean distance as a metric for hierarchical clustering.
Figure 3.

Amplification and sequencing of mature tRNAs using YAMAT-seq. (A) Image of 8% native PAGE of amplified cDNAs resulting from YAMAT-seq of total RNAs from MCF-7, SK-BR-3 and BT-20 cells. The result of one of the triplicate, Replicate 1 (R1), from each cell line is shown. The region designated with a gray line was subjected to gel-purification and Illumina sequencing. (B) Flow diagram of the read numbers filtered by the indicated sequence analyses. The result of R1 from each cell line is shown, whereas the results of R2 and R3 are shown in Supplementary Figure S1. (C) Spearman's rank correlation analysis using tRNA reads of the technical triplicates (R1–R3) from each cell line. The results using the reads of all tRNAs (left), cyto tRNAs (middle) and mt tRNAs (right) are shown. The color bar indicates the strength of Spearman's correlation coefficient.

Northern blot analysis

Northern blot analysis was performed as described previously (44). Briefly, total RNA (500 ng) was resolved by 12% PAGE containing 7 M urea, transferred to Hybond N+ membranes (GE Healthcare) and hybridized to 5΄-end labeled antisense probes (cyto tRNALysCUU: 5΄-GTCTCATGCTCTACCGACT-3΄; cyto tRNAAlaAGC: 5΄-GCGCTCTACCACTGAGCTA-3΄; and mt tRNAValUAC: 5΄-GTGTTAAGCTACACTCTG-3΄). Typhoon 9400 and ImageQuant ver. 5.2 (GE Healthcare) were used for storage phosphor autoradiography visualization and quantification.

RESULTS AND DISCUSSION

Design scheme of YAMAT-seq for high-throughput mature tRNA sequencing

We designed YAMAT-seq to selectively and efficiently sequence mature tRNAs bearing conserved protruding 3΄-ends comprising a 4-nt sequence of the trinucleotides 5΄-CCA-3΄ and a preceding discriminator base (5΄-NCCA-3΄). This method includes the following four steps (Figure 1). First, amino acids at the 3΄-ends of mature aminoacylated tRNAs are removed by incubating total RNA in a high pH buffer (deacylation treatment). Second, a DNA/RNA hybrid Y-shaped adapter (Y-AD) is specifically hybridized and ligated to the deacylated tRNAs via Rnl2 nick ligation. Third, Y-AD-tRNA ligation products are amplified by RT-PCR. Fourth, the resultant cDNAs are gel-purified and subjected to Illumina sequencing.
Figure 1.

Schematic representation of mature tRNA sequencing by YAMAT-seq. Initially, amino acids at the 3΄-ends of mature aminoacylated tRNAs are removed by deacylation treatment. A DNA/RNA hybrid Y-shaped adapter is then specifically hybridized to these deacylated mature tRNAs. The bold line regions of the adapter contain identical sequences to those used for Illumina sequencing. Following hybridization, Rnl2 ligates nicks between the adapter and mature tRNA, and the resultant ligation product is amplified, gel-purified and subjected to Illumina sequencing.

Schematic representation of mature tRNA sequencing by YAMAT-seq. Initially, amino acids at the 3΄-ends of mature aminoacylated tRNAs are removed by deacylation treatment. A DNA/RNA hybrid Y-shaped adapter is then specifically hybridized to these deacylated mature tRNAs. The bold line regions of the adapter contain identical sequences to those used for Illumina sequencing. Following hybridization, Rnl2 ligates nicks between the adapter and mature tRNA, and the resultant ligation product is amplified, gel-purified and subjected to Illumina sequencing. Y-AD comprises two strands containing either Illumina 5΄-adapter sequences (Y-5΄-AD) or 3΄-adapter sequences (Y-3΄-AD) (Figure 1). Y-5΄-AD contains 5΄-OH and 3΄-OH ends and is composed of DNA, except for the two 3΄-terminal RNAs. Y-3΄-AD contains 5΄-P and 3΄-OH ends and its entire nucleotides are DNA. The two united strands form a 4-nt protruding end, 5΄-TGGN-3΄, which recognizes and hybridizes the common protruding 3΄-end of mature tRNAs (5΄-NCCA-3΄). Because the tRNA discriminator base can comprise any four nucleotides, four different Y-5΄-ADs, containing 3΄-terminal A, G, C or U, are mixed and used to capture all mature tRNA species. Hybridization unites the tRNA acceptor stem and Y-AD stem to form long double-stranded nucleotides containing two nicks: ‘Y-5΄-AD-3΄/5΄-tRNA’ and ‘tRNA-3΄/5΄-Y-3΄-AD’. The respective nick structures are ‘RNA-OH-3΄/5΄-P-RNA’ and ‘RNA-OH-3΄/5΄-P-DNA’, and both are efficient substrates for Rnl2 ligation (32–34). No significant ligation efficiency bias is expected between different nucleotides (45). Because the Rnl2 ligation products contain Illumina adapter sequences, they can be amplified by RT-PCR using an Illumina kit. The dependency of RT-PCR amplification on Rnl2 nick ligation allows highly specific amplification and sequencing of mature tRNAs among total RNAs.

Efficient one-step adapter ligation in YAMAT-seq drastically improves mature tRNA amplification efficiency

The conventional RNA-seq method utilizes two-step ligation reactions in which single-stranded 5΄- and 3΄-adapters (5΄- and 3΄-AD) are ligated individually to the respective 5΄- and 3΄-ends of tRNAs (Figure 2A). To examine the efficiency of these ligation reactions toward mature tRNAs, synthetic human cyto tRNAAspGUC was subjected to 3΄-AD ligation followed by 5΄-AD ligation, using the Illumina TruSeq Small RNA Sample Preparation Kit. Real-Time qRT-PCR quantification of the ligated products revealed 5΄- and 3΄-AD ligation efficiencies of 0.15% and 61.3%, respectively (Figure 2B). The extremely inefficient ligation between 5΄-AD and 5΄-end of tRNA might be attributable to the rigid terminal structure of the mature tRNA, in which 5΄-end forms base-pairs with 3΄-end to form an acceptor stem. The relatively high 3΄-AD ligation efficiency was likely attributable to the protruding 3΄-ends of tRNAs that would not be much affected by rigid tRNA structure.
Figure 2.

Adapter-tRNA ligation efficiencies with a conventional method and YAMAT-seq. (A) Schematic representation of the adapter ligation reactions used in the conventional and YAMAT-seq methods. In the conventional method, tRNA was subjected to 3΄-adapter (3΄-AD) and 5΄-adapter (5΄-AD) ligations catalyzed by truncated Rnl2 and Rnl1, respectively, using the Illumina TruSeq Small RNA Sample Preparation Kit. In contrast, in the YAMAT-seq method, a Y-shaped adapter (Y-AD) is ligated to a tRNA by Rnl2. (B) Ligation products of synthetic human cyto tRNAAspGUC and 5΄-AD or 3΄-AD were quantified by real-time qRT-PCR. The amounts of ligation products were determined based on standard curves, and ligation efficiencies were calculated as percentages of ligated tRNA versus input tRNA (set as 100%). Each data set represents the average of three independent experiments with bars showing the SD. (C) Ligation products of the indicated native cyto tRNAs in BT-474 total RNA and 5΄-AD or 3΄-AD were quantified by real-time qRT-PCR. Each data set represents the average Ct values of three independent experiments with bars showing the SD. (D) Image of 8% native polyacrylamide gel electrophoresis (PAGE) of amplified cDNAs resulting from BT-474 total RNA sequencing by conventional and YAMAT-seq procedures. The total adapter lengths of the conventional and YAMAT-seq procedures are 118 and 140 bp, respectively. The YAMAT-seq-derived cDNAs in the designated Bands 1–3 regions were gel-purified and subjected to cloning; the identified sequences are shown in Supplementary Table S1. The cDNAs in regions designated with gray lines are expected to contain tRNA fraction.

Adapter-tRNA ligation efficiencies with a conventional method and YAMAT-seq. (A) Schematic representation of the adapter ligation reactions used in the conventional and YAMAT-seq methods. In the conventional method, tRNA was subjected to 3΄-adapter (3΄-AD) and 5΄-adapter (5΄-AD) ligations catalyzed by truncated Rnl2 and Rnl1, respectively, using the Illumina TruSeq Small RNA Sample Preparation Kit. In contrast, in the YAMAT-seq method, a Y-shaped adapter (Y-AD) is ligated to a tRNA by Rnl2. (B) Ligation products of synthetic human cyto tRNAAspGUC and 5΄-AD or 3΄-AD were quantified by real-time qRT-PCR. The amounts of ligation products were determined based on standard curves, and ligation efficiencies were calculated as percentages of ligated tRNA versus input tRNA (set as 100%). Each data set represents the average of three independent experiments with bars showing the SD. (C) Ligation products of the indicated native cyto tRNAs in BT-474 total RNA and 5΄-AD or 3΄-AD were quantified by real-time qRT-PCR. Each data set represents the average Ct values of three independent experiments with bars showing the SD. (D) Image of 8% native polyacrylamide gel electrophoresis (PAGE) of amplified cDNAs resulting from BT-474 total RNA sequencing by conventional and YAMAT-seq procedures. The total adapter lengths of the conventional and YAMAT-seq procedures are 118 and 140 bp, respectively. The YAMAT-seq-derived cDNAs in the designated Bands 1–3 regions were gel-purified and subjected to cloning; the identified sequences are shown in Supplementary Table S1. The cDNAs in regions designated with gray lines are expected to contain tRNA fraction. The ability of YAMAT-seq to capture the mature tRNA repertoire relies on the efficient and specific ligation of Y-AD to mature tRNAs. Hybridization and Rnl2 ligation of Y-AD to synthetic human cyto tRNAAspGUC yielded ligation efficiencies of 22.8% and 91.8% for Y-5΄-AD and Y-3΄-AD, respectively (Figure 2A and B). The difference in efficiency is attributable to the substrate preference of Rnl2, as demonstrated previously by the finding that Rnl2-catalyzed ligation efficiency is higher at a nick of 5΄-RNA-OH/P-DNA-3΄ with a complementary DNA strand than at a nick of double-stranded RNA (31). Compared with the two-step linear adapter ligation reaction of the conventional method, the one-step Y-AD ligation reaction in YAMAT-seq improved the efficiency of 5΄-end ligation by ∼150 fold. Together with the 1.5-fold enhancement in 3΄-end ligation efficiency, YAMAT-seq increased the efficiency of adapter ligation to mature tRNA by more than 200-fold from that of the conventional method. To further compare ligation efficiencies toward native mature tRNAs, total RNA extracted from BT-474 breast cancer cells was subjected to the conventional and YAMAT-seq methods. Real-time qRT-PCR reactions of the adapter-ligated cyto tRNAAspGUC, tRNAGluCUC, tRNALysCUU and tRNAPheGAA commonly yielded much lower Ct values in YAMAT-seq method than in conventional method, and the differences in Ct values were especially apparent for 5΄-end ligation (Figure 2C). These results confirmed that YAMAT-seq procedure greatly increased the adapter ligation efficiency to native mature tRNAs, especially regarding their recessed 5΄-end, compared to the conventional method. In the conventional RNA-seq method, poor adapter ligation efficiency would be expected to cause the inefficient amplification of cDNAs derived from mature tRNAs. Indeed, despite the abundant existence of mature tRNAs as observed by PAGE (data not shown), Illumina kit-based cDNA amplification from the BT-474 total RNA yielded only faint bands with the length of ∼190 bp (adapter's length: 118 bp) in the region corresponding to the lengths of mature tRNAs (Figure 2D). In contrast, consistent with the improvement in adapter ligation efficiency, YAMAT-seq specifically amplified abundant cDNA bands with lengths of 170–230 bp (via comparison with marker DNAs in native PAGE; Figure 2D) from the same BT-474 total RNA. Given the length of Y-AD, the inserted RNA lengths were estimated to be 30–90 nt, whose range was wider than expected for the specific amplification of mature tRNAs. However, as shown in Supplementary Table S1, several clones obtained from cDNA bands in three major regions (Bands 1–3; Figure 2D) were unanimously derived from mature tRNAs, suggesting that YAMAT-seq efficiently and specifically amplified mature tRNAs; in addition, the band sizes of cDNAs of Y-AD-ligated mature tRNAs may be shifted relative to marker cDNAs, a characteristic that is likely due to the highly-structured sequences of these molecules. Taken together, the rigid mature tRNA structure prevents adapter ligation in conventional RNA-seq methods, thereby causing mature tRNAs to be inefficiently amplified. YAMAT-seq drastically improves mature tRNA amplification efficiency by employing efficient one-step Y-AD ligation scheme.

High specificity, sensitivity and reproducibility of YAMAT-seq

Given the success of mature tRNA amplification from BT-474 cells, we further applied the YAMAT-seq method to total RNAs isolated from three additional breast cancer cell lines, MCF-7, SK-BR-3 and BT-20, and each performed in triplicate. As shown in Figure 4A, amplified cDNAs from the total RNAs of different cell lines exhibited similar band patterns on native PAGE, indicating the feasibility and general versatility of YAMAT-seq for the amplification of mature tRNAs among total RNAs from various cells. Illumina next-generation sequencing of the amplified cDNA bands yielded ∼15–16 million raw reads, high percentage (>96%) of which were actually mapped to tRNA space (Figure 4B and Supplementary Figure S1). This is a significant improvement compared to the conventional RNA-seq method in which the Illumina sequencing reads of BT-474 cDNAs, derived from tRNA region (Figure 2D), showed only 25% mapping rate to tRNA space (data not shown). We further focused on mature tRNA-derived reads defined by the presence of 3΄-terminal CCA sequences and lengths of 60–87 nt, from which approximately 92–95% of total reads were eventually extracted. Whole genome mapping ensured that almost all extracted sequences were actually derived from tRNA space. The high rate of CCA-containing mature tRNA-derived sequences demonstrated the successful execution of selective mature tRNA amplification and sequencing with this YAMAT-seq scheme.
Figure 4.

YAMAT-seq can quantitatively estimate identified mature tRNAs. (A) Cyto tRNALysCUU, cyto tRNAAlaAGC and mt tRNAValUAC were detected in the indicated cells by Northern blots. 5S rRNA was detected in the respective membranes as a loading control. (B) Quantification of Northern blot bands. The band intensities of tRNAs were normalized by those of 5S rRNA. Each data set represents the average amounts of three independent experiments, relative to those in the MCF-7 cells (set as 1), with bars showing the SD. (C) Reads per million of the three tRNAs present in YAMAT-seq libraries from the indicated cells. Each data set represents the average of technical triplicates with bars showing the SD.

Regarding the mapped isoacceptors with different anticodon sequences, all 22 mt tRNAs and 46 of 49 cyto tRNAs (93.9%) were successfully detected from all of the three cell lines (Supplementary Table S2 and S3). The high coverage of the identified tRNA species for all the examined cell lines demonstrated the sensitivity and general versatility of YAMAT-seq for mature tRNA sequencing analyses. Cyto tRNAAsnAUU and tRNATyrAUA were not detected in all of the libraries. In addition, cyto tRNAIleGAU and tRNAIleUAU were either not detected or detected as very low read numbers. These tRNAs may not be expressed or be expressed at extremely low levels. Alternatively, these tRNAs may contain several modifications that strongly interfere with reverse transcription. To examine the reproducibility of YAMAT-seq, nine sequence libraries comprising technical triplicates from the respective three cell lines were subjected to statistical correlation analyses. Scatter plots of the sequence reads from any two libraries in the triplicate unexceptionally showed an excellent correlation with R2 of 0.99 or better (Figure S2). Furthermore, Spearman's rank correlation analysis using all the libraries showed strong correlations between libraries in each technical triplicate, while libraries from different cell lines showed lower correlations (Figure 3C). The consistency among the technical triplicates indicates the high reproducibility of YAMAT-seq for mature tRNA sequencing analyses. Amplification and sequencing of mature tRNAs using YAMAT-seq. (A) Image of 8% native PAGE of amplified cDNAs resulting from YAMAT-seq of total RNAs from MCF-7, SK-BR-3 and BT-20 cells. The result of one of the triplicate, Replicate 1 (R1), from each cell line is shown. The region designated with a gray line was subjected to gel-purification and Illumina sequencing. (B) Flow diagram of the read numbers filtered by the indicated sequence analyses. The result of R1 from each cell line is shown, whereas the results of R2 and R3 are shown in Supplementary Figure S1. (C) Spearman's rank correlation analysis using tRNA reads of the technical triplicates (R1–R3) from each cell line. The results using the reads of all tRNAs (left), cyto tRNAs (middle) and mt tRNAs (right) are shown. The color bar indicates the strength of Spearman's correlation coefficient.

High quantitative ability of YAMAT-seq

To analyze whether the obtained sequence read numbers reflected the actual quantities of expressed mature tRNAs, we selected three tRNAs, cyto tRNALysCUU, cyto tRNAAlaAGC and mt tRNAValUAC, whose read numbers have wide varieties (∼230–620,000 reads per million). The amounts of the three tRNAs among identical amounts of total RNAs from MCF-7, SK-BR-3 and BT-20 cells were examined by Northern blot. As shown in Figure 4A–C, their relative abundances via Northern blots were consistent with those estimated by YAMAT-seq read numbers, indicating the ability of the YAMAT-seq method to provide quantitative information regarding mature tRNA expression among different samples. YAMAT-seq can quantitatively estimate identified mature tRNAs. (A) Cyto tRNALysCUU, cyto tRNAAlaAGC and mt tRNAValUAC were detected in the indicated cells by Northern blots. 5S rRNA was detected in the respective membranes as a loading control. (B) Quantification of Northern blot bands. The band intensities of tRNAs were normalized by those of 5S rRNA. Each data set represents the average amounts of three independent experiments, relative to those in the MCF-7 cells (set as 1), with bars showing the SD. (C) Reads per million of the three tRNAs present in YAMAT-seq libraries from the indicated cells. Each data set represents the average of technical triplicates with bars showing the SD. Because the tRNAs whose genes have high copy numbers in the genome tended to be abundant in YAMAT-seq data (Figure 5A), we reasoned that YAMAT-seq read numbers of each tRNA in the same library could reflect the actual abundance. To examine this, the amounts of cyto tRNALysCUU, cyto tRNAAspGUC, mt tRNAAlaUGC and mt tRNAGluUUC in MCF-7 total RNA were determined based on Northern blot analyses in which standard curves were obtained from corresponding synthetic tRNAs (Supplementary Figure S3A and S3B). Rough correlation between the determined amounts and YAMAT-seq read numbers (Supplementary Figure S3B) suggested that the read numbers contain quantitative information to some extent. However, the correlation did not show perfect linearity, probably because the efficiency of YAMAT-seq procedures, especially at the step of reverse transcription, is dependent on the structure and modification state of each tRNA.
Figure 5.

Mature tRNA repertoire and abundances identified using YAMAT-seq in three cell lines. (A) Scatter plots of cyto tRNAs for their YAMAT-seq reads per million from the indicated cells and copy number in human genome. The average reads per million of technical triplicate were used. (B) Reads per million of mt tRNAs in YAMAT-seq libraries from the indicated cells. Each data set represents the average of technical triplicates with bars showing the SD. tRNAs were aligned in order of the read numbers from MCF-7 cells. (C) Heatmap and dendrogram showing the Euclidean distance of tRNA expression pattern among YAMAT-seq libraries from the indicated cells. tRNAs with <1.0 reads per million (cyto tRNAIleGAU, tRNAIleUAU and tRNATrpCCA) were omitted, and the average reads per million of technical triplicate were used.

Mature tRNA repertoire and abundances identified using YAMAT-seq in three cell lines. (A) Scatter plots of cyto tRNAs for their YAMAT-seq reads per million from the indicated cells and copy number in human genome. The average reads per million of technical triplicate were used. (B) Reads per million of mt tRNAs in YAMAT-seq libraries from the indicated cells. Each data set represents the average of technical triplicates with bars showing the SD. tRNAs were aligned in order of the read numbers from MCF-7 cells. (C) Heatmap and dendrogram showing the Euclidean distance of tRNA expression pattern among YAMAT-seq libraries from the indicated cells. tRNAs with <1.0 reads per million (cyto tRNAIleGAU, tRNAIleUAU and tRNATrpCCA) were omitted, and the average reads per million of technical triplicate were used. Each cell line exhibited a distinctive mature tRNA expression signature, and each mature tRNA species exhibited a wide variety of distinctive expression patterns in different cells (Figure 5A–C). The analyses of Spearman's correlation imply that the expression profiles of mt tRNAs could be more variable among cell lines than those of cyto tRNAs (Figure 3C). The wide variations in mature tRNA expression patterns have been suggested in previous reports (6–8), and the current findings suggest the usefulness of YAMAT-seq for identifying such variations.

Advantages of the YAMAT-seq method

YAMAT-seq circumvents the issue of inefficient adapter ligation, a characteristic of conventional RNA sequencing methods, by employing the efficient ligation of Y-AD to mature tRNAs via Rnl2. In addition to efficiency, YAMAT-seq features the advantages of convenience and specificity over previously-reported tRNA sequencing methods that require several steps involving the gel- or HPLC-purification of tRNA fractions or Poly-A-polymerase reaction (28–30). YAMAT-seq requires only 300 ng (3/10 of ligation mixture with 1 μg total RNA was subjected to reverse transcription) of total RNA extracted by standard organic extraction methods, with no purification/concentration steps to obtain the tRNA fraction. After total RNA is subjected to deacylation, Rnl2 ligation to Y-AD and Illumina RT-PCR are the only procedures required for specific mature tRNA amplification, which yields numerous mature tRNA sequences and a high level of coverage of isoacceptors. Given these advantages, we expect that the YAMAT-seq concept will be widely applicable to the identification of mature tRNA expression profiles in cells or tissues from any organism, as well as valuable clinical samples, provided that 300 ng of total RNA is available. Many post-transcriptional modifications existing in mature tRNAs are known to impair reverse transcription (46), leading to bias in sequencing results. However, recent studies have successfully removed some RT-impairing modifications, such as m1A and m1G, using AlkB demethylase and thus have improved the efficiency and quality of the amplification and sequencing of tRNAs and their fragments (30,47). Such demethylase pretreatment of total RNAs would be an option to reduce bias of the sequencing results in YAMAT-seq.

CONCLUSIONS

We have described the development of YAMAT-seq, an efficient and convenient method for the selective amplification and sequencing of mature tRNAs. We have further demonstrated the high specificity of this method for mature tRNAs, high efficiency and sensitivity that cover most isoacceptors in a small amount of total RNA, high reproducibility that yields consistent results from technical triplicates, quantitative capability to estimate relative expression levels, and broad applicability for different cell lines. As our view of tRNA function is greatly being expanded, YAMAT-seq will provide a much-needed convenient and high-throughput method for the analysis of mature tRNA abundance and heterogeneity, which could play important regulatory roles in translation and other biological processes. Click here for additional data file.
  46 in total

1.  Direct comparison of nick-joining activity of the nucleic acid ligases from bacteriophage T4.

Authors:  Desmond R Bullard; Richard P Bowater
Journal:  Biochem J       Date:  2006-08-15       Impact factor: 3.857

2.  Drosophila RNA polymerase III repressor Maf1 controls body size and developmental timing by modulating tRNAiMet synthesis and systemic insulin signaling.

Authors:  Elizabeth J Rideout; Lynne Marshall; Savraj S Grewal
Journal:  Proc Natl Acad Sci U S A       Date:  2012-01-06       Impact factor: 11.205

3.  Exploring the regulation of tRNA distribution on the genomic scale.

Authors:  Kimberly A Dittmar; Evelyn M Mobley; Agnes Jancso Radek; Tao Pan
Journal:  J Mol Biol       Date:  2004-03-12       Impact factor: 5.469

Review 4.  tRNA fragments in human health and disease.

Authors:  Paul Anderson; Pavel Ivanov
Journal:  FEBS Lett       Date:  2014-09-16       Impact factor: 4.124

5.  Kinetic mechanism of nick sealing by T4 RNA ligase 2 and effects of 3'-OH base mispairs and damaged base lesions.

Authors:  Mathieu Chauleau; Stewart Shuman
Journal:  RNA       Date:  2013-10-24       Impact factor: 4.942

6.  Nuclear and mitochondrial tRNA-lookalikes in the human genome.

Authors:  Aristeidis G Telonis; Phillipe Loher; Yohei Kirino; Isidore Rigoutsos
Journal:  Front Genet       Date:  2014-10-08       Impact factor: 4.599

7.  dendextend: an R package for visualizing, adjusting and comparing trees of hierarchical clustering.

Authors:  Tal Galili
Journal:  Bioinformatics       Date:  2015-07-23       Impact factor: 6.937

8.  ARM-seq: AlkB-facilitated RNA methylation sequencing reveals a complex landscape of modified tRNA fragments.

Authors:  Aaron E Cozen; Erin Quartley; Andrew D Holmes; Eva Hrabeta-Robinson; Eric M Phizicky; Todd M Lowe
Journal:  Nat Methods       Date:  2015-08-03       Impact factor: 28.547

9.  Hints of tRNA-Derived Small RNAs Role in RNA Silencing Mechanisms.

Authors:  Maria Rosa Garcia-Silva; Florencia Cabrera-Cabrera; Maria Catalina Güida; Alfonso Cayota
Journal:  Genes (Basel)       Date:  2012-10-10       Impact factor: 4.096

10.  Efficient and quantitative high-throughput tRNA sequencing.

Authors:  Guanqun Zheng; Yidan Qin; Wesley C Clark; Qing Dai; Chengqi Yi; Chuan He; Alan M Lambowitz; Tao Pan
Journal:  Nat Methods       Date:  2015-07-27       Impact factor: 28.547

View more
  39 in total

Review 1.  Role of tRNAs in Breast Cancer Regulation.

Authors:  Nam Hoon Kwon; Jin Young Lee; Sunghoon Kim
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

2.  Differential expression of human tRNA genes drives the abundance of tRNA-derived fragments.

Authors:  Adrian Gabriel Torres; Oscar Reina; Camille Stephan-Otto Attolini; Lluís Ribas de Pouplana
Journal:  Proc Natl Acad Sci U S A       Date:  2019-04-08       Impact factor: 11.205

Review 3.  Small non-coding RNAs in human cancer: function, clinical utility, and characterization.

Authors:  Zhao Zhang; Jian Zhang; Lixia Diao; Leng Han
Journal:  Oncogene       Date:  2021-01-15       Impact factor: 9.867

4.  Combining tRNA sequencing methods to characterize plant tRNA expression and post-transcriptional modification.

Authors:  Jessica M Warren; Thalia Salinas-Giegé; Guillaume Hummel; Nicole L Coots; Joshua M Svendsen; Kristen C Brown; Laurence Drouard; Daniel B Sloan
Journal:  RNA Biol       Date:  2020-07-25       Impact factor: 4.652

5.  Accurate characterization of Escherichia coli tRNA modifications with a simple method of deep-sequencing library preparation.

Authors:  Ji Wang; Claire Toffano-Nioche; Florence Lorieux; Daniel Gautheret; Jean Lehmann
Journal:  RNA Biol       Date:  2020-07-26       Impact factor: 4.652

Review 6.  Cross-species RNA-seq for deciphering host-microbe interactions.

Authors:  Alexander J Westermann; Jörg Vogel
Journal:  Nat Rev Genet       Date:  2021-02-17       Impact factor: 53.242

7.  LOTTE-seq (Long hairpin oligonucleotide based tRNA high-throughput sequencing): specific selection of tRNAs with 3'-CCA end for high-throughput sequencing.

Authors:  Lieselotte Erber; Anne Hoffmann; Jörg Fallmann; Heike Betat; Peter F Stadler; Mario Mörl
Journal:  RNA Biol       Date:  2019-09-16       Impact factor: 4.652

8.  Exploration of CCA-added RNAs revealed the expression of mitochondrial non-coding RNAs regulated by CCA-adding enzyme.

Authors:  Kamlesh Pawar; Megumi Shigematsu; Phillipe Loher; Shozo Honda; Isidore Rigoutsos; Yohei Kirino
Journal:  RNA Biol       Date:  2019-09-12       Impact factor: 4.652

Review 9.  tRNA Metabolism and Neurodevelopmental Disorders.

Authors:  Ashleigh E Schaffer; Otis Pinkard; Jeffery M Coller
Journal:  Annu Rev Genomics Hum Genet       Date:  2019-05-13       Impact factor: 8.929

10.  The birth of a bacterial tRNA gene by large-scale, tandem duplication events.

Authors:  Gökçe B Ayan; Hye Jin Park; Jenna Gallie
Journal:  Elife       Date:  2020-10-30       Impact factor: 8.140

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.