| Literature DB >> 34239893 |
Lingyu Guan1, Vincent Lam2, Andrey Grigoriev1.
Abstract
Accumulating evidence has suggested that tRNA-derived fragments (tRFs) could be loaded to Argonaute proteins and function as regulatory small RNAs. However, their mode of action remains largely unknown, and investigations of their binding mechanisms have been limited, revealing little more than microRNA-like seed regions in a handful of tRFs and a few targets. Here, we identified such regions of potential interaction on a larger scale, using in vivo formed hybrids of guides and targets in crosslinked chimeric reads in two orientations. We considered "forward pairs" (with guides located on the 5' ends and targets on the 3' ends of hybrids) and "reverse pairs" (opposite orientation) and compared them as independent sets of biological constructs. We observed intriguing differences between the two chimera orientations, including the paucity of tRNA halves and abundance of polyT-containing targets in forward pairs. We found a total of 197 quality-ranked motifs supported by ∼120,000 tRF-mRNA chimeras, with 103 interacting motifs common in forward and reverse pairs. By analyzing T→C conversions in human and mouse PAR-CLIP datasets, we detected Argonaute crosslinking sites in tRFs, conserved across species. We proposed a novel model connecting the formation of asymmetric pairs in two sets to the potential binding mechanisms of tRFs, involving the identified interaction motifs and crosslinking sites to Argonaute proteins. Our results suggest the way forward for further experimental elucidation of tRF-binding mechanisms.Entities:
Keywords: Argonaute; binding motif; tRNA; tRNA fragments; target genes
Year: 2021 PMID: 34239893 PMCID: PMC8258673 DOI: 10.3389/fmolb.2021.647449
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
Numbers of reads supporting hybrids with specific types of targets in forward and reverse pairs of tRFs.
| Guides | All aligned reads | mRNA | rRNA | miRNA | Other | |
|---|---|---|---|---|---|---|
| Forward pairs | tRF-5p | 37,459/662,433 | 12,282 (5,754) | 20,065 (15,535) | 2,606 (2,297) | 2,506 (1,075) |
| tRF-5i | 25,531/57,921 | 5,190 (2,840) | 19,556 (16,654) | 349 (295) | 436 (184) | |
| tRF-3p | 909,963/1,599,089 | 75,163 (54,562) | 809,273 (794,288) | 6,261 (5,701) | 19,266 (16,271) | |
| tRF-3i | 4,754/45,863 | 1,095 (671) | 2,859 (2,250) | 595 (547) | 205 (109) | |
| tRF-3t | 131,129/281,072 | 10,425 (7,550) | 115,997 (112,457) | 1,903 (1,686) | 2,804 (1,981) | |
| tRF-i | 89,822/2,048,367 | 16,314 (10,100) | 65,365 (53,181) | 4,515 (3,841) | 3,628 (2,237) | |
| Reverse pairs | tRF-5p | 46,059/210,383 | 10,046 (7,531) | 29,460 (26,318) | 4,425 (4,092) | 2,128 (1,823) |
| tRF-5i | 53,354/188,008 | 9,747 (7,874) | 40,780 (39,000) | 1,890 (1,785) | 937 (788) | |
| tRF-3p | 170,215/1,667,846 | 48,859 (39,734) | 101,831 (92,839) | 4,171 (3,740) | 15,354 (13,368) | |
| tRF-3i | 15,090/47,882 | 895 (660) | 13,877 (13,110) | 195 (168) | 123 (80) | |
| tRF-3t | 23,720/222,031 | 5,404 (4,157) | 16,145 (13,701) | 930 (784) | 1,241 (957) | |
| tRF-i | 123,266/1,194,325 | 11,586 (8,115) | 104,395 (94,232) | 4,167 (3,569) | 3,118 (2,312) |
Notation for tRF types.
tRF-5p start in the first 5 nt of tRNA genes and end before the anticodon loop.
tRF-5i start in the first 5 nt of tRNA genes and end in the anticodon loop.
tRF-3p end in the last 5 nt of tRNA genes with CCA additions and start after the anticodon loop.
tRF-3i end in the last 5 nt of tRNA genes with CCA additions and start in the anticodon loop.
tRF-3t end at least 3 nt downstream of tRNA genes and also include fragments typically called tRF-1 (starting after the end of tRNA genes).
tRF-i do not fit into the above categories and start and end in the internal regions of tRNA genes.
In general, a notation for each tRF is provided as , where X stands for the tRNA gene origin: [N] for nuclear, [M] for mitochondrial, or [NM] for cases when nuclear and mitochondrial cannot be distinguished. Start and end correspond to the coordinates on the tRNA gene, including introns, if present in tRNA. CCA is assumed added to tRF-3p, while 3′ trailer is added to tRF-3t.
Numbers before slash indicate the interactions of tRFs and annotated targets (mRNAs, rRNAs, miRNAs, and other ncRNAs). Numbers after slash indicate all CLASH reads containing tRFs.
Numbers in parentheses correspond to the hybrids supported by at least two reads.
Excluding tRNA.
FIGURE 1Comparison of the tRFs identified in forward and reverse pairs. The tRF length distributions are shown for different types of tRFs identified in (A) forward pairs and (B) reverse pairs. (C) Relative frequencies of tRFs common to forward and reverse chimeras. The 10 most abundant tRFs with the highest differential frequencies in the two orientations (fold change >5) are labeled.
FIGURE 2PCA plots of dinucleotide composition. Plots of the first vs. second principal components calculated from dinucleotide frequencies (counting all overlapping dinucleotides) are shown for (A) tRF–mRNA chimeric sequences, (B) tRF sequences, and (C) mRNA target sequences that were identified in forward pairs (blue points) and reverse pairs (red points). Red lines separate the region enriched with polyT runs from the rest of the sequences.
FIGURE 3Analyses of Ago-crosslinking and target binding sites. Histograms show the cumulative bitscores of motifs in (A) forward and (B) reverse pairs relative to the most frequent T→C conversion sites identified for all motif-containing tRFs in human cells (blue) and identical tRFs in mouse cells (red). Bitscores (heights of logo symbols, reflecting the complementary match frequencies between tRFs and target nucleotides) for every position in the motif were added up in respective positions of all motif-containing tRFs relative to their conversion sites as a measure of target binding frequencies in these positions. tRFs are aligned to the most frequent conversion sites and shown from 5′ to 3′. (C) icSHAPE scores of target sites relative to the predicted interacting motifs. Activity scores are averaged for every nucleotide of all unique mRNA targets and separated into single-stranded (red) and double-stranded (blue) regions formed at the motif site. mRNA targets are aligned to the start of motif-binding sites and shown from 3′ to 5'. (D) A model describing how Ago-crosslinking and target binding results may be connected to the observed asymmetries of tRF and target pairs and their motifs. (E) Motifs found for CysGCA tRF in the forward (top) and reverse (bottom) chimeras, see the legend of Supplementary Figure S5 for details.