| Literature DB >> 32160106 |
Zhikang Hu1,2,3, Tao Lyu1,3, Chao Yan1,4, Yupeng Wang1,2, Ning Ye2, Zhengqi Fan3, Xinlei Li3, Jiyuan Li3, Hengfu Yin1,3.
Abstract
Direct single-molecule sequencing of full-length transcripts allows efficient identification of gene isoforms, which is apt to alternative splicing (AS), polyadenylation, and long non-coding RNA analyses. However, the identification of gene isoforms and long non-coding RNAs with novel regulatory functions remains challenging, especially for species without a reference genome. Here, we present a comprehensive analysis of a combined long-read and short-read transcriptome sequencing in Camellia japonica. Through a novel bioinformatic pipeline of reverse-tracing the split-sites, we have uncovered 257,692 AS sites from 61,838 transcripts; and 13,068 AS isoforms have been validated by aligning the short reads. We have identified the tissue-specific AS isoforms along with 6,373 AS events that were found in all tissues. Furthermore, we have analysed the polyadenylation (polyA) patterns of transcripts, and found that the preference for polyA signals was different between the AS and non-AS transcripts. Moreover, we have predicted the phased small interfering RNA (phasiRNA) loci through integrative analyses of transcriptome and small RNA sequencing. We have shown that a newly evolved phasiRNA locus from lipoxygenases generated 12 consecutive 21 bp secondary RNAs, which were responsive to cold and heat stress in Camellia. Our studies of the isoform transcriptome provide insights into gene splicing and functions that may facilitate the mechanistic understanding of plants.Entities:
Keywords: Camellia ; Alternative splicing; lipoxygenase; phased small interfering RNA; polyadenylation; single-molecule sequencing
Mesh:
Substances:
Year: 2020 PMID: 32160106 PMCID: PMC7549672 DOI: 10.1080/15476286.2020.1738703
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652