Literature DB >> 29554332

A Model-Based Approach for Identifying Functional Intergenic Transcribed Regions and Noncoding RNAs.

John P Lloyd1, Zing Tsung-Yeh Tsai2, Rosalie P Sowers3, Nicholas L Panchy4, Shin-Han Shiu1,4,5.   

Abstract

With advances in transcript profiling, the presence of transcriptional activities in intergenic regions has been well established. However, whether intergenic expression reflects transcriptional noise or activity of novel genes remains unclear. We identified intergenic transcribed regions (ITRs) in 15 diverse flowering plant species and found that the amount of intergenic expression correlates with genome size, a pattern that could be expected if intergenic expression is largely nonfunctional. To further assess the functionality of ITRs, we first built machine learning models using Arabidopsis thaliana as a model that accurately distinguish functional sequences (benchmark protein-coding and RNA genes) and likely nonfunctional ones (pseudogenes and unexpressed intergenic regions) by integrating 93 biochemical, evolutionary, and sequence-structure features. Next, by applying the models genome-wide, we found that 4,427 ITRs (38%) and 796 annotated ncRNAs (44%) had features significantly similar to benchmark protein-coding or RNA genes and thus were likely parts of functional genes. Approximately 60% of ITRs and ncRNAs were more similar to nonfunctional sequences and were likely transcriptional noise. The predictive framework established here provides not only a comprehensive look at how functional, genic sequences are distinct from likely nonfunctional ones, but also a new way to differentiate novel genes from genomic regions with noisy transcriptional activities.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29554332     DOI: 10.1093/molbev/msy035

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  5 in total

Review 1.  Plant Noncoding RNAs: Hidden Players in Development and Stress Responses.

Authors:  Yu Yu; Yuchan Zhang; Xuemei Chen; Yueqin Chen
Journal:  Annu Rev Cell Dev Biol       Date:  2019-08-12       Impact factor: 13.827

2.  We simply cannot go on being so vague about 'function'.

Authors:  W Ford Doolittle
Journal:  Genome Biol       Date:  2018-12-18       Impact factor: 13.583

3.  Characterization of novel pollen-expressed transcripts reveals their potential roles in pollen heat stress response in Arabidopsis thaliana.

Authors:  Nicholas Rutley; Laetitia Poidevin; Tirza Doniger; Richard L Tillett; Abhishek Rath; Javier Forment; Gilad Luria; Karen A Schlauch; Alejandro Ferrando; Jeffery F Harper; Gad Miller
Journal:  Plant Reprod       Date:  2021-01-18       Impact factor: 3.767

4.  Genome-Wide ChIP-seq and RNA-seq Analyses of STAT3 Target Genes in TLRs Activated Human Peripheral Blood B Cells.

Authors:  Jing Wu; Ying-Ying Jin; Ruo-Lan Gong; Fan Yang; Xiao-Ya Su; Tong-Xin Chen
Journal:  Front Immunol       Date:  2022-03-08       Impact factor: 7.561

5.  Indole-3-acetic acid has long-term effects on long non-coding RNA gene methylation and growth in Populus tomentosa.

Authors:  Dong Ci; Min Tian; Yuepeng Song; Qingzhang Du; Mingyang Quan; Anran Xuan; Jianyuan Yu; Zixuan Yuan; Deqiang Zhang
Journal:  Mol Genet Genomics       Date:  2019-07-19       Impact factor: 3.291

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.