| Literature DB >> 30224646 |
Jessime M Kirk1,2, Susan O Kim1,3, Kaoru Inoue1,3, Matthew J Smola4,5, David M Lee1,6, Megan D Schertzer1,6, Joshua S Wooten1,6, Allison R Baker1,7, Daniel Sprague1,8, David W Collins9, Christopher R Horning9, Shuo Wang9, Qidi Chen9, Kevin M Weeks4, Peter J Mucha10, J Mauro Calabrese11.
Abstract
The functions of most long non-coding RNAs (lncRNAs) are unknown. In contrast to proteins, lncRNAs with similar functions often lack linear sequence homology; thus, the identification of function in one lncRNA rarely informs the identification of function in others. We developed a sequence comparison method to deconstruct linear sequence relationships in lncRNAs and evaluate similarity based on the abundance of short motifs called k-mers. We found that lncRNAs of related function often had similar k-mer profiles despite lacking linear homology, and that k-mer profiles correlated with protein binding to lncRNAs and with their subcellular localization. Using a novel assay to quantify Xist-like regulatory potential, we directly demonstrated that evolutionarily unrelated lncRNAs can encode similar function through different spatial arrangements of related sequence motifs. K-mer-based classification is a powerful approach to detect recurrent relationships between sequence and function in lncRNAs.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30224646 PMCID: PMC6262761 DOI: 10.1038/s41588-018-0207-8
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330