| Literature DB >> 33214769 |
Girik Malik1, Tanu Agarwal1, Utkarsh Raj1, Vijayaraghava Seshadri Sundararajan1, Obul Reddy Bandapalli1, Prashanth Suravajhala1.
Abstract
Hypothetical Proteins [HP] are the transcripts predicted to be expressed in an organism, but no evidence of it exists in gene banks. On the other hand, long non-coding RNAs [lncRNAs] are the transcripts that might be present in the 5' UTR or intergenic regions of the genes whose lengths are above 200 bases. With the known unknown [KU] regions in the genomes rapidly existing in gene banks, there is a need to understand the role of open reading frames in the context of annotation. In this commentary, we emphasize that HPs could indeed be the predecessors of lncRNAs.Entities:
Keywords: Hypothetical proteins; annotation; aptamers; functional genomics; lncRNA; transcripts
Year: 2020 PMID: 33214769 PMCID: PMC7604745 DOI: 10.2174/1389202921999200611155418
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.236
Fig. (1)The figure showing the difference between a known-known and known unknown protein. Precisely, the characteristic domains such as domains of unknown function (DUF) or ORFs unrelated or KIAA domains are associated with hypothetical proteins, which usually are present in the c terminal region of the protein. We show a classic example of how CAC92745, an HP, could be annotated as a lncRNA, viz. LINC00208. (A higher resolution / colour version of this figure is available in the electronic copy of the article).