| Literature DB >> 30691342 |
Cinta Pegueroles1,2, Susana Iraola-Guzmán1,2, Uciel Chorostecki1,2, Ewa Ksiezopolska1,2, Ester Saus1,2, Toni Gabaldón1,2,3.
Abstract
Long non-coding RNAs (lncRNAs) are a heterogeneous class of genes that do not code for proteins. Since lncRNAs (or a fraction thereof) are expected to be functional, many efforts have been dedicated to catalog lncRNAs in numerous organisms, but our knowledge of lncRNAs in non vertebrate species remains very limited. Here, we annotated lncRNAs using transcriptomic data from the same larval stage of four Caenorhabditis species. The number of annotated lncRNAs in self-fertile nematodes was lower than in out-crossing species. We used a combination of approaches to identify putatively homologous lncRNAs: synteny, sequence conservation, and structural conservation. We classified a total of 1,532 out of 7,635 genes from the four species into families of lncRNAs with conserved synteny and expression at the larval stage, suggesting that a large fraction of the predicted lncRNAs may be species specific. Despite both sequence and local secondary structure seem to be poorly conserved, sequences within families frequently shared BLASTn hits and short sequence motifs, which were more likely to be unpaired in the predicted structures. We provide the first multi-species catalog of lncRNAs in nematodes and identify groups of lncRNAs with conserved synteny and expression, that share exposed motifs.Entities:
Keywords: Lncrna; motifs; secondary structure; synteny
Mesh:
Substances:
Year: 2019 PMID: 30691342 PMCID: PMC6380332 DOI: 10.1080/15476286.2019.1572438
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652
Figure 1.Phylogeny of the Caenorhabditis species included in our study. dS estimates were obtained from Cutter [36].
Figure 2.Box-plots showing length (a), number of exons (b), GC content (c), and expression values (d) for the annotated protein-coding and lncRNA genes in all studied species. In Figure 2(d) we discarded genes with log2(TPM)<0.
Figure 3.Venn diagram showing the number of lncRNA families (in bold) and genes for the blast (a), secondary structure (b) and syntenic classification (c). C. elegans in blue, C. briggsae in dark pink, C. remanei in orange and C. brenneri in light pink.