| Literature DB >> 34723326 |
Marco Di Marsico1, Andreu Paytuvi Gallart2, Walter Sanseverino2, Riccardo Aiese Cigliano2.
Abstract
The Green Non-Coding Database (GreeNC) is one of the reference databases for the study of plant long non-coding RNAs (lncRNAs). Here we present our most recent update where 16 species have been updated, while 78 species have been added, resulting in the annotation of more than 495 000 lncRNAs. Moreover, sequence clustering was applied providing information about sequence conservation and gene families. The current version of the database is available at: http://greenc.sequentiabiotech.com/wiki2/Main_Page.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34723326 PMCID: PMC8728176 DOI: 10.1093/nar/gkab1014
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of the in-house developed computational pipeline for lncRNA annotation, which consists of script 1 (A) and script 2 (B).
Figure 2.A snapshot of a Cucumis melo entry from the GreeNC database. (A) Header, to navigate through the website and access to the tools and the pages of the species; (B) table of gene information reporting genomic coordinates, genome version, the source of the genome assembly and if the gene encodes at least one coding transcript; (C) table of transcript features reporting the kind of lncRNA (low-/high-confidence), if it is a precursor of miRNAs, length, orthologous group, sequence and links to get the Open Reading Frame (ORF), the Coding Potential, the folding energy and the GC content; (D) an optional table that provides links to other databases, when applicable, and giving information about the version of the database and the e-value of the match; (E) table of transcripts belonging to the same orthogroup reporting the kind of lncRNA, length, folding energies (AMFE, MFEI), GC content (F) a schematic representation of the gene and transcript models.