| Literature DB >> 26586799 |
Yi Zhao1, Hui Li2, Shuangsang Fang2, Yue Kang3, Wei Wu3, Yajing Hao3, Ziyang Li4, Dechao Bu4, Ninghui Sun4, Michael Q Zhang5, Runsheng Chen6.
Abstract
NONCODE (http://www.bioinfo.org/noncode/) is an interactive database that aims to present the most complete collection and annotation of non-coding RNAs, especially long non-coding RNAs (lncRNAs). The recently reduced cost of RNA sequencing has produced an explosion of newly identified data. Revolutionary third-generation sequencing methods have also contributed to more accurate annotations. Accumulative experimental data also provides more comprehensive knowledge of lncRNA functions. In this update, NONCODE has added six new species, bringing the total to 16 species altogether. The lncRNAs in NONCODE have increased from 210 831 to 527,336. For human and mouse, the lncRNA numbers are 167,150 and 130,558, respectively. NONCODE 2016 has also introduced three important new features: (i) conservation annotation; (ii) the relationships between lncRNAs and diseases; and (iii) an interface to choose high-quality datasets through predicted scores, literature support and long-read sequencing method support. NONCODE is also accessible through http://www.noncode.org/.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26586799 PMCID: PMC4702886 DOI: 10.1093/nar/gkv1252
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Transcript and gene statistics for NONCODE
| Species | Number of lncRNA transcripts | Number of lncRNA genes | |
|---|---|---|---|
| human | 167,150 | 101,700 | |
| mouse | 130,558 | 86,935 | |
| cow | 23 599 | 18 189 | |
| rat | 29 070 | 25 114 | |
| chimpanzee | 18 604 | 13 224 | |
| gorilla | 20 785 | 17 140 | |
| orangutan | 15 601 | 13 432 | |
| rhesus macaque | 9325 | 6125 | |
| opossum | 21 014 | 14 135 | |
| platypus | 11 518 | 9394 | |
| chicken | 13 085 | 9688 | |
| zebrafish | 5000 | 3635 | |
| fruitfly | 54 818 | 13 890 | |
| 3269 | 2746 | ||
| yeast | 60 | 56 | |
| Arabidopsis | 3853 | 2477 | |
| Total | 527,336 | 337,880 |
Figure 1.Disease related data acquisition pipeline
Figure 2.Conservation annotation for NONHSAG200087.