| Literature DB >> 26590260 |
Chih-Hung Chou1, Nai-Wen Chang2, Sirjana Shrestha3, Sheng-Da Hsu1, Yu-Ling Lin4, Wei-Hsiang Lee5, Chi-Dung Yang6, Hsiao-Chin Hong1, Ting-Yen Wei7, Siang-Jyun Tu3, Tzi-Ren Tsai3, Shu-Yi Ho3, Ting-Yan Jian8, Hsin-Yi Wu8, Pin-Rong Chen8, Nai-Chieh Lin1, Hsin-Tzu Huang9, Tzu-Ling Yang1, Chung-Yuan Pai8, Chun-San Tai10, Wen-Liang Chen11, Chia-Yen Huang12, Chun-Chi Liu13, Shun-Long Weng14, Kuang-Wen Liao15, Wen-Lian Hsu16, Hsien-Da Huang17.
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs of approximately 22 nucleotides, which negatively regulate the gene expression at the post-transcriptional level. This study describes an update of the miRTarBase (http://miRTarBase.mbc.nctu.edu.tw/) that provides information about experimentally validated miRNA-target interactions (MTIs). The latest update of the miRTarBase expanded it to identify systematically Argonaute-miRNA-RNA interactions from 138 crosslinking and immunoprecipitation sequencing (CLIP-seq) data sets that were generated by 21 independent studies. The database contains 4966 articles, 7439 strongly validated MTIs (using reporter assays or western blots) and 348 007 MTIs from CLIP-seq. The number of MTIs in the miRTarBase has increased around 7-fold since the 2014 miRTarBase update. The miRNA and gene expression profiles from The Cancer Genome Atlas (TCGA) are integrated to provide an effective overview of this exponential growth in the miRNA experimental data. These improvements make the miRTarBase one of the more comprehensively annotated, experimentally validated miRNA-target interactions databases and motivate additional miRNA research efforts.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26590260 PMCID: PMC4702890 DOI: 10.1093/nar/gkv1258
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Advances and improvements provided by miRTarBase 6.0
| Features | miRTarBase 4.5 | miRTarBase 6.0 |
|---|---|---|
| Release date | 2013/11/01 | 2015/09/15 |
| Curated articles | 2636 | 4966 |
| miRNAs | 1232 | 3786 |
| Target genes | 17 520 | 22 563 |
| Curated miRNA-target interactions | 51 460 | 366 181 |
| miRNA and target gene expression profiles | GEO | GEO, TCGA (571 data sets, 19 cancer type) |
| Text-mining technique to prescreen literature | Keyword search | NLP |
| Abstract annotation data for download | None | Yes |
| MTIs were validated by experimental technology | Reporter assay, western blot, northern blot, qRT-PCR, microarray, pSILAC, NGS (HITS-CLIP, Degradome-seq and CLASH) | Reporter assay, western blot, northern blot, qRT-PCR, microarray, pSILAC, 3′ LIFE, TRAP, NGS (HITS-CLIP, Degradome-seq, CLASH, PAR-CLIP, iPAR-CLIP). |
| Graphical visualization | miRNA, secondary structure, known and novel miRNA target sites, functional and nonfunctional MTIs, experimental conditions, miRNA-target network, upgrade reminder service, error report system, user feedback service, miRNA-target expression profile from GEO. | miRNA, secondary structure, known and novel miRNA target sites, functional and non-functional MTIs, experimental conditions, miRNA-target network, upgrade reminder service, user feedback service, miRNA-target expression profile from GEO and TCGA, updated error report system, enhanced search system, word cloud of miRNA-disease information, miRNA-Target site viewer using CLIP-seq data. |
Figure 1.Snapshot of major improvements provided by miRTarBase interface. (A) New miRTarBase home page and three new major interfaces, including miRNA disease word cloud (B), miRNA-target site viewer based on CLIP-seq data (C) and miRNA and gene expression profiles obtained using TCGA data set (D).
Number of miRNA-target interactions with different validation methods in miRTarBase 4.5 and miRTarBase 6.0
| Features | miRTarBase 4.5 | miRTarBase 6.0 |
|---|---|---|
| MTIs Supported by strong experimental evidences | ||
| Number of MTIs validated by ‘Reporter assay’ | 4109 | 6694 |
| Number of MTIs validated by ‘Western blot’ | 2405 | 4580 |
| Number of MTIs validated by ‘qPCR’ | 2512 | 4645 |
| Number of MTIs validated by ‘Reporter assay and western blot’ | 1915 | 3854 |
| Number of MTIs validated by ‘Reporter assay or western blot’ | 4563 | 7439 |
| MTIs Supported by limited experimental evidences | ||
| Number of MTIs validated by ‘Microarray’ | 12 547 | 13 587 |
| Number of MTIs validated by ‘NGS’ | 31 907 | 348 007 |
Figure 2.Natural language processing (NLP) techniques for finding MTI articles. (i) Articles collected from PubMed; (ii) preprocessing articles in the following steps; split sentences, tokenization, stemming and stopword removal; (iii) Named Entity Recognition (NER) based on principle-based approach (PBA), Conditional Random Field (CRF) and dictionary-based approach; (iv) following construction and evaluation of model, MTI extraction protocol was developed using PBA; (v) all curated articles were manually evaluated by biological domain experts.