| Literature DB >> 27924033 |
Ke-Ren Zhou1, Shun Liu1, Wen-Ju Sun1, Ling-Ling Zheng1, Hui Zhou1, Jian-Hua Yang2, Liang-Hu Qu3.
Abstract
The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27924033 PMCID: PMC5210649 DOI: 10.1093/nar/gkw965
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.System overview of ChIPBase v2.0 core framework. All results generated by ChIPBase v2.0 are deposited in MySQL relational databases and displayed in the visual browser and web page.
The library statistics of ChIP-seq datasets in ChIPBase v2.0
| Species | Total library | TF library | TCF library | CRF library | Other library | Histone library |
|---|---|---|---|---|---|---|
| human | 5803 | 2498 | 433 | 192 | 214 | 2466 |
| mouse | 2500 | 1036 | 209 | 72 | 89 | 1094 |
| worm | 852 | 428 | 67 | 18 | 310 | 15 |
| fruitfly | 838 | 186 | 82 | 54 | 183 | 347 |
| 54 | 51 | / | / | / | 3 | |
| yeast | 52 | 52 | / | / | / | / |
| rat | 44 | 15 | 2 | 5 | 1 | 21 |
| zebrafish | 32 | 10 | / | / | / | 22 |
| 30 | 14 | / | / | / | 16 | |
| chicken | 11 | 10 | / | / | / | 1 |
Library statistics indicating the numbers of sample library (ChIP-seq, ChIP-exo and MNChIP-seq), including TFs, TCFs, CRFs, other DNA-binding protein (other) and histone modifications (histone) in 10 species.
The statistics of trans-acting factors in ChIPBase v2.0
| Species | Total | TF | TCF | CRF | Other | Histone |
|---|---|---|---|---|---|---|
| human | 711 | 480 | 51 | 43 | 89 | 48 |
| mouse | 302 | 189 | 22 | 18 | 42 | 31 |
| worm | 151 | 68 | 6 | 6 | 67 | 4 |
| fruitfly | 162 | 60 | 6 | 8 | 56 | 32 |
| 29 | 26 | / | / | / | 3 | |
| yeast | 15 | 15 | / | / | / | / |
| rat | 20 | 7 | 1 | 3 | 1 | 8 |
| zebrafish | 11 | 7 | / | / | / | 4 |
| 8 | 4 | / | / | / | 4 | |
| chicken | 5 | 4 | / | / | / | 1 |
This statistics indicating the numbers of TFs, TCFs, CRFs, other DNA-binding protein (other) and histone modifications (histone) in 10 species.
The statistics of RNA-seq and miRNA-seq expression data used in ChIPBase v2.0
| Species | Project name | Diseases or studies | Samples |
|---|---|---|---|
| human | TCGA Pan-Cancer (PANCAN, RNA-seq) | 32 | 10 359 |
| human | TCGA Pan-Cancer (PANCAN, miRNA-seq) | 32 | 9966 |
| human | Genotype-Tissue Expression (GTEx) project | 31 | 7834 |
| human | RNA-seq from the CCLE | 20 | 780 |
| human | RIKEN FANTOM5 project (human) | 2 | 76 |
| human | 32 different tissues of human | 1 | 32 |
| mouse | RIKEN FANTOM5 project (mouse tissue) | 10 | 156 |
| mouse | RIKEN FANTOM5 project (mouse cell lines) | 1 | 35 |
| mouse | RNA-seq of mouse DBA/2J x C57BL/6J tissues | 1 | 6 |
| mouse | Individual-Th single cell RNA-Seq | 1 | 91 |
| rat | Strand-specific RNA-seq of nine rat tissues | 3 | 27 |
| worm | Developmental Stages, modENCODE | 1 | 17 |
| chicken | Strand-specific RNA-seq of nine chicken tissues | 1 | 9 |
| chicken | RNA-seq of poly-A enriched total RNA of tissue samples from chicken | 1 | 5 |
| RNA-seq of poly-A enriched total RNA of tissue samples from frog | 1 | 5 | |
| RNA-seq during Arabidopsis meristem development from day 7 to 16 after germination | 1 | 10 | |
| RNA-seq of coding RNA of Arabidopsis seedlings from 19 natural accessions | 1 | 19 | |
| Transcriptomes for hybrids (F1s) between 18 | 1 | 9 |
This table contained the data sources and sample numbers of RNA-seq and miRNA-seq across seven species.
Figure 2.The co-expression patterns of HIF1A-miR194-2/192 and TGFB1-miR194-2/192 in stomach adenocarcinoma and esophageal carcinoma respectively. The dots in blue represent tumor samples, while the ones in purple represent normal samples. (A) The co-expression pattern of HIF1A-miR194-2 in 447 samples. (B) The co-expression pattern of HIF1A-miR192 in 447 samples. (C) The co-expression pattern of TGFB1-miR194-2 in 194 samples. (D) The co-expression pattern of TGFB1-miR192 in 194 samples.
Figure 3.The binding patterns of YY1, CTCF, PORL2A, H3K4me3 and H3K4me1 across FIRRE gene in ChIPBase genome browser. The experiment density tracks of YY1, CTCF, PORL2A, H3K4me3 and H3K4me1 have the similar peak distribution across the FIRRE gene.