| Literature DB >> 34383882 |
Alberto Zenere1, Olof Rundquist2, Mika Gustafsson2, Claudio Altafini1.
Abstract
MOTIVATION: The simultaneous availability of ATAC-seq and RNA-seq experiments allows to obtain a more in-depth knowledge on the regulatory mechanisms occurring in gene regulatory networks (GRNs). In this paper, we highlight and analyze two novel aspects that leverage on the possibility of pairing RNA-seq and ATAC-seq data. Namely we investigate the causality of the relationships between transcription factors (TFs), chromatin and target genes and the internal consistency between the two omics, here measured in terms of structural balance in the sample correlations along elementary length-3 cycles.Entities:
Year: 2021 PMID: 34383882 PMCID: PMC8696094 DOI: 10.1093/bioinformatics/btab577
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Workflow of this article. (a) Schematic depiction of two key events that lead to gene expression: (1) the chromatin region around the promoter is loose and accessible to TFs binding, and (2) available TFs bind to specific DNA sequences in the promoter region of the gene. (b) Possible regulatory motifs. Which event precedes the other is still under investigation, thus several causal three-node regulatory motifs can be associated to represent the regulatory interactions between TFs, chromatin regions and target genes. (c) Balanced and unbalanced cycles corresponding to the undirected graph of (i.e. chromatin → TF → gene). Plus and minus signs denote positive and negative correlation values between the corresponding nodes
List of paired RNA-seq and ATAC-seq datasets used in this study
| Index | Cell type | Availability and reference |
|---|---|---|
| A | Human Th1 | E-MTAB-7775, E-MTAB-10444, ( |
| B | Human Th1 | E-MTAB-10423, E-MTAB-10444 |
| C | Human DC | GSE125817 ( |
| D | Human DC | GSE125918 ( |
Overview of the datasets. (Upper) We report the total number of regulatory motifs (and the percentage of balanced ones) present in the TF-peak-target map. (Middle) Next, we test if each regulatory motif is characterized by a statistically large balance ratio (see Section 3.2 for details on how the statistical test was built); fold change indicates the ratio between the value observed in the data and the mean of the null distribution. (Lower) Lastly, we report the number of regulatory motifs that pass the conditional independence test described in Supplementary Materials and Methods and how many of them belong to an unbalanced cycle.
| Number of regulatory motifs in the data (of which balanced) | ||||
|---|---|---|---|---|
| Dataset | Chains |
|
| |
| A | 408088 (71%) | 9134221 (72%) | 7736 (77%) | |
| B | 367308 (67%) | 8227783 (67%) | 7100 (72%) | |
| C | 309675 (63%) | 10686804 (65%) | 3456 (65%) | |
| D | 255324 (70%) | 9004018 (68%) | 3419 (74%) | |
|
| ||||
| Enrichment of balanced regulatory motifs: | ||||
|
| ||||
| Chains |
|
| ||
|
| ||||
| A |
|
|
| |
| B |
|
|
| |
| C | not significant | not significant |
| |
| D |
|
|
| |
|
| ||||
| Number of selected regulatory motifs (of which unbalanced) | ||||
|
| ||||
|
|
|
|
| |
|
| ||||
| A | 21138 (4) | 19272 (3) | 419330 (32) | 298 (0) |
| B | 26573 (13) | 12627 (12) | 290724 (191) | 184 (0) |
| C | 37856 (440) | 23427 (422) | 808439 (13855) | 187 (2) |
| D | 38435 (324) | 15154 (309) | 519882 (11838) | 202 (3) |
Note: Since and correspond to the same undirect graph we use the more general term ‘Chains’ to denote (A, T, G) triplets.
Contingency table between the number of selected T → A → G and A → T → G regulatory motifs in dataset A
| T →A →G | |||
|---|---|---|---|
| Selected | Non-selected | ||
| A →T →G | Selected | 302 | 20 840 |
| Non-selected | 18 973 | 367 973 | |
Note: See Supplementary Results for the contingency tables of datasets B, C and D.
Fig. 2.For each dataset, we gather all the regulatory motifs in Figure 1, then for each regulatory motif we calculate minimum, geometric mean and maximum of its three correlations. Blue denotes the distributions obtained in the balanced cycles, red the unbalanced. In the table below we summarize the mean of each scalar measure, computed separately in the balanced and unbalanced case
Fig. 3.(a) regulatory motif and corresponding distribution of , divided in balanced (blue) and unbalanced (red) cycles. The balanced and unbalanced distributions are normalized with respect to their total count independently. (b) Similar analysis for the regulatory motif and the corresponding distribution of
To test if there exists a relationship between which regulatory motifs are balanced (resp. selected) in dataset A and B we performed a hypergeometric test that compares the ratio of balanced (resp. selected) regulatory motifs in dataset A with the same quantity but when we restrict only to regulatory motifs that are also balanced (resp. selected) in dataset B
| Relationship between ‘balanced in A’ and ‘balanced in B’ ( | Relationship between ‘selected in A’ and ‘selected in B’ ( | |
|---|---|---|
|
|
| not significant |
|
|
|
|
|
|
|
|
|
|
| 0.04, 1.64 |
Note: FC indicates the fold change of the latter with respect to the former quantity.