| Literature DB >> 35413815 |
Abstract
BACKGROUND: Topologically associating domains (TADs) are locally highly-interacting genome regions, which also play a critical role in regulating gene expression in the cell. TADs have been first identified while investigating the 3D genome structure over High-throughput Chromosome Conformation Capture (Hi-C) interaction dataset. Substantial degree of efforts have been devoted to develop techniques for inferring TADs from Hi-C interaction dataset. Many TAD-calling methods have been developed which differ in their criteria and assumptions in TAD inference. Correspondingly, TADs inferred via these callers vary in terms of both similarities and biological features they are enriched in. RESULT: We have carried out a systematic comparison of 27 TAD-calling methods over mammals. We use Micro-C, a recent high-resolution variant of Hi-C, to compare TADs at a very high resolution, and classify the methods into 3 categories: feature-based methods, Clustering methods, Graph-partitioning methods. We have evaluated TAD boundaries, gaps between adjacent TADs, and quality of TADs across various criteria. We also found particularly CTCF and Cohesin proteins to be effective in formation of TADs with corner dots. We have also assessed the callers performance on simulated datasets since a gold standard for TADs is missing. TAD sizes and numbers change remarkably between TAD callers and dataset resolutions, indicating that TADs are hierarchically-organized domains, instead of disjoint regions. A core subset of feature-based TAD callers regularly perform the best while inferring reproducible domains, which are also enriched for TAD related biological properties.Entities:
Keywords: Chromatin organization; Hi-C; Micro-C; TAD callers; Topologically associating domains
Mesh:
Substances:
Year: 2022 PMID: 35413815 PMCID: PMC9006547 DOI: 10.1186/s12859-022-04674-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The comparison of the evaluated TAD caller features
| TAD Caller | Runtime complexity | Hierarchical TADs | Gaps Allowed | Parameters | Input Details | Implementation Language |
|---|---|---|---|---|---|---|
| 3DNetMod [ | ✓ | ✓ | 18 | Sparse 3-column matrix | Python | |
| Armatus [ | ✓ | ✓ | 1 | C++ | ||
| Arrowhead [ | ✓ | ✓ | 1 | .hic file | Java | |
| CaTCH [ | NA | ✓ | ✓ | 1 | Sparse 4-column matrix | R |
| Constrained HAC [ | ✗ | ✗ | 1 | R | ||
| CHDF [ | NA | ✗ | ✗ | 1 | C++ | |
| chromoR [ | NA | ✗ | ✗ | 2 | R | |
| ClusterTAD [ | NA | ✗ | ✓ | 1 | MATLAB | |
| deDoc [ | ✓ | ✓ | 0 | Sparse 3-column matrix | Java | |
| DI [ | NA | ✗ | ✓ | 3 | MATLAB, Perl | |
| EAST [ | ✗ | ✓ | 3 | Python | ||
| GMAP [ | NA | ✓ | ✓ | 10 | R, C++ | |
| GRiNCH [ | ✗ | ✓ | 3 | C++ | ||
| HiCExplorer [ | NA | ✗ | ✓ | 4 | h5 file | Python |
| HiCKey [ | ✓ | ✓ | 3 | C++ | ||
| HiCseg [ | ✗ | ✗ | 3 | R, C | ||
| HiTAD [ | NA | ✗ | ✓ | 1 | .cool file | Python |
| IC-Finder [ | NA | ✗ | ✓ | 2 | MATLAB | |
| InsulationScore [ | NA | ✗ | ✗ | 5 | Perl | |
| Matryoshka [ | ✓ | ✓ | 1 | C++ | ||
| MrTADFinder [ | ✗ | ✓ | 1 | Sparse 3-column matrix | Julia | |
| MSTD [ | NA | ✗ | ✓ | 1 | Python | |
| OnTAD [ | ✓ | ✓ | 5 | C++ | ||
| PSYCHIC [ | NA | ✓ | ✗ | 1 | MATLAB, Python,Perl | |
| Spectral [ | NA | ✗ | ✓ | 2 | MATLAB | |
| SpectralTAD [ | ✓ | ✓ | 11 | R | ||
| TADBD [ | NA | ✗ | ✓ | 2 | R | |
| TADbit [ | NA | ✗ | ✗ | 1 | Python, C | |
| TADpole [ | NA | ✓ | ✓ | 3 | R | |
| TADtree [ | ✓ | ✓ | 6 | Python | ||
| TopDom [ | NA | ✗ | ✓ | 1 | R |
The parameters below are as follows: Hi-C/Micro-C interaction matrix size is referred by n. TADtree’s S parameter refers to the maximum size of inferred TADs. OnTAD’s d parameter refers to the maximum size of inferred TADs, whereas m refers to the expected count of possible boundaries. Matryoshka’s t parameter defines the number of resolutions to be inferred, and l refers to interval frequency while clustering the inferred t resolutions. HiCseg’s K parameter defines the maximum number of diagonal TAD partitions. h parameter in Constrained HAC is the bandwidth. k in GRiNCH is the rank of low-dimensional matrices. Lastly, Armatus’ t parameter defines the number of resolutions to be inferred
Fig. 1TAD inference performance via TAD callers across a number of interaction matrix resolutions (5kb, 10kb) using HiCNorm normalized Micro-C interaction dataset over human embryonic stem cell line. A Across multiple resolutions, the number of TADs inferred via callers in all chromosomes. B The mean TAD size inferred by considered TAD callers across different resolutions over all chromosomes. C The comparison of topologically associating domain boundaries inferred by a single caller with the remaining callers for a number of resolutions over all chromosomes in terms of Jaccard Index similarity metric
Fig. 2Identification of TADs with and without corner dot (extrusion loop) via evaluated TAD callers over normalized mouse ESCs Micro-C dataset at a lower 1kb resolution. A The fraction of TADs with a corner-dot by each caller in all chromosomes. B The mean TAD size with and without corner dot, identified by each caller in all chromosomes
Fig. 3TAD inference performance via TAD callers across a number of sequencing depths at 5kb resolution using HiCNorm normalized Micro-C interaction dataset over human embryonic stem cell line. A Across multiple ratio of downsampled sequenced reads, the number of TADs inferred by each caller in all chromosomes. B The mean TAD size inferred by considered TAD callers across different ratio of downsampled sequenced reads over all chromosomes. C The comparison of TAD boundaries inferred by one TAD caller with the others across a number of sequencing depths over all chromosomes in terms of Jaccard Index similarity metric. We do not report CHDF results as CHDF has inferred each bin in the interaction matrix as an independent TAD after downsampling the dataset
Fig. 4Annotated TADs and heatmaps inferred by TAD callers over the first chromosome region 1900–2000 at 5kb resolution over HiCNorm normalized human embryonic stem cell’s first experiment sample
Fig. 5Regulatory elements enrichment and depletion per 1kb near topologically associating domain boundaries with a 1000kb window, identified by different TAD callers at 5kb resolution over HiCNorm normalized human embryonic stem cell’s first experiment sample. A CTCF for enrichment and B H3K9me3 for depletion. The top rows show the topmost performing 5 callers, whereas the bottom rows show the worst perfoming 5 callers
The mean peak near inferred TAD boundaries (2kb) for 10 functional elements over normalized interaction dataset from human embryonic stem cells at 5kb resolution
| H3K9me3 | H3K4me3 | H3K36me3 | CTCF | HK genes | TSS | SMC3 | SINE | RAD21 | Polymerase II | Rank | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Armatus | 0.253 | 0.333 | 0.337 | 0.367 | 0.085 | 0.646 | 0.155 | 4.542 | 0.144 | 0.338 | 16 |
| Arrowhead | 0.333 | 0.401 | 0.419 | 0.458 | 0.108 | 0.715 | 0.211 | 5.366 | 0.194 | 0.418 | 28 |
| CaTCH | 0.230 | 0.371 | 0.399 | 0.101 | 0.650 | 0.141 | 3.701 | 0.126 | 0.372 | 14 | |
| Constrained HAC | 0.451 | 0.515 | 0.376 | 0.111 | 0.782 | 5.199 | 0.503 | 33 | |||
| CHDF | 0.280 | 0.339 | 0.352 | 0.387 | 0.084 | 0.652 | 0.181 | 4.482 | 0.169 | 0.344 | 17 |
| ClusterTAD | 0.321 | 0.471 | 0.505 | 0.362 | 0.118 | 0.832 | 0.203 | 5.148 | 0.187 | 0.490 | 31 |
| deDoc | 0.345 | 0.401 | 0.412 | 0.416 | 0.098 | 0.717 | 0.229 | 5.036 | 0.213 | 0.406 | 27 |
| DI | 0.375 | 0.383 | 0.238 | 0.216 | 0.512 | ||||||
| EAST | 0.258 | 0.362 | 0.388 | 0.452 | 0.094 | 0.694 | 0.163 | 4.618 | 0.153 | 0.309 | 17 |
| GMAP | 0.287 | 0.353 | 0.411 | 0.413 | 0.101 | 0.778 | 0.174 | 5.283 | 0.160 | 0.420 | 23 |
| GRiNCH | 0.291 | 0.357 | 0.415 | 0.414 | 0.100 | 0.775 | 0.176 | 5.285 | 0.158 | 0.423 | 23 |
| HiCExplorer | 0.393 | 0.466 | 0.461 | 0.417 | 0.111 | 0.774 | 0.258 | 5.372 | 0.236 | 0.463 | 33 |
| HiCKey | 0.332 | 0.475 | 0.501 | 0.344 | 0.123 | 0.772 | 0.197 | 5.157 | 0.181 | 0.490 | 31 |
| HiCseg | 0.330 | 0.472 | 0.504 | 0.347 | 0.125 | 0.770 | 0.201 | 5.153 | 0.178 | 0.489 | 31 |
| IC-Finder | 0.343 | 0.460 | 0.458 | 0.430 | 0.112 | 0.790 | 0.218 | 5.359 | 0.201 | 0.446 | 28 |
| InsulationScore | 0.275 | 0.365 | 0.376 | 0.377 | 0.101 | 0.751 | 0.175 | 5.429 | 0.162 | 0.366 | 21 |
| Matryoshka | 0.274 | 0.388 | 0.390 | 0.365 | 0.098 | 0.737 | 0.173 | 4.926 | 0.158 | 0.370 | 20 |
| MrTADFinder | 0.215 | 0.248 | 0.269 | 0.429 | 0.067 | 0.506 | 0.126 | 4.354 | 0.116 | 0.297 | 10 |
| MSTD | 0.289 | 0.341 | 0.358 | 0.381 | 0.089 | 0.656 | 0.181 | 4.959 | 0.159 | 0.396 | 20 |
| OnTAD | 0.377 | 0.485 | 0.479 | 0.420 | 0.123 | 0.826 | 0.250 | 5.622 | 0.231 | 0.455 | 35 |
| Spectral | 0.191 | 0.247 | 0.247 | 0.419 | 0.067 | 0.474 | 0.118 | 4.078 | 0.109 | 0.246 | 9 |
| SpectralTAD | 0.311 | 0.384 | 0.411 | 0.426 | 0.095 | 0.676 | 0.208 | 4.840 | 0.189 | 0.388 | 24 |
| TADBD | 0.294 | 0.375 | 0.387 | 0.456 | 0.101 | 0.724 | 0.182 | 5.503 | 0.172 | 0.352 | 21 |
| TADbit | 0.361 | 0.520 | 0.558 | 0.410 | 0.133 | 0.888 | 0.224 | 5.647 | 0.201 | 34 | |
| TADpole | 0.287 | 0.392 | 0.390 | 0.454 | 0.105 | 0.725 | 0.179 | 5.146 | 0.165 | 0.369 | 21 |
| TADtree | 0.284 | 0.396 | 0.392 | 0.453 | 0.101 | 0.722 | 0.177 | 5.145 | 0.167 | 0.367 | 21 |
| TopDom | 0.382 | 0.467 | 0.458 | 0.387 | 0.110 | 0.785 | 0.252 | 5.316 | 0.234 | 0.448 | 32 |
Bold numbers show the leading result for every column. The final column is sum of each caller’s ranking in terms of the whole set of functional regulatory elements except H3K9me3
The inferred TAD boundaries boundary tagged ratio for 10 functional elements over normalized interaction dataset from human embryonic stem cell line at 5kb resolution
| H3K9me3 | H3K4me3 | H3K36me3 | CTCF | HK genes | TSS | SMC3 | SINE | RAD21 | Polymerase II | Rank | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Armatus | 0.695 | 0.311 | 0.351 | 0.630 | 0.156 | 0.860 | 0.570 | 0.998 | 0.530 | 0.501 | 18 |
| Arrowhead | 0.910 | 0.442 | 0.512 | 0.747 | 0.229 | 0.911 | 0.832 | 0.999 | 0.796 | 0.647 | 32 |
| CaTCH | 0.484 | 0.303 | 0.341 | 0.166 | 0.488 | 0.448 | 0.569 | 0.429 | 0.417 | 9 | |
| Constrained HAC | 0.910 | 0.469 | 0.550 | 0.737 | 0.248 | 0.887 | 0.856 | 0.987 | 0.826 | 0.672 | 34 |
| CHDF | 0.733 | 0.338 | 0.383 | 0.647 | 0.161 | 0.849 | 0.624 | 0.978 | 0.590 | 0.513 | 20 |
| ClusterTAD | 0.779 | 0.471 | 0.533 | 0.689 | 0.243 | 0.807 | 0.712 | 0.981 | 0.691 | 0.679 | 29 |
| deDoc | 0.858 | 0.435 | 0.499 | 0.724 | 0.218 | 0.876 | 0.788 | 0.996 | 0.760 | 0.636 | 32 |
| DI | 0.920 | 0.530 | 0.747 | 0.909 | 0.858 | 0.996 | 0.828 | ||||
| EAST | 0.821 | 0.404 | 0.476 | 0.707 | 0.200 | 0.880 | 0.728 | 0.992 | 0.694 | 0.610 | 27 |
| GMAP | 0.848 | 0.416 | 0.482 | 0.699 | 0.214 | 0.883 | 0.740 | 0.996 | 0.704 | 0.628 | 30 |
| GRiNCH | 0.849 | 0.419 | 0.485 | 0.702 | 0.216 | 0.884 | 0.745 | 0.998 | 0.705 | 0.626 | 30 |
| HiCExplorer | 0.490 | 0.559 | 0.770 | 0.257 | 0.906 | 0.997 | 0.698 | ||||
| HiCKey | 0.492 | 0.561 | 0.768 | 0.259 | 0.908 | 0.997 | 0.699 | ||||
| HiCseg | 0.872 | 0.505 | 0.576 | 0.706 | 0.278 | 0.870 | 0.804 | 0.988 | 0.771 | 0.705 | |
| IC-Finder | 0.890 | 0.476 | 0.535 | 0.739 | 0.239 | 0.902 | 0.819 | 0.999 | 0.789 | 0.669 | 35 |
| InsulationScore | 0.828 | 0.430 | 0.485 | 0.681 | 0.208 | 0.881 | 0.719 | 0.971 | 0.678 | 0.640 | 29 |
| Matryoshka | 0.740 | 0.358 | 0.403 | 0.639 | 0.185 | 0.873 | 0.628 | 0.997 | 0.590 | 0.544 | 22 |
| MrTADFinder | 0.702 | 0.294 | 0.348 | 0.704 | 0.130 | 0.857 | 0.567 | 0.998 | 0.529 | 0.508 | 18 |
| MSTD | 0.853 | 0.380 | 0.450 | 0.689 | 0.194 | 0.872 | 0.766 | 0.991 | 0.726 | 0.609 | 27 |
| OnTAD | 0.926 | 0.507 | 0.577 | 0.758 | 0.263 | 0.910 | 0.867 | 0.998 | 0.841 | 0.712 | |
| Spectral | 0.665 | 0.305 | 0.361 | 0.649 | 0.131 | 0.786 | 0.548 | 0.920 | 0.514 | 0.477 | 15 |
| SpectralTAD | 0.834 | 0.399 | 0.464 | 0.703 | 0.200 | 0.881 | 0.754 | 0.998 | 0.721 | 0.601 | 27 |
| TADBD | 0.864 | 0.436 | 0.499 | 0.712 | 0.216 | 0.901 | 0.770 | 0.737 | 0.630 | 30 | |
| TADbit | 0.912 | 0.596 | 0.763 | 0.276 | 0.844 | 0.998 | 0.814 | 0.731 | |||
| TADpole | 0.807 | 0.421 | 0.471 | 0.717 | 0.205 | 0.885 | 0.704 | 0.998 | 0.673 | 0.604 | 28 |
| TADtree | 0.805 | 0.419 | 0.470 | 0.716 | 0.204 | 0.883 | 0.703 | 0.998 | 0.671 | 0.600 | 28 |
| TopDom | 0.897 | 0.471 | 0.534 | 0.730 | 0.242 | 0.881 | 0.841 | 0.996 | 0.817 | 0.674 | 34 |
Bold numbers show the leading result for every column. The final column is sum of each caller’s ranking in terms of the whole set of functional regulatory elements except H3K9me3
Fold-change () between bilateral regions (500kb) and TAD boundary segments (1kb) for 10 regulatory functional elements over normalized interaction dataset of human embryonic stem cell line at 5kb resolution
| H3K9me3 | H3K4me3 | H3K36me3 | CTCF | HK genes | TSS | SMC3 | SINE | RAD21 | Polymerase II | Rank | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Armatus | 0.318 | 0.307 | 0.285 | − 0.269 | 0.324 | 0.191 | 0.389 | 0.038 | 0.383 | 0.359 | 17 |
| Arrowhead | 0.609 | 0.391 | 0.372 | − 0.356 | 0.473 | 0.220 | 0.695 | 0.646 | 0.557 | 27 | |
| CaTCH | 0.549 | 0.568 | 0.127 | 0.471 | 31 | ||||||
| Constrained HAC | 0.527 | 0.663 | − 0.409 | 0.502 | 0.358 | 0.132 | 0.785 | 33 | |||
| CHDF | 0.481 | 0.363 | 0.352 | − 0.226 | 0.309 | 0.265 | 0.606 | 0.036 | 0.613 | 0.412 | 23 |
| ClusterTAD | 0.515 | 0.452 | 0.495 | − 0.474 | 0.478 | 0.299 | 0.598 | 0.032 | 0.543 | 0.577 | 25 |
| deDoc | 0.608 | 0.314 | 0.293 | − 0.284 | 0.243 | 0.183 | 0.773 | 0.035 | 0.745 | 0.420 | 20 |
| DI | 0.644 | 0.630 | 0.616 | − 0.438 | 0.654 | 0.439 | 0.714 | 0.146 | 0.646 | 0.721 | 33 |
| EAST | 0.292 | 0.109 | 0.233 | − 0.077 | 0.260 | 0.110 | 0.372 | 0.039 | 0.365 | 0.062 | 15 |
| GMAP | 0.432 | 0.291 | 0.395 | − 0.351 | 0.462 | 0.292 | 0.433 | 0.159 | 0.381 | 0.568 | 24 |
| GRiNCH | 0.434 | 0.294 | 0.396 | − 0.352 | 0.464 | 0.295 | 0.436 | 0.160 | 0.382 | 0.569 | 24 |
| HiCExplorer | 0.696 | 0.452 | 0.372 | − 0.365 | 0.332 | 0.226 | 0.852 | 0.082 | 0.804 | 0.533 | 27 |
| HiCKey | 0.697 | 0.454 | 0.373 | − 0.369 | 0.335 | 0.228 | 0.850 | 0.081 | 0.802 | 0.531 | 27 |
| HiCseg | 0.669 | 0.670 | 0.699 | − 0.405 | 0.753 | 0.417 | 0.689 | 0.175 | 0.595 | 0.810 | |
| IC-Finder | 0.532 | 0.505 | 0.424 | − 0.286 | 0.408 | 0.307 | 0.635 | 0.096 | 0.597 | 0.521 | 26 |
| InsulationScore | 0.274 | 0.254 | 0.201 | − 0.329 | 0.394 | 0.281 | 0.339 | 0.178 | 0.282 | 0.328 | 20 |
| Matryoshka | 0.365 | 0.384 | 0.347 | − 0.295 | 0.394 | 0.276 | 0.460 | 0.075 | 0.430 | 0.410 | 21 |
| MrTADFinder | 0.123 | − 0.021 | 0.004 | − 0.136 | 0.047 | − 0.042 | 0.121 | 0.004 | 0.098 | 0.162 | 9 |
| MSTD | 0.491 | 0.195 | 0.197 | − 0.489 | 0.243 | 0.115 | 0.568 | 0.111 | 0.463 | 0.499 | 22 |
| OnTAD | 0.603 | 0.428 | 0.374 | − 0.343 | 0.418 | 0.273 | 0.717 | 0.117 | 0.689 | 0.459 | 26 |
| Spectral | 0.041 | 0.069 | − 0.064 | 0.010 | 0.108 | − 0.045 | 0.052 | − 0.004 | 0.045 | 0.046 | 9 |
| SpectralTAD | 0.544 | 0.425 | 0.410 | − 0.230 | 0.361 | 0.221 | 0.669 | 0.057 | 0.634 | 0.491 | 24 |
| TADBD | 0.213 | 0.151 | 0.139 | − 0.124 | 0.191 | 0.122 | 0.248 | 0.117 | 0.246 | 0.163 | 13 |
| TADbit | 0.598 | 0.591 | 0.642 | − 0.377 | 0.632 | 0.424 | 0.657 | 0.165 | 0.572 | 0.747 | 33 |
| TADpole | 0.325 | 0.327 | 0.254 | − 0.121 | 0.298 | 0.199 | 0.387 | 0.079 | 0.380 | 0.295 | 18 |
| TADtree | 0.323 | 0.325 | 0.256 | − 0.124 | 0.297 | 0.196 | 0.385 | 0.077 | 0.381 | 0.293 | 18 |
| TopDom | 0.673 | 0.445 | 0.353 | − 0.382 | 0.311 | 0.237 | 0.830 | 0.062 | 0.793 | 0.505 | 27 |
The final column is sum of each caller’s ranking in terms of the whole set of functional regulatory elements except H3K9me3. Bold numbers show the leading result for every column
Analysis for the enrichment of four regulatory elements (from the left: CTCF, Housekeeping genes, RAD21, TSSs) using the KR-normalized Micro-C dataset () from mouse ESCs at the 1kb resolution
| TAD Callers | Functional regulatory elements | ||||
|---|---|---|---|---|---|
| CTCF | HK genes | RAD21 | TSS | Rank | |
| Armatus | 0.334/0.802/0.615/1/0.145/0.772 | 0.138/0.261/0.54/1/0.19/0.27 | 0.267/0.756/0.622/1/0.161/0.742 | 0.517/0.704/0.543/1/0.089/0.836 | 14/13/10/ |
| CaTCH | 0.261/0.57/0.818/1/0.033/0.684 | 0.142/0.237/ | 0.199/0.549/0.681/1/0.039/0.676 | 0.472/0.508/ | 10/5/14/16/4/13 |
| Constrained HAC | 0.375/0.9/ | 0.136/0.282/0.937/1/0.089/0.469 | 0.299/0.875/ | 0.518/0.693/0.864/1/0.017/0.634 | 15/14/ |
| ClusterTAD | 0.351/0.809/0.773/1/0.024/0.473 | 0.167/0.31/0.954/1/0.083/0.419 | 0.274/0.775/0.681/1/0.028/0.469 | 0.587/0.729/0.781/1/0.014/0.511 | |
| DI | 0.375/0.917/0.889/1/0.066/0.807 | 0.175/0.329/1.014/1/0.16/0.481 | 0.292/0.885/0.798/1/0.077/0.795 | ||
| HiCExplorer | 0.377/ | 0.133/0.271/0.646/1/0.139/0.467 | 0.311/ | 0.525/0.708/0.693/1/0.039/0.881 | 15/15/13/ |
| HiCseg | 0.34/0.889/0.838/1/0.045/0.775 | 0.256/0.85/0.726/1/0.053/0.762 | 0.598/ | 15/ | |
| IC-Finder | 0.338/0.885/0.747/1/0.097/0.869 | 0.128/0.255/0.598/1/0.152/0.399 | 0.275/0.853/0.737/1/0.111/0.85 | 0.491/0.693/0.626/1/0.052/0.898 | 14/14/11/ |
| Matryoshka | 0.331/0.857/0.591/1/0.16/0.748 | 0.14/0.276/0.506/1/ | 0.266/0.819/0.571/1/0.18/0.72 | 0.509/0.728/0.509/1/0.095/0.804 | 13/15/10/ |
| MrTADFinder | 0.183/0.592/0.179/1/ | 0.081/0.135/0.237/0.286/0.097/0.114 | 0.143/0.532/0.134/0.857/0.23/0.428 | 0.325/0.492/0.26/0.857/ | 4/4/4/12/13/4 |
| OnTAD | 0.36/0.925/0.758/1/0.07/0.875 | 0.146/0.294/0.643/1/0.146/0.496 | 0.291/0.894/0.732/1/0.082/ | 0.487/0.734/0.539/1/0.037/0.897 | 14/15/10/ |
| SpectralTAD | 0.294/0.804/0.743/1/0.093/ | 0.111/0.213/0.587/1/0.131/0.384 | 0.237/0.767/0.714/1/0.106/0.856 | 0.422/0.615/0.608/1/0.049/ | 10/10/11/ |
| TADbit | 0.369/0.902/0.872/1/0.062/0.825 | 0.17/0.314/0.951/1/0.149/0.495 | 0.287/0.869/0.788/1/0.073/0.812 | 0.587/0.758/0.859/1/0.035/0.848 | |
| TADtree | 0.28/0.842/0.412/1/0.22/0.72 | 0.117/0.242/0.348/1/0.196/0.189 | 0.227/0.802/0.4/1/ | 0.416/0.681/0.316/1/0.128/0.795 | 10/13/7/ |
| TopDom | 0.141/0.28/0.605/1/0.163/0.417 | 0.541/0.73/0.683/1/0.051/0.901 | 15/15/13/ | ||
For each regulatory element, values from left to right represent (1) Mean peak near inferred TAD boundaries, (2) Inferred TAD boundaries boundary tagged ratio, (3) Fold change () between bilateral regions and TAD boundary regions, (4) p value () of mean peak near inferred TAD boundaries in bilateral regions, (5) JI between functional regulatory elements and TAD boundaries using the number of TAD boundaries as numerator, and (6) JI between functional regulatory elements and TAD boundaries using number of regulatory elements as numerator, respectively. The final column is sum of each caller’s ranking in terms of the whole set of functional regulatory elements. Bold numbers show the leading result for every column
The mean peak near inferred TAD boundaries (2kb) for six functional elements for both with and without corner dot using the normalized Micro-C dataset over mouse ESCs at a lower 1kb resolution
| H3K9me3 | H3K4me3 | H3K36me3 | RAD21 | SMC3 | CTCF | H3K9me3 | H3K4me3 | H3K36me3 | RAD21 | SMC3 | CTCF | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| With corner dot (extrusion loop) | Without corner dot | |||||||||||
| Armatus | 0.451 | 0.342 | 0.353 | 0.531 | 0.535 | 0.565 | 0.221 | 0.112 | 0.123 | 0.301 | 0.305 | 0.335 |
| Arrowhead | 0.531 | 0.392 | 0.409 | 0.599 | 0.617 | 0.301 | 0.162 | 0.179 | 0.369 | 0.387 | 0.426 | |
| CaTCH | 0.428 | 0.324 | 0.339 | 0.569 | 0.597 | 0.436 | 0.198 | 0.094 | 0.109 | 0.339 | 0.367 | |
| Constrained HAC | 0.595 | 0.468 | 0.649 | 0.712 | 0.574 | 0.365 | 0.419 | 0.482 | 0.344 | |||
| CHDF | 0.367 | 0.379 | 0.537 | 0.551 | 0.585 | 0.248 | 0.137 | 0.149 | 0.307 | 0.321 | 0.355 | |
| ClusterTAD | 0.519 | 0.385 | 0.401 | 0.669 | 0.703 | 0.561 | 0.289 | 0.155 | 0.171 | 0.439 | 0.473 | 0.331 |
| deDoc | 0.543 | 0.411 | 0.427 | 0.599 | 0.612 | 0.614 | 0.313 | 0.181 | 0.197 | 0.369 | 0.382 | 0.384 |
| DI | 0.573 | 0.414 | 0.436 | 0.581 | 0.343 | 0.184 | 0.206 | 0.536 | 0.351 | |||
| EAST | 0.456 | 0.351 | 0.361 | 0.562 | 0.586 | 0.652 | 0.226 | 0.121 | 0.131 | 0.332 | 0.356 | 0.422 |
| GMAP | 0.485 | 0.358 | 0.372 | 0.551 | 0.609 | 0.611 | 0.255 | 0.128 | 0.142 | 0.321 | 0.379 | 0.381 |
| GRiNCH | 0.495 | 0.362 | 0.375 | 0.557 | 0.614 | 0.615 | 0.252 | 0.129 | 0.144 | 0.324 | 0.381 | 0.382 |
| HiCExplorer | 0.571 | 0.434 | 0.664 | 0.659 | 0.615 | 0.361 | 0.204 | 0.236 | 0.434 | 0.429 | 0.385 | |
| HiCKey | 0.531 | 0.377 | 0.399 | 0.675 | 0.705 | 0.548 | 0.301 | 0.147 | 0.171 | 0.445 | 0.475 | 0.317 |
| HiCseg | 0.528 | 0.376 | 0.398 | 0.672 | 0.702 | 0.545 | 0.298 | 0.146 | 0.168 | 0.442 | 0.472 | 0.315 |
| IC-Finder | 0.539 | 0.398 | 0.416 | 0.658 | 0.656 | 0.628 | 0.309 | 0.168 | 0.186 | 0.428 | 0.426 | 0.398 |
| InsulationScore | 0.473 | 0.363 | 0.374 | 0.564 | 0.574 | 0.575 | 0.243 | 0.133 | 0.144 | 0.334 | 0.344 | 0.345 |
| Matryoshka | 0.472 | 0.356 | 0.371 | 0.586 | 0.588 | 0.563 | 0.242 | 0.126 | 0.141 | 0.356 | 0.358 | 0.333 |
| MrTADFinder | 0.413 | 0.314 | 0.324 | 0.446 | 0.467 | 0.627 | 0.183 | 0.084 | 0.095 | 0.236 | 0.237 | 0.397 |
| MSTD | 0.487 | 0.357 | 0.378 | 0.539 | 0.556 | 0.578 | 0.257 | 0.127 | 0.158 | 0.309 | 0.336 | 0.348 |
| OnTAD | 0.575 | 0.429 | 0.448 | 0.683 | 0.677 | 0.618 | 0.345 | 0.199 | 0.218 | 0.453 | 0.447 | 0.388 |
| Spectral | 0.389 | 0.307 | 0.316 | 0.445 | 0.447 | 0.617 | 0.159 | 0.077 | 0.086 | 0.215 | 0.217 | 0.387 |
| SpectralTAD | 0.509 | 0.387 | 0.406 | 0.582 | 0.609 | 0.624 | 0.279 | 0.157 | 0.176 | 0.352 | 0.379 | 0.394 |
| TADBD | 0.492 | 0.372 | 0.384 | 0.573 | 0.585 | 0.654 | 0.262 | 0.142 | 0.154 | 0.344 | 0.355 | 0.424 |
| TADbit | 0.559 | 0.398 | 0.422 | 0.718 | 0.755 | 0.608 | 0.329 | 0.168 | 0.192 | 0.488 | 0.378 | |
| TADpole | 0.483 | 0.367 | 0.376 | 0.596 | 0.594 | 0.653 | 0.351 | 0.137 | 0.144 | 0.365 | 0.360 | 0.419 |
| TADtree | 0.482 | 0.365 | 0.375 | 0.594 | 0.592 | 0.651 | 0.135 | 0.145 | 0.364 | 0.362 | 0.421 | |
| TopDom | 0.485 | 0.432 | 0.457 | 0.665 | 0.656 | 0.585 | 0.255 | 0.202 | 0.227 | 0.435 | 0.426 | 0.355 |
Bold numbers show the leading result for every column
Fig. 6The ratio of Hi-C/Micro-C signal variance over the inferrred TADs as evaluated via TAD among a loci pair across Mb distance. Multiple TAD callers are evaluated on Micro-C interaction dataset normalized by HiCNorm at 5kb resolution over embryonic stem cell line’s first sample. The top row shows the topmost performing 5 callers, whereas the bottom row shows the worst perfoming 5 callers
The ratio of signal variance over the inferrred TADs as evaluated via TAD among a loci pair across M distance
| With corner dot (extrusion loop) | Without corner dot | |||||
|---|---|---|---|---|---|---|
| Average | Max | Min | Average | Max | Min | |
| Armatus | 0.641 | 0.709 | 0.524 | 0.653 | 0.721 | 0.535 |
| Arrowhead | 0.654 | 0.740 | 0.531 | 0.666 | 0.751 | 0.533 |
| CaTCH | 0.699 | 0.759 | 0.571 | 0.712 | 0.773 | 0.585 |
| Constrained HAC | 0.698 | 0.768 | 0.568 | 0.711 | 0.779 | 0.579 |
| CHDF | 0.654 | 0.726 | 0.544 | 0.667 | 0.739 | 0.556 |
| ClusterTAD | 0.674 | 0.741 | 0.555 | 0.687 | 0.755 | 0.567 |
| deDoc | 0.695 | 0.761 | 0.573 | 0.707 | 0.774 | 0.585 |
| DI | 0.690 | 0.754 | 0.568 | 0.701 | 0.765 | 0.582 |
| EAST | 0.714 | 0.583 | 0.725 | 0.595 | ||
| GMAP | 0.645 | 0.712 | 0.527 | 0.657 | 0.725 | 0.540 |
| GRiNCH | 0.648 | 0.715 | 0.531 | 0.659 | 0.728 | 0.545 |
| HiCExplorer | 0.697 | 0.766 | 0.573 | 0.709 | 0.777 | 0.586 |
| HiCKey | 0.658 | 0.735 | 0.549 | 0.667 | 0.745 | 0.559 |
| HiCseg | 0.655 | 0.733 | 0.544 | 0.665 | 0.744 | 0.556 |
| IC-Finder | 0.685 | 0.752 | 0.564 | 0.696 | 0.761 | 0.575 |
| InsulationScore | 0.665 | 0.729 | 0.549 | 0.678 | 0.741 | 0.561 |
| Matryoshka | 0.692 | 0.761 | 0.565 | 0.703 | 0.774 | 0.578 |
| MrTADFinder | 0.631 | 0.685 | 0.535 | 0.642 | 0.695 | 0.545 |
| MSTD | 0.671 | 0.736 | 0.553 | 0.683 | 0.748 | 0.566 |
| OnTAD | 0.799 | 0.745 | 0.812 | |||
| Spectral | 0.638 | 0.711 | 0.519 | 0.651 | 0.722 | 0.531 |
| SpectralTAD | 0.645 | 0.708 | 0.533 | 0.656 | 0.721 | 0.545 |
| TADBD | 0.686 | 0.753 | 0.565 | 0.696 | 0.763 | 0.576 |
| TADbit | 0.724 | 0.793 | 0.594 | 0.736 | 0.805 | 0.605 |
| TADpole | 0.670 | 0.741 | 0.559 | 0.685 | 0.751 | 0.571 |
| TADtree | 0.672 | 0.738 | 0.558 | 0.682 | 0.748 | 0.569 |
| TopDom | 0.654 | 0.717 | 0.541 | 0.667 | 0.729 | 0.553 |
Multiple TAD callers are evaluated on the interaction dataset normalized by HiCNorm at 5kb resolution over human embryonic stem cell line. Bold numbers show the leading result for every column
Fig. 7The evaluation of TAD boundaries statistically among the considered TAD callers between replicates over normalized human embryonic stem cell line Micro-C dataset at 5kb resolution, having the same sequencing depths. A TAD boundaries reproducibility between technical and biological replicates, as evaluated via Jaccard Index. B TAD count distribution relative to various reproducibility levels for individual TAD callers (Levels of reproducibility are between 8 and 1 from top to bottom). C The mean TAD interaction frequency across various TAD reproducibility levels between samples. D TAD sizes across various TAD reproducibility levels between samples
Fig. 8Different TAD callers performance using simulated dataset with various noise levels. A The total number (top), TPR (middle), and FDR (bottom) of TADs from non-nested TADs simulated interaction dataset (dashed lines indicate labelled TADs of the simulated data). B The total number (top), TPR (middle), and FDR (bottom) of TADs from the nested TADs simulated data
TAD boundaries’ True Positive Rate and False Positive Rate called from the in silico dataset
| True Positive Rate (TPR) | False Positive Rate (FPR) | |||||
|---|---|---|---|---|---|---|
| Average | Max | Min | Average | Max | Min | |
| Armatus | 0.708 | 0.917 | 0.462 | 0.192 | 0.438 | 0.041 |
| Arrowhead | 0.218 | 0.35 | 0.141 | 0.586 | 0.703 | 0.444 |
| CaTCH | 0.194 | 0.471 | 0.044 | 0.052 | 0.375 | |
| CHAC | 0.355 | 0.5 | 0.233 | 0.047 | 0.179 | |
| CHDF | 0.81 | 0.381 | 0.517 | 0.237 | ||
| ClusterTAD | 0.354 | 0.453 | 0.244 | 0.153 | 0.375 | |
| deDoc | 0.497 | 0.632 | 0.372 | 0.28 | 0.46 | 0.068 |
| DI | 0.113 | 0.198 | 0.041 | 0.864 | 0.95 | 0.754 |
| EAST | 0.26 | 0.343 | 0.014 | 0.539 | 0.636 | 0.417 |
| GMAP | 0.255 | 0.412 | 0.136 | 0.425 | 0.6 | 0.182 |
| GRiNCH | 0.341 | 0.504 | 0.217 | 0.327 | 0.562 | 0.113 |
| HiCExplorer | 0.877 | 0.949 | 0.709 | 0.002 | 0.036 | |
| HiCKey | 0.316 | 0.429 | 0.215 | 0.018 | 0.089 | 0.012 |
| HiCseg | 0.275 | 0.406 | 0.181 | 0.006 | 0.071 | |
| IC-Finder | 0.946 | 0.077 | 0.158 | 0.025 | ||
| InsulationScore | 0.09 | 0.203 | 0.024 | 0.636 | 0.875 | 0.333 |
| Matryoshka | 0.706 | 0.875 | 0.5 | 0.332 | 0.606 | 0.127 |
| MrTADFinder | 0.752 | 0.895 | 0.557 | 0.337 | 0.594 | 0.107 |
| MSTD | 0.264 | 0.383 | 0.174 | 0.053 | 0.211 | |
| OnTAD | 0.429 | 0.487 | 0.367 | 0.014 | 0.094 | |
| Spectral | 0.182 | 0.425 | 0.058 | 0.785 | 0.887 | 0.571 |
| SpectralTAD | 0.869 | 0.956 | 0.684 | 0.213 | 0.367 | 0.089 |
| TADBD | 0.482 | 0.5 | 0.38 | |||
| TADbit | 0.452 | 0.571 | 0.304 | 0.478 | 0.653 | 0.313 |
| TADpole | 0.431 | 0.522 | 0.345 | 0.216 | 0.521 | 0.128 |
| TADtree | 0.961 | 0.785 | 0.001 | 0.016 | ||
| TopDom | 0.708 | 0.917 | 0.462 | 0.192 | 0.438 | 0.041 |
Bold numbers show the leading result for every column
Running time of different TAD callers at 5kb resolution for chromosome 6 on human embryonic stem cell line, tested over a 2.9 GHz 8-Core Intel i9 cpu having 128 GB memory
| Methods | Key Parameters | Running time (s) |
|---|---|---|
| Armatus | g = 0.5 | 90.76 |
| Arrowhead | Normalization = KR | 92.93 |
| CaTCH | Defaults | 44.19 |
| Constrained HAC | h = 342 | 26.48 |
| CHDF | length = 3422; number = 3422; size = 342 | 952.38 |
| ClusterTAD | Max_TADsize = 800000 | 7608.69 |
| deDoc | Defaults | 264.85 |
| DI | window = 2500000; min = 2; prob = 0.99 | 294.77 |
| EAST | Defaults | 11.28 |
| GMAP | Defaults | 62.46 |
| GRiNCH | Defaults | 22.57 |
| HiCExplorer | minDepth = 3*50000; maxDepth = 6*50000; step = 50000 | 22.94 |
| HiCKey | Defaults | 42.13 |
| HiCseg | change_max = 342; distrib = ‘G’; | 146.32 |
| IC-Finder | Defaults | 64.49 |
| InsulationScore | is = 2500000; ids = 1000000; im = mean; bmoe = 3; nt = 0.1 | 474.39 |
| Matryoshka | g = 0.5 | 123.95 |
| MrTADFinder | res = 1.5 | 625.73 |
| MSTD | MDHD = 10; window = 10 | 16.48 |
| OnTAD | Defaults | 4.78 |
| Spectral | Threshold = 0.8; region size = 1 | 61.69 |
| SpectralTAD | levels = 2 | 32.15 |
| TADBD | Defaults | 19.19 |
| TADbit | Defaults | 20910.81 |
| TADpole | Defaults | 211.63 |
| TADtree | S = 50; q = 12; p = 3; M = 10; N = 1025; gamma = 500 | 32732.71 |
| TopDom | w = 5 | 28.43 |