| Literature DB >> 23282014 |
Christina Schweikert1, Stuart Brown, Zuojian Tang, Phillip R Smith, D Frank Hsu.
Abstract
BACKGROUND: Due to the recent rapid development in ChIP-seq technologies, which uses high-throughput next-generation DNA sequencing to identify the targets of Chromatin Immunoprecipitation, there is an increasing amount of sequencing data being generated that provides us with greater opportunity to analyze genome-wide protein-DNA interactions. In particular, we are interested in evaluating and enhancing computational and statistical techniques for locating protein binding sites. Many peak detection systems have been developed; in this study, we utilize the following six: CisGenome, MACS, PeakSeq, QuEST, SISSRs, and TRLocator.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23282014 PMCID: PMC3535708 DOI: 10.1186/1471-2164-13-S8-S12
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1ChIP-seq experiment and analytic workflow.
Figure 2Regions of individual methods upstream of the ARRDC4 gene.
Figure 3Example of the intersection and union between PeakSeq and QuEST regions.
Figure 4ChIP-seq tags from an immunoprecipitation with antibody for H3K4 and an IGG control. The graph shows the total number of tag start positions mapped to each basepair within 1000 bp flanking all annotated RefSeq TSS. Tags mapped to the forward strand are shown in blue and the reverse strand in green. The graph also shows a very clear nucleosome depleted region located exactly at the TSS.
Average precision for single methods.
| Method | Average precision; rank | Number |
|---|---|---|
| F = SISSRs | 0.8212; 6 | 20715 |
| E = CisGenome | 0.8277; 5 | 21190 |
| D = QuEST | 0.8281; 4 | 21514 |
| C = PeakSeq | 0.8634; 3 | 20000 |
| B = MACS | 0.9023; 2 | 19918 |
| A = TRLocator | 0.9217; 1 | 19673 |
Average precision for the intersection (*) of two methods.
| x * y | Average precision | Number | |
|---|---|---|---|
| Score combination | Rank combination | ||
| C * F = PeakSeq * SISSRs | 0.887166 | 0.887675 | 13293 |
| E * F = CisGenome * SISSRs | 0.900260 | 0.885596 | 12841 |
| C * E = PeakSeq * CisGenome | 0.902211 | 0.892652 | 12662 |
| C * D = PeakSeq * QuEST | 0.910056 | 0.920872 | 11865 |
| D * F = QuEST * SISSRs | 0.911046 | 0.908774 | 10789 |
| D * E = QuEST * CisGenome | 0.914799 | 0.917028 | 14452 |
| B * D = MACS * QuEST | 0.938479 | 0.937476 | 14528 |
| B * C = MACS * PeakSeq | 0.941655 | 0.948495 | 12095 |
| B * F = MACS * SISSRs | 0.942113 | 0.950036 | 11003 |
| B * E = MACS * CisGenome | 0.949365 | 0.948955 | 14244 |
| A * B = TRLocator * MACS | 0.951392 | 0.950802 | 16921 |
| A * D = TRLocator * QuEST | 0.951877 | 0.950939 | 13270 |
| A * C = TRLocator * PeakSeq | 0.952759 | 0.956961 | 11573 |
| A * F = TRLocator * SISSRs | 0.959214 | 0.960584 | 10463 |
| A * E = TRLocator * CisGenome | 0.959687 | 0.959111 | 13155 |
Average precision for the union (+) of two methods.
| x + y | Average precision | Number | |
|---|---|---|---|
| Score combination | Rank combination | ||
| E + F = CisGenome + SISSRs | 0.8114 | 0.7997 | 26371 |
| D + E = QuEST + CisGenome | 0.8190 | 0.8158 | 26457 |
| D + F = QuEST + SISSRs | 0.8204 | 0.8038 | 25574 |
| C + D = PeakSeq + QuEST | 0.8526 | 0.8475 | 22191 |
| C + F = PeakSeq + SISSRs | 0.8545 | 0.8559 | 22876 |
| C + E = PeakSeq + CisGenome | 0.8610 | 0.8522 | 24415 |
| B + F = MACS + SISSRs | 0.8880 | 0.8950 | 20767 |
| B + D = MACS + QuEST | 0.8883 | 0.8876 | 21242 |
| B + E = MACS + CisGenome | 0.8983 | 0.8977 | 20768 |
| B + C = MACS + PeakSeq | 0.8983 | 0.9033 | 19895 |
| A + F = TRLocator + SISSRs | 0.9126 | 0.9030 | 19673 |
| A + B = TRLocator + MACS | 0.9168 | 0.9158 | 20279 |
| A + D = TRLocator + QuEST | 0.9168 | 0.9071 | 20117 |
| A + E = TRLocator + CisGenome | 0.9193 | 0.9178 | 19720 |
| A + C = TRLocator + PeakSeq | 0.9199 | 0.9177 | 19281 |
Coverage for single methods.
| Method | Coverage; rank | Number |
|---|---|---|
| F = SISSRs | 9322; 6 | 20715 |
| E = MACS | 11804; 5 | 19918 |
| D = TRLocator | 11850; 4 | 19673 |
| C = CisGenome | 14010; 3 | 21190 |
| B = QuEST | 14440; 2 | 21514 |
| A = PeakSeq | 15611; 1 | 20000 |
Coverage for the intersection (*) of two methods.
| x * y | Coverage | Number | |
|---|---|---|---|
| Score combination | Rank combination | ||
| B * F = QuEST * SISSRs | 10016 | 10016 | 10789 |
| C * F = CisGenome * SISSRs | 11211 | 11211 | 12841 |
| A * B = PeakSeq * QuEST | 11920 | 11920 | 11865 |
| A * C = PeakSeq * CisGenome | 12010 | 12010 | 12662 |
| E * F = MACS * SISSRs | 12351 | 12351 | 11003 |
| B * C = QuEST * CisGenome | 12459 | 12459 | 14452 |
| D * F = TRLocator * SISSRs | 12662 | 12662 | 10463 |
| A * F = PeakSeq * SISSRs | 12921 | 12921 | 13293 |
| A * E = PeakSeq * MACS | 13717 | 13717 | 12095 |
| A * D = PeakSeq * TRLocator | 13939 | 13939 | 11573 |
| B * E = QuEST * MACS | 14700 | 14700 | 14528 |
| B * D = QuEST * TRLocator | 14725 | 14725 | 13270 |
| C * E = CisGenome * MACS | 14947 | 14947 | 14244 |
| C * D = CisGenome * TRLocator | 15075 | 15075 | 13155 |
| D * E = TRLocator * MACS | 17725 | 17725 | 16921 |
Coverage for the union (+) of two methods.
| x + y | Coverage | Number | |
|---|---|---|---|
| Score combination | Rank combination | ||
| A + E = PeakSeq + MACS | 18964 | 18964 | 19895 |
| A + D = PeakSeq + TRLocator | 19433 | 19433 | 19281 |
| C + E = CisGenome + MACS | 19458 | 19458 | 20768 |
| E + F = MACS + SISSRs | 19459 | 19459 | 20767 |
| A + B = PeakSeq + QuEST | 19520 | 19520 | 22191 |
| D + F = TRLocator + SISSRs | 19742 | 19742 | 19673 |
| B + E = QuEST + MACS | 19760 | 19760 | 21242 |
| C + D = CisGenome + TRLocator | 19767 | 19767 | 19720 |
| B + D = QuEST + TRLocator | 20014 | 20014 | 20117 |
| D + E = TRLocator + MACS | 20127 | 20127 | 20279 |
| B + F = QuEST + SISSRs | 21003 | 21003 | 25574 |
| A + F = PeakSeq + SISSRs | 21032 | 21032 | 22876 |
| B + C = QuEST + CisGenome | 21165 | 21165 | 26457 |
| C + F = CisGenome + SISSRs | 21360 | 21360 | 26371 |
| A + C = PeakSeq + CisGenome | 21738 | 21738 | 24415 |
Figure 5Average precision for intersection of two methods.
Figure 6Average precision for union of two methods.
Figure 7Coverage for intersection of two methods.
Figure 8Coverage for union of two methods.