| Literature DB >> 33659905 |
Stephen Kissane1, Vignesh Dhandapani1, Luisa Orsini1.
Abstract
The assay for transposase accessible chromatin (ATAC-seq) is a method for mapping genome-wide chromatin accessibility. Coupled with high-throughput sequencing, it enables integrative epigenomics analyses. ATAC-seq requires direct access to cell nuclei, a major challenge in non-model species such as small invertebrates, whose soft tissue is surrounded by a protective exoskeleton. Here, we present modifications of the ATAC-seq protocol for applications in small crustaceans, extending applications to non-model species. For complete information on the use and execution of this protocol, please refer to Buenrostro et al. (2013).Entities:
Keywords: Bioinformatics; Genomics; High-throughput screening; Sequencing
Mesh:
Substances:
Year: 2021 PMID: 33659905 PMCID: PMC7896190 DOI: 10.1016/j.xpro.2021.100341
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
ATAC primers
| Primer | Sequence F primer 5′-3′ | Sequence R primer 5′-3′ |
|---|---|---|
| ATAC-1 | CAAGCAGAAGACGGCATACGAGATGACACAGTGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACGACACAGTTCGTCGGCAGCGTCAGATGTG |
| ATAC-2 | CAAGCAGAAGACGGCATACGAGATGCATAACGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACGCATAACGTCGTCGGCAGCGTCAGATGTG |
| ATAC-3 | CAAGCAGAAGACGGCATACGAGATACAGAGGTGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACACAGAGGTTCGTCGGCAGCGTCAGATGTG |
| ATAC-4 | CAAGCAGAAGACGGCATACGAGATCCACTAAGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACCCACTAAGTCGTCGGCAGCGTCAGATGTG |
| ATAC-5 | CAAGCAGAAGACGGCATACGAGATTGTTCCGTGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACTGTTCCGTTCGTCGGCAGCGTCAGATGTG |
| ATAC-6 | CAAGCAGAAGACGGCATACGAGATGATACCTGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACGATACCTGTCGTCGGCAGCGTCAGATGTG |
| ATAC-7 | CAAGCAGAAGACGGCATACGAGATAGCCGTAAGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACAGCCGTAATCGTCGGCAGCGTCAGATGTG |
| ATAC-8 | CAAGCAGAAGACGGCATACGAGATCTCCTGAAGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACCTCCTGAATCGTCGGCAGCGTCAGATGTG |
| ATAC-9 | CAAGCAGAAGACGGCATACGAGATACGAATCCGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACACGAATCCTCGTCGGCAGCGTCAGATGTG |
| ATAC-10 | CAAGCAGAAGACGGCATACGAGATAATGGTCGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACAATGGTCGTCGTCGGCAGCGTCAGATGTG |
| ATAC-11 | CAAGCAGAAGACGGCATACGAGATCGCTACATGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACCGCTACATTCGTCGGCAGCGTCAGATGTG |
| ATAC-12 | CAAGCAGAAGACGGCATACGAGATCCTAAGTCGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACCCTAAGTCTCGTCGGCAGCGTCAGATGTG |
| ATAC-13 | CAAGCAGAAGACGGCATACGAGATTTGCTTGGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACTTGCTTGGTCGTCGGCAGCGTCAGATGTG |
| ATAC-14 | CAAGCAGAAGACGGCATACGAGATCCTGTCAAGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACCCTGTCAATCGTCGGCAGCGTCAGATGTG |
| ATAC-15 | CAAGCAGAAGACGGCATACGAGATAGCCTATCGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACAGCCTATCTCGTCGGCAGCGTCAGATGTG |
| ATAC-16 | CAAGCAGAAGACGGCATACGAGATTGATCACGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACTGATCACGTCGTCGGCAGCGTCAGATGTG |
| ATAC-17 | CAAGCAGAAGACGGCATACGAGATCCACATTGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACCCACATTGTCGTCGGCAGCGTCAGATGTG |
| ATAC-18 | CAAGCAGAAGACGGCATACGAGATTCGAGAGTGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACTCGAGAGTTCGTCGGCAGCGTCAGATGTG |
| ATAC-19 | CAAGCAGAAGACGGCATACGAGATGGTCGTATGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACGGTCGTATTCGTCGGCAGCGTCAGATGTG |
| ATAC-20 | CAAGCAGAAGACGGCATACGAGATACAGGCATGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACACAGGCATTCGTCGGCAGCGTCAGATGTG |
| ATAC-21 | CAAGCAGAAGACGGCATACGAGATGTGATCCAGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACGTGATCCATCGTCGGCAGCGTCAGATGTG |
| ATAC-22 | CAAGCAGAAGACGGCATACGAGATTTCGTACGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACTTCGTACGTCGTCGGCAGCGTCAGATGTG |
| ATAC-23 | CAAGCAGAAGACGGCATACGAGATATGACAGGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACATGACAGGTCGTCGGCAGCGTCAGATGTG |
| ATAC-24 | CAAGCAGAAGACGGCATACGAGATCGACCTAAGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACCGACCTAATCGTCGGCAGCGTCAGATGTG |
| ATAC-25 | CAAGCAGAAGACGGCATACGAGATTATGGCACGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACTATGGCACTCGTCGGCAGCGTCAGATGTG |
| ATAC-26 | CAAGCAGAAGACGGCATACGAGATATAACGCCGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACATAACGCCTCGTCGGCAGCGTCAGATGTG |
| ATAC-27 | CAAGCAGAAGACGGCATACGAGATGTAGTACCGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACGTAGTACCTCGTCGGCAGCGTCAGATGTG |
| ATAC-28 | CAAGCAGAAGACGGCATACGAGATCGCGTATTGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACCGCGTATTTCGTCGGCAGCGTCAGATGTG |
| ATAC-29 | CAAGCAGAAGACGGCATACGAGATATCCACGAGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACATCCACGATCGTCGGCAGCGTCAGATGTG |
| ATAC-30 | CAAGCAGAAGACGGCATACGAGATTAACGTCGGTCTCGTGGGCTCGGAGATGT | AATGATACGGCGACCACCGAGATCTACACTAACGTCGTCGTCGGCAGCGTCAGATGTG |
List of forward (F) and reverse (R) primer sequences used in the sequencing of ATAC-seq data on an Illumina platform. Primers used at 25μM final concentration are shown in 5′-3′ orientation.
Figure 1Library amplification optimization via qPCR
Fluorescence plot used in point 5 of the ATAC protocol to optimize DNA library amplification steps and prevent over or underamplification, which can causes biases in the DNA fragment representation within libraries. To determine the optimal number of PCR cycles, we plot the qPCR fluorescent signal, and identify the value corresponding to 1/3 of the plateau fluorescent value (red line). The number of PCR cycles to achieve an optimal amplification is the number of cycles that intersects the threshold (red line). If this value falls between two cycles, the lower cycle should be preferred. The vertical red line is the number of cycles that intersects the threshold.
Figure 2Optimizing DNA input for ATAC-seq
A titration experiment was completed to identify the optimal input material (tissue) to generate a robust ATAC-seq library. The results of this titration experiment are shown as tape station traces of ATAC-seq libraries obtained from different starting material: 7, 15, 30, and 60 last instar Daphnia female juveniles, from a single genotype. These different starting materials were tested with three tissue homogenization techniques: manual homogenization with a pestle, automated tissue homogenizer (Geno/Grinder) and a combination of automated tissue homogenizer (Geno/Grinder) and DNA sharing using a pico-sonicator (Bioruptor). The scale on the y axis may differ. The scale on the x axis shows size in base pairs (bp).
Figure 3Optimizing transposase dilution in ATAC-seq libraries
Population epigenomics is largely unexplored due to the prohibitive costs of the transposase enzyme. Here we show the outcome of a titration experiment for different dilution of the transposase enzyme suggested by the manufacturer. ATAC-seq tape station traces are shown for libraries obtained with different dilutions of transposase: 100%, 50%, 25%, and 12% of the manufacturer suggested concentration. (A–D) ATAC libraries obtained with an over/night (O/N) incubation of the transposition reaction preceded by an automated tissue homogenization step; (E–H) libraries obtained with an over/night (O/N) incubation of the transposition reaction preceded by an automated tissue homogenization step combined with pico-sonication (DNA shearing); A1 and C1: ATAC libraries obtained for 100% and 25% dilutions of the transposase with an incubation of the transposition reaction of 30 min.
Figure 4Normalized ATAC-seq traces for four scaffolds on the Daphnia magna genome
Normalized ATAC-seq traces shown for 4 scaffolds on the Daphnia magna genome (v2.4) obtained from the ATAC-seq libraries in Figure 3. These traces are obtained from low coverage libraries and compared to a high coverage library obtained from the same Daphnia genotype R, used as reference.
Sequence metrics
| Sample name | No. of raw paired end reads | No. of QC paired end reads | Insert size median | Coverage mean | Mapped reads % | No. of reads mapped to the mitochondria | No. of peaks |
|---|---|---|---|---|---|---|---|
| LRV13_3R1_C | 18,635,973 x 2 | 16,135,365 x 2 | 151 | 24,5574 | 91.33% | 17,974 | 61,285 |
| LRV13_3-R2C | 29,340,610 x 2 | 25,005,090 x 2 | 153 | 37,6782 | 91.84% | 19,888 | 65,281 |
| LRV13_3R1_I | 16,668,003 x 2 | 12,773,098 x 2 | 131 | 16,3577 | 89.55% | 24,572 | 59,735 |
| LRV13_3R2_I | 13,709,892 x 2 | 10,100,948 x 2 | 123 | 12,4835 | 88.91% | 21,810 | 60,661 |
| LRV2_1R1_C | 12,140,926 x 2 | 10,137,006 x 2 | 197 | 16,1945 | 91.36% | 3,280 | 28,993 |
| LRV2_1R2_C | 14,302,926 x 2 | 11,238,267 x 2 | 168 | 15,8192 | 90.47% | 7,696 | 43,685 |
| LRV2_1R1_I | 12,388,306 x 2 | 7,082,046 x 2 | 111 | 7,4666 | 83.23% | 21,434 | 61,512 |
| LRV2_1R2_I | 13,813,597 x 2 | 9,790,744 x 2 | 129 | 11,1734 | 83.20% | 21,059 | 58,803 |
| LRV3_5_15R1_C | 18,483,584 x 2 | 13,824,157 x 2 | 155 | 18,9622 | 93.69% | 8,435 | 51,683 |
| LRV3_5_15R2_C | 25,080,326 x 2 | 20,644,243 x 2 | 174 | 30,8094 | 93.78% | 7,970 | 48,057 |
| LRV3_5_15R1_I | 14,942,965 x 2 | 10,405,452 x 2 | 129 | 13,1903 | 92.63% | 20,251 | 55,355 |
| LRV3_5_15R2_I | 13,019,186 x 2 | 8,165,030 x 2 | 129 | 10,0822 | 92.68% | 17,601 | 53,166 |
QC metrics for NGS data generated from the high coverage samples (LRV2_1; LRV3.5_15 and LRV13_3) obtained from an exposure experiment in which clonal replicates were either exposed to the insecticide Carabaryl (8μg/L) or maintained in non-stress (control) conditions. C = control; I = insecticide carbaryl; R1 = technical replicate 1; R2 = technical replicate 2.
Figure 5Normalized ATAC-seq traces of three Daphnia genotypes exposed to the insecticide carbaryl
Normalized ATAC traces are shown for three genotypes of D. magna (LRV2_1; LRV3.5_15 and LRV13_3) for a representative scaffold of ca. 400 kb (Scaffold 2244). For each genotype a clonal replicate exposed to 8 μg/L of the insecticide carbaryl (I) and a non-exposed clonal replicate (C) are shown. These traces are obtained from high coverage libraries (40 million reads).
Figure 6Chromatin accessibility patterns in Daphnia magna
(A and B) (A) Principal component analysis of genome-wide accessibility variation at three D. magna genotypes (LRV2_1; LRV3.5_15 and LRV13_3). Clonal replicates of the three genotypes exposed to the insecticide carbaryl (I; 8 μg/L) as well as non-exposed clonal replicates (C) are shown; (B) Venn diagram showing unique ATAC normalized traces identified in D. magna genotypes exposed to the insecticide carbaryl (I), as well as traces shared between treated (I) and untreated (C) genotypes.
Figure 7Heatmaps of transcription start sites (TSS)
Distribution of genome-wide DNA motifs shared among three Daphnia magna genotypes exposed to the insecticide carbaryl (8 μg/L) (LRV3.5_15I, LRV2_1I and LRV13_3I) and maintained in non-stressful conditions (LRV3.5_15C, LRV2_1C, and LRV13_3C). Peak traces across samples were scaled using deeptools and visualized with plotHeatmap. TSS were scaled to include 1 kb up and downstream of the TSS. Similarity decreases from red to blue.
Known motifs enrichment results
| Motif name | Consensus | Log p value | q value (Benjamini) | # of target sequences with motif (of 10,325) | % of target sequences with motif | # of background sequences with motif (of 32,685) | % of background sequences with motif |
|---|---|---|---|---|---|---|---|
| NFIL3(bZIP)/HepG2-NFIL3-ChIP-Seq(Encode)/Homer | VTTACGTAAYNNNNN | −7.14E+02 | 0 | 2,868 | 27.77% | 4,448.9 | 13.61% |
| Foxh1(Forkhead)/hESC-FOXH1-ChIP-Seq(GSE29422)/Homer | NNTGTGGATTSS | −1.28E+03 | 0 | 3,435 | 33.27% | 4,471.3 | 13.68% |
| AT2G20400(G2like)/colamp-AT2G20400-DAP-Seq(GSE60143)/Homer | DNVGAATATTCBNHN | −6.69E+02 | 0 | 1,164 | 11.27% | 1,050.5 | 3.21% |
| AT1G76880(Trihelix)/col-AT1G76880-DAP-Seq(GSE60143)/Homer | ACGGTAAAAW | −6.13E+02 | 0 | 2,177 | 21.08% | 3,130.6 | 9.58% |
| At2g03500(G2like)/col-At2g03500-DAP-Seq(GSE60143)/Homer | WWAGAATATTCT | −5.68E+02 | 0 | 1,315 | 12.73% | 1,457.9 | 4.46% |
| At5g29000(G2like)/col-At5g29000-DAP-Seq(GSE60143)/Homer | RGAATATTCYHH | −5.35E+02 | 0 | 1370 | 13.27% | 1,618.3 | 4.95% |
| JGL(C2H2)/col-JGL-DAP-Seq(GSE60143)/Homer | ACYTTCAGTT | −4.94E+02 | 0 | 3,956 | 38.31% | 7,971.9 | 24.39% |
| Foxa3(Forkhead)/Liver-Foxa3-ChIP-Seq(GSE77670)/Homer | BSNTGTTTACWYWGN | −3.54E+02 | 0 | 1,701 | 16.47% | 2,746.3 | 8.40% |
| FOXP1(Forkhead)/H9-FOXP1-ChIP-Seq(GSE31006)/Homer | NYYTGTTTACHN | −3.12E+02 | 0 | 2,146 | 20.78% | 3,964.4 | 12.13% |
| At3g04030(G2like)/col-At3g04030-DAP-Seq(GSE60143)/Homer | DRGAATCT | −2.84E+02 | 0 | 1,903 | 18.43% | 3,465.8 | 10.60% |
| At1g25550(G2like)/colamp-At1g25550-DAP-Seq(GSE60143)/Homer | NAGATTCY | −2.79E+02 | 0 | 1,921 | 18.60% | 3,530.5 | 10.80% |
| FOXK2(Forkhead)/U2OS-FOXK2-ChIP-Seq(E-MTAB-2204)/Homer | SCHTGTTTACAT | −2.58E+02 | 0 | 2,290 | 22.18% | 4,558.8 | 13.95% |
| FUS3(ABI3VP1)/col-FUS3-DAP-Seq(GSE60143)/Homer | DNNWTNTGCATGKNN | −2.36E+02 | 0 | 1,323 | 12.81% | 2,240.2 | 6.85% |
| At1g68670(G2like)/colamp-At1g68670-DAP-Seq(GSE60143)/Homer | WNWWHNRAAGATTCT | −2.30E+02 | 0 | 1,419 | 13.74% | 2,489.4 | 7.62% |
| Foxo1(Forkhead)/RAW-Foxo1-ChIP-Seq(Fan_et_al.)/Homer | CTGTTTAC | −2.13E+02 | 0 | 3,355 | 32.49% | 7,721.5 | 23.63% |
| GTL1(Trihelix)/colamp-GTL1-DAP-Seq(GSE60143)/Homer | WWTTTACCKY | −1.89E+02 | 0 | 3,023 | 29.28% | 6,935.8 | 21.22% |
| Foxo3(Forkhead)/U2OS-Foxo3-ChIP-Seq(E-MTAB-2701)/Homer | DGTAAACA | −1.86E+02 | 0 | 2,570 | 24.89% | 5,689.8 | 17.41% |
| FoxD3(forkhead)/ZebrafishEmbryo-Foxd3.biotin-ChIP-seq(GSE106676)/Homer | TGTTTAYTTAGC | −1.81E+02 | 0 | 2,115 | 20.48% | 4,483.8 | 13.72% |
| OCT:OCT-short(POU,Homeobox)/NPC-OCT6-ChIP-Seq(GSE43916)/Homer | ATGCATWATGCATRW | −1.65E+02 | 0 | 2,091 | 20.25% | 4,511 | 13.80% |
| At5g04390(C2H2)/col200-At5g04390-DAP-Seq(GSE60143)/Homer | AGTGANDN | −1.55E+02 | 0 | 6473 | 62.69% | 17,722.3 | 54.23% |
| Foxa2(Forkhead)/Liver-Foxa2-ChIP-Seq(GSE25694)/Homer | CYTGTTTACWYW | −1.48E+02 | 0 | 2,418 | 23.42% | 5,514.6 | 16.87% |
| At3g24120(G2like)/col-At3g24120-DAP-Seq(GSE60143)/Homer | NWWAGMATMW | −1.21E+02 | 0 | 5,036 | 48.77% | 13,498.4 | 41.30% |
| AT5G60130(ABI3VP1)/col-AT5G60130-DAP-Seq(GSE60143)/Homer | WTTYTAAGVAAA | −1.20E+02 | 0 | 3,896 | 37.73% | 10,024.7 | 30.67% |
| At2g01060(G2like)/colamp-At2g01060-DAP-Seq(GSE60143)/Homer | AGATKCBNWW | −1.17E+02 | 0 | 5,572 | 53.96% | 15,213.2 | 46.55% |
| RBFox2(?)/Heart-RBFox2-CLIP-Seq(GSE57926)/Homer | TGCATGCA | −1.10E+02 | 0 | 2,618 | 25.35% | 6,366.5 | 19.48% |
| AT5G47660(Trihelix)/colamp-AT5G47660-DAP-Seq(GSE60143)/Homer | AWTTTTACCG | −1.10E+02 | 0 | 3,348 | 32.42% | 8,493.7 | 25.99% |
| Fox:Ebox(Forkhead,bHLH)/Panc1-Foxa2-ChIP-Seq(GSE47459)/Homer | NNNVCTGWGYAAACASN | −9.71E+01 | 0 | 1,766 | 17.10% | 4,066.8 | 12.44% |
| STZ(C2H2)/colamp-STZ-DAP-Seq(GSE60143)/Homer | HNBTCACT | −9.54E+01 | 0 | 6,447 | 62.43% | 18,252.9 | 55.85% |
| At3g12730(G2like)/colamp-At3g12730-DAP-Seq(GSE60143)/Homer | AAGATTCT | −8.99E+01 | 0 | 2,458 | 23.80% | 6,079.2 | 18.60% |
| AT5G45580(G2like)/colamp-AT5G45580-DAP-Seq(GSE60143)/Homer | ADRGAATCTH | −7.41E+01 | 0 | 3,038 | 29.42% | 7,938.1 | 24.29% |
| GT2(Trihelix)/colamp-GT2-DAP-Seq(GSE60143)/Homer | AMGGTAAAWWWN | −7.10E+01 | 0 | 2,944 | 28.51% | 7694.4 | 23.54% |
| FOXK1(Forkhead)/HEK293-FOXK1-ChIP-Seq(GSE51673)/Homer | NVWTGTTTAC | −6.86E+01 | 0 | 2,808 | 27.19% | 7,319.6 | 22.40% |
| FOXM1(Forkhead)/MCF7-FOXM1-ChIP-Seq(GSE72977)/Homer | TRTTTACTTW | −6.43E+01 | 0 | 2,428 | 23.51% | 6,249.7 | 19.12% |
| AT2G38300(G2like)/col-AT2G38300-DAP-Seq(GSE60143)/Homer | ADRGAATGTT | −6.01E+01 | 0 | 2,429 | 23.52% | 6,299.9 | 19.28% |
| FoxL2(Forkhead)/Ovary-FoxL2-ChIP-Seq(GSE60858)/Homer | WWTRTAAACAVG | −4.43E+01 | 0 | 2,395 | 23.19% | 6,396.6 | 19.57% |
| AP-2gamma(AP2)/MCF7-TFAP2C-ChIP-Seq(GSE21234)/Homer | SCCTSAGGSCAW | −4.38E+01 | 0 | 205 | 1.99% | 322.3 | 0.99% |
| AP-2alpha(AP2)/Hela-AP2alpha-ChIP-Seq(GSE31477)/Homer | ATGCCCTGAGGC | −4.31E+01 | 0 | 150 | 1.45% | 207.7 | 0.64% |
| KAN2(G2like)/colamp-KAN2-DAP-Seq(GSE60143)/Homer | ATATTCTY | −4.10E+01 | 0 | 2,289 | 22.17% | 6,126.4 | 18.75% |
| FOXA1(Forkhead)/MCF7-FOXA1-ChIP-Seq(GSE26831)/Homer | WAAGTAAACA | −2.48E+01 | 0 | 2,438 | 23.61% | 6,834.6 | 20.91% |
| Foxf1(Forkhead)/Lung-Foxf1-ChIP-Seq(GSE77951)/Homer | WWATRTAAACAN | −1.78E+01 | 0 | 2,621 | 25.38% | 7,539.7 | 23.07% |
| KANADI1(Myb)/Seedling-KAN1-ChIP-Seq(GSE48081)/Homer | ARGAATAWWN | −1.77E+01 | 0 | 2,421 | 23.45% | 6,932.4 | 21.21% |
| EBF(EBF)/proBcell-EBF-ChIP-Seq(GSE21978)/Homer | DGTCCCYRGGGA | −1.45E+01 | 0 | 29 | 0.28% | 31.1 | 0.10% |
| EBF2(EBF)/BrownAdipose-EBF2-ChIP-Seq(GSE97114)/Homer | NABTCCCWDGGGAVH | −1.41E+01 | 0 | 178 | 1.72% | 384.3 | 1.18% |
| Dorsal(RHD)/Embryo-dl-ChIP-Seq(GSE65441)/Homer | GGGAAAAMCCCG | −1.39E+01 | 0 | 130 | 1.26% | 262.8 | 0.80% |
| MYB73(MYB)/col-MYB73-DAP-Seq(GSE60143)/Homer | NNNNHAACNGHHDHN | −1.38E+01 | 0 | 4,006 | 38.80% | 11,938.3 | 36.53% |
| E2F4(E2F)/K562-E2F4-ChIP-Seq(GSE31477)/Homer | GGCGGGAAAH | −1.35E+01 | 0 | 421 | 4.08% | 1,055 | 3.23% |
| Tlx?(NR)/NPC-H3K4me1-ChIP-Seq(GSE16256)/Homer | CTGGCAGSCTGCCA | −1.23E+01 | 0.0001 | 117 | 1.13% | 238.7 | 0.73% |
| HDG7(HB)/col-HDG7-DAP-Seq(GSE60143)/Homer | WGCATTTAATGC | −1.20E+01 | 0.0001 | 509 | 4.93% | 1,324.4 | 4.05% |
| AT2G40260(G2like)/colamp-AT2G40260-DAP-Seq(GSE60143)/Homer | WAAAYATTCTTT | −1.19E+01 | 0.0001 | 2,664 | 25.80% | 7,829.9 | 23.96% |
| LBD2(LOBAS2)/colamp-LBD2-DAP-Seq(GSE60143)/Homer | TCCGAWTTTTTCGGN | −1.14E+01 | 0.0002 | 547 | 5.30% | 1,442.8 | 4.41% |
| RKD2(RWPRK)/colamp-RKD2-DAP-Seq(GSE60143)/Homer | GACKTTTCRDCTTCC | −1.07E+01 | 0.0004 | 741 | 7.18% | 2,020.9 | 6.18% |
| ZNF528(Zf)/HEK293-ZNF528.GFP-ChIP-Seq(GSE58341)/Homer | AGAAATGACTTCCCT | −9.87E+00 | 0.001 | 6 | 0.06% | 2.4 | 0.01% |
| ZNF143|STAF(Zf)/CUTLL-ZNF143-ChIP-Seq(GSE29600)/Homer | ATTTCCCAGVAKSCY | −9.78E+00 | 0.0011 | 77 | 0.75% | 151.6 | 0.46% |
| LRF(Zf)/Erythroblasts-ZBTB7A-ChIP-Seq(GSE74977)/Homer | AAGACCCYYN | −8.93E+00 | 0.0025 | 317 | 3.07% | 813 | 2.49% |
| FOXA1(Forkhead)/LNCAP-FOXA1-ChIP-Seq(GSE27824)/Homer | WAAGTAAACA | −7.56E+00 | 0.0095 | 2,801 | 27.13% | 8,400.2 | 25.70% |
| Pax8(Paired,Homeobox)/Thyroid-Pax8-ChIP-Seq(GSE26938)/Homer | GTCATGCHTGRCTGS | −6.85E+00 | 0.019 | 98 | 0.95% | 223.7 | 0.68% |
| THRa(NR)/C17.2-THRa-ChIP-Seq(GSE38347)/Homer | GGTCANYTGAGGWCA | −6.79E+00 | 0.0199 | 106 | 1.03% | 246 | 0.75% |
| EBF1(EBF)/Near-E2A-ChIP-Seq(GSE21512)/Homer | GTCCCCWGGGGA | −6.50E+00 | 0.0261 | 122 | 1.18% | 291.4 | 0.89% |
| LOB(LOBAS2)/col-LOB-DAP-Seq(GSE60143)/Homer | CGCCGKAWWTTHCGS | −6.18E+00 | 0.0353 | 188 | 1.82% | 479.4 | 1.47% |
| At3g60580(C2H2)/col-At3g60580-DAP-Seq(GSE60143)/Homer | WTTYTACT | −6.16E+00 | 0.0354 | 5,763 | 55.81% | 17,780 | 54.40% |
| GEI-11(Myb?)/cElegans-L4-GEI11-ChIP-Seq(modEncode)/Homer | CCGACAYYTYACGGG | −6.02E+00 | 0.0402 | 47 | 0.46% | 95.1 | 0.29% |
ATAC motifs found in the control (non-exposed) and in carabaryl exposed samples. The motif name; the consensus motif sequence; the log p value; the corrected Benjamini q value; the Number of Target Sequences (including their percentage); and the number of Background Sequences with Motifs (including their percentage) are shown.
Tris-HCl solution
| Reagent | Final concentration (mM) | Volume (mL)/weight (g) |
|---|---|---|
| Tris-HCl | 1,000 | 7.88 g |
| Nuclease-free H2O | n/a | 50 mL |
| Total | n/a | 50 mL |
NaCl solution
| Reagent | Final concentration (mM) | Volume (mL)/weight (g) |
|---|---|---|
| NaCl | 5,000 | 14.61 g |
| Nuclease-free H2O | n/a | 50 mL |
| Total | n/a | 50 mL |
MgCl2 solution
| Reagent | Final concentration (mM) | Volume (mL)/weight (g) |
|---|---|---|
| MgCl2 | 300 | 1.43 g |
| Nuclease-free H2O | n/a | 50 mL |
| Total | n/a | 50 mL |
RSB buffer
| Reagent | Final concentration (mM) | Volume (mL) |
|---|---|---|
| Tris-HCl | 1,000 | 0.25 |
| NaCl | 5,000 | 0.05 |
| MgCl2 | 300 | 0.25 |
| Nuclease-free H2O | n/a | 24.45 |
| Total | n/a | 25 |
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Buffer EB | Qiagen | CAT# 19086 |
| Ceramic beads (1.4 mm diameter; 325 g) | Qiagen | CAT# 13113-325 |
| ∗Ethanol absolute, molecular biology grade (200 proof) | Fisher Scientific | CAT# 10517694 |
| High sensitivity D1000 reagents | Agilent | CAT# 5067- 5585 |
| High sensitivity D1000 ScreenTape | Agilent | CAT# 5067- 5584 |
| Magbio HighPrep PCR beads | Auto Q Biosciences | CAT# AQ 60500 |
| Illumina Tagment DNA TDE1 enzyme and buffer kits | Illumina | CAT# 20034198 |
| Liquid Proteinase K | Fisher Scientific | CAT# NC1442588 |
| ∗MasterPure Complete DNA and RNA Purification Kit | Lucigen | CAT# MC85200 |
| ∗MgCl2 | Sigma-Aldrich | CAT# M8266-100G |
| ∗NaCl | Sigma-Aldrich | CAT# S7653-250G |
| NEBNext High-Fidelity 2× PCR Master Mix | New England Biolabs | CAT# M0541L |
| Qubit 1× dsDNA HS Assay Kit | Invitrogen | CAT# Q33231 |
| ∗RNase A | Qiagen | CAT# 19101 |
| Screw cap tubes (2 mL) | VWR International | CAT# 211-0432 |
| iTaq Universal SYBR Green Supermix | Bio-Rad | CAT# 172-5121 |
| Tris-HCl | Sigma-Aldrich | CAT# 93363-500G |
| Tween 20 | Sigma-Aldrich | CAT# P9416-100ML |
| Water DEPC treated | National Diagnostics | EC-625 |
| University of Birmingham | NA | |
| University of Birmingham | NA | |
| University of Birmingham | NA | |
| University of Birmingham | NA | |
| fastqc (V0.11.9) | ( | |
| Trimmomatic (v0.32) | ( | = |
| Galaxy pipeline | ( | |
| multiqc | ( | |
| ataqv | ( | |
| Bowtie2 (v2.2.6) | ( | |
| Samtools | ( | |
| Picard tools | Online resource | |
| Qualimap 2 | ( | |
| deeptools alignmentSieve | ( | |
| MACS2 callpeak | ( | |
| bedtools | ( | |
| Hypergeometric Optimization of Motif EnRichment (HOMER) | ( | |
| Source | Identifier | |
|---|---|---|
| Fine Balance | VWR | LA2541 |
| Centrifuge compatible for Eppendorf tubes (5424R) | Eppendorf | 5404000332 |
| 2020 Geno/Grinder | Spex Sample Prep | n/a |
| NanoDrop 8000 | Labtech | ND-8000-GL |
| Nexus Gradient | Eppendorf | 6331000041 |
| Qubit 3.0 Fluorometer | Invitrogen | Q33216 |
| Tapestation 2200 | Agilent | G2964AA |
| Thermomixer C | Eppendorf | 5382000031 |
| Biomek FxP | Beckman Coulter | A31844 |
| Bioruptor Pico-sonication device | Diagenode | B01060010 |
The ATAC-seq protocol was optimized manually on a small number of samples. After this optimization step, a liquid handling Biomek FxP workstation was used to prepare large sample numbers. Any liquid handling robotic workstation can be used.
| ENCODE standard | Current paper |
|---|---|
| Experiments should have two or more biological replicates or at least two technical replicates | Three biological replicated and two technical replicates per samples |
| Each replicate should have 25 million on-duplicate, non-mitochondrial aligned reads | Reads per replica ranged between 14 and 50 million reads (Average read per replica: 26 million reads) |
| Percentage of mapped reads should be greater than 95%, though values >80% may be acceptable | Percentage of mapped reads ranged between 83.2% and 93.8% |
| Irreproducible Discovery Rate (IDR) should be less than 2 | IDR <1.6 |
| The number of peaks within a replicated peak file should be >150,000, though values >100,000 may be acceptable | >50,000 (corrected for the genome size of |
| The number of peaks within an IDR peak file should be >70,000, though values >50,000 may be acceptable | > 11,700 (corrected for the genome size of |