| Literature DB >> 35251128 |
Na He1,2, Wenjing Wang3, Chao Fang4, Yongjian Tan2, Li Li5,6, Chunhui Hou2.
Abstract
Negative regulatory elements (NREs) down-regulate gene expression by inhibiting the activities of promoters or enhancers. The repressing activity of NREs can be measured globally by massively parallel reporter assays (MPRAs). However, most existing algorithms are designed for the statistical detection of positively enriched signals in MPRA datasets. To identify reduced signals in MPRA experiments, we designed a NRE identification program, fast-NR, by integrating the count and graphic features of sequenced reads to detect NREs using datasets generated by experiments of self-transcribing active regulatory region sequencing (STARR-seq). Fast-NR identified hundreds of silencers in human K562 cells that can be validated by independent methods.Entities:
Keywords: count difference; curve similarity; negative regulatory element; silencer; silencer identification
Year: 2022 PMID: 35251128 PMCID: PMC8896116 DOI: 10.3389/fgene.2022.818344
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Negative regulatory element identification pipeline. BH, Benjamini–Hochberg correction.
FIGURE 2Program performance comparison on simulated datasets. (A) The number of silencers identified by different programs. The fraction of reads retained to simulate silencers is shown under x axis. p value <10−5. (B) The silencer detection power of different programs at different levels of confidence. Detection power is the ratio between number of identified silencers over total number of simulated silencers.
FIGURE 3Program performance comparison on STARR-seq datasets. (A) The number of silencers identified by different programs. p value <10−5. (B) The percentage of silencers identified by CRADLE and Fast-NR at different confidence levels. The number of silencers identified at p < 10−2 is set at 100%. (C) Venn diagrams of silencers identified by Fast-NR and CRADLE in K562 and A549, respectively. p value <10−5. (D) Reads ratio (reporter cDNA/input inserts) distribution for silencers identified only by CRADLE or Fast-NR in K562 and A549, respectively. (E) Exemplary silencers identified only by CRADLE (left) or by Fast-NR (right). (F) Percentages of Fast-NR- and CRADLE-identified silencers (p < 10−5) reported by Doni Jayavelu et al.
FIGURE 4Curve similarity effect on silencer identification. (A) The cosine distance distribution for silencers identified by Fast-NR with similarity considered, similarity not considered, and controls of whole genome regions with 400 bp size and shuffled silencer regions. Distance negatively correlates with similarity. **p < 10−3, ***p < 10−4, Wilcoxon rank sum test. (B) The correlation between silencer strength and curve similarity. The X axis shows the value of −log2 (cDNA reads/insert DNA reads). The Y axis shows the curve similarity index, −log2 (Cosine distance) of silencers calculated by the method of cosine. Blue dots are silencers that pass curve similarity threshold of 0.9, and gray dots are silencers that do not pass curve similarity threshold.