| Literature DB >> 32117432 |
Gyan Prakash Mishra1,2, Arup Ghosh1,2, Atimukta Jha1,3, Sunil Kumar Raghav1,2,3.
Abstract
A large number of genomic regions, such as transcription factor binding sites (TFBSs) captured from next generation sequencing (NGS) data analyses or those available from the public resource database ENCODE, are generally overlapped to answer a variety of biological questions. Though several command-line tools are available to perform such an analysis, there is a notable lack of an integrated webserver application with which to identify genomic region intersections, generate publication-ready plots depicting subsets of the overlapped regions, and perform functional annotation. Thus, there is an ardent need for a comprehensive and user-friendly webserver application that allows the users to either upload multiple datasets or select from the integrated Gene Transcription Regulation Database (GTRD). We thus introduce BedSect (http://imgsb.org/bedsect/.), which not only fulfils the above criteria but also performs intersection analysis along with visualization of the intersection regions as an UpSet and correlation plot using the integrated Shiny application. Moreover, analyses, including functional annotation, gene ontology, and biological pathways enrichment for the identified unique and intersected genomic regions, can also be performed using the integrated GREAT tool. To view the genomic regions in the genome browser, the inbuilt hyperlink for UCSC can redirect the user to visualize the results as custom tracks.Entities:
Keywords: functional annotation; genomic colocalization; genomic region overlap; next generation sequencing (NGS); web server tool
Year: 2020 PMID: 32117432 PMCID: PMC7013082 DOI: 10.3389/fgene.2020.00003
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Flowchart representing the workflow of BedSect tool.
Figure 2(A) Snapshot of the homepage of the web server. (B) Result page showing the analysis results after uploading the BED files. (C) UpSet plot representation of intersection regions. (D) Correlation plot representing similarity based on Jaccard indices (0 to 1). (E) Snapshot of UCSC genome browser showing uploaded user track.
Figure 3(A) Heatmap of correlation of 24 NR based on Jaccard index. (B) UpSet plot representing the number of subsets of intersection between five factors. (C) Genomic regions distribution near TSS. (D) Number of genes annotated to genomic regions. (E) Enriched pathway against MSigDB obtained from the GREAT analysis.
Comparison of BedSect with other publicly available tools.
| Tools | Algorithm/Methods | Generate figures (type) | Intersection regions file as output | Functional annotation | Visualization in genome browser | Platform type or Utility |
|---|---|---|---|---|---|---|
| BEDtools | Set theory on the genome | × | ✓ | × | × | Command line based for genomic regions overlap |
| Pybedtools | Classical Venn | ✓ | ✓ | × | × | Command line based for genomic regions overlap |
| ChippeakAnno( | Classical Venn | ✓ | ✓ | × | × | R package |
| PAVIS ( | Functional annotation | ✓ | × | × | Web server only for functional annotation | |
| PeakAnalyzer( | Functional annotation |
| × | × | Command line based for functional annotation and genomic regions overlap of two genomic region files. | |
| regioneR( | Randomization-based | × | × | × | × | R package |
| BEDOPS( | Suite | × | ✓ | × | × | Command line Based for genomic region intersection |
| Intervene | Classical Venn, Euler, Edwards, Chow-Ruskey, Square, Battle | ✓ | ✓ | × | × | Command line based for genomic regions intersection and visualization. Web server only for visualization |
| BedSect | Integration of multiple tools (BEDtools, | ✓ | ✓ | ✓ | ✓ | Completely Web server based for the intersection, visualization and functional annotation |