Literature DB >> 25220945

nuMap: a web platform for accurate prediction of nucleosome positioning.

Bader A Alharbi1, Thamir H Alshammari2, Nathan L Felton3, Victor B Zhurkin4, Feng Cui5.   

Abstract

Nucleosome positioning is critical for gene expression and of major biological interest. The high cost of experimentally mapping nucleosomal arrangement signifies the need for computational approaches to predict nucleosome positions at high resolution. Here, we present a web-based application to fulfill this need by implementing two models, YR and W/S schemes, for the translational and rotational positioning of nucleosomes, respectively. Our methods are based on sequence-dependent anisotropic bending that dictates how DNA is wrapped around a histone octamer. This application allows users to specify a number of options such as schemes and parameters for threading calculation and provides multiple layout formats. The nuMap is implemented in Java/Perl/MySQL and is freely available for public use at http://numap.rit.edu. The user manual, implementation notes, description of the methodology and examples are available at the site.
Copyright © 2014 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  Nucleosome rotational positioning; Prediction of nucleosome positioning; Sequence patterns; Sequence-dependent DNA anisotropy; Web server

Mesh:

Substances:

Year:  2014        PMID: 25220945      PMCID: PMC4411418          DOI: 10.1016/j.gpb.2014.08.001

Source DB:  PubMed          Journal:  Genomics Proteomics Bioinformatics        ISSN: 1672-0229            Impact factor:   7.691


Introduction

The basic repeating unit of eukaryotic chromatin is the nucleosome, which contains a 145–147 bp DNA fragment wound around a histone core. The positioning of nucleosomes on a DNA fragment is critical for gene expression [1]. Along with other factors such as DNA methylation and histone modifications, DNA sequence is one of the most important factors guiding nucleosome positioning in vitro and in vivo [2]. Accurate determination of nucleosome positions is extremely important to study gene regulatory mechanisms. Unfortunately, micrococcal nuclease, which is commonly used for mapping nucleosome positions experimentally, exhibits high AT-specificity [3,4] and lacks the resolution to identify exact nucleosome positions, especially for nucleosomes with the GC-rich boundaries [5]. Therefore, prediction of nucleosome positioning on genomic sequence at high resolution is of great biological interest. Nucleosome positioning is usually characterized by two parameters: rotational positioning, which describes the side of the DNA helix that faces the histones, and translational positioning, which determines the nucleosome midpoint (or dyad) with regard to the DNA sequence [6]. We have developed two simple models, the YR scheme [7] and W/S scheme [8], for the prediction of translational and rotational positioning of nucleosomes, respectively. Both methods successfully predict ∼80% of nucleosome positions in vitro with 2-bp precision (see Supplementary materials at http://numap.rit.edu/app/dna/suppMaterial.xhtml). Here, we present a web-based application that implements both models for prediction of nucleosome positioning patterns.

Methods

Both YR and W/S schemes are based on the sequence-dependent DNA anisotropy [9,10], which dictates how DNA is wrapped around a histone octamer. One of the best established sequence patterns consistent with this anisotropy is the periodic occurrence of AT-containing dinucleotides (WW) and GC-containing dinucleotides (SS) in the nucleosomal locations where DNA is bent in the minor and major grooves, respectively [11]. The minor and major groove bending sites are defined by the base-pair step Roll values observed in nucleosome crystal structures, in which a DNA fragment of 147 bp or 146 bp in length is co-crystallized with histones [12]. Based on the Roll values, 14 minor-groove DNA bending sites and 12 major-groove DNA bending sites were selected from the 147-bp or 146-bp nucleosomal DNA template; each site is 4 bp in length (see the Methods section on the server website for details). The W/S scheme implements the aforementioned periodic WW and SS patterns. The W/S score SW/S(n) of the threaded sequence with the center at position n can be calculated aswhere CWW and CSS are the total occurrences of WW and SS dinucleotides at a given minor-groove or major-groove DNA bending site. In addition to the WW and SS motifs, the YR scheme incorporates GC, pyrimidine–purine (YR), YYRR and RYRY motifs to take into account their anisotropic bending into DNA grooves (see Methods at the site for details). The YR score S(n) of the threaded sequence with the center at position n can be calculated aswhere C is the occurrence of a designated motif x at a given site and w is the weight of the motif at this site. A total of 28 DNA sequence patterns are used in the YR scheme [7]. If a motif occurs at the site where DNA is severely bent, its occurrence is given a higher weight than at other sites (see Methods section on the website for details).

Implementation

The nuMap web application takes a DNA sequence as input and returns both numeric scores and corresponding profiles. We use the model-view-controller (MVC) design, in which the communication between the client and the database is mediated by the web application server (Figure 1). The server is implemented using a combination of extensible hypertext markup language pages (XHTML), and JavaServer Faces (JSF) as a rich component-based user interface. Nucleosome positioning scores can be computed by the YR and W/S schemes implemented at the backend of the server as Perl scripts. An open-source reporting engine, JasperReports library, is used with the combination of Java codes for reporting the results in graphical output in multiple formats including PDF, Excel and CSV (see Implementation Notes at the server site for details).
Figure 1

System infrastructure of the nuMap server

The MVC model describes an application in terms of three different design modules, the Model, the View and the Controller. The Model describes how the application represents the data model, including data files and a database management system (e.g., MySQL). The View describes what the user interface looks like and how the information is displayed at the user end (e.g., XHTML). The Controller part represents how the user interface is managed and controlled by the application code. MVC, model-view-controller.

Prediction options and features

Figure 2 shows the general layout for nuMap user interface. The user can select from four different output layouts. The “Single Layout” is used to generate single graph for each of the input sequences, whereas the “Superimpose Layout” overlays more than one graph, if multiple sequences are used as input. The “Average Layout (Asymmetric)” is used to calculate the average scores for multiple sequences, while the “Average Layout (Symmetric)” produces a symmetrical graph by generating the reverse complement of all the input sequences and then calculating the average scores at each base pair position. The setting panel allows the user to choose from two prediction schemes, YR and W/S. By default, a chosen scheme will calculate a score for the first nucleosome starting at the position 1 of the input sequence, and assign that score to the dyad position 74. The user can also give a specific value to the start of the input sequence (if not 1) by changing the value in the “Starting Position” box. The “Weight Set” option is specific to the YR scheme, the user can choose from three different sets used initially to establish this scheme with each set evaluating the occurrence of specific pattern at specific site differently (see Table SIII in [7]).
Figure 2

A screen shot of the nuMap server interface

.

The nuMap provides numerous reporting formats such as PDF (Figure 3), Word and Excel. Moreover, the server offers many features that allow the user to interact with output data in different ways such as zooming in a large graph, which is useful to investigate a certain range of values, and exploring dynamic data features, which show the corresponding score/base pair position when a user places the mouse over any point in the graph (Figure 4).
Figure 3

Example of nuMap output in PDF format

A sample out of the nuMap server is shown. The Information Panel is shown on the top, containing job-related information such as sequence name and user-selected options including prediction scheme, sequence, its length and composition. A profile of the nucleosome positioning score of the ‘601’ sequence, in this case, predicted by the W/S scheme is shown underneath. The physical meaning of the score is straightforward – the lower the DNA deformation energy for particular sequence motifs placed at the sites of DNA distortion, the higher the score. The score is calculated for each 147-bp window and is assigned to the center position of a given window. The actual score at each position is given in a tabular form.

Figure 4

A screen shot of nuMap output of superimposing profiles for two sequences

A screen shot of the nuMap output for two sequences superimposed together is shown. The score value on a particular position can be shown on the screen by mouse. In this case, a mouse hovering over position 94 shows the score, 47, at this position. This example illustrates dynamic data features of the server. The YR (pyrimidine–purine) scheme was used to calculate the scores. The physical meaning of the scores is given in the legend in Figure 3.

Future developments

Other models [13,14] work well in predicting nucleosome occupancy in vivo by introducing a position-independent component, PL, to represent sequences that are generally favored or disfavored regardless of their position within the nucleosome (most notably, poly(dA:dT) tracts, which are strongly disfavored by nucleosomes) [14]. Detailed comparisons in prediction accuracy between these models and ours have been made and will be published separately. The incorporation of this component into the YR or W/S scheme has a potential to improve both the rotational and translational positioning predictions of nucleosomes in vivo. Various models have been developed for nucleosome positioning predictions [15-19] and for gene regulatory analysis [20,21]. Nonetheless, one has to browse through different servers, which often have different formats and representations, making the comparison of the results extremely difficult. We will overcome this obstacle by reprogramming these methods, incorporating them into the nuMap server and presenting the results in the same format. Comparison of all the methods in this way will allow detailed analyses of the strengths and weaknesses of each approach, facilitating our understanding of the biophysical principles of nucleosome positioning.

Authors’ contributions

BAA and THA developed the server, performed the analyses and drafted the paper. NLF set up the server. VBZ and FC contributed to data analysis and paper writing. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.
  21 in total

1.  CMGRN: a web server for constructing multilevel gene regulatory networks using ChIP-seq and gene expression data.

Authors:  Daogang Guan; Jiaofang Shao; Youping Deng; Panwen Wang; Zhongying Zhao; Yan Liang; Junwen Wang; Bin Yan
Journal:  Bioinformatics       Date:  2014-01-02       Impact factor: 6.937

2.  nuScore: a web-interface for nucleosome positioning predictions.

Authors:  Michael Y Tolstorukov; Vidhu Choudhary; Wilma K Olson; Victor B Zhurkin; Peter J Park
Journal:  Bioinformatics       Date:  2008-04-29       Impact factor: 6.937

3.  Sequence periodicities in chicken nucleosome core DNA.

Authors:  S C Satchwell; H R Drew; A A Travers
Journal:  J Mol Biol       Date:  1986-10-20       Impact factor: 5.469

4.  Anisotropic flexibility of DNA and the nucleosomal structure.

Authors:  V B Zhurkin; Y P Lysov; V I Ivanov
Journal:  Nucleic Acids Res       Date:  1979-03       Impact factor: 16.971

5.  High sequence specificity of micrococcal nuclease.

Authors:  C Dingwall; G P Lomonossoff; R A Laskey
Journal:  Nucleic Acids Res       Date:  1981-06-25       Impact factor: 16.971

6.  Sequence specific cleavage of DNA by micrococcal nuclease.

Authors:  W Hörz; W Altenburger
Journal:  Nucleic Acids Res       Date:  1981-06-25       Impact factor: 16.971

7.  Structure-based analysis of DNA sequence patterns guiding nucleosome positioning in vitro.

Authors:  Feng Cui; Victor B Zhurkin
Journal:  J Biomol Struct Dyn       Date:  2010-06

8.  ICM Web: the interactive chromatin modeling web server.

Authors:  Richard C Stolz; Thomas C Bishop
Journal:  Nucleic Acids Res       Date:  2010-06-11       Impact factor: 16.971

9.  PTHGRN: unraveling post-translational hierarchical gene regulatory networks using PPI, ChIP-seq and gene expression data.

Authors:  Daogang Guan; Jiaofang Shao; Zhongying Zhao; Panwen Wang; Jing Qin; Youping Deng; Kenneth R Boheler; Junwen Wang; Bin Yan
Journal:  Nucleic Acids Res       Date:  2014-05-29       Impact factor: 16.971

Review 10.  Determinants of nucleosome positioning.

Authors:  Kevin Struhl; Eran Segal
Journal:  Nat Struct Mol Biol       Date:  2013-03       Impact factor: 15.369

View more
  4 in total

1.  Nucleosome allostery in pioneer transcription factor binding.

Authors:  Cheng Tan; Shoji Takada
Journal:  Proc Natl Acad Sci U S A       Date:  2020-08-10       Impact factor: 11.205

2.  ZCMM: A Novel Method Using Z-Curve Theory- Based and Position Weight Matrix for Predicting Nucleosome Positioning.

Authors:  Ying Cui; Zelong Xu; Jianzhong Li
Journal:  Genes (Basel)       Date:  2019-09-28       Impact factor: 4.096

3.  A deformation energy model reveals sequence-dependent property of nucleosome positioning.

Authors:  Guoqing Liu; Hongyu Zhao; Hu Meng; Yongqiang Xing; Lu Cai
Journal:  Chromosoma       Date:  2021-01-16       Impact factor: 4.316

4.  Temporal gene regulation by p53 is associated with the rotational setting of its binding sites in nucleosomes.

Authors:  Julia M Freewoman; Rajiv Snape; Feng Cui
Journal:  Cell Cycle       Date:  2021-03-25       Impact factor: 4.534

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.