| Literature DB >> 22587738 |
Jan Mrázek1, Tejas Chaudhari, Aryabrata Basu.
Abstract
BACKGROUND: Periodic spacing of short adenine or thymine runs phased with DNA helical period of ~10.5 bp is associated with intrinsic DNA curvature and deformability, which play important roles in DNA-protein interactions and in the organization of chromosomes in both eukaryotes and prokaryotes. Local differences in DNA sequence periodicity have been linked to differences in gene expression in some organisms. Despite the significance of these periodic patterns, there are virtually no publicly accessible tools for their analysis.Entities:
Year: 2011 PMID: 22587738 PMCID: PMC3372288 DOI: 10.1186/2042-5783-1-13
Source DB: PubMed Journal: Microb Inform Exp ISSN: 2042-5783
Figure 1Periodicity plots for the four analyzed genomes. The ordinate displays the normalized intensity Q*(P) of the periodic signal in the spacing of AA and TT dinucleotides for the period shown by the abscissa. The parameters smin and smax were set at 30 and 100 bp, respectively. See text for details. The horizontal lines and shading refer to statistical significance of the peaks in the plot. The dark shaded area corresponds to values below the 50th percentile of the dominant periodic signal in random sequences. The light shaded area refers to values between the 50th and 95th percentiles. Peaks rising above the shaded area can be considered statistically significant. An additional line without shading refers to the 99th percentile. The definition of the MaxQ and PMaxQ indices is demonstrated in the periodicity plot for M. jannaschii.
MaxQ index percentiles in random sequencesa.
| Methodb | MaxQ percentiles for five different spacing rangesc | ||||
|---|---|---|---|---|---|
| 40 bp | 70 bp | 100 bp | 150 bp | 200 bp | |
| AT | 3.07, 2.57, 1.80d | 3.15, 2.71, 1.99 | 3.18, 2.79, 2.10 | 3.26, 2.89, 2.23 | 3.32, 2.96, 2.31 |
| A2T2 | 2.98, 2.52, 1.80 | 3.08, 2.66, 1.98 | 3.17, 2.77, 2.09 | 3.26, 2.90, 2.23 | 3.36, 2.98, 2.32 |
| A3T3 | 2.89, 2.50, 1.80 | 3.05, 2.65, 1.99 | 3.17, 2.77, 2.11 | 3.28, 2.90, 2.24 | 3.35, 2.99, 2.33 |
| A4T4 | 2.90, 2.45, 1.79 | 3.03, 2.64, 1.99 | 3.15, 2.76, 2.11 | 3.27, 2.91, 2.24 | 3.39, 3.01, 2.34 |
| A5T5 | 2.81, 2.42, 1.77 | 2.96, 2.60, 1.96 | 3.11, 2.73, 2.09 | 3.21, 2.88, 2.23 | 3.33, 2.98, 2.33 |
| AT2 | 2.97, 2.50, 1.79 | 3.08, 2.66, 1.98 | 3.16, 2.76, 2.10 | 3.24, 2.88, 2.23 | 3.32, 2.96, 2.31 |
| AT3 | 2.94, 2.48, 1.80 | 3.07, 2.66, 1.98 | 3.15, 2.77, 2.11 | 3.28, 2.90, 2.24 | 3.37, 3.00, 2.33 |
| AT4 | 2.88, 2.47, 1.79 | 3.05, 2.65, 1.99 | 3.17, 2.77, 2.12 | 3.27, 2.91, 2.25 | 3.39, 3.01, 2.35 |
| AT5 | 2.90, 2.45, 1.79 | 3.01, 2.62, 1.99 | 3.14, 2.76, 2.11 | 3.27, 2.92, 2.26 | 3.40, 3.02, 2.35 |
| AT6 | 2.78, 2.40, 1.75 | 2.96, 2.59, 1.94 | 3.08, 2.73, 2.07 | 3.24, 2.88, 2.21 | 3.35, 2.98, 2.31 |
a MaxQ measures the highest periodic signal intensity detected over the range or periods 5-20 bp. The table shows the 99th, 95th, and 50th percentile, for each combination of parameters. See text for details.
b Definition of A-tracts: "AT", single nucleotides A or T; "A2T2" dinucleotides AA or TT; "AT2", dinucleotides AA, AT, or TT; "A3T3" trinucleotides AAA or TTT; "AT3", trinucleotides AAA, AAT, ATT, or TTT; etc.
c The spacing range refers to the difference between parameters smax-smin. The simulations were performed for spacing range values 40, 70, 100, 150, and 200 bp. See text for details.
d The 99th, 95th, and 50th percentiles, respectively, of the MaxQ values in 20,500 random sequences are shown.
Figure 2Periodicity scan of the . a) The main periodicity scan plot. The level of grey signifies the intensity of the periodic signal for the chromosomal location shown on the horizontal axis and the period shown on the vertical axis. The periodicity was evaluated in a 10 kb window was shifted by 5 kb at a time. The white areas correspond to the relative signal intensity Q*(P)≤1.8 whereas black shading indicates signal intensity Q*(P)≥4.0. The level of gray continuously changes from white to black between the values 1.8 and 4.0. b) The fraction of windows with the maximum signal at the period indicated by the abscissa plus or minus 0.2 bp, regardless of the height of the maximum. c) The fraction of windows with the signal intensity for the given period Q*(P)≥2.0 (cyan), ≥2.5 (magenta), ≥3.0 (blue), ≥4.0 (green), and ≥6.0 (red). See text for details. The definitions of indices MaxMax, PMaxMax, Max2, and PMax2 are demonstrated in panels b and c. The indices Max3 and PMax3 are analogous to Max2 and PMax2 but derived from the blue section of the plot.
Figure 3Periodicity scan of the . See legend to Figure 2.
H.influenzae genes located in regions with a strong sequence periodicity.
| Locus tag | Start | End | Strand | Product |
|---|---|---|---|---|
| HI0417 | 439370 | 440050 | + | thiamine-phosphate pyrophosphorylase ThiE |
| HI0418 | 439995 | 441338 | + | transport protein |
| HI0419 | 441507 | 442889 | + | protease |
| HI0420 | 443031 | 443330 | + | hypothetical protein |
| HI0422 | 444029 | 445348 | - | ATP-dependent RNA helicase SrmB |
| HI0423 | 445394 | 446116 | + | hypothetical protein |
| HI0424 | 446149 | 447204 | - | rRNA methylase |
| HI0425 | 447351 | 448718 | + | phosphatidylserine synthase PssA |
| HI0426 | 448763 | 449488 | - | fatty acid metabolism regulator FadR |
| HI0427 | 449613 | 451157 | + | sodium/proton antiporter NhaB |
| HI0736 | 789998 | 791524 | - | sodium-dependent transporter |
| HI0737 | 791772 | 792569 | + | acetohydroxy acid synthase II |
| HI0738 | 792641 | 794479 | + | dihydroxy-acid dehydratase IlvD |
| HI0738.1 | 794559 | 796100 | + | threonine dehydratase IlvA |
| HI0739 | 796139 | 799618 | - | DNA polymerase III subunit alpha DnaE |
| HI0740 | 799857 | 801509 | + | Phosphomannomutase YhxB |
| HI1262 | 1339751 | 1340431 | - | SanA |
| HI1263 | 1340589 | 1341665 | + | homoserine O-acetyltransferase MetX |
| HI1264 | 1341719 | 1344361 | - | DNA gyrase subunit A GyrA |
| HI1265 | 1344944 | 1346707 | - | hypothetical protein |
| HI1266 | 1346844 | 1347230 | - | hypothetical protein |
| HI1268 | 1347455 | 1347634 | + | hypothetical protein |
| HI1269 | 1347628 | 1347744 | + | hypothetical protein |
| HI1272 | 1348468 | 1349259 | + | ABC transporter ATP-binding protein |
| HI1273 | 1349256 | 1350062 | + | hypothetical protein |
Figure 4Periodicity scan of the . See legend to Figure 2.
M. jannaschii genes located in the region with 11 bp periodicity.
| Locus tag | Start | End | Strand | Product | Species with top three |
|---|---|---|---|---|---|
| MJ0782 | 703765 | 705786 | - | transcription initiation factor IIB | |
| MJ0782.1 | 705793 | 706038 | - | H/ACA RNA-protein complex component Gar1 | |
| MJ0783 | 706179 | 706739 | + | hypothetical protein | |
| MJ0784 | 707015 | 708091 | + | H(2)-dependent methylenetetrahydro-methanopterin dehydrogenase | |
| MJ0785 | 708313 | 709440 | + | biotin synthase | |
| MJ0785.1 | 709430 | 709999 | + | hypothetical protein | |
| MJ0786 | 710062 | 710622 | + | hypothetical protein | |
| MJ0787 | 710772 | 712286 | + | hypothetical protein | |
| MJ0788 | 712302 | 712541 | + | hypothetical protein | |
| MJ0789 | 712624 | 712974 | + | hypothetical protein | |
| MJ0790 | 713009 | 713698 | + | NADH dehydrogenase subunit 1 | |
| MJ0791 | 713720 | 715174 | - | argininosuccinate lyase | |
a Excluding the order Methanococcales. Only one hit per species is reported (excluding hits to multiple strains). Hits to eubacterial proteins are labeled by an asterisk and hits to eukaryotes or organelles by a "+". The blastp program implementation at the NCBI web site http://blast.ncbi.nlm.nih.gov with default parameters was used to find the top hits. Fewer than three hits are shown when less than three significant hits were detected.
Figure 5Periodicity scan of the . See legend to Figure 2.
Figure 6Periodicity scan of the . See legend to Figure 2.