Literature DB >> 12016048

Compensation for nucleotide bias in a genome by representation as a discrete channel with noise.

Mark Schreiber1, Chris Brown.   

Abstract

MOTIVATION: Calculation of the information content of motifs in genomes highly biased in nucleotide composition is likely to lead to overestimates of the amount of useful information in the motif. Calculating relative information can compensate for biases, however the resulting information content is the amount seen by an observer and not by a macromolecule binding to the motif. The latter is needed to calculate the discriminatory power of the motif and to compare motifs between species.
RESULTS: By treating a biased genome as a discrete channel with noise, in accordance with Shannon Information Theory, we were able to remove both 'Distortion' and 'Noise' from the motif and recover a more instructive biological 'signal.' A Java application, LogoPaint, was developed to remove nucleotide bias distortion and triplet frequency noise from motifs, calculate information content and present the motif as a logo. We demonstrate how this technique can 'unmask' motifs in the translation initiation regions of bacteria that are obscured by strong sequence biases. AVAILABILITY: LogoPaint is available to all users from the authors as an executable JAR file. Source code is available by arrangement.

Mesh:

Substances:

Year:  2002        PMID: 12016048     DOI: 10.1093/bioinformatics/18.4.507

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  PrediSi: prediction of signal peptides and their cleavage positions.

Authors:  Karsten Hiller; Andreas Grote; Maurice Scheer; Richard Münch; Dieter Jahn
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  RhlR expression in Pseudomonas aeruginosa is modulated by the Pseudomonas quinolone signal via PhoB-dependent and -independent pathways.

Authors:  Vanessa Jensen; Dagmar Löns; Caroline Zaoui; Florian Bredenbruch; Andree Meissner; Guido Dieterich; Richard Münch; Susanne Häussler
Journal:  J Bacteriol       Date:  2006-10-06       Impact factor: 3.490

3.  The Fnr regulon of Bacillus subtilis.

Authors:  Heike Reents; Richard Münch; Thorben Dammeyer; Dieter Jahn; Elisabeth Härtig
Journal:  J Bacteriol       Date:  2006-02       Impact factor: 3.490

4.  CodonLogo: a sequence logo-based viewer for codon patterns.

Authors:  Virag Sharma; David P Murphy; Gregory Provan; Pavel V Baranov
Journal:  Bioinformatics       Date:  2012-05-17       Impact factor: 6.937

5.  Recovering motifs from biased genomes: application of signal correction.

Authors:  Samiul Hasan; Mark Schreiber
Journal:  Nucleic Acids Res       Date:  2006-09-20       Impact factor: 16.971

6.  Error correction and diversity analysis of population mixtures determined by NGS.

Authors:  Graham R Wood; Nigel J Burroughs; David J Evans; Eugene V Ryabov
Journal:  PeerJ       Date:  2014-11-13       Impact factor: 2.984

7.  TISs-ST: a web server to evaluate polymorphic translation initiation sites and their reflections on the secretory targets.

Authors:  Renato Vicentini; Marcelo Menossi
Journal:  BMC Bioinformatics       Date:  2007-05-21       Impact factor: 3.169

8.  dagLogo: An R/Bioconductor package for identifying and visualizing differential amino acid group usage in proteomics data.

Authors:  Jianhong Ou; Haibo Liu; Niraj K Nirala; Alexey Stukalov; Usha Acharya; Michael R Green; Lihua Julie Zhu
Journal:  PLoS One       Date:  2020-11-06       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.