Literature DB >> 21128856

Importance sampling of word patterns in DNA and protein sequences.

Hock Peng Chan1, Nancy Ruonan Zhang, Louis H Y Chen.   

Abstract

Monte Carlo methods can provide accurate p-value estimates of word counting test statistics and are easy to implement. They are especially attractive when an asymptotic theory is absent or when either the search sequence or the word pattern is too short for the application of asymptotic formulae. Naive direct Monte Carlo is undesirable for the estimation of small probabilities because the associated rare events of interest are seldom generated. We propose instead efficient importance sampling algorithms that use controlled insertion of the desired word patterns on randomly generated sequences. The implementation is illustrated on word patterns of biological interest: palindromes and inverted repeats, patterns arising from position-specific weight matrices (PSWMs), and co-occurrences of pairs of motifs.

Mesh:

Year:  2010        PMID: 21128856      PMCID: PMC3787731          DOI: 10.1089/cmb.2008.0233

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  11 in total

Review 1.  Probabilistic and statistical properties of words: an overview.

Authors:  G Reinert; S Schbath; M S Waterman
Journal:  J Comput Biol       Date:  2000 Feb-Apr       Impact factor: 1.479

2.  Occurrence probability of structured motifs in random sequences.

Authors:  S Robin; J-J Daudin; H Richard; M-F Sagot; S Schbath
Journal:  J Comput Biol       Date:  2002       Impact factor: 1.479

3.  CisModule: de novo discovery of cis-regulatory modules by hierarchical mixture modeling.

Authors:  Qing Zhou; Wing H Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2004-08-05       Impact factor: 11.205

4.  Determination of local statistical significance of patterns in Markov sequences with application to promoter element identification.

Authors:  Haiyan Huang; Ming-Chih J Kao; Xianghong Zhou; Jun S Liu; Wing H Wong
Journal:  J Comput Biol       Date:  2004       Impact factor: 1.479

Review 5.  Nonrandom clusters of palindromes in herpesvirus genomes.

Authors:  Ming-Ying Leung; Kwok Pui Choi; Aihua Xia; Louis H Y Chen
Journal:  J Comput Biol       Date:  2005-04       Impact factor: 1.479

Review 6.  Statistical significance in biological sequence analysis.

Authors:  Alexander Yu Mitrophanov; Mark Borodovsky
Journal:  Brief Bioinform       Date:  2006-03       Impact factor: 11.622

7.  SCPD: a promoter database of the yeast Saccharomyces cerevisiae.

Authors:  J Zhu; M Q Zhang
Journal:  Bioinformatics       Date:  1999 Jul-Aug       Impact factor: 6.937

8.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

9.  Compound poisson approximation of the number of occurrences of a position frequency matrix (PFM) on both strands.

Authors:  Utz J Pape; Sven Rahmann; Fengzhu Sun; Martin Vingron
Journal:  J Comput Biol       Date:  2008 Jul-Aug       Impact factor: 1.479

10.  Phylogenetically and spatially conserved word pairs associated with gene-expression changes in yeasts.

Authors:  Derek Y Chiang; Alan M Moses; Manolis Kellis; Eric S Lander; Michael B Eisen
Journal:  Genome Biol       Date:  2003-06-26       Impact factor: 13.583

View more
  4 in total

1.  atSNP: transcription factor binding affinity testing for regulatory SNP detection.

Authors:  Chandler Zuo; Sunyoung Shin; Sündüz Keleş
Journal:  Bioinformatics       Date:  2015-06-18       Impact factor: 6.937

2.  Family-based quantitative trait meta-analysis implicates rare noncoding variants in DENND1A in polycystic ovary syndrome.

Authors:  Matthew Dapas; Ryan Sisk; Richard S Legro; Margrit Urbanek; Andrea Dunaif; M Geoffrey Hayes
Journal:  J Clin Endocrinol Metab       Date:  2019-04-30       Impact factor: 5.958

3.  Functional Genetic Variation in the Anti-Müllerian Hormone Pathway in Women With Polycystic Ovary Syndrome.

Authors:  Lidija K Gorsic; Matthew Dapas; Richard S Legro; M Geoffrey Hayes; Margrit Urbanek
Journal:  J Clin Endocrinol Metab       Date:  2019-07-01       Impact factor: 5.958

4.  Genetic variants influence on the placenta regulatory landscape.

Authors:  Fabien Delahaye; Catherine Do; Yu Kong; Remi Ashkar; Martha Salas; Ben Tycko; Ronald Wapner; Francine Hughes
Journal:  PLoS Genet       Date:  2018-11-19       Impact factor: 5.917

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.