Literature DB >> 11414565

Nonoverlapping clusters: approximate distribution and application to molecular biology.

X Su1, S Wallenstein, D Bishop.   

Abstract

An approach is developed for the screening of genomic sequence data to identify gene regulatory regions. This approach is based on deciding if putative transcription factor binding sites are clustered together to a greater extent than one would expect by chance. Given n events occurring on an interval of width L (L base pairs), an r:w cluster is defined as r + 1 consecutive events all contained within a window of length wL. Accurate and easily computable approximations are derived for the distribution of the number of nonoverlapping r:w clusters under the model that the positions of the n events have a uniform distribution. Simulations demonstrate that these approximations have greater accuracy than existing methods. The approximation is applied to detect erythroid-specific regulatory regions in genomic DNA sequences, first in an artificial case where r is specified a priori and then as part of an exploratory approach.

Mesh:

Substances:

Year:  2001        PMID: 11414565     DOI: 10.1111/j.0006-341x.2001.00420.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  2 in total

1.  Homotypic regulatory clusters in Drosophila.

Authors:  Alexander P Lifanov; Vsevolod J Makeev; Anna G Nazina; Dmitri A Papatsenko
Journal:  Genome Res       Date:  2003-04       Impact factor: 9.043

2.  A Fast Implementation of a Scan Statistic for Identifying Chromosomal Patterns of Genome Wide Association Studies.

Authors:  Yan V Sun; Douglas M Jacobsen; Stephen T Turner; Eric Boerwinkle; Sharon L R Kardia
Journal:  Comput Stat Data Anal       Date:  2009-03-15       Impact factor: 1.681

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.