| Literature DB >> 12603053 |
M Koivisto1, M Perola, T Varilo, W Hennah, J Ekelund, M Lukk, L Peltonen, E Ukkonen, H Mannila.
Abstract
We describe a new method for finding haplotype blocks based on the use of the minimum description length principle. We give a rigorous definition of the quality of a segmentation of a genomic region into blocks, and describe a dynamic programming algorithm for finding the optimal segmentation with respect to this measure. We also describe a method for finding the probability of a block boundary for each pair of adjacent markers: this gives a tool for evaluating the significance of each block boundary. We have applied the method to the published data of Daly et al. The results are in relatively good agreement with the published results, but also show clear differences in the predicted block boundaries and their strengths. We also give results on the block structure in population isolates.Mesh:
Substances:
Year: 2003 PMID: 12603053 DOI: 10.1142/9789812776303_0047
Source DB: PubMed Journal: Pac Symp Biocomput ISSN: 2335-6928