| Literature DB >> 34260708 |
Abstract
MOTIVATION: A few algorithms have been developed for splitting the genome in nearly independent blocks of linkage disequilibrium. Due to the complexity of this problem, these algorithms rely on heuristics, which makes them sub-optimal.Entities:
Year: 2021 PMID: 34260708 PMCID: PMC8696101 DOI: 10.1093/bioinformatics/btab519
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Illustration of subproblems solved by the algorithm using a small LD matrix. The cost of separating the region starting at variant i in k blocks exactly, C(i, k), is broken down in two: the error E(i, j), the sum of all squared correlations between variants from block (i, j) and variants from all the later blocks, and the cost of separating the rest starting at using blocks. The variant j at which the split occurs is chosen so that the cost is minimized. The optimal split is highlighted in red here.