| Literature DB >> 18466467 |
Hsuan Jung1, Keyan Zhao, Paul Marjoram.
Abstract
Given the increasing size of modern genetic data sets and, in particular, the move towards genome-wide studies, there is merit in considering analyses that gain computational efficiency by being more heuristic in nature. With this in mind, we present results of cladistic analyses methods on the Genetic Analysis Workshop 15 Problem 3 simulated data (answers known). Our analysis attempts to capture similarities between individuals using a series of trees, and then looks for regions in which mutations on those trees can successfully explain a phenotype of interest. Existing varieties of such algorithms assume haplotypes are known, or have been inferred, an assumption that is often unrealistic for genome-wide data. We therefore present an extension of these methods that can successfully analyze genotype, rather than haplotype, data.Entities:
Year: 2007 PMID: 18466467 PMCID: PMC2367549 DOI: 10.1186/1753-6561-1-s1-s125
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Results of analysis of chromosomes 6, 11, 18, and 3 (left to right). The x-axis represents position along the chromosome (for convenience, markers are plotted as if equally spaced). The y-axis gives the -log p-value for association at each locus on that chromosome. The trait locus position marked with red line.
Summary results across all 100 replicates for four chromosomes
| Analysis | Range of log( | Mean distance from true locus (kb)a |
| Chromosome 6 | [-34.8, -15.3] | 19 |
| Chromosome 11 | [-17.81, -9.75] | 31 |
| Chromosome 18 | [-5.69, -3.47] | 5550 |
| Chromosome 3 | [-3.97, -3.34] | NA |
aDistance between the functional locus and the locus corresponding to the smallest p-value.