| Literature DB >> 19379528 |
Zhipeng Cai1, Hadi Sabaa, Yining Wang, Randy Goebel, Zhiquan Wang, Jiaofen Xu, Paul Stothard, Guohui Lin.
Abstract
BACKGROUND: The "common disease--common variant" hypothesis and genome-wide association studies have achieved numerous successes in the last three years, particularly in genetic mapping in human diseases. Nevertheless, the power of the association study methods are still low, in particular on quantitative traits, and the description of the full allelic spectrum is deemed still far from reach. Given increasing density of single nucleotide polymorphisms available and suggested by the block-like structure of the human genome, a popular and prosperous strategy is to use haplotypes to try to capture the correlation structure of SNPs in regions of little recombination. The key to the success of this strategy is thus the ability to unambiguously determine the haplotype allele sharing status among the members. The association studies based on haplotype sharing status would have significantly reduced degrees of freedom and be able to capture the combined effects of tightly linked causal variants.Entities:
Mesh:
Year: 2009 PMID: 19379528 PMCID: PMC2691739 DOI: 10.1186/1471-2105-10-115
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
10 K breakpoint recovery results
| xPedPhase | Block-Extension | |||||
| Pedigree | Precision | Recall | Precision | Recall | Precision | Recall |
| 2-2 | 0.994 | 0.936 | 0.971 | 0.952 | 0.253 | 1.000 |
| 2–3 | 0.982 | 0.965 | 0.964 | 0.966 | 0.326 | 0.999 |
| 2-3-1 | 0.985 | 0.965 | 0.961 | 0.972 | 0.214 | 0.999 |
| 2-3-2 | 0.989 | 0.962 | 0.955 | 0.972 | 0.151 | 0.995 |
| 2-3-3 | 0.972 | 0.968 | 0.935 | 0.976 | 0.177 | 0.996 |
| 2-3-5 | 0.977 | 0.971 | 0.872 | 0.989 | 0.160 | 0.997 |
| 2-4-3 | 0.984 | 0.969 | 0.924 | 0.978 | 0.203 | 0.999 |
| 2-5-4 | 0.989 | 0.949 | 0.882 | 0.976 | 0.231 | 0.999 |
| 2-5-5 | 0.991 | 0.970 | 0.846 | 0.989 | 0.204 | 0.998 |
| 2-6-5 | 0.986 | 0.956 | 0.867 | 0.984 | 0.212 | 0.999 |
| Average | 0.984 | 0.964 | 0.912 | 0.978 | 0.213 | 0.998 |
Breakpoint recovery results: Average breakpoint recovery precision and recall by i Linker, xPedPhase, and Block-Extension on 10 K genotype datasets.
50 K breakpoint recovery results
| xPedPhase | ||||
| Pedigree | Precision | Recall | Precision | Recall |
| 2-2 | 1.000 | 0.967 | - | - |
| 2–3 | 0.994 | 0.969 | - | - |
| 2-3-1 | 1.000 | 0.971 | 0.977 | 0.978 |
| 2-3-2 | 1.000 | 0.976 | 0.986 | 0.981 |
| 2-3-3 | 0.991 | 0.981 | 0.969 | 0.988 |
| 2-3-5 | 0.993 | 0.973 | 0.950 | 0.987 |
| 2-4-3 | 0.992 | 0.976 | 0.966 | 0.981 |
| 2-5-4 | 0.996 | 0.966 | 0.932 | 0.985 |
| 2-5-5 | 0.996 | 0.965 | 0.937 | 0.982 |
| 2-6-5 | 0.997 | 0.972 | 0.942 | 0.983 |
| Average | 0.996 | 0.972 | 0.957 | 0.983 |
Breakpoint recovery results: Average breakpoint recovery precision and recall by i Linker and xPedPhase on 50 K genotype datasets.
Figure 1Scatter plot of the starting SNP sites of shared regions: simulated vs. discovered by .
Figure 2Scatter plot of the starting SNP sites of shared regions: simulated vs. discovered by xPedPhase on 500 simulated 10 K genotype datasets.
Figure 3Scatter plot of the ending SNP sites of shared regions: simulated vs. discovered by .
Figure 4Scatter plot of the ending SNP sites of shared regions: simulated vs. discovered by xPedPhase on 500 simulated 10 K genotype datasets.
10 K breakpoint recovery results at the presence of missing data
| 0.5% | 1% | 1.5% | 2% | 2.5% | 3% | |||||||
| Pedigree | Precision | Recall | Precision | Recall | Precision | Recall | Precision | Recall | Precision | Recall | Precision | Recall |
| 2-2 | 0.998 | 0.968 | 0.998 | 0.926 | 0.997 | 0.941 | 0.998 | 0.937 | 0.987 | 0.946 | 1.000 | 0.944 |
| 2–3 | 0.983 | 0.969 | 0.997 | 0.968 | 0.989 | 0.970 | 0.998 | 0.967 | 0.993 | 0.967 | 0.989 | 0.948 |
| 2-3-1 | 0.994 | 0.973 | 0.994 | 0.959 | 0.986 | 0.959 | 0.986 | 0.947 | 0.988 | 0.952 | 0.992 | 0.968 |
| 2-3-2 | 0.981 | 0.968 | 0.981 | 0.971 | 0.984 | 0.947 | 0.984 | 0.947 | 0.985 | 0.950 | 0.985 | 0.945 |
| 2-3-3 | 0.989 | 0.974 | 0.967 | 0.967 | 0.990 | 0.962 | 0.962 | 0.962 | 0.969 | 0.950 | 0.983 | 0.958 |
| 2-3-5 | 0.974 | 0.972 | 0.966 | 0.966 | 0.964 | 0.960 | 0.960 | 0.961 | 0.970 | 0.954 | 0.959 | 0.951 |
| 2-4-3 | 0.984 | 0.964 | 0.985 | 0.960 | 0.985 | 0.953 | 0.986 | 0.964 | 0.978 | 0.950 | 0.99 | 0.956 |
| 2-5-4 | 0.980 | 0.952 | 0.984 | 0.953 | 0.979 | 0.950 | 0.984 | 0.940 | 0.983 | 0.939 | 0.980 | 0.932 |
| 2-5-5 | 0.980 | 0.961 | 0.971 | 0.958 | 0.980 | 0.947 | 0.983 | 0.947 | 0.982 | 0.935 | 0.973 | 0.930 |
| 2-6-5 | 0.985 | 0.952 | 0.987 | 0.952 | 0.984 | 0.939 | 0.983 | 0.930 | 0.982 | 0.922 | 0.984 | 0.911 |
| Average | 0.985 | 0.965 | 0.983 | 0.958 | 0.984 | 0.950 | 0.983 | 0.950 | 0.982 | 0.947 | 0.984 | 0.944 |
Breakpoint recovery results: Average breakpoint recovery precision and recall by i Linker at the presence of 0.5–3% missing genotype data.
50 K breakpoint recovery results at the presence of missing data
| 0.5% | 1% | 1.5% | 2% | 2.5% | 3% | |||||||
| Pedigree | Precision | Recall | Precision | Recall | Precision | Recall | Precision | Recall | Precision | Recall | Precision | Recall |
| 2-2 | 0.991 | 0.974 | 1.000 | 0.972 | 1.000 | 0.972 | 1.000 | 0.966 | 0.996 | 0.964 | 1.000 | 0.964 |
| 2–3 | 0.999 | 0.982 | 0.999 | 0.975 | 0.996 | 0.973 | 0.991 | 0.976 | 0.999 | 0.979 | 0.998 | 0.971 |
| 2-3-1 | 0.991 | 0.975 | 0.999 | 0.975 | 0.996 | 0.972 | 0.990 | 0.976 | 0.999 | 0.979 | 0.998 | 0.971 |
| 2-3-2 | 0.992 | 0.972 | 0.990 | 0.977 | 0.995 | 0.973 | 0.998 | 0.960 | 0.994 | 0.980 | 0.995 | 0.972 |
| 2-3-3 | 0.996 | 0.977 | 0.992 | 0.971 | 0.996 | 0.968 | 0.993 | 0.969 | 0.994 | 0.973 | 0.994 | 0.968 |
| 2-3-5 | 0.988 | 0.973 | 0.988 | 0.965 | 0.990 | 0.965 | 0.986 | 0.969 | 0.987 | 0.973 | 0.982 | 0.967 |
| 2-4-3 | 0.996 | 0.981 | 0.994 | 0.974 | 0.997 | 0.979 | 0.995 | 0.969 | 0.991 | 0.969 | 0.991 | 0.969 |
| 2-5-4 | 0.997 | 0.974 | 0.996 | 0.969 | 0.980 | 0.971 | 0.997 | 0.966 | 0.998 | 0.968 | 0.996 | 0.954 |
| 2-5-5 | 0.989 | 0.977 | 0.994 | 0.969 | 0.996 | 0.972 | 0.996 | 0.968 | 0.984 | 0.967 | 0.992 | 0.969 |
| 2-6-5 | 0.998 | 0.976 | 0.995 | 0.966 | 0.997 | 0.966 | 0.978 | 0.962 | 0.982 | 0.953 | 0.992 | 0.953 |
| Average | 0.994 | 0.976 | 0.995 | 0.971 | 0.994 | 0.971 | 0.992 | 0.968 | 0.992 | 0.970 | 0.994 | 0.966 |
Breakpoint recovery results: Average breakpoint recovery precision and recall by i Linker at the presence of 0.5–3% missing genotype data.