| Literature DB >> 19570807 |
Quan Long1, Daniel MacArthur, Zemin Ning, Chris Tyler-Smith.
Abstract
SUMMARY: We present a program to improve haplotype reconstruction by incorporating information from paired-end reads, and demonstrate its utility on simulated data. We find that given a fixed coverage, longer reads (implying fewer of them) are preferable. AVAILABILITY: The executable and user manual can be freely downloaded from ftp://ftp.sanger.ac.uk/pub/zn1/HI.Entities:
Mesh:
Year: 2009 PMID: 19570807 PMCID: PMC2735667 DOI: 10.1093/bioinformatics/btp412
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.If a pair of reads covers two heterozygous positions, the alleles carried (A and G in the example shown) must be on the same chromosome. Therefore we can phase the genotype (A/T, G/C) to (A, G) and (T, C).
Fig. 2.The y-axis shows the percentage of phasing errors eliminated by HI. The x-axis shows the values of the parameter indicated. In each plot, the performance value varied with the parameter of interest; while the other three parameters were marginalized. (The data for insert size longer than 3 kb are only used in the insert size plot.)