Literature DB >> 17277416

Accuracy assessment of diploid consensus sequences.

Jong Hyun Kim1, Michael S Waterman, Lei M Li.   

Abstract

If the origins of fragments are known in genome sequencing projects, it is straightforward to reconstruct diploid consensus sequences. In reality, however, this is not true. Although there are proposed methods to reconstruct haplotypes from genome sequencing projects, an accuracy assessment is required to evaluate the confidence of the estimated diploid consensus sequences. In this paper, we define the confidence score of diploid consensus sequences. It requires the calculation of the likelihood of an assembly. To calculate the likelihood, we propose a linear time algorithm with respect to the number of polymorphic sites. The likelihood calculation and confidence score are used for further improvements of haplotype estimation in two directions. One direction is that low-scored phases are disconnected. The other direction is that, instead of using nominal frequency 1/2, the haplotype frequency is estimated to reflect the actual contribution of each haplotype. Our method was evaluated on the simulated data whose polymorphism rate (1.2 percent) was based on Ciona intestinalis. As a result, the high accuracy of our algorithm was indicated: The true positive rate of the haplotype estimation was greater than 97 percent.

Entities:  

Mesh:

Year:  2007        PMID: 17277416     DOI: 10.1109/TCBB.2007.1007

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  3 in total

1.  Diploid genome reconstruction of Ciona intestinalis and comparative analysis with Ciona savignyi.

Authors:  Jong Hyun Kim; Michael S Waterman; Lei M Li
Journal:  Genome Res       Date:  2007-06-13       Impact factor: 9.043

Review 2.  Statistical analysis strategies for association studies involving rare variants.

Authors:  Vikas Bansal; Ondrej Libiger; Ali Torkamani; Nicholas J Schork
Journal:  Nat Rev Genet       Date:  2010-10-13       Impact factor: 53.242

3.  HapEdit: an accuracy assessment viewer for haplotype assembly using massively parallel DNA-sequencing technologies.

Authors:  Jong Hyun Kim; Woo-Cheol Kim; Lei M Li; Sanghyun Park
Journal:  Nucleic Acids Res       Date:  2011-05-16       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.