Literature DB >> 34403470

SpecHap: a diploid phasing algorithm based on spectral graph theory.

Yonghan Yu1, Lingxi Chen1, Xinyao Miao1, Shuai Cheng Li1.   

Abstract

Haplotype phasing plays an important role in understanding the genetic data of diploid eukaryotic organisms. Different sequencing technologies (such as next-generation sequencing or third-generation sequencing) produce various genetic data that require haplotype assembly. Although multiple diploid haplotype phasing algorithms exist, only a few will work equally well across all sequencing technologies. In this work, we propose SpecHap, a novel haplotype assembly tool that leverages spectral graph theory. On both in silico and whole-genome sequencing datasets, SpecHap consumed less memory and required less CPU time, yet achieved comparable accuracy with state-of-art methods across all the test instances, which comprises sequencing data from next-generation sequencing, linked-reads, high-throughput chromosome conformation capture, PacBio single-molecule real-time, and Oxford Nanopore long-reads. Furthermore, SpecHap successfully phased an individual Ambystoma mexicanum, a species with gigantic diploid genomes, within 6 CPU hours and 945MB peak memory usage, while other tools failed to yield results either due to memory overflow (40GB) or time limit exceeded (5 days). Our results demonstrated that SpecHap is scalable, efficient, and accurate for diploid phasing across many sequencing platforms.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2021        PMID: 34403470      PMCID: PMC8565328          DOI: 10.1093/nar/gkab709

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  41 in total

1.  A SPECTRAL GRAPH APPROACH TO DISCOVERING GENETIC ANCESTRY.

Authors:  Ann B Lee; Diana Luca; Kathryn Roeder
Journal:  Ann Appl Stat       Date:  2010       Impact factor: 2.083

2.  HapCUT: an efficient and accurate algorithm for the haplotype assembly problem.

Authors:  Vikas Bansal; Vineet Bafna
Journal:  Bioinformatics       Date:  2008-08-15       Impact factor: 6.937

3.  Allele-specific epigenome maps reveal sequence-dependent stochastic switching at regulatory loci.

Authors:  Vitor Onuchic; Eugene Lurie; Ivenise Carrero; Piotr Pawliczek; Ronak Y Patel; Joel Rozowsky; Timur Galeev; Zhuoyi Huang; Robert C Altshuler; Zhizhuo Zhang; R Alan Harris; Cristian Coarfa; Lillian Ashmore; Jessica W Bertol; Walid D Fakhouri; Fuli Yu; Manolis Kellis; Mark Gerstein; Aleksandar Milosavljevic
Journal:  Science       Date:  2018-08-23       Impact factor: 47.728

4.  Spectral identification of topological domains.

Authors:  Jie Chen; Alfred O Hero; Indika Rajapakse
Journal:  Bioinformatics       Date:  2016-05-05       Impact factor: 6.937

5.  The axolotl genome and the evolution of key tissue formation regulators.

Authors:  Sergej Nowoshilow; Siegfried Schloissnig; Ji-Feng Fei; Andreas Dahl; Andy W C Pang; Martin Pippel; Sylke Winkler; Alex R Hastie; George Young; Juliana G Roscito; Francisco Falcon; Dunja Knapp; Sean Powell; Alfredo Cruz; Han Cao; Bianca Habermann; Michael Hiller; Elly M Tanaka; Eugene W Myers
Journal:  Nature       Date:  2018-01-24       Impact factor: 49.962

6.  PBSIM: PacBio reads simulator--toward accurate genome assembly.

Authors:  Yukiteru Ono; Kiyoshi Asai; Michiaki Hamada
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

7.  Multiple polymorphisms in the TNFAIP3 region are independently associated with systemic lupus erythematosus.

Authors:  Stacy L Musone; Kimberly E Taylor; Timothy T Lu; Joanne Nititham; Ricardo C Ferreira; Ward Ortmann; Nataliya Shifrin; Michelle A Petri; M Ilyas Kamboh; Susan Manzi; Michael F Seldin; Peter K Gregersen; Timothy W Behrens; Averil Ma; Pui-Yan Kwok; Lindsey A Criswell
Journal:  Nat Genet       Date:  2008-09       Impact factor: 38.330

8.  Long-read sequence and assembly of segmental duplications.

Authors:  Mitchell R Vollger; Philip C Dishuck; Melanie Sorensen; AnneMarie E Welch; Vy Dang; Max L Dougherty; Tina A Graves-Lindsay; Richard K Wilson; Mark J P Chaisson; Evan E Eichler
Journal:  Nat Methods       Date:  2018-12-17       Impact factor: 28.547

9.  An integrated map of structural variation in 2,504 human genomes.

Authors:  Peter H Sudmant; Tobias Rausch; Eugene J Gardner; Robert E Handsaker; Alexej Abyzov; John Huddleston; Yan Zhang; Kai Ye; Goo Jun; Markus Hsi-Yang Fritz; Miriam K Konkel; Ankit Malhotra; Adrian M Stütz; Xinghua Shi; Francesco Paolo Casale; Jieming Chen; Fereydoun Hormozdiari; Gargi Dayama; Ken Chen; Maika Malig; Mark J P Chaisson; Klaudia Walter; Sascha Meiers; Seva Kashin; Erik Garrison; Adam Auton; Hugo Y K Lam; Xinmeng Jasmine Mu; Can Alkan; Danny Antaki; Taejeong Bae; Eliza Cerveira; Peter Chines; Zechen Chong; Laura Clarke; Elif Dal; Li Ding; Sarah Emery; Xian Fan; Madhusudan Gujral; Fatma Kahveci; Jeffrey M Kidd; Yu Kong; Eric-Wubbo Lameijer; Shane McCarthy; Paul Flicek; Richard A Gibbs; Gabor Marth; Christopher E Mason; Androniki Menelaou; Donna M Muzny; Bradley J Nelson; Amina Noor; Nicholas F Parrish; Matthew Pendleton; Andrew Quitadamo; Benjamin Raeder; Eric E Schadt; Mallory Romanovitch; Andreas Schlattl; Robert Sebra; Andrey A Shabalin; Andreas Untergasser; Jerilyn A Walker; Min Wang; Fuli Yu; Chengsheng Zhang; Jing Zhang; Xiangqun Zheng-Bradley; Wanding Zhou; Thomas Zichner; Jonathan Sebat; Mark A Batzer; Steven A McCarroll; Ryan E Mills; Mark B Gerstein; Ali Bashir; Oliver Stegle; Scott E Devine; Charles Lee; Evan E Eichler; Jan O Korbel
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

10.  Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing.

Authors:  Peter Edge; Vikas Bansal
Journal:  Nat Commun       Date:  2019-10-11       Impact factor: 14.919

View more
  1 in total

1.  Chromosome-Level Haplotype Assembly for Equus asinu.

Authors:  Xinyao Miao; Yonghan Yu; Zicheng Zhao; Yinan Wang; Xiaobo Qian; Yonghui Wang; Shengbin Li; Changfa Wang
Journal:  Front Genet       Date:  2022-05-27       Impact factor: 4.772

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.