Literature DB >> 25461772

Fast and accurate mapping of Complete Genomics reads.

Donghyuk Lee1, Farhad Hormozdiari2, Hongyi Xin1, Faraz Hach3, Onur Mutlu4, Can Alkan5.   

Abstract

Many recent advances in genomics and the expectations of personalized medicine are made possible thanks to power of high throughput sequencing (HTS) in sequencing large collections of human genomes. There are tens of different sequencing technologies currently available, and each HTS platform have different strengths and biases. This diversity both makes it possible to use different technologies to correct for shortcomings; but also requires to develop different algorithms for each platform due to the differences in data types and error models. The first problem to tackle in analyzing HTS data for resequencing applications is the read mapping stage, where many tools have been developed for the most popular HTS methods, but publicly available and open source aligners are still lacking for the Complete Genomics (CG) platform. Unfortunately, Burrows-Wheeler based methods are not practical for CG data due to the gapped nature of the reads generated by this method. Here we provide a sensitive read mapper (sirFAST) for the CG technology based on the seed-and-extend paradigm that can quickly map CG reads to a reference genome. We evaluate the performance and accuracy of sirFAST using both simulated and publicly available real data sets, showing high precision and recall rates.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Complete Genomics; Gapped reads; High throughput sequencing; Read mapping

Mesh:

Year:  2014        PMID: 25461772      PMCID: PMC4406782          DOI: 10.1016/j.ymeth.2014.10.012

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  24 in total

1.  PatternHunter: faster and more sensitive homology search.

Authors:  Bin Ma; John Tromp; Ming Li
Journal:  Bioinformatics       Date:  2002-03       Impact factor: 6.937

2.  Computational techniques for human genome resequencing using mated gapped reads.

Authors:  Paolo Carnevali; Jonathan Baccash; Aaron L Halpern; Igor Nazarenko; Geoffrey B Nilsen; Krishna P Pant; Jessica C Ebert; Anushka Brownley; Matt Morenzoni; Vitali Karpinchyk; Bruce Martin; Dennis G Ballinger; Radoje Drmanac
Journal:  J Comput Biol       Date:  2011-12-16       Impact factor: 1.479

3.  The complete genome of an individual by massively parallel DNA sequencing.

Authors:  David A Wheeler; Maithreyan Srinivasan; Michael Egholm; Yufeng Shen; Lei Chen; Amy McGuire; Wen He; Yi-Ju Chen; Vinod Makhijani; G Thomas Roth; Xavier Gomes; Karrie Tartaro; Faheem Niazi; Cynthia L Turcotte; Gerard P Irzyk; James R Lupski; Craig Chinault; Xing-zhi Song; Yue Liu; Ye Yuan; Lynne Nazareth; Xiang Qin; Donna M Muzny; Marcel Margulies; George M Weinstock; Richard A Gibbs; Jonathan M Rothberg
Journal:  Nature       Date:  2008-04-17       Impact factor: 49.962

Review 4.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

5.  Tools for mapping high-throughput sequencing data.

Authors:  Nuno A Fonseca; Johan Rung; Alvis Brazma; John C Marioni
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

Review 6.  Genotype and SNP calling from next-generation sequencing data.

Authors:  Rasmus Nielsen; Joshua S Paul; Anders Albrechtsen; Yun S Song
Journal:  Nat Rev Genet       Date:  2011-06       Impact factor: 53.242

7.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

8.  Real-time DNA sequencing from single polymerase molecules.

Authors:  John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald Cicero; Sonya Clark; Ravindra Dalal; Alex Dewinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin; Paul Lundquist; Congcong Ma; Patrick Marks; Mark Maxham; Devon Murphy; Insil Park; Thang Pham; Michael Phillips; Joy Roy; Robert Sebra; Gene Shen; Jon Sorenson; Austin Tomaney; Kevin Travers; Mark Trulson; John Vieceli; Jeffrey Wegener; Dawn Wu; Alicia Yang; Denis Zaccarin; Peter Zhao; Frank Zhong; Jonas Korlach; Stephen Turner
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

9.  Personalized copy number and segmental duplication maps using next-generation sequencing.

Authors:  Can Alkan; Jeffrey M Kidd; Tomas Marques-Bonet; Gozde Aksay; Francesca Antonacci; Fereydoun Hormozdiari; Jacob O Kitzman; Carl Baker; Maika Malig; Onur Mutlu; S Cenk Sahinalp; Richard A Gibbs; Evan E Eichler
Journal:  Nat Genet       Date:  2009-08-30       Impact factor: 38.330

10.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  1 in total

Review 1.  The impact of single-cell genomics on the field of mycobacterial infection.

Authors:  Inês Geraldes; Mónica Fernandes; Alexandra G Fraga; Nuno S Osório
Journal:  Front Microbiol       Date:  2022-09-30       Impact factor: 6.064

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.