Literature DB >> 18321888

Consensus generation and variant detection by Celera Assembler.

Gennady Denisov1, Brian Walenz, Aaron L Halpern, Jason Miller, Nelson Axelrod, Samuel Levy, Granger Sutton.   

Abstract

MOTIVATION: We present an algorithm to identify allelic variation given a Whole Genome Shotgun (WGS) assembly of haploid sequences, and to produce a set of haploid consensus sequences rather than a single consensus sequence. Existing WGS assemblers take a column-by-column approach to consensus generation, and produce a single consensus sequence which can be inconsistent with the underlying haploid alleles, and inconsistent with any of the aligned sequence reads. Our new algorithm uses a dynamic windowing approach. It detects alleles by simultaneously processing the portions of aligned reads spanning a region of sequence variation, assigns reads to their respective alleles, phases adjacent variant alleles and generates a consensus sequence corresponding to each confirmed allele. This algorithm was used to produce the first diploid genome sequence of an individual human. It can also be applied to assemblies of multiple diploid individuals and hybrid assemblies of multiple haploid organisms.
RESULTS: Being applied to the individual human genome assembly, the new algorithm detects exactly two confirmed alleles and reports two consensus sequences in 98.98% of the total number 2,033311 detected regions of sequence variation. In 33,269 out of 460,373 detected regions of size >1 bp, it fixes the constructed errors of a mosaic haploid representation of a diploid locus as produced by the original Celera Assembler consensus algorithm. Using an optimized procedure calibrated against 1 506 344 known SNPs, it detects 438 814 new heterozygous SNPs with false positive rate 12%. AVAILABILITY: The open source code is available at: http://wgs-assembler.cvs.sourceforge.net/wgs-assembler/

Entities:  

Mesh:

Year:  2008        PMID: 18321888     DOI: 10.1093/bioinformatics/btn074

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  46 in total

1.  Decolorization and biodegradation of textile di-azo dye Acid Blue 113 by Pseudomonas stutzeri AK6.

Authors:  Anjali U Joshi; Ankit T Hinsu; Rohitkumar J Kotadiya; Jalpa K Rank; Kavan N Andharia; Ramesh K Kothari
Journal:  3 Biotech       Date:  2020-04-25       Impact factor: 2.406

2.  A consistency-based consensus algorithm for de novo and reference-guided sequence assembly of short reads.

Authors:  Tobias Rausch; Sergey Koren; Gennady Denisov; David Weese; Anne-Katrin Emde; Andreas Döring; Knut Reinert
Journal:  Bioinformatics       Date:  2009-03-05       Impact factor: 6.937

Review 3.  Next-generation sequencing and large genome assemblies.

Authors:  Joseph Henson; German Tischler; Zemin Ning
Journal:  Pharmacogenomics       Date:  2012-06       Impact factor: 2.533

4.  A Comprehensive Guide to Potato Transcriptome Assembly.

Authors:  Maja Zagorščak; Marko Petek
Journal:  Methods Mol Biol       Date:  2021

5.  Population-level transcriptome sequencing of nonmodel organisms Erynnis propertius and Papilio zelicaon.

Authors:  Shawn T O'Neil; Jason D K Dzurisin; Rory D Carmichael; Neil F Lobo; Scott J Emrich; Jessica J Hellmann
Journal:  BMC Genomics       Date:  2010-05-17       Impact factor: 3.969

6.  Detection and correction of false segmental duplications caused by genome mis-assembly.

Authors:  David R Kelley; Steven L Salzberg
Journal:  Genome Biol       Date:  2010-03-10       Impact factor: 13.583

7.  Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo): genome assembly and analysis.

Authors:  Rami A Dalloul; Julie A Long; Aleksey V Zimin; Luqman Aslam; Kathryn Beal; Le Ann Blomberg; Pascal Bouffard; David W Burt; Oswald Crasta; Richard P M A Crooijmans; Kristal Cooper; Roger A Coulombe; Supriyo De; Mary E Delany; Jerry B Dodgson; Jennifer J Dong; Clive Evans; Karin M Frederickson; Paul Flicek; Liliana Florea; Otto Folkerts; Martien A M Groenen; Tim T Harkins; Javier Herrero; Steve Hoffmann; Hendrik-Jan Megens; Andrew Jiang; Pieter de Jong; Pete Kaiser; Heebal Kim; Kyu-Won Kim; Sungwon Kim; David Langenberger; Mi-Kyung Lee; Taeheon Lee; Shrinivasrao Mane; Guillaume Marcais; Manja Marz; Audrey P McElroy; Thero Modise; Mikhail Nefedov; Cédric Notredame; Ian R Paton; William S Payne; Geo Pertea; Dennis Prickett; Daniela Puiu; Dan Qioa; Emanuele Raineri; Magali Ruffier; Steven L Salzberg; Michael C Schatz; Chantel Scheuring; Carl J Schmidt; Steven Schroeder; Stephen M J Searle; Edward J Smith; Jacqueline Smith; Tad S Sonstegard; Peter F Stadler; Hakim Tafer; Zhijian Jake Tu; Curtis P Van Tassell; Albert J Vilella; Kelly P Williams; James A Yorke; Liqing Zhang; Hong-Bin Zhang; Xiaojun Zhang; Yang Zhang; Kent M Reed
Journal:  PLoS Biol       Date:  2010-09-07       Impact factor: 8.029

8.  Whole-genome, transcriptome, and methylome analyses provide insights into the evolution of platycoside biosynthesis in Platycodon grandiflorus, a medicinal plant.

Authors:  Jungeun Kim; Sang-Ho Kang; Sin-Gi Park; Tae-Jin Yang; Yi Lee; Ok Tae Kim; Oksung Chung; Jungho Lee; Jae-Pil Choi; Soo-Jin Kwon; Keunpyo Lee; Byoung-Ohg Ahn; Dong Jin Lee; Seung-Il Yoo; In-Gang Shin; Yurry Um; Dae Young Lee; Geum-Soog Kim; Chang Pyo Hong; Jong Bhak; Chang-Kug Kim
Journal:  Hortic Res       Date:  2020-07-01       Impact factor: 6.793

9.  Identification and genome analysis of Deinococcus actinosclerus SJTR1, a novel 17β-estradiol degradation bacterium.

Authors:  Weiliang Xiong; Wanli Peng; Rubing Liang
Journal:  3 Biotech       Date:  2018-10-01       Impact factor: 2.406

Review 10.  Genomic insights into tuberculosis.

Authors:  James E Galagan
Journal:  Nat Rev Genet       Date:  2014-03-25       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.