Literature DB >> 12952883

PCAP: a whole-genome assembly program.

Xiaoqiu Huang1, Jianmin Wang, Srinivas Aluru, Shiaw-Pyng Yang, LaDeana Hillier.   

Abstract

We describe a whole-genome assembly program named PCAP for processing tens of millions of reads. The PCAP program has several features to address efficiency and accuracy issues in assembly. Multiple processors are used to perform most time-consuming computations in assembly. A more sensitive method is used to avoid missing overlaps caused by sequencing errors. Repetitive regions of reads are detected on the basis of many overlaps with other reads, instead of many shorter word matches with other reads. Contaminated end regions of reads are identified and removed. Generation of a consensus sequence for a contig is based on an alignment of reads in the contig, in which both base quality values and coverage information are used to determine every consensus base. The PCAP program was tested on a mouse whole-genome data set of 30 million reads and a human Chromosome 20 data set of 1.7 million reads. The program is freely available for academic use.

Entities:  

Mesh:

Year:  2003        PMID: 12952883      PMCID: PMC403719          DOI: 10.1101/gr.1390403

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  18 in total

1.  CAP3: A DNA sequence assembly program.

Authors:  X Huang; A Madan
Journal:  Genome Res       Date:  1999-09       Impact factor: 9.043

2.  An Eulerian path approach to DNA fragment assembly.

Authors:  P A Pevzner; H Tang; M S Waterman
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-14       Impact factor: 11.205

3.  RePS: a sequence assembler that masks exact repeats identified from the shotgun data.

Authors:  Jun Wang; Gane Ka-Shu Wong; Peixiang Ni; Yujun Han; Xiangang Huang; Jianguo Zhang; Chen Ye; Yong Zhang; Jianfei Hu; Kunlin Zhang; Xin Xu; Lijuan Cong; Hong Lu; Xide Ren; Xiaoyu Ren; Jun He; Lin Tao; Douglas A Passey; Jian Wang; Huanming Yang; Jun Yu; Songgang Li
Journal:  Genome Res       Date:  2002-05       Impact factor: 9.043

4.  An optimized protocol for analysis of EST sequences.

Authors:  F Liang; I Holt; G Pertea; S Karamycheva; S L Salzberg; J Quackenbush
Journal:  Nucleic Acids Res       Date:  2000-09-15       Impact factor: 16.971

5.  Initial sequencing and comparative analysis of the mouse genome.

Authors:  Robert H Waterston; Kerstin Lindblad-Toh; Ewan Birney; Jane Rogers; Josep F Abril; Pankaj Agarwal; Richa Agarwala; Rachel Ainscough; Marina Alexandersson; Peter An; Stylianos E Antonarakis; John Attwood; Robert Baertsch; Jonathon Bailey; Karen Barlow; Stephan Beck; Eric Berry; Bruce Birren; Toby Bloom; Peer Bork; Marc Botcherby; Nicolas Bray; Michael R Brent; Daniel G Brown; Stephen D Brown; Carol Bult; John Burton; Jonathan Butler; Robert D Campbell; Piero Carninci; Simon Cawley; Francesca Chiaromonte; Asif T Chinwalla; Deanna M Church; Michele Clamp; Christopher Clee; Francis S Collins; Lisa L Cook; Richard R Copley; Alan Coulson; Olivier Couronne; James Cuff; Val Curwen; Tim Cutts; Mark Daly; Robert David; Joy Davies; Kimberly D Delehaunty; Justin Deri; Emmanouil T Dermitzakis; Colin Dewey; Nicholas J Dickens; Mark Diekhans; Sheila Dodge; Inna Dubchak; Diane M Dunn; Sean R Eddy; Laura Elnitski; Richard D Emes; Pallavi Eswara; Eduardo Eyras; Adam Felsenfeld; Ginger A Fewell; Paul Flicek; Karen Foley; Wayne N Frankel; Lucinda A Fulton; Robert S Fulton; Terrence S Furey; Diane Gage; Richard A Gibbs; Gustavo Glusman; Sante Gnerre; Nick Goldman; Leo Goodstadt; Darren Grafham; Tina A Graves; Eric D Green; Simon Gregory; Roderic Guigó; Mark Guyer; Ross C Hardison; David Haussler; Yoshihide Hayashizaki; LaDeana W Hillier; Angela Hinrichs; Wratko Hlavina; Timothy Holzer; Fan Hsu; Axin Hua; Tim Hubbard; Adrienne Hunt; Ian Jackson; David B Jaffe; L Steven Johnson; Matthew Jones; Thomas A Jones; Ann Joy; Michael Kamal; Elinor K Karlsson; Donna Karolchik; Arkadiusz Kasprzyk; Jun Kawai; Evan Keibler; Cristyn Kells; W James Kent; Andrew Kirby; Diana L Kolbe; Ian Korf; Raju S Kucherlapati; Edward J Kulbokas; David Kulp; Tom Landers; J P Leger; Steven Leonard; Ivica Letunic; Rosie Levine; Jia Li; Ming Li; Christine Lloyd; Susan Lucas; Bin Ma; Donna R Maglott; Elaine R Mardis; Lucy Matthews; Evan Mauceli; John H Mayer; Megan McCarthy; W Richard McCombie; Stuart McLaren; Kirsten McLay; John D McPherson; Jim Meldrim; Beverley Meredith; Jill P Mesirov; Webb Miller; Tracie L Miner; Emmanuel Mongin; Kate T Montgomery; Michael Morgan; Richard Mott; James C Mullikin; Donna M Muzny; William E Nash; Joanne O Nelson; Michael N Nhan; Robert Nicol; Zemin Ning; Chad Nusbaum; Michael J O'Connor; Yasushi Okazaki; Karen Oliver; Emma Overton-Larty; Lior Pachter; Genís Parra; Kymberlie H Pepin; Jane Peterson; Pavel Pevzner; Robert Plumb; Craig S Pohl; Alex Poliakov; Tracy C Ponce; Chris P Ponting; Simon Potter; Michael Quail; Alexandre Reymond; Bruce A Roe; Krishna M Roskin; Edward M Rubin; Alistair G Rust; Ralph Santos; Victor Sapojnikov; Brian Schultz; Jörg Schultz; Matthias S Schwartz; Scott Schwartz; Carol Scott; Steven Seaman; Steve Searle; Ted Sharpe; Andrew Sheridan; Ratna Shownkeen; Sarah Sims; Jonathan B Singer; Guy Slater; Arian Smit; Douglas R Smith; Brian Spencer; Arne Stabenau; Nicole Stange-Thomann; Charles Sugnet; Mikita Suyama; Glenn Tesler; Johanna Thompson; David Torrents; Evanne Trevaskis; John Tromp; Catherine Ucla; Abel Ureta-Vidal; Jade P Vinson; Andrew C Von Niederhausern; Claire M Wade; Melanie Wall; Ryan J Weber; Robert B Weiss; Michael C Wendl; Anthony P West; Kris Wetterstrand; Raymond Wheeler; Simon Whelan; Jamey Wierzbowski; David Willey; Sophie Williams; Richard K Wilson; Eitan Winter; Kim C Worley; Dudley Wyman; Shan Yang; Shiaw-Pyng Yang; Evgeny M Zdobnov; Michael C Zody; Eric S Lander
Journal:  Nature       Date:  2002-12-05       Impact factor: 49.962

6.  Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes.

Authors:  Samuel Aparicio; Jarrod Chapman; Elia Stupka; Nik Putnam; Jer-Ming Chia; Paramvir Dehal; Alan Christoffels; Sam Rash; Shawn Hoon; Arian Smit; Maarten D Sollewijn Gelpke; Jared Roach; Tania Oh; Isaac Y Ho; Marie Wong; Chris Detter; Frans Verhoef; Paul Predki; Alice Tay; Susan Lucas; Paul Richardson; Sarah F Smith; Melody S Clark; Yvonne J K Edwards; Norman Doggett; Andrey Zharkikh; Sean V Tavtigian; Dmitry Pruss; Mary Barnstead; Cheryl Evans; Holly Baden; Justin Powell; Gustavo Glusman; Lee Rowen; Leroy Hood; Y H Tan; Greg Elgar; Trevor Hawkins; Byrappa Venkatesh; Daniel Rokhsar; Sydney Brenner
Journal:  Science       Date:  2002-07-25       Impact factor: 47.728

7.  A whole-genome assembly of Drosophila.

Authors:  E W Myers; G G Sutton; A L Delcher; I M Dew; D P Fasulo; M J Flanigan; S A Kravitz; C M Mobarry; K H Reinert; K A Remington; E L Anson; R A Bolanos; H H Chou; C M Jordan; A L Halpern; S Lonardi; E M Beasley; R C Brandon; L Chen; P J Dunn; Z Lai; Y Liang; D R Nusskern; M Zhan; Q Zhang; X Zheng; G M Rubin; M D Adams; J C Venter
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

8.  ARACHNE: a whole-genome shotgun assembler.

Authors:  Serafim Batzoglou; David B Jaffe; Ken Stanley; Jonathan Butler; Sante Gnerre; Evan Mauceli; Bonnie Berger; Jill P Mesirov; Eric S Lander
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

9.  The phusion assembler.

Authors:  James C Mullikin; Zemin Ning
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

10.  Whole-genome sequence assembly for mammalian genomes: Arachne 2.

Authors:  David B Jaffe; Jonathan Butler; Sante Gnerre; Evan Mauceli; Kerstin Lindblad-Toh; Jill P Mesirov; Michael C Zody; Eric S Lander
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

View more
  106 in total

1.  Fosmid-based physical mapping of the Histoplasma capsulatum genome.

Authors:  Vincent Magrini; Wesley C Warren; John Wallis; William E Goldman; Jian Xu; Elaine R Mardis; John D McPherson
Journal:  Genome Res       Date:  2004-08       Impact factor: 9.043

2.  The Atlas genome assembly system.

Authors:  Paul Havlak; Rui Chen; K James Durbin; Amy Egan; Yanru Ren; Xing-Zhi Song; George M Weinstock; Richard A Gibbs
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

3.  Hawkeye and AMOS: visualizing and assessing the quality of genome assemblies.

Authors:  Michael C Schatz; Adam M Phillippy; Daniel D Sommer; Arthur L Delcher; Daniela Puiu; Giuseppe Narzisi; Steven L Salzberg; Mihai Pop
Journal:  Brief Bioinform       Date:  2011-12-23       Impact factor: 11.622

4.  Molecular cloning and characterization of the repetitive DNA sequences that comprise the constitutive heterochromatin of the W chromosomes of medaka fishes.

Authors:  Yusuke Takehana; Kiyoshi Naruse; Yusuke Asada; Yoichi Matsuda; Tadasu Shin-I; Yuji Kohara; Asao Fujiyama; Satoshi Hamaguchi; Mitsuru Sakaizumi
Journal:  Chromosome Res       Date:  2012-01       Impact factor: 5.239

5.  Genome assembly quality: assessment and improvement using the neutral indel model.

Authors:  Stephen Meader; LaDeana W Hillier; Devin Locke; Chris P Ponting; Gerton Lunter
Journal:  Genome Res       Date:  2010-03-19       Impact factor: 9.043

6.  Assembly of large genomes using second-generation sequencing.

Authors:  Michael C Schatz; Arthur L Delcher; Steven L Salzberg
Journal:  Genome Res       Date:  2010-05-27       Impact factor: 9.043

7.  Assembly of polymorphic genomes: algorithms and application to Ciona savignyi.

Authors:  Jade P Vinson; David B Jaffe; Keith O'Neill; Elinor K Karlsson; Nicole Stange-Thomann; Scott Anderson; Jill P Mesirov; Nori Satoh; Yutaka Satou; Chad Nusbaum; Bruce Birren; James E Galagan; Eric S Lander
Journal:  Genome Res       Date:  2005-08       Impact factor: 9.043

8.  Short read fragment assembly of bacterial genomes.

Authors:  Mark J Chaisson; Pavel A Pevzner
Journal:  Genome Res       Date:  2007-12-14       Impact factor: 9.043

9.  The Oxytricha trifallax macronuclear genome: a complex eukaryotic genome with 16,000 tiny chromosomes.

Authors:  Estienne C Swart; John R Bracht; Vincent Magrini; Patrick Minx; Xiao Chen; Yi Zhou; Jaspreet S Khurana; Aaron D Goldman; Mariusz Nowacki; Klaas Schotanus; Seolkyoung Jung; Robert S Fulton; Amy Ly; Sean McGrath; Kevin Haub; Jessica L Wiggins; Donna Storton; John C Matese; Lance Parsons; Wei-Jen Chang; Michael S Bowen; Nicholas A Stover; Thomas A Jones; Sean R Eddy; Glenn A Herrick; Thomas G Doak; Richard K Wilson; Elaine R Mardis; Laura F Landweber
Journal:  PLoS Biol       Date:  2013-01-29       Impact factor: 8.029

10.  Population genomics: whole-genome analysis of polymorphism and divergence in Drosophila simulans.

Authors:  David J Begun; Alisha K Holloway; Kristian Stevens; Ladeana W Hillier; Yu-Ping Poh; Matthew W Hahn; Phillip M Nista; Corbin D Jones; Andrew D Kern; Colin N Dewey; Lior Pachter; Eugene Myers; Charles H Langley
Journal:  PLoS Biol       Date:  2007-11-06       Impact factor: 8.029

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.