Literature DB >> 12529309

The phusion assembler.

James C Mullikin1, Zemin Ning.   

Abstract

The Phusion assembler has assembled the mouse genome from the whole-genome shotgun (WGS) dataset collected by the Mouse Genome Sequencing Consortium, at ~7.5x sequence coverage, producing a high-quality draft assembly 2.6 gigabases in size, of which 90% of these bases are in 479 scaffolds. For the mouse genome, which is a large and repeat-rich genome, the input dataset was designed to include a high proportion of paired end sequences of various size selected inserts, from 2-200 kbp lengths, into various host vector templates. Phusion uses sequence data, called reads, and information about reads that share common templates, called read pairs, to drive the assembly of this large genome to highly accurate results. The preassembly stage, which clusters the reads into sensible groups, is a key element of the entire assembler, because it permits a simple approach to parallelization of the assembly stage, as each cluster can be treated independent of the others. In addition to the application of Phusion to the mouse genome, we will also present results from the WGS assembly of Caenorhabditis briggsae sequenced to about 11x coverage. The C. briggsae assembly was accessioned through EMBL, http://www.ebi.ac.uk/services/index.html, using the series CAAC01000001-CAAC01000578, however, the Phusion mouse assembly described here was not accessioned. The mouse data was generated by the Mouse Genome Sequencing Consortium. The C. briggsae sequence was generated at The Wellcome Trust Sanger Institute and the Genome Sequencing Center, Washington University School of Medicine.

Entities:  

Mesh:

Year:  2003        PMID: 12529309      PMCID: PMC430959          DOI: 10.1101/gr.731003

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  20 in total

1.  The maps. Clone by clone by clone.

Authors:  M V Olson
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

2.  SSAHA: a fast search method for large DNA databases.

Authors:  Z Ning; A J Cox; J C Mullikin
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

3.  RePS: a sequence assembler that masks exact repeats identified from the shotgun data.

Authors:  Jun Wang; Gane Ka-Shu Wong; Peixiang Ni; Yujun Han; Xiangang Huang; Jianguo Zhang; Chen Ye; Yong Zhang; Jianfei Hu; Kunlin Zhang; Xin Xu; Lijuan Cong; Hong Lu; Xide Ren; Xiaoyu Ren; Jun He; Lin Tao; Douglas A Passey; Jian Wang; Huanming Yang; Jun Yu; Songgang Li
Journal:  Genome Res       Date:  2002-05       Impact factor: 9.043

4.  Modeling the feasibility of whole genome shotgun sequencing using a pairwise end strategy.

Authors:  A F Siegel; G van den Engh; L Hood; B Trask; J C Roach
Journal:  Genomics       Date:  2000-09-15       Impact factor: 5.736

5.  Initial sequencing and comparative analysis of the mouse genome.

Authors:  Robert H Waterston; Kerstin Lindblad-Toh; Ewan Birney; Jane Rogers; Josep F Abril; Pankaj Agarwal; Richa Agarwala; Rachel Ainscough; Marina Alexandersson; Peter An; Stylianos E Antonarakis; John Attwood; Robert Baertsch; Jonathon Bailey; Karen Barlow; Stephan Beck; Eric Berry; Bruce Birren; Toby Bloom; Peer Bork; Marc Botcherby; Nicolas Bray; Michael R Brent; Daniel G Brown; Stephen D Brown; Carol Bult; John Burton; Jonathan Butler; Robert D Campbell; Piero Carninci; Simon Cawley; Francesca Chiaromonte; Asif T Chinwalla; Deanna M Church; Michele Clamp; Christopher Clee; Francis S Collins; Lisa L Cook; Richard R Copley; Alan Coulson; Olivier Couronne; James Cuff; Val Curwen; Tim Cutts; Mark Daly; Robert David; Joy Davies; Kimberly D Delehaunty; Justin Deri; Emmanouil T Dermitzakis; Colin Dewey; Nicholas J Dickens; Mark Diekhans; Sheila Dodge; Inna Dubchak; Diane M Dunn; Sean R Eddy; Laura Elnitski; Richard D Emes; Pallavi Eswara; Eduardo Eyras; Adam Felsenfeld; Ginger A Fewell; Paul Flicek; Karen Foley; Wayne N Frankel; Lucinda A Fulton; Robert S Fulton; Terrence S Furey; Diane Gage; Richard A Gibbs; Gustavo Glusman; Sante Gnerre; Nick Goldman; Leo Goodstadt; Darren Grafham; Tina A Graves; Eric D Green; Simon Gregory; Roderic Guigó; Mark Guyer; Ross C Hardison; David Haussler; Yoshihide Hayashizaki; LaDeana W Hillier; Angela Hinrichs; Wratko Hlavina; Timothy Holzer; Fan Hsu; Axin Hua; Tim Hubbard; Adrienne Hunt; Ian Jackson; David B Jaffe; L Steven Johnson; Matthew Jones; Thomas A Jones; Ann Joy; Michael Kamal; Elinor K Karlsson; Donna Karolchik; Arkadiusz Kasprzyk; Jun Kawai; Evan Keibler; Cristyn Kells; W James Kent; Andrew Kirby; Diana L Kolbe; Ian Korf; Raju S Kucherlapati; Edward J Kulbokas; David Kulp; Tom Landers; J P Leger; Steven Leonard; Ivica Letunic; Rosie Levine; Jia Li; Ming Li; Christine Lloyd; Susan Lucas; Bin Ma; Donna R Maglott; Elaine R Mardis; Lucy Matthews; Evan Mauceli; John H Mayer; Megan McCarthy; W Richard McCombie; Stuart McLaren; Kirsten McLay; John D McPherson; Jim Meldrim; Beverley Meredith; Jill P Mesirov; Webb Miller; Tracie L Miner; Emmanuel Mongin; Kate T Montgomery; Michael Morgan; Richard Mott; James C Mullikin; Donna M Muzny; William E Nash; Joanne O Nelson; Michael N Nhan; Robert Nicol; Zemin Ning; Chad Nusbaum; Michael J O'Connor; Yasushi Okazaki; Karen Oliver; Emma Overton-Larty; Lior Pachter; Genís Parra; Kymberlie H Pepin; Jane Peterson; Pavel Pevzner; Robert Plumb; Craig S Pohl; Alex Poliakov; Tracy C Ponce; Chris P Ponting; Simon Potter; Michael Quail; Alexandre Reymond; Bruce A Roe; Krishna M Roskin; Edward M Rubin; Alistair G Rust; Ralph Santos; Victor Sapojnikov; Brian Schultz; Jörg Schultz; Matthias S Schwartz; Scott Schwartz; Carol Scott; Steven Seaman; Steve Searle; Ted Sharpe; Andrew Sheridan; Ratna Shownkeen; Sarah Sims; Jonathan B Singer; Guy Slater; Arian Smit; Douglas R Smith; Brian Spencer; Arne Stabenau; Nicole Stange-Thomann; Charles Sugnet; Mikita Suyama; Glenn Tesler; Johanna Thompson; David Torrents; Evanne Trevaskis; John Tromp; Catherine Ucla; Abel Ureta-Vidal; Jade P Vinson; Andrew C Von Niederhausern; Claire M Wade; Melanie Wall; Ryan J Weber; Robert B Weiss; Michael C Wendl; Anthony P West; Kris Wetterstrand; Raymond Wheeler; Simon Whelan; Jamey Wierzbowski; David Willey; Sophie Williams; Richard K Wilson; Eitan Winter; Kim C Worley; Dudley Wyman; Shan Yang; Shiaw-Pyng Yang; Evgeny M Zdobnov; Michael C Zody; Eric S Lander
Journal:  Nature       Date:  2002-12-05       Impact factor: 49.962

6.  Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes.

Authors:  Samuel Aparicio; Jarrod Chapman; Elia Stupka; Nik Putnam; Jer-Ming Chia; Paramvir Dehal; Alan Christoffels; Sam Rash; Shawn Hoon; Arian Smit; Maarten D Sollewijn Gelpke; Jared Roach; Tania Oh; Isaac Y Ho; Marie Wong; Chris Detter; Frans Verhoef; Paul Predki; Alice Tay; Susan Lucas; Paul Richardson; Sarah F Smith; Melody S Clark; Yvonne J K Edwards; Norman Doggett; Andrey Zharkikh; Sean V Tavtigian; Dmitry Pruss; Mary Barnstead; Cheryl Evans; Holly Baden; Justin Powell; Gustavo Glusman; Lee Rowen; Leroy Hood; Y H Tan; Greg Elgar; Trevor Hawkins; Byrappa Venkatesh; Daniel Rokhsar; Sydney Brenner
Journal:  Science       Date:  2002-07-25       Impact factor: 47.728

7.  Base-calling of automated sequencer traces using phred. II. Error probabilities.

Authors:  B Ewing; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

8.  ARACHNE: a whole-genome shotgun assembler.

Authors:  Serafim Batzoglou; David B Jaffe; Ken Stanley; Jonathan Butler; Sante Gnerre; Evan Mauceli; Bonnie Berger; Jill P Mesirov; Eric S Lander
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

9.  A draft sequence of the rice genome (Oryza sativa L. ssp. indica).

Authors:  Jun Yu; Songnian Hu; Jun Wang; Gane Ka-Shu Wong; Songgang Li; Bin Liu; Yajun Deng; Li Dai; Yan Zhou; Xiuqing Zhang; Mengliang Cao; Jing Liu; Jiandong Sun; Jiabin Tang; Yanjiong Chen; Xiaobing Huang; Wei Lin; Chen Ye; Wei Tong; Lijuan Cong; Jianing Geng; Yujun Han; Lin Li; Wei Li; Guangqiang Hu; Xiangang Huang; Wenjie Li; Jian Li; Zhanwei Liu; Long Li; Jianping Liu; Qiuhui Qi; Jinsong Liu; Li Li; Tao Li; Xuegang Wang; Hong Lu; Tingting Wu; Miao Zhu; Peixiang Ni; Hua Han; Wei Dong; Xiaoyu Ren; Xiaoli Feng; Peng Cui; Xianran Li; Hao Wang; Xin Xu; Wenxue Zhai; Zhao Xu; Jinsong Zhang; Sijie He; Jianguo Zhang; Jichen Xu; Kunlin Zhang; Xianwu Zheng; Jianhai Dong; Wanyong Zeng; Lin Tao; Jia Ye; Jun Tan; Xide Ren; Xuewei Chen; Jun He; Daofeng Liu; Wei Tian; Chaoguang Tian; Hongai Xia; Qiyu Bao; Gang Li; Hui Gao; Ting Cao; Juan Wang; Wenming Zhao; Ping Li; Wei Chen; Xudong Wang; Yong Zhang; Jianfei Hu; Jing Wang; Song Liu; Jian Yang; Guangyu Zhang; Yuqing Xiong; Zhijie Li; Long Mao; Chengshu Zhou; Zhen Zhu; Runsheng Chen; Bailin Hao; Weimou Zheng; Shouyi Chen; Wei Guo; Guojie Li; Siqi Liu; Ming Tao; Jian Wang; Lihuang Zhu; Longping Yuan; Huanming Yang
Journal:  Science       Date:  2002-04-05       Impact factor: 47.728

10.  A physical map of the mouse genome.

Authors:  Simon G Gregory; Mandeep Sekhon; Jacqueline Schein; Shaying Zhao; Kazutoyo Osoegawa; Carol E Scott; Richard S Evans; Paul W Burridge; Tony V Cox; Christopher A Fox; Richard D Hutton; Ian R Mullenger; Kimbly J Phillips; James Smith; Jim Stalker; Glen J Threadgold; Ewan Birney; Kristine Wylie; Asif Chinwalla; John Wallis; LaDeana Hillier; Jason Carter; Tony Gaige; Sara Jaeger; Colin Kremitzki; Dan Layman; Jason Maas; Rebecca McGrane; Kelly Mead; Rebecca Walker; Steven Jones; Michael Smith; Jennifer Asano; Ian Bosdet; Susanna Chan; Suganthi Chittaranjan; Readman Chiu; Chris Fjell; Dan Fuhrmann; Noreen Girn; Catharine Gray; Ran Guin; Letticia Hsiao; Martin Krzywinski; Reta Kutsche; Soo Sen Lee; Carrie Mathewson; Candice McLeavy; Steve Messervier; Steven Ness; Pawan Pandoh; Anna-Liisa Prabhu; Parvaneh Saeedi; Duane Smailus; Lorraine Spence; Jeff Stott; Sheryl Taylor; Wesley Terpstra; Miranda Tsai; Jill Vardy; Natasja Wye; George Yang; Sofiya Shatsman; Bola Ayodeji; Keita Geer; Getahun Tsegaye; Alla Shvartsbeyn; Elizabeth Gebregeorgis; Margaret Krol; Daniel Russell; Larry Overton; Joel A Malek; Mike Holmes; Michael Heaney; Jyoti Shetty; Tamara Feldblyum; William C Nierman; Joseph J Catanese; Tim Hubbard; Robert H Waterston; Jane Rogers; Pieter J de Jong; Claire M Fraser; Marco Marra; John D McPherson; David R Bentley
Journal:  Nature       Date:  2002-08-04       Impact factor: 49.962

View more
  83 in total

1.  Hierarchical scaffolding with Bambus.

Authors:  Mihai Pop; Daniel S Kosack; Steven L Salzberg
Journal:  Genome Res       Date:  2004-01       Impact factor: 9.043

2.  PCAP: a whole-genome assembly program.

Authors:  Xiaoqiu Huang; Jianmin Wang; Srinivas Aluru; Shiaw-Pyng Yang; LaDeana Hillier
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

3.  The Iccare web server: an attempt to merge sequence and mapping information for plant and animal species.

Authors:  Cédric Muller; Mathieu Denis; Laurent Gentzbittel; Thomas Faraut
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

4.  De novo repeat classification and fragment assembly.

Authors:  Pavel A Pevzner; Paul A Pevzner; Haixu Tang; Glenn Tesler
Journal:  Genome Res       Date:  2004-09       Impact factor: 9.043

5.  The Atlas genome assembly system.

Authors:  Paul Havlak; Rui Chen; K James Durbin; Amy Egan; Yanru Ren; Xing-Zhi Song; George M Weinstock; Richard A Gibbs
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

6.  Statistical significance of optical map alignments.

Authors:  Deepayan Sarkar; Steve Goldstein; David C Schwartz; Michael A Newton
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

7.  Genome assembly quality: assessment and improvement using the neutral indel model.

Authors:  Stephen Meader; LaDeana W Hillier; Devin Locke; Chris P Ponting; Gerton Lunter
Journal:  Genome Res       Date:  2010-03-19       Impact factor: 9.043

8.  Assembly of polymorphic genomes: algorithms and application to Ciona savignyi.

Authors:  Jade P Vinson; David B Jaffe; Keith O'Neill; Elinor K Karlsson; Nicole Stange-Thomann; Scott Anderson; Jill P Mesirov; Nori Satoh; Yutaka Satou; Chad Nusbaum; Bruce Birren; James E Galagan; Eric S Lander
Journal:  Genome Res       Date:  2005-08       Impact factor: 9.043

9.  Conserved and novel Wnt clusters in the basal eumetazoan Nematostella vectensis.

Authors:  James C Sullivan; Joseph F Ryan; James C Mullikin; John R Finnerty
Journal:  Dev Genes Evol       Date:  2007-02-20       Impact factor: 0.900

10.  The Schistosoma japonicum genome reveals features of host-parasite interplay.

Authors: 
Journal:  Nature       Date:  2009-07-16       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.