Literature DB >> 21102452

Limitations of next-generation genome sequence assembly.

Can Alkan1, Saba Sajjadian, Evan E Eichler.   

Abstract

High-throughput sequencing technologies promise to transform the fields of genetics and comparative biology by delivering tens of thousands of genomes in the near future. Although it is feasible to construct de novo genome assemblies in a few months, there has been relatively little attention to what is lost by sole application of short sequence reads. We compared the recent de novo assemblies using the short oligonucleotide analysis package (SOAP), generated from the genomes of a Han Chinese individual and a Yoruban individual, to experimentally validated genomic features. We found that de novo assemblies were 16.2% shorter than the reference genome and that 420.2 megabase pairs of common repeats and 99.1% of validated duplicated sequences were missing from the genome. Consequently, over 2,377 coding exons were completely missing. We conclude that high-quality sequencing approaches must be considered in conjunction with high-throughput sequencing for comparative genomics analyses and studies of genome evolution.

Entities:  

Mesh:

Year:  2010        PMID: 21102452      PMCID: PMC3115693          DOI: 10.1038/nmeth.1527

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


  27 in total

1.  Whole-genome disassembly.

Authors:  Phil Green
Journal:  Proc Natl Acad Sci U S A       Date:  2002-03-19       Impact factor: 11.205

2.  De novo fragment assembly with short mate-paired reads: Does the read length matter?

Authors:  Mark J Chaisson; Dumitru Brinza; Pavel A Pevzner
Journal:  Genome Res       Date:  2008-12-03       Impact factor: 9.043

3.  The complete genome of an individual by massively parallel DNA sequencing.

Authors:  David A Wheeler; Maithreyan Srinivasan; Michael Egholm; Yufeng Shen; Lei Chen; Amy McGuire; Wen He; Yi-Ju Chen; Vinod Makhijani; G Thomas Roth; Xavier Gomes; Karrie Tartaro; Faheem Niazi; Cynthia L Turcotte; Gerard P Irzyk; James R Lupski; Craig Chinault; Xing-zhi Song; Yue Liu; Ye Yuan; Lynne Nazareth; Xiang Qin; Donna M Muzny; Marcel Margulies; George M Weinstock; Richard A Gibbs; Jonathan M Rothberg
Journal:  Nature       Date:  2008-04-17       Impact factor: 49.962

4.  Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species.

Authors: 
Journal:  J Hered       Date:  2009-11-05       Impact factor: 2.645

5.  Building the sequence map of the human pan-genome.

Authors:  Ruiqiang Li; Yingrui Li; Hancheng Zheng; Ruibang Luo; Hongmei Zhu; Qibin Li; Wubin Qian; Yuanyuan Ren; Geng Tian; Jinxiang Li; Guangyu Zhou; Xuan Zhu; Honglong Wu; Junjie Qin; Xin Jin; Dongfang Li; Hongzhi Cao; Xueda Hu; Hélène Blanche; Howard Cann; Xiuqing Zhang; Songgang Li; Lars Bolund; Karsten Kristiansen; Huanming Yang; Jun Wang; Jian Wang
Journal:  Nat Biotechnol       Date:  2009-12-07       Impact factor: 54.908

6.  A whole-genome assembly of Drosophila.

Authors:  E W Myers; G G Sutton; A L Delcher; I M Dew; D P Fasulo; M J Flanigan; S A Kravitz; C M Mobarry; K H Reinert; K A Remington; E L Anson; R A Bolanos; H H Chou; C M Jordan; A L Halpern; S Lonardi; E M Beasley; R C Brandon; L Chen; P J Dunn; Z Lai; Y Liang; D R Nusskern; M Zhan; Q Zhang; X Zheng; G M Rubin; M D Adams; J C Venter
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

7.  Finishing the euchromatic sequence of the human genome.

Authors: 
Journal:  Nature       Date:  2004-10-21       Impact factor: 49.962

8.  Shotgun sequence assembly and recent segmental duplications within the human genome.

Authors:  Xinwei She; Zhaoshi Jiang; Royden A Clark; Ge Liu; Ze Cheng; Eray Tuzun; Deanna M Church; Granger Sutton; Aaron L Halpern; Evan E Eichler
Journal:  Nature       Date:  2004-10-21       Impact factor: 49.962

9.  The sequence and de novo assembly of the giant panda genome.

Authors:  Ruiqiang Li; Wei Fan; Geng Tian; Hongmei Zhu; Lin He; Jing Cai; Quanfei Huang; Qingle Cai; Bo Li; Yinqi Bai; Zhihe Zhang; Yaping Zhang; Wen Wang; Jun Li; Fuwen Wei; Heng Li; Min Jian; Jianwen Li; Zhaolei Zhang; Rasmus Nielsen; Dawei Li; Wanjun Gu; Zhentao Yang; Zhaoling Xuan; Oliver A Ryder; Frederick Chi-Ching Leung; Yan Zhou; Jianjun Cao; Xiao Sun; Yonggui Fu; Xiaodong Fang; Xiaosen Guo; Bo Wang; Rong Hou; Fujun Shen; Bo Mu; Peixiang Ni; Runmao Lin; Wubin Qian; Guodong Wang; Chang Yu; Wenhui Nie; Jinhuan Wang; Zhigang Wu; Huiqing Liang; Jiumeng Min; Qi Wu; Shifeng Cheng; Jue Ruan; Mingwei Wang; Zhongbin Shi; Ming Wen; Binghang Liu; Xiaoli Ren; Huisong Zheng; Dong Dong; Kathleen Cook; Gao Shan; Hao Zhang; Carolin Kosiol; Xueying Xie; Zuhong Lu; Hancheng Zheng; Yingrui Li; Cynthia C Steiner; Tommy Tsan-Yuk Lam; Siyuan Lin; Qinghui Zhang; Guoqing Li; Jing Tian; Timing Gong; Hongde Liu; Dejin Zhang; Lin Fang; Chen Ye; Juanbin Zhang; Wenbo Hu; Anlong Xu; Yuanyuan Ren; Guojie Zhang; Michael W Bruford; Qibin Li; Lijia Ma; Yiran Guo; Na An; Yujie Hu; Yang Zheng; Yongyong Shi; Zhiqiang Li; Qing Liu; Yanling Chen; Jing Zhao; Ning Qu; Shancen Zhao; Feng Tian; Xiaoling Wang; Haiyin Wang; Lizhi Xu; Xiao Liu; Tomas Vinar; Yajun Wang; Tak-Wah Lam; Siu-Ming Yiu; Shiping Liu; Hemin Zhang; Desheng Li; Yan Huang; Xia Wang; Guohua Yang; Zhi Jiang; Junyi Wang; Nan Qin; Li Li; Jingxiang Li; Lars Bolund; Karsten Kristiansen; Gane Ka-Shu Wong; Maynard Olson; Xiuqing Zhang; Songgang Li; Huanming Yang; Jian Wang; Jun Wang
Journal:  Nature       Date:  2009-12-13       Impact factor: 49.962

10.  Complete Khoisan and Bantu genomes from southern Africa.

Authors:  Stephan C Schuster; Webb Miller; Aakrosh Ratan; Lynn P Tomsho; Belinda Giardine; Lindsay R Kasson; Robert S Harris; Desiree C Petersen; Fangqing Zhao; Ji Qi; Can Alkan; Jeffrey M Kidd; Yazhou Sun; Daniela I Drautz; Pascal Bouffard; Donna M Muzny; Jeffrey G Reid; Lynne V Nazareth; Qingyu Wang; Richard Burhans; Cathy Riemer; Nicola E Wittekindt; Priya Moorjani; Elizabeth A Tindall; Charles G Danko; Wee Siang Teo; Anne M Buboltz; Zhenhai Zhang; Qianyi Ma; Arno Oosthuysen; Abraham W Steenkamp; Hermann Oostuisen; Philippus Venter; John Gajewski; Yu Zhang; B Franklin Pugh; Kateryna D Makova; Anton Nekrutenko; Elaine R Mardis; Nick Patterson; Tom H Pringle; Francesca Chiaromonte; James C Mullikin; Evan E Eichler; Ross C Hardison; Richard A Gibbs; Timothy T Harkins; Vanessa M Hayes
Journal:  Nature       Date:  2010-02-18       Impact factor: 49.962

View more
  325 in total

1.  Graph accordance of next-generation sequence assemblies.

Authors:  Guohui Yao; Liang Ye; Hongyu Gao; Patrick Minx; Wesley C Warren; George M Weinstock
Journal:  Bioinformatics       Date:  2011-10-23       Impact factor: 6.937

2.  Test driving genome assemblers.

Authors:  Wei Fan; Ruiqiang Li
Journal:  Nat Biotechnol       Date:  2012-04-10       Impact factor: 54.908

3.  The genome of melon (Cucumis melo L.).

Authors:  Jordi Garcia-Mas; Andrej Benjak; Walter Sanseverino; Michael Bourgeois; Gisela Mir; Víctor M González; Elizabeth Hénaff; Francisco Câmara; Luca Cozzuto; Ernesto Lowy; Tyler Alioto; Salvador Capella-Gutiérrez; Jose Blanca; Joaquín Cañizares; Pello Ziarsolo; Daniel Gonzalez-Ibeas; Luis Rodríguez-Moreno; Marcus Droege; Lei Du; Miguel Alvarez-Tejado; Belen Lorente-Galdos; Marta Melé; Luming Yang; Yiqun Weng; Arcadi Navarro; Tomas Marques-Bonet; Miguel A Aranda; Fernando Nuez; Belén Picó; Toni Gabaldón; Guglielmo Roma; Roderic Guigó; Josep M Casacuberta; Pere Arús; Pere Puigdomènech
Journal:  Proc Natl Acad Sci U S A       Date:  2012-07-02       Impact factor: 11.205

4.  Assemblies: the good, the bad, the ugly.

Authors:  Ewan Birney
Journal:  Nat Methods       Date:  2011-01       Impact factor: 28.547

5.  RAD Capture (Rapture): Flexible and Efficient Sequence-Based Genotyping.

Authors:  Omar A Ali; Sean M O'Rourke; Stephen J Amish; Mariah H Meek; Gordon Luikart; Carson Jeffres; Michael R Miller
Journal:  Genetics       Date:  2015-12-29       Impact factor: 4.562

6.  Assessment of human diploid genome assembly with 10x Linked-Reads data.

Authors:  Lu Zhang; Xin Zhou; Ziming Weng; Arend Sidow
Journal:  Gigascience       Date:  2019-11-01       Impact factor: 6.524

Review 7.  Massively parallel sequencing: the new frontier of hematologic genomics.

Authors:  Jill M Johnsen; Deborah A Nickerson; Alex P Reiner
Journal:  Blood       Date:  2013-09-10       Impact factor: 22.113

8.  High-quality draft assemblies of mammalian genomes from massively parallel sequence data.

Authors:  Sante Gnerre; Iain Maccallum; Dariusz Przybylski; Filipe J Ribeiro; Joshua N Burton; Bruce J Walker; Ted Sharpe; Giles Hall; Terrance P Shea; Sean Sykes; Aaron M Berlin; Daniel Aird; Maura Costello; Riza Daza; Louise Williams; Robert Nicol; Andreas Gnirke; Chad Nusbaum; Eric S Lander; David B Jaffe
Journal:  Proc Natl Acad Sci U S A       Date:  2010-12-27       Impact factor: 11.205

Review 9.  Functional primate genomics--leveraging the medical potential.

Authors:  Wolfgang Enard
Journal:  J Mol Med (Berl)       Date:  2012-05-04       Impact factor: 4.599

10.  Genome assembly and haplotyping with Hi-C.

Authors:  Jan O Korbel; Charles Lee
Journal:  Nat Biotechnol       Date:  2013-12       Impact factor: 54.908

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.