Literature DB >> 20724458

De novo assembly of short sequence reads.

Konrad Paszkiewicz1, David J Studholme.   

Abstract

A new generation of sequencing technologies is revolutionizing molecular biology. Illumina's Solexa and Applied Biosystems' SOLiD generate gigabases of nucleotide sequence per week. However, a perceived limitation of these ultra-high-throughput technologies is their short read-lengths. De novo assembly of sequence reads generated by classical Sanger capillary sequencing is a mature field of research. Unfortunately, the existing sequence assembly programs were not effective for short sequence reads generated by Illumina and SOLiD platforms. Early studies suggested that, in principle, sequence reads as short as 20-30 nucleotides could be used to generate useful assemblies of both prokaryotic and eukaryotic genome sequences, albeit containing many gaps. The early feasibility studies and proofs of principle inspired several bioinformatics research groups to implement new algorithms as freely available software tools specifically aimed at assembling reads of 30-50 nucleotides in length. This has led to the generation of several draft genome sequences based exclusively on short sequence Illumina sequence reads, recently culminating in the assembly of the 2.25-Gb genome of the giant panda from Illumina sequence reads with an average length of just 52 nucleotides. As well as reviewing recent developments in the field, we discuss some practical aspects such as data filtering and submission of assembly data to public repositories.

Entities:  

Mesh:

Year:  2010        PMID: 20724458     DOI: 10.1093/bib/bbq020

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  62 in total

Review 1.  Next-generation transcriptome assembly.

Authors:  Jeffrey A Martin; Zhong Wang
Journal:  Nat Rev Genet       Date:  2011-09-07       Impact factor: 53.242

Review 2.  Survey of (Meta)genomic Approaches for Understanding Microbial Community Dynamics.

Authors:  Anukriti Sharma; Rup Lal
Journal:  Indian J Microbiol       Date:  2016-11-11       Impact factor: 2.461

3.  RIFRAF: a frame-resolving consensus algorithm.

Authors:  Kemal Eren; Ben Murrell
Journal:  Bioinformatics       Date:  2018-11-15       Impact factor: 6.937

4.  De novo transcriptome assembly for a non-model species, the blood-sucking bug Triatoma brasiliensis, a vector of Chagas disease.

Authors:  A Marchant; F Mougel; C Almeida; E Jacquin-Joly; J Costa; M Harry
Journal:  Genetica       Date:  2014-09-19       Impact factor: 1.082

5.  Effects of short read quality and quantity on a de novo vertebrate transcriptome assembly.

Authors:  T I Garcia; Y Shen; J Catchen; A Amores; M Schartl; J Postlethwait; R B Walter
Journal:  Comp Biochem Physiol C Toxicol Pharmacol       Date:  2011-06-01       Impact factor: 3.228

6.  Consistency-based detection of potential tumor-specific deletions in matched normal/tumor genomes.

Authors:  Roland Wittler; Cedric Chauve
Journal:  BMC Bioinformatics       Date:  2011-10-05       Impact factor: 3.169

7.  Mining new crystal protein genes from Bacillus thuringiensis on the basis of mixed plasmid-enriched genome sequencing and a computational pipeline.

Authors:  Weixing Ye; Lei Zhu; Yingying Liu; Neil Crickmore; Donghai Peng; Lifang Ruan; Ming Sun
Journal:  Appl Environ Microbiol       Date:  2012-04-27       Impact factor: 4.792

8.  Comparative Transcriptome Analysis between the Fungal Plant Pathogens Sclerotinia sclerotiorum and S. trifoliorum Using RNA Sequencing.

Authors:  Dan Qiu; Liangsheng Xu; George Vandemark; Weidong Chen
Journal:  J Hered       Date:  2015-11-27       Impact factor: 2.645

9.  Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species.

Authors:  Keith R Bradnam; Joseph N Fass; Anton Alexandrov; Paul Baranay; Michael Bechner; Inanç Birol; Sébastien Boisvert; Jarrod A Chapman; Guillaume Chapuis; Rayan Chikhi; Hamidreza Chitsaz; Wen-Chi Chou; Jacques Corbeil; Cristian Del Fabbro; T Roderick Docking; Richard Durbin; Dent Earl; Scott Emrich; Pavel Fedotov; Nuno A Fonseca; Ganeshkumar Ganapathy; Richard A Gibbs; Sante Gnerre; Elénie Godzaridis; Steve Goldstein; Matthias Haimel; Giles Hall; David Haussler; Joseph B Hiatt; Isaac Y Ho; Jason Howard; Martin Hunt; Shaun D Jackman; David B Jaffe; Erich D Jarvis; Huaiyang Jiang; Sergey Kazakov; Paul J Kersey; Jacob O Kitzman; James R Knight; Sergey Koren; Tak-Wah Lam; Dominique Lavenier; François Laviolette; Yingrui Li; Zhenyu Li; Binghang Liu; Yue Liu; Ruibang Luo; Iain Maccallum; Matthew D Macmanes; Nicolas Maillet; Sergey Melnikov; Delphine Naquin; Zemin Ning; Thomas D Otto; Benedict Paten; Octávio S Paulo; Adam M Phillippy; Francisco Pina-Martins; Michael Place; Dariusz Przybylski; Xiang Qin; Carson Qu; Filipe J Ribeiro; Stephen Richards; Daniel S Rokhsar; J Graham Ruby; Simone Scalabrin; Michael C Schatz; David C Schwartz; Alexey Sergushichev; Ted Sharpe; Timothy I Shaw; Jay Shendure; Yujian Shi; Jared T Simpson; Henry Song; Fedor Tsarev; Francesco Vezzi; Riccardo Vicedomini; Bruno M Vieira; Jun Wang; Kim C Worley; Shuangye Yin; Siu-Ming Yiu; Jianying Yuan; Guojie Zhang; Hao Zhang; Shiguo Zhou; Ian F Korf
Journal:  Gigascience       Date:  2013-07-22       Impact factor: 6.524

10.  Implications of Mobile Genetic Elements for Salmonella enterica Single-Nucleotide Polymorphism Subtyping and Source Tracking Investigations.

Authors:  Shaoting Li; Shaokang Zhang; Leen Baert; Balamurugan Jagadeesan; Catherine Ngom-Bru; Taylor Griswold; Lee S Katz; Heather A Carleton; Xiangyu Deng
Journal:  Appl Environ Microbiol       Date:  2019-11-27       Impact factor: 4.792

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.