Literature DB >> 22111509

iAssembler: a package for de novo assembly of Roche-454/Sanger transcriptome sequences.

Yi Zheng1, Liangjun Zhao, Junping Gao, Zhangjun Fei.   

Abstract

BACKGROUND: Expressed Sequence Tags (ESTs) have played significant roles in gene discovery and gene functional analysis, especially for non-model organisms. For organisms with no full genome sequences available, ESTs are normally assembled into longer consensus sequences for further downstream analysis. However current de novo EST assembly programs often generate large number of assembly errors that will negatively affect the downstream analysis. In order to generate more accurate consensus sequences from ESTs, tools are needed to reduce or eliminate errors from de novo assemblies.
RESULTS: We present iAssembler, a pipeline that can assemble large-scale ESTs into consensus sequences with significantly higher accuracy than current existing assemblers. iAssembler employs MIRA and CAP3 assemblers to generate initial assemblies, followed by identifying and correcting two common types of transcriptome assembly errors: 1) ESTs from different transcripts (mainly alternatively spliced transcripts or paralogs) are incorrectly assembled into same contigs; and 2) ESTs from same transcripts fail to be assembled together. iAssembler can be used to assemble ESTs generated using the traditional Sanger method and/or the Roche-454 massive parallel pyrosequencing technology.
CONCLUSION: We compared performances of iAssembler and several other de novo EST assembly programs using both Roche-454 and Sanger EST datasets. It demonstrated that iAssembler generated significantly more accurate consensus sequences than other assembly programs.

Entities:  

Mesh:

Year:  2011        PMID: 22111509      PMCID: PMC3233632          DOI: 10.1186/1471-2105-12-453

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  20 in total

1.  A greedy algorithm for aligning DNA sequences.

Authors:  Z Zhang; S Schwartz; L Wagner; W Miller
Journal:  J Comput Biol       Date:  2000 Feb-Apr       Impact factor: 1.479

2.  TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets.

Authors:  Geo Pertea; Xiaoqiu Huang; Feng Liang; Valentin Antonescu; Razvan Sultana; Svetlana Karamycheva; Yuandan Lee; Joseph White; Foo Cheung; Babak Parvizi; Jennifer Tsai; John Quackenbush
Journal:  Bioinformatics       Date:  2003-03-22       Impact factor: 6.937

3.  DNA sequence quality trimming and vector removal.

Authors:  H H Chou; M H Holmes
Journal:  Bioinformatics       Date:  2001-12       Impact factor: 6.937

4.  Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.

Authors:  Shaogui Guo; Yi Zheng; Je-Gun Joung; Shiqiang Liu; Zhonghua Zhang; Oswald R Crasta; Bruno W Sobral; Yong Xu; Sanwen Huang; Zhangjun Fei
Journal:  BMC Genomics       Date:  2010-06-17       Impact factor: 3.969

5.  The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species.

Authors:  J Quackenbush; J Cho; D Lee; F Liang; I Holt; S Karamycheva; B Parvizi; G Pertea; R Sultana; J White
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

6.  De novo sequencing and analysis of the American ginseng root transcriptome using a GS FLX Titanium platform to discover putative genes involved in ginsenoside biosynthesis.

Authors:  Chao Sun; Ying Li; Qiong Wu; Hongmei Luo; Yongzhen Sun; Jingyuan Song; Edmund M K Lui; Shilin Chen
Journal:  BMC Genomics       Date:  2010-04-24       Impact factor: 3.969

7.  Integrative genomics viewer.

Authors:  James T Robinson; Helga Thorvaldsdóttir; Wendy Winckler; Mitchell Guttman; Eric S Lander; Gad Getz; Jill P Mesirov
Journal:  Nat Biotechnol       Date:  2011-01       Impact factor: 54.908

8.  Comparing de novo assemblers for 454 transcriptome data.

Authors:  Sujai Kumar; Mark L Blaxter
Journal:  BMC Genomics       Date:  2010-10-16       Impact factor: 3.969

9.  Comparative 454 pyrosequencing of transcripts from two olive genotypes during fruit development.

Authors:  Fiammetta Alagna; Nunzio D'Agostino; Laura Torchia; Maurizio Servili; Rosa Rao; Marco Pietrella; Giovanni Giuliano; Maria Luisa Chiusano; Luciana Baldoni; Gaetano Perrotta
Journal:  BMC Genomics       Date:  2009-08-26       Impact factor: 3.969

10.  Combining next-generation pyrosequencing with microarray for large scale expression analysis in non-model species.

Authors:  Diana Bellin; Alberto Ferrarini; Antonio Chimento; Olaf Kaiser; Natasha Levenkova; Pascal Bouffard; Massimo Delledonne
Journal:  BMC Genomics       Date:  2009-11-24       Impact factor: 3.969

View more
  76 in total

1.  CBrowse: a SAM/BAM-based contig browser for transcriptome assembly visualization and analysis.

Authors:  Pei Li; Guoli Ji; Min Dong; Emily Schmidt; Douglas Lenox; Liangliang Chen; Qi Liu; Lin Liu; Jie Zhang; Chun Liang
Journal:  Bioinformatics       Date:  2012-07-12       Impact factor: 6.937

2.  Endosymbiotic gene transfer in tertiary plastid-containing dinoflagellates.

Authors:  Fabien Burki; Behzad Imanian; Elisabeth Hehenberger; Yoshihisa Hirakawa; Shinichiro Maruyama; Patrick J Keeling
Journal:  Eukaryot Cell       Date:  2013-12-02

3.  A Zinc Finger Protein Regulates Flowering Time and Abiotic Stress Tolerance in Chrysanthemum by Modulating Gibberellin Biosynthesis.

Authors:  Yingjie Yang; Chao Ma; Yanjie Xu; Qian Wei; Muhammad Imtiaz; Haibo Lan; Shan Gao; Lina Cheng; Meiyan Wang; Zhangjun Fei; Bo Hong; Junping Gao
Journal:  Plant Cell       Date:  2014-05-23       Impact factor: 11.277

4.  Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing.

Authors:  David E Cook; Jose Espejo Valle-Inclan; Alice Pajoro; Hanna Rovenich; Bart P H J Thomma; Luigi Faino
Journal:  Plant Physiol       Date:  2018-11-06       Impact factor: 8.340

5.  Evolutionary relatedness does not predict competition and co-occurrence in natural or experimental communities of green algae.

Authors:  Markos A Alexandrou; Bradley J Cardinale; John D Hall; Charles F Delwiche; Keith Fritschie; Anita Narwani; Patrick A Venail; Bastian Bentlage; M Sabrina Pankey; Todd H Oakley
Journal:  Proc Biol Sci       Date:  2015-01-22       Impact factor: 5.349

Review 6.  Quo vadis venomics? A roadmap to neglected venomous invertebrates.

Authors:  Bjoern Marcus von Reumont; Lahcen I Campbell; Ronald A Jenner
Journal:  Toxins (Basel)       Date:  2014-12-19       Impact factor: 4.546

7.  Sequence comparative analysis using networks: software for evaluating de novo transcript assembly from next-generation sequencing.

Authors:  Ian Misner; Cédric Bicep; Philippe Lopez; Sébastien Halary; Eric Bapteste; Christopher E Lane
Journal:  Mol Biol Evol       Date:  2013-05-10       Impact factor: 16.240

8.  Changes in protein expression in the salt marsh mussel Geukensia demissa: evidence for a shift from anaerobic to aerobic metabolism during prolonged aerial exposure.

Authors:  Peter A Fields; Chris Eurich; William L Gao; Bekim Cela
Journal:  J Exp Biol       Date:  2014-02-05       Impact factor: 3.312

9.  Deep venomics reveals the mechanism for expanded peptide diversity in cone snail venom.

Authors:  Sébastien Dutertre; Ai-hua Jin; Quentin Kaas; Alun Jones; Paul F Alewood; Richard J Lewis
Journal:  Mol Cell Proteomics       Date:  2012-11-14       Impact factor: 5.911

10.  Characterization of a New Pink-Fruited Tomato Mutant Results in the Identification of a Null Allele of the SlMYB12 Transcription Factor.

Authors:  Josefina-Patricia Fernandez-Moreno; Oren Tzfadia; Javier Forment; Silvia Presa; Ilana Rogachev; Sagit Meir; Diego Orzaez; Aspah Aharoni; Antonio Granell
Journal:  Plant Physiol       Date:  2016-05-06       Impact factor: 8.340

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.