Literature DB >> 22419786

Fulcrum: condensing redundant reads from high-throughput sequencing studies.

Matthew S Burriesci1, Erik M Lehnert, John R Pringle.   

Abstract

MOTIVATION: Ultra-high-throughput sequencing produces duplicate and near-duplicate reads, which can consume computational resources in downstream applications. A tool that collapses such reads should reduce storage and assembly complications and costs.
RESULTS: We developed Fulcrum to collapse identical and near-identical Illumina and 454 reads (such as those from PCR clones) into single error-corrected sequences; it can process paired-end as well as single-end reads. Fulcrum is customizable and can be deployed on a single machine, a local network or a commercially available MapReduce cluster, and it has been optimized to maximize ease-of-use, cross-platform compatibility and future scalability. Sequence datasets have been collapsed by up to 71%, and the reduced number and improved quality of the resulting sequences allow assemblers to produce longer contigs while using less memory.

Mesh:

Year:  2012        PMID: 22419786      PMCID: PMC3348557          DOI: 10.1093/bioinformatics/bts123

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  Correction of sequencing errors in a mixed set of reads.

Authors:  Leena Salmela
Journal:  Bioinformatics       Date:  2010-04-08       Impact factor: 6.937

2.  Assembly of large genomes using second-generation sequencing.

Authors:  Michael C Schatz; Arthur L Delcher; Steven L Salzberg
Journal:  Genome Res       Date:  2010-05-27       Impact factor: 9.043

3.  Galaxy: a platform for interactive large-scale genome analysis.

Authors:  Belinda Giardine; Cathy Riemer; Ross C Hardison; Richard Burhans; Laura Elnitski; Prachi Shah; Yi Zhang; Daniel Blankenberg; Istvan Albert; James Taylor; Webb Miller; W James Kent; Anton Nekrutenko
Journal:  Genome Res       Date:  2005-09-16       Impact factor: 9.043

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  Efficient frequency-based de novo short-read clustering for error trimming in next-generation sequencing.

Authors:  Wei Qu; Shin-Ichi Hashimoto; Shinichi Morishita
Journal:  Genome Res       Date:  2009-05-13       Impact factor: 9.043

6.  SHREC: a short-read error correction method.

Authors:  Jan Schröder; Heiko Schröder; Simon J Puglisi; Ranjan Sinha; Bertil Schmidt
Journal:  Bioinformatics       Date:  2009-06-19       Impact factor: 6.937

7.  Targeted sequencing of the human X chromosome exome.

Authors:  Kajari Mondal; Amol Carl Shetty; Viren Patel; David J Cutler; Michael E Zwick
Journal:  Genomics       Date:  2011-04-16       Impact factor: 5.736

8.  The sequence and de novo assembly of the giant panda genome.

Authors:  Ruiqiang Li; Wei Fan; Geng Tian; Hongmei Zhu; Lin He; Jing Cai; Quanfei Huang; Qingle Cai; Bo Li; Yinqi Bai; Zhihe Zhang; Yaping Zhang; Wen Wang; Jun Li; Fuwen Wei; Heng Li; Min Jian; Jianwen Li; Zhaolei Zhang; Rasmus Nielsen; Dawei Li; Wanjun Gu; Zhentao Yang; Zhaoling Xuan; Oliver A Ryder; Frederick Chi-Ching Leung; Yan Zhou; Jianjun Cao; Xiao Sun; Yonggui Fu; Xiaodong Fang; Xiaosen Guo; Bo Wang; Rong Hou; Fujun Shen; Bo Mu; Peixiang Ni; Runmao Lin; Wubin Qian; Guodong Wang; Chang Yu; Wenhui Nie; Jinhuan Wang; Zhigang Wu; Huiqing Liang; Jiumeng Min; Qi Wu; Shifeng Cheng; Jue Ruan; Mingwei Wang; Zhongbin Shi; Ming Wen; Binghang Liu; Xiaoli Ren; Huisong Zheng; Dong Dong; Kathleen Cook; Gao Shan; Hao Zhang; Carolin Kosiol; Xueying Xie; Zuhong Lu; Hancheng Zheng; Yingrui Li; Cynthia C Steiner; Tommy Tsan-Yuk Lam; Siyuan Lin; Qinghui Zhang; Guoqing Li; Jing Tian; Timing Gong; Hongde Liu; Dejin Zhang; Lin Fang; Chen Ye; Juanbin Zhang; Wenbo Hu; Anlong Xu; Yuanyuan Ren; Guojie Zhang; Michael W Bruford; Qibin Li; Lijia Ma; Yiran Guo; Na An; Yujie Hu; Yang Zheng; Yongyong Shi; Zhiqiang Li; Qing Liu; Yanling Chen; Jing Zhao; Ning Qu; Shancen Zhao; Feng Tian; Xiaoling Wang; Haiyin Wang; Lizhi Xu; Xiao Liu; Tomas Vinar; Yajun Wang; Tak-Wah Lam; Siu-Ming Yiu; Shiping Liu; Hemin Zhang; Desheng Li; Yan Huang; Xia Wang; Guohua Yang; Zhi Jiang; Junyi Wang; Nan Qin; Li Li; Jingxiang Li; Lars Bolund; Karsten Kristiansen; Gane Ka-Shu Wong; Maynard Olson; Xiuqing Zhang; Songgang Li; Huanming Yang; Jian Wang; Jun Wang
Journal:  Nature       Date:  2009-12-13       Impact factor: 49.962

9.  Parallel, tag-directed assembly of locally derived short sequence reads.

Authors:  Joseph B Hiatt; Rupali P Patwardhan; Emily H Turner; Choli Lee; Jay Shendure
Journal:  Nat Methods       Date:  2010-01-17       Impact factor: 28.547

10.  Generation and analysis of transcriptomic resources for a model system on the rise: the sea anemone Aiptasia pallida and its dinoflagellate endosymbiont.

Authors:  Shinichi Sunagawa; Emily C Wilson; Michael Thaler; Marc L Smith; Carlo Caruso; John R Pringle; Virginia M Weis; Mónica Medina; Jodi A Schwarz
Journal:  BMC Genomics       Date:  2009-06-05       Impact factor: 3.969

View more
  14 in total

1.  Fast-HBR: Fast hash based duplicate read remover.

Authors:  Sami Altayyar; Abdel Monim Artoli
Journal:  Bioinformation       Date:  2022-01-31

2.  Oculus: faster sequence alignment by streaming read compression.

Authors:  Brendan A Veeneman; Matthew K Iyer; Arul M Chinnaiyan
Journal:  BMC Bioinformatics       Date:  2012-11-13       Impact factor: 3.169

3.  Developing the anemone Aiptasia as a tractable model for cnidarian-dinoflagellate symbiosis: the transcriptome of aposymbiotic A. pallida.

Authors:  Erik M Lehnert; Matthew S Burriesci; John R Pringle
Journal:  BMC Genomics       Date:  2012-06-22       Impact factor: 3.969

4.  G-CNV: A GPU-Based Tool for Preparing Data to Detect CNVs with Read-Depth Methods.

Authors:  Andrea Manconi; Emanuele Manca; Marco Moscatelli; Matteo Gnocchi; Alessandro Orro; Giuliano Armano; Luciano Milanesi
Journal:  Front Bioeng Biotechnol       Date:  2015-03-10

5.  Rapid evolution of chemosensory receptor genes in a pair of sibling species of orchid bees (Apidae: Euglossini).

Authors:  Philipp Brand; Santiago R Ramírez; Florian Leese; J Javier G Quezada-Euan; Ralph Tollrian; Thomas Eltz
Journal:  BMC Evol Biol       Date:  2015-08-28       Impact factor: 3.260

6.  Reference-based compression of short-read sequences using path encoding.

Authors:  Carl Kingsford; Rob Patro
Journal:  Bioinformatics       Date:  2015-02-02       Impact factor: 6.937

7.  Draft Genome Sequence of the Archiascomycetous Yeast Saitoella complicata.

Authors:  Kenta Yamauchi; Shinji Kondo; Makiko Hamamoto; Yurika Takahashi; Yoshitoshi Ogura; Tetsuya Hayashi; Hiromi Nishida
Journal:  Genome Announc       Date:  2015-05-28

8.  Removing duplicate reads using graphics processing units.

Authors:  Andrea Manconi; Marco Moscatelli; Giuliano Armano; Matteo Gnocchi; Alessandro Orro; Luciano Milanesi
Journal:  BMC Bioinformatics       Date:  2016-11-08       Impact factor: 3.169

9.  Effect of method of deduplication on estimation of differential gene expression using RNA-seq.

Authors:  Anna V Klepikova; Artem S Kasianov; Mikhail S Chesnokov; Natalia L Lazarevich; Aleksey A Penin; Maria Logacheva
Journal:  PeerJ       Date:  2017-03-16       Impact factor: 2.984

10.  FastUniq: a fast de novo duplicates removal tool for paired short reads.

Authors:  Haibin Xu; Xiang Luo; Jun Qian; Xiaohui Pang; Jingyuan Song; Guangrui Qian; Jinhui Chen; Shilin Chen
Journal:  PLoS One       Date:  2012-12-20       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.