Literature DB >> 27829364

LSCplus: a fast solution for improving long read accuracy by short read alignment.

Ruifeng Hu1,2,3,4, Guibo Sun1,2,3,4, Xiaobo Sun5,6,7,8.   

Abstract

BACKGROUND: The single molecule, real time (SMRT) sequencing technology of Pacific Biosciences enables the acquisition of transcripts from end to end due to its ability to produce extraordinarily long reads (>10 kb). This new method of transcriptome sequencing has been applied to several projects on humans and model organisms. However, the raw data from SMRT sequencing are of relatively low quality, with a random error rate of approximately 15 %, for which error correction using next-generation sequencing (NGS) short reads is typically necessary. Few tools have been designed that apply a hybrid sequencing approach that combines NGS and SMRT data, and the most popular existing tool for error correction, LSC, has computing resource requirements that are too intensive for most laboratory and research groups. These shortcomings severely limit the application of SMRT long reads for transcriptome analysis.
RESULTS: Here, we report an improved tool (LSCplus) for error correction with the LSC program as a reference. LSCplus overcomes the disadvantage of LSC's time consumption and improves quality. Only 1/3-1/4 of the time and 1/20-1/25 of the error correction time is required using LSCplus compared with that required for using LSC.
CONCLUSIONS: LSCplus is freely available at http://www.herbbol.org:8001/lscplus/ . Sample calculations are provided illustrating the precision and efficiency of this method regarding error correction and isoform detection.

Entities:  

Keywords:  Error correction; RNA-seq; SMRT sequencing; Time-consumption

Mesh:

Year:  2016        PMID: 27829364      PMCID: PMC5103424          DOI: 10.1186/s12859-016-1316-y

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  25 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

3.  Characterization of fusion genes and the significantly expressed fusion isoforms in breast cancer by hybrid sequencing.

Authors:  Jason L Weirather; Pegah Tootoonchi Afshar; Tyson A Clark; Elizabeth Tseng; Linda S Powers; Jason G Underwood; Joseph Zabner; Jonas Korlach; Wing Hung Wong; Kin Fai Au
Journal:  Nucleic Acids Res       Date:  2015-06-03       Impact factor: 16.971

4.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

5.  proovread: large-scale high-accuracy PacBio correction through iterative short read consensus.

Authors:  Thomas Hackl; Rainer Hedrich; Jörg Schultz; Frank Förster
Journal:  Bioinformatics       Date:  2014-07-10       Impact factor: 6.937

6.  Defining a personal, allele-specific, and single-molecule long-read transcriptome.

Authors:  Hagen Tilgner; Fabian Grubert; Donald Sharon; Michael P Snyder
Journal:  Proc Natl Acad Sci U S A       Date:  2014-06-24       Impact factor: 11.205

7.  PBSIM: PacBio reads simulator--toward accurate genome assembly.

Authors:  Yukiteru Ono; Kiyoshi Asai; Michiaki Hamada
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

Review 8.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

9.  Transcriptome sequencing to detect gene fusions in cancer.

Authors:  Christopher A Maher; Chandan Kumar-Sinha; Xuhong Cao; Shanker Kalyana-Sundaram; Bo Han; Xiaojun Jing; Lee Sam; Terrence Barrette; Nallasivam Palanisamy; Arul M Chinnaiyan
Journal:  Nature       Date:  2009-01-11       Impact factor: 49.962

10.  LoRDEC: accurate and efficient long read error correction.

Authors:  Leena Salmela; Eric Rivals
Journal:  Bioinformatics       Date:  2014-08-26       Impact factor: 6.937

View more
  9 in total

1.  Haplotype-resolved genome assembly enables gene discovery in the red palm weevil Rhynchophorus ferrugineus.

Authors:  Guilherme B Dias; Musaad A Altammami; Hamadttu A F El-Shafie; Fahad M Alhoshani; Mohamed B Al-Fageeh; Casey M Bergman; Manee M Manee
Journal:  Sci Rep       Date:  2021-05-11       Impact factor: 4.379

Review 2.  Computational Methods for Mapping, Assembly and Quantification for Coding and Non-coding Transcripts.

Authors:  Isaac A Babarinde; Yuhao Li; Andrew P Hutchins
Journal:  Comput Struct Biotechnol J       Date:  2019-05-07       Impact factor: 7.271

3.  Hardware Acceleration of Genomics Data Analysis: Challenges and Opportunities.

Authors:  Tony Robinson; Jim Harkin; Priyank Shukla
Journal:  Bioinformatics       Date:  2021-05-25       Impact factor: 6.937

4.  Transcriptome analysis identifies putative genes involved in triterpenoid biosynthesis in Platycodon grandiflorus.

Authors:  Hanwen Yu; Mengli Liu; Minzhen Yin; Tingyu Shan; Huasheng Peng; Jutao Wang; Xiangwei Chang; Daiyin Peng; Liangping Zha; Shuangying Gui
Journal:  Planta       Date:  2021-07-21       Impact factor: 4.116

5.  Reconstruction of the full-length transcriptome of cigar tobacco without a reference genome and characterization of anion channel/transporter transcripts.

Authors:  Hui Zhang; Jingjing Jin; Guoyun Xu; Zefeng Li; Niu Zhai; Qingxia Zheng; Hongkun Lv; Pingping Liu; Lifeng Jin; Qiansi Chen; Peijian Cao; Huina Zhou
Journal:  BMC Plant Biol       Date:  2021-06-29       Impact factor: 4.215

6.  A Sequence-Based Novel Approach for Quality Evaluation of Third-Generation Sequencing Reads.

Authors:  Wenjing Zhang; Neng Huang; Jiantao Zheng; Xingyu Liao; Jianxin Wang; Hong-Dong Li
Journal:  Genes (Basel)       Date:  2019-01-14       Impact factor: 4.096

7.  Iso-Seq Allows Genome-Independent Transcriptome Profiling of Grape Berry Development.

Authors:  Andrea Minio; Mélanie Massonnet; Rosa Figueroa-Balderas; Amanda M Vondras; Barbara Blanco-Ulate; Dario Cantu
Journal:  G3 (Bethesda)       Date:  2019-03-07       Impact factor: 3.154

8.  Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads.

Authors:  Anqi Wang; Kin Fai Au
Journal:  Genome Biol       Date:  2020-01-17       Impact factor: 13.583

9.  Illuminating the dark side of the human transcriptome with long read transcript sequencing.

Authors:  Richard I Kuo; Yuanyuan Cheng; Runxuan Zhang; John W S Brown; Jacqueline Smith; Alan L Archibald; David W Burt
Journal:  BMC Genomics       Date:  2020-10-30       Impact factor: 3.969

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.