Literature DB >> 33648554

2passtools: two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing.

Matthew T Parker1, Katarzyna Knop2, Geoffrey J Barton2, Gordon G Simpson3,4.   

Abstract

Transcription of eukaryotic genomes involves complex alternative processing of RNAs. Sequencing of full-length RNAs using long reads reveals the true complexity of processing. However, the relatively high error rates of long-read sequencing technologies can reduce the accuracy of intron identification. Here we apply alignment metrics and machine-learning-derived sequence information to filter spurious splice junctions from long-read alignments and use the remaining junctions to guide realignment in a two-pass approach. This method, available in the software package 2passtools ( https://github.com/bartongroup/2passtools ), improves the accuracy of spliced alignment and transcriptome assembly for species both with and without existing high-quality annotations.

Entities:  

Keywords:  Gene expression; Long-read sequencing; Machine learning; Nanopore sequencing; RNA-seq; Spliced alignment; Splicing; Transcriptome assembly

Mesh:

Substances:

Year:  2021        PMID: 33648554      PMCID: PMC7919322          DOI: 10.1186/s13059-021-02296-0

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


  41 in total

1.  A handful of intron-containing genes produces the lion's share of yeast mRNA.

Authors:  M Ares; L Grate; M H Pauling
Journal:  RNA       Date:  1999-09       Impact factor: 4.942

2.  Proteogenomic analysis reveals alternative splicing and translation as part of the abscisic acid response in Arabidopsis seedlings.

Authors:  Fu-Yuan Zhu; Mo-Xian Chen; Neng-Hui Ye; Lu Shi; Kai-Long Ma; Jing-Fang Yang; Yun-Ying Cao; Youjun Zhang; Takuya Yoshida; Alisdair R Fernie; Guang-Yi Fan; Bo Wen; Ruo Zhou; Tie-Yuan Liu; Tao Fan; Bei Gao; Di Zhang; Ge-Fei Hao; Shi Xiao; Ying-Gao Liu; Jianhua Zhang
Journal:  Plant J       Date:  2017-06-07       Impact factor: 6.417

3.  Snakemake--a scalable bioinformatics workflow engine.

Authors:  Johannes Köster; Sven Rahmann
Journal:  Bioinformatics       Date:  2012-08-20       Impact factor: 6.937

4.  Highly parallel direct RNA sequencing on an array of nanopores.

Authors:  Daniel R Garalde; Elizabeth A Snell; Daniel Jachimowicz; Botond Sipos; Joseph H Lloyd; Mark Bruce; Nadia Pantic; Tigist Admassu; Phillip James; Anthony Warland; Michael Jordan; Jonah Ciccone; Sabrina Serra; Jemma Keenan; Samuel Martin; Luke McNeill; E Jayne Wallace; Lakmal Jayasinghe; Chris Wright; Javier Blasco; Stephen Young; Denise Brocklebank; Sissel Juul; James Clarke; Andrew J Heron; Daniel J Turner
Journal:  Nat Methods       Date:  2018-01-15       Impact factor: 28.547

5.  StringTie enables improved reconstruction of a transcriptome from RNA-seq reads.

Authors:  Mihaela Pertea; Geo M Pertea; Corina M Antonescu; Tsung-Cheng Chang; Joshua T Mendell; Steven L Salzberg
Journal:  Nat Biotechnol       Date:  2015-02-18       Impact factor: 54.908

6.  Apparent non-canonical trans-splicing is generated by reverse transcriptase in vitro.

Authors:  Jonathan Houseley; David Tollervey
Journal:  PLoS One       Date:  2010-08-18       Impact factor: 3.240

7.  Comprehensive splice-site analysis using comparative genomics.

Authors:  Nihar Sheth; Xavier Roca; Michelle L Hastings; Ted Roeder; Adrian R Krainer; Ravi Sachidanandam
Journal:  Nucleic Acids Res       Date:  2006-08-12       Impact factor: 16.971

8.  Efficient and accurate detection of splice junctions from RNA-seq with Portcullis.

Authors:  Daniel Mapleson; Luca Venturini; Gemy Kaithakottil; David Swarbreck
Journal:  Gigascience       Date:  2018-12-01       Impact factor: 6.524

9.  Alternative isoform regulation in human tissue transcriptomes.

Authors:  Eric T Wang; Rickard Sandberg; Shujun Luo; Irina Khrebtukova; Lu Zhang; Christine Mayr; Stephen F Kingsmore; Gary P Schroth; Christopher B Burge
Journal:  Nature       Date:  2008-11-27       Impact factor: 49.962

10.  Illuminating the dark side of the human transcriptome with long read transcript sequencing.

Authors:  Richard I Kuo; Yuanyuan Cheng; Runxuan Zhang; John W S Brown; Jacqueline Smith; Alan L Archibald; David W Burt
Journal:  BMC Genomics       Date:  2020-10-30       Impact factor: 3.969

View more
  5 in total

1.  Widespread premature transcription termination of Arabidopsis thaliana NLR genes by the spen protein FPA.

Authors:  Matthew T Parker; Katarzyna Knop; Vasiliki Zacharaki; Anna V Sherwood; Daniel Tomé; Xuhong Yu; Pascal Gp Martin; Jim Beynon; Scott D Michaels; Geoffrey J Barton; Gordon G Simpson
Journal:  Elife       Date:  2021-04-27       Impact factor: 8.140

2.  NanoSplicer: Accurate identification of splice junctions using Oxford Nanopore sequencing.

Authors:  Yupei You; Michael B Clark; Heejung Shim
Journal:  Bioinformatics       Date:  2022-05-27       Impact factor: 6.931

3.  Quantitative analysis of the splice variants expressed by the major hepatitis B virus genotypes.

Authors:  Chun Shen Lim; Vitina Sozzi; Margaret Littlejohn; Lilly K W Yuen; Nadia Warner; Brigid Betz-Stablein; Fabio Luciani; Peter A Revill; Chris M Brown
Journal:  Microb Genom       Date:  2021-01

4.  Sequencing of individual barcoded cDNAs using Pacific Biosciences and Oxford Nanopore Technologies reveals platform-specific error patterns.

Authors:  Alla Mikheenko; Andrey D Prjibelski; Anoushka Joglekar; Hagen U Tilgner
Journal:  Genome Res       Date:  2022-03-17       Impact factor: 9.438

5.  Accurate spliced alignment of long RNA sequencing reads.

Authors:  Kristoffer Sahlin; Veli Mäkinen
Journal:  Bioinformatics       Date:  2021-07-24       Impact factor: 6.937

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.