Literature DB >> 35524113

UNAGI: Yeast Transcriptome Reconstruction and Gene Discovery Using Nanopore Sequencing.

Mohamad Al Kadi1, Nicolas Jung2, Daisuke Okuzaki3,4,5.   

Abstract

Computational approaches are the main approaches used in genome annotation. However, accuracy is low. Untranslated regions are not identified, complex isoforms are not predicted correctly and discovery rate of noncoding RNA is low. RNA-seq has revolutionized transcriptome reconstruction over the last decade. However, fragmentation included in cDNA sequencing leads to information loss, requiring transcripts to be assembled and reconstructed, thus affecting the accuracy of reconstructed transcriptome. Recently, long-read sequencing has been introduced with technologies such as Oxford Nanopore sequencing. cDNA is sequenced directly without fragmentation producing long reads that don't need to be assembled keeping the transcript structure intact and increasing the accuracy of transcriptome reconstruction.Here we present a protocol and a pipeline to reconstruct the transcriptome of compact genomes including yeasts. It involves generating full-length cDNA and using Oxford Nanopore ligation-based sequencing kit to sequence multiple samples in the same run. The pipeline (1) strands the generated long reads, (2) corrects the reads by mapping them to the reference genome, (3) identifies transcripts including 5'UTR and 3'UTR, (4) profiles the isoforms, filtering out artifacts resulting from low accuracy in sequencing, and (5) improves accuracy of provided annotations. Using long reads improves the accuracy of transcriptome reconstruction and helps in discovering a significant number of novel RNAs.
© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.

Entities:  

Keywords:  Full-length cDNA; Long-read sequencing; Nanopore; Pipeline; Transcriptome reconstruction

Mesh:

Substances:

Year:  2022        PMID: 35524113     DOI: 10.1007/978-1-0716-2257-5_6

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  9 in total

1.  BRAKER1: Unsupervised RNA-Seq-Based Genome Annotation with GeneMark-ET and AUGUSTUS.

Authors:  Katharina J Hoff; Simone Lange; Alexandre Lomsadze; Mark Borodovsky; Mario Stanke
Journal:  Bioinformatics       Date:  2015-11-11       Impact factor: 6.937

2.  RNA-Seq improves annotation of protein-coding genes in the cucumber genome.

Authors:  Zhen Li; Zhonghua Zhang; Pengcheng Yan; Sanwen Huang; Zhangjun Fei; Kui Lin
Journal:  BMC Genomics       Date:  2011-11-02       Impact factor: 3.969

3.  StringTie enables improved reconstruction of a transcriptome from RNA-seq reads.

Authors:  Mihaela Pertea; Geo M Pertea; Corina M Antonescu; Tsung-Cheng Chang; Joshua T Mendell; Steven L Salzberg
Journal:  Nat Biotechnol       Date:  2015-02-18       Impact factor: 54.908

4.  Next-generation genome annotation: we still struggle to get it right.

Authors:  Steven L Salzberg
Journal:  Genome Biol       Date:  2019-05-16       Impact factor: 13.583

5.  Improved annotation of the domestic pig genome through integration of Iso-Seq and RNA-seq data.

Authors:  H Beiki; H Liu; J Huang; N Manchanda; D Nonneman; T P L Smith; J M Reecy; C K Tuggle
Journal:  BMC Genomics       Date:  2019-05-07       Impact factor: 3.969

6.  A benchmark study of ab initio gene prediction methods in diverse eukaryotic organisms.

Authors:  Nicolas Scalzitti; Anne Jeannin-Girardon; Pierre Collet; Olivier Poch; Julie D Thompson
Journal:  BMC Genomics       Date:  2020-04-09       Impact factor: 3.969

7.  A pipeline for automated annotation of yeast genome sequences by a conserved-synteny approach.

Authors:  Estelle Proux-Wéra; David Armisén; Kevin P Byrne; Kenneth H Wolfe
Journal:  BMC Bioinformatics       Date:  2012-09-17       Impact factor: 3.169

8.  Hybrid transcriptome sequencing approach improved assembly and gene annotation in Cynara cardunculus (L.).

Authors:  Giuseppe D Puglia; Andrey D Prjibelski; Domenico Vitale; Elena Bushmanova; Karl J Schmid; Salvatore A Raccuia
Journal:  BMC Genomics       Date:  2020-08-21       Impact factor: 3.969

9.  UNAGI: an automated pipeline for nanopore full-length cDNA sequencing uncovers novel transcripts and isoforms in yeast.

Authors:  Mohamad Al Kadi; Nicolas Jung; Shingo Ito; Shoichiro Kameoka; Takashi Hishida; Daisuke Motooka; Shota Nakamura; Tetsuya Iida; Daisuke Okuzaki
Journal:  Funct Integr Genomics       Date:  2020-01-18       Impact factor: 3.410

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.