| Literature DB >> 35013600 |
Francisca Rojas Ringeling1, Shounak Chakraborty1, Caroline Vissers2, Derek Reiman3, Akshay M Patel1, Ki-Heon Lee4, Ari Hong5,6, Chan-Woo Park4, Tim Reska1, Julien Gagneur7,8,9, Hyeshik Chang5,6,10, Maria L Spletter11, Ki-Jun Yoon4, Guo-Li Ming12, Hongjun Song12, Stefan Canzar13.
Abstract
The accuracy of methods for assembling transcripts from short-read RNA sequencing data is limited by the lack of long-range information. Here we introduce Ladder-seq, an approach that separates transcripts according to their lengths before sequencing and uses the additional information to improve the quantification and assembly of transcripts. Using simulated data, we show that a kallisto algorithm extended to process Ladder-seq data quantifies transcripts of complex genes with substantially higher accuracy than conventional kallisto. For reference-based assembly, a tailored scheme based on the StringTie2 algorithm reconstructs a single transcript with 30.8% higher precision than its conventional counterpart and is more than 30% more sensitive for complex genes. For de novo assembly, a similar scheme based on the Trinity algorithm correctly assembles 78% more transcripts than conventional Trinity while improving precision by 78%. In experimental data, Ladder-seq reveals 40% more genes harboring isoform switches compared to conventional RNA sequencing and unveils widespread changes in isoform usage upon m6A depletion by Mettl14 knockout.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35013600 DOI: 10.1038/s41587-021-01136-7
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 68.164