| Literature DB >> 35922509 |
Dafni A Glinos1,2, Garrett Garborcauskas3, Paul Hoffman4, Nava Ehsan5, Lihua Jiang6, Alper Gokden4, Xiaoguang Dai7, François Aguet8, Kathleen L Brown4,9, Kiran Garimella8, Tera Bowers8, Maura Costello8, Kristin Ardlie8, Ruiqi Jian6, Nathan R Tucker10,11, Patrick T Ellinor11, Eoghan D Harrington7, Hua Tang6, Michael Snyder6, Sissel Juul7, Pejman Mohammadi5,12, Daniel G MacArthur13,14,15, Tuuli Lappalainen16,17,18, Beryl B Cummings19.
Abstract
Regulation of transcript structure generates transcript diversity and plays an important role in human disease1-7. The advent of long-read sequencing technologies offers the opportunity to study the role of genetic variation in transcript structure8-16. In this Article, we present a large human long-read RNA-seq dataset using the Oxford Nanopore Technologies platform from 88 samples from Genotype-Tissue Expression (GTEx) tissues and cell lines, complementing the GTEx resource. We identified just over 70,000 novel transcripts for annotated genes, and validated the protein expression of 10% of novel transcripts. We developed a new computational package, LORALS, to analyse the genetic effects of rare and common variants on the transcriptome by allele-specific analysis of long reads. We characterized allele-specific expression and transcript structure events, providing new insights into the specific transcript alterations caused by common and rare genetic variants and highlighting the resolution gained from long-read data. We were able to perturb the transcript structure upon knockdown of PTBP1, an RNA binding protein that mediates splicing, thereby finding genetic regulatory effects that are modified by the cellular environment. Finally, we used this dataset to enhance variant interpretation and study rare variants leading to aberrant splicing patterns.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35922509 DOI: 10.1038/s41586-022-05035-y
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 69.504