Wilson McKerrow1,2, David Fenyö1,2. 1. Institute for Systems Genetics, USA. 2. Department for Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY, USA.
Abstract
MOTIVATION: LINE-1 elements are retrotransposons that are capable of copying their sequence to new genomic loci. LINE-1 derepression is associated with a number of disease states, and has the potential to cause significant cellular damage. Because LINE-1 elements are repetitive, it is difficult to quantify LINE-1 RNA at specific loci and to separate transcripts with protein coding capability from other sources of LINE-1 RNA. RESULTS: We provide a tool, L1EM that uses the expectation maximization algorithm to quantify LINE-1 RNA at each genomic locus, separating transcripts that are capable of generating retrotransposition from those that are not. We show the accuracy of L1EM on simulated data and against long read sequencing from HEK cells. AVAILABILITY AND IMPLEMENTATION: L1EM is written in python. The source code along with the necessary annotations are available at https://github.com/FenyoLab/L1EM and distributed under GPLv3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: LINE-1 elements are retrotransposons that are capable of copying their sequence to new genomic loci. LINE-1 derepression is associated with a number of disease states, and has the potential to cause significant cellular damage. Because LINE-1 elements are repetitive, it is difficult to quantify LINE-1 RNA at specific loci and to separate transcripts with protein coding capability from other sources of LINE-1 RNA. RESULTS: We provide a tool, L1EM that uses the expectation maximization algorithm to quantify LINE-1 RNA at each genomic locus, separating transcripts that are capable of generating retrotransposition from those that are not. We show the accuracy of L1EM on simulated data and against long read sequencing from HEK cells. AVAILABILITY AND IMPLEMENTATION: L1EM is written in python. The source code along with the necessary annotations are available at https://github.com/FenyoLab/L1EM and distributed under GPLv3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Wan R Yang; Daniel Ardeljan; Clarissa N Pacyna; Lindsay M Payer; Kathleen H Burns Journal: Nucleic Acids Res Date: 2019-03-18 Impact factor: 16.971
Authors: W Wei; N Gilbert; S L Ooi; J F Lawler; E M Ostertag; H H Kazazian; J D Boeke; J V Moran Journal: Mol Cell Biol Date: 2001-02 Impact factor: 4.272
Authors: Nemanja Rodić; Reema Sharma; Rajni Sharma; John Zampella; Lixin Dai; Martin S Taylor; Ralph H Hruban; Christine A Iacobuzio-Donahue; Anirban Maitra; Michael S Torbenson; Michael Goggins; Ie-Ming Shih; Amy S Duffield; Elizabeth A Montgomery; Edward Gabrielson; George J Netto; Tamara L Lotan; Angelo M De Marzo; William Westra; Zev A Binder; Brent A Orr; Gary L Gallia; Charles G Eberhart; Jef D Boeke; Chris R Harris; Kathleen H Burns Journal: Am J Pathol Date: 2014-03-06 Impact factor: 4.307
Authors: Hiroki Kano; Irene Godoy; Christine Courtney; Melissa R Vetter; George L Gerton; Eric M Ostertag; Haig H Kazazian Journal: Genes Dev Date: 2009-06-01 Impact factor: 11.361
Authors: Eugene J Gardner; Vincent K Lam; Daniel N Harris; Nelson T Chuang; Emma C Scott; W Stephen Pittard; Ryan E Mills; Scott E Devine Journal: Genome Res Date: 2017-08-30 Impact factor: 9.043
Authors: Darren Taylor; Robert Lowe; Claude Philippe; Kevin C L Cheng; Olivia A Grant; Nicolae Radu Zabet; Gael Cristofari; Miguel R Branco Journal: Nucleic Acids Res Date: 2022-04-08 Impact factor: 16.971
Authors: Wilson McKerrow; Xuya Wang; Carlos Mendez-Dorantes; Paolo Mita; Song Cao; Mark Grivainis; Li Ding; John LaCava; Kathleen H Burns; Jef D Boeke; David Fenyö Journal: Proc Natl Acad Sci U S A Date: 2022-02-22 Impact factor: 12.779