Literature DB >> 29444201

Hierarchical analysis of RNA-seq reads improves the accuracy of allele-specific expression.

Narayanan Raghupathy1, Kwangbom Choi1, Matthew J Vincent1, Glen L Beane1, Keith S Sheppard1, Steven C Munger1, Ron Korstanje1, Fernando Pardo-Manual de Villena2, Gary A Churchill1.   

Abstract

Motivation: Allele-specific expression (ASE) refers to the differential abundance of the allelic copies of a transcript. RNA sequencing (RNA-seq) can provide quantitative estimates of ASE for genes with transcribed polymorphisms. When short-read sequences are aligned to a diploid transcriptome, read-mapping ambiguities confound our ability to directly count reads. Multi-mapping reads aligning equally well to multiple genomic locations, isoforms or alleles can comprise the majority (>85%) of reads. Discarding them can result in biases and substantial loss of information. Methods have been developed that use weighted allocation of read counts but these methods treat the different types of multi-reads equivalently. We propose a hierarchical approach to allocation of read counts that first resolves ambiguities among genes, then among isoforms, and lastly between alleles. We have implemented our model in EMASE software (Expectation-Maximization for Allele Specific Expression) to estimate total gene expression, isoform usage and ASE based on this hierarchical allocation.
Results: Methods that align RNA-seq reads to a diploid transcriptome incorporating known genetic variants improve estimates of ASE and total gene expression compared to methods that use reference genome alignments. Weighted allocation methods outperform methods that discard multi-reads. Hierarchical allocation of reads improves estimation of ASE even when data are simulated from a non-hierarchical model. Analysis of RNA-seq data from F1 hybrid mice using EMASE reveals widespread ASE associated with cis-acting polymorphisms and a small number of parent-of-origin effects. Availability and implementation: EMASE software is available at https://github.com/churchill-lab/emase. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29444201      PMCID: PMC6022640          DOI: 10.1093/bioinformatics/bty078

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  29 in total

1.  Near-optimal probabilistic RNA-seq quantification.

Authors:  Nicolas L Bray; Harold Pimentel; Páll Melsted; Lior Pachter
Journal:  Nat Biotechnol       Date:  2016-04-04       Impact factor: 54.908

2.  Tools and best practices for data processing in allelic expression analysis.

Authors:  Stephane E Castel; Ami Levy-Moonshine; Pejman Mohammadi; Eric Banks; Tuuli Lappalainen
Journal:  Genome Biol       Date:  2015-09-17       Impact factor: 13.583

3.  Haplotype and isoform specific expression estimation using multi-mapping RNA-seq reads.

Authors:  Ernest Turro; Shu-Yi Su; Ângela Gonçalves; Lachlan J M Coin; Sylvia Richardson; Alex Lewin
Journal:  Genome Biol       Date:  2011-02-10       Impact factor: 13.583

4.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

5.  voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.

Authors:  Charity W Law; Yunshun Chen; Wei Shi; Gordon K Smyth
Journal:  Genome Biol       Date:  2014-02-03       Impact factor: 13.583

6.  PRDM9 drives evolutionary erosion of hotspots in Mus musculus through haplotype-specific initiation of meiotic recombination.

Authors:  Christopher L Baker; Shimpei Kajita; Michael Walker; Ruth L Saxl; Narayanan Raghupathy; Kwangbom Choi; Petko M Petkov; Kenneth Paigen
Journal:  PLoS Genet       Date:  2015-01-08       Impact factor: 5.917

7.  Erratum to: A survey of best practices for RNA-seq data analysis.

Authors:  Ana Conesa; Pedro Madrigal; Sonia Tarazona; David Gomez-Cabrero; Alejandra Cervera; Andrew McPherson; Michal Wojciech Szcześniak; Daniel J Gaffney; Laura L Elo; Xuegong Zhang; Ali Mortazavi
Journal:  Genome Biol       Date:  2016-08-26       Impact factor: 13.583

8.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

9.  Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data.

Authors:  Jacob F Degner; John C Marioni; Athma A Pai; Joseph K Pickrell; Everlyne Nkadori; Yoav Gilad; Jonathan K Pritchard
Journal:  Bioinformatics       Date:  2009-10-06       Impact factor: 6.937

10.  Sources of bias in measures of allele-specific expression derived from RNA-sequence data aligned to a single reference genome.

Authors:  Kraig R Stevenson; Joseph D Coolon; Patricia J Wittkopp
Journal:  BMC Genomics       Date:  2013-08-07       Impact factor: 3.969

View more
  30 in total

1.  Statistical Modeling of High Dimensional Counts.

Authors:  Michael I Love
Journal:  Methods Mol Biol       Date:  2021

2.  Mapping the Effects of Genetic Variation on Chromatin State and Gene Expression Reveals Loci That Control Ground State Pluripotency.

Authors:  Daniel A Skelly; Anne Czechanski; Candice Byers; Selcan Aydin; Catrina Spruce; Chris Olivier; Kwangbom Choi; Daniel M Gatti; Narayanan Raghupathy; Gregory R Keele; Alexander Stanton; Matthew Vincent; Stephanie Dion; Ian Greenstein; Matthew Pankratz; Devin K Porter; Whitney Martin; Callan O'Connor; Wenning Qin; Alison H Harrill; Ted Choi; Gary A Churchill; Steven C Munger; Christopher L Baker; Laura G Reinholdt
Journal:  Cell Stem Cell       Date:  2020-08-13       Impact factor: 24.633

3.  High-Resolution Mapping of Multiway Enhancer-Promoter Interactions Regulating Pathogen Detection.

Authors:  Pranitha Vangala; Rachel Murphy; Sofia A Quinodoz; Kyle Gellatly; Patrick McDonel; Mitchell Guttman; Manuel Garber
Journal:  Mol Cell       Date:  2020-09-28       Impact factor: 17.970

Review 4.  Pangenome Graphs.

Authors:  Jordan M Eizenga; Adam M Novak; Jonas A Sibbesen; Simon Heumos; Ali Ghaffaari; Glenn Hickey; Xian Chang; Josiah D Seaman; Robin Rounthwaite; Jana Ebler; Mikko Rautiainen; Shilpa Garg; Benedict Paten; Tobias Marschall; Jouni Sirén; Erik Garrison
Journal:  Annu Rev Genomics Hum Genet       Date:  2020-05-26       Impact factor: 8.929

5.  Genetic Drivers of Pancreatic Islet Function.

Authors:  Mark P Keller; Daniel M Gatti; Kathryn L Schueler; Mary E Rabaglia; Donnie S Stapleton; Petr Simecek; Matthew Vincent; Sadie Allen; Aimee Teo Broman; Rhonda Bacher; Christina Kendziorski; Karl W Broman; Brian S Yandell; Gary A Churchill; Alan D Attie
Journal:  Genetics       Date:  2018-03-22       Impact factor: 4.562

Review 6.  Understanding the Transcriptomic Landscape to Drive New Innovations in Musculoskeletal Regenerative Medicine.

Authors:  Stacey M Thomas; Cheryl L Ackert-Bicknell; Michael J Zuscik; Karin A Payne
Journal:  Curr Osteoporos Rep       Date:  2022-02-14       Impact factor: 5.096

7.  Quantification of mutant-allele expression at isoform level in cancer from RNA-seq data.

Authors:  Wenjiang Deng; Tian Mou; Yudi Pawitan; Trung Nghia Vu
Journal:  NAR Genom Bioinform       Date:  2022-07-13

8.  Nfkbid Overexpression in Nonobese Diabetic Mice Elicits Complete Type 1 Diabetes Resistance in Part Associated with Enhanced Thymic Deletion of Pathogenic CD8 T Cells and Increased Numbers and Activity of Regulatory T Cells.

Authors:  Jennifer R Dwyer; Jeremy J Racine; Harold D Chapman; Anna Quinlan; Maximiliano Presa; Grace A Stafford; Ingo Schmitz; David V Serreze
Journal:  J Immunol       Date:  2022-06-27       Impact factor: 5.426

9.  Identification of a Core Module for Bone Mineral Density through the Integration of a Co-expression Network and GWAS Data.

Authors:  Olivia L Sabik; Gina M Calabrese; Eric Taleghani; Cheryl L Ackert-Bicknell; Charles R Farber
Journal:  Cell Rep       Date:  2020-09-15       Impact factor: 9.423

10.  Feasibility of predicting allele specific expression from DNA sequencing using machine learning.

Authors:  Zhenhua Zhang; Freerk van Dijk; Niek de Klein; Mariëlle E van Gijn; Lude H Franke; Richard J Sinke; Morris A Swertz; K Joeri van der Velde
Journal:  Sci Rep       Date:  2021-05-19       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.