Literature DB >> 24833809

A Bayesian Semi-parametric Approach for the Differential Analysis of Sequence Counts Data.

Michele Guindani1, Nuno Sepúlveda2, Carlos Daniel Paulino3, Peter Müller4.   

Abstract

Data obtained using modern sequencing technologies are often summarized by recording the frequencies of observed sequences. Examples include the analysis of T cell counts in immunological research and studies of gene expression based on counts of RNA fragments. In both cases the items being counted are sequences, of proteins and base pairs, respectively. The resulting sequence-abundance distribution is usually characterized by overdispersion. We propose a Bayesian semi-parametric approach to implement inference for such data. Besides modeling the overdispersion, the approach takes also into account two related sources of bias that are usually associated with sequence counts data: some sequence types may not be recorded during the experiment and the total count may differ from one experiment to another. We illustrate our methodology with two data sets, one regarding the analysis of CD4+ T cell counts in healthy and diabetic mice and another data set concerning the comparison of mRNA fragments recorded in a Serial Analysis of Gene Expression (SAGE) experiment with gastrointestinal tissue of healthy and cancer patients.

Entities:  

Year:  2014        PMID: 24833809      PMCID: PMC4017673          DOI: 10.1111/rssc.12041

Source DB:  PubMed          Journal:  J R Stat Soc Ser C Appl Stat        ISSN: 0035-9254            Impact factor:   1.864


  28 in total

1.  Differential expression in SAGE: accounting for normal between-library variation.

Authors:  Keith A Baggerly; Li Deng; Jeffrey S Morris; C Marcelo Aldaz
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

2.  A Bayesian nonparametric approach for comparing clustering structures in EST libraries.

Authors:  Antonio Lijoi; Ramsés H Mena; Igor Prünster
Journal:  J Comput Biol       Date:  2008-12       Impact factor: 1.479

3.  Non-obese diabetic mice select a low-diversity repertoire of natural regulatory T cells.

Authors:  Cristina Ferreira; Yogesh Singh; Anna L Furmanski; F Susan Wong; Oliver A Garden; Julian Dyson
Journal:  Proc Natl Acad Sci U S A       Date:  2009-04-09       Impact factor: 11.205

4.  Bayesian Modeling of MPSS Data: Gene Expression Analysis of Bovine Salmonella Infection.

Authors:  Soma S Dhavala; Sujay Datta; Bani K Mallick; Raymond J Carroll; Sangeeta Khare; Sara D Lawhon; L Garry Adams
Journal:  J Am Stat Assoc       Date:  2010-09-01       Impact factor: 5.033

5.  baySeq: empirical Bayesian methods for identifying differential expression in sequence count data.

Authors:  Thomas J Hardcastle; Krystyna A Kelly
Journal:  BMC Bioinformatics       Date:  2010-08-10       Impact factor: 3.169

Review 6.  The many important facets of T-cell repertoire diversity.

Authors:  Janko Nikolich-Zugich; Mark K Slifka; Ilhem Messaoudi
Journal:  Nat Rev Immunol       Date:  2004-02       Impact factor: 53.106

7.  Fast Bayesian Inference in Dirichlet Process Mixture Models.

Authors:  Lianming Wang; David B Dunson
Journal:  J Comput Graph Stat       Date:  2011-01-01       Impact factor: 2.302

8.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

9.  Modeling SAGE tag formation and its effects on data interpretation within a Bayesian framework.

Authors:  Michael A Gilchrist; Hong Qin; Russell Zaretzki
Journal:  BMC Bioinformatics       Date:  2007-10-18       Impact factor: 3.169

10.  Modeling Sage data with a truncated gamma-Poisson model.

Authors:  Helene H Thygesen; Aeilko H Zwinderman
Journal:  BMC Bioinformatics       Date:  2006-03-20       Impact factor: 3.169

View more
  4 in total

1.  Bayesian Nonparametric Inference - Why and How.

Authors:  Peter Müller; Riten Mitra
Journal:  Bayesian Anal       Date:  2013       Impact factor: 3.728

2.  Quantification of Inter-Sample Differences in T-Cell Receptor Repertoires Using Sequence-Based Information.

Authors:  Ryo Yokota; Yuki Kaminaga; Tetsuya J Kobayashi
Journal:  Front Immunol       Date:  2017-11-15       Impact factor: 7.561

Review 3.  Machine Learning Approaches to TCR Repertoire Analysis.

Authors:  Yotaro Katayama; Ryo Yokota; Taishin Akiyama; Tetsuya J Kobayashi
Journal:  Front Immunol       Date:  2022-07-15       Impact factor: 8.786

4.  powerTCR: A model-based approach to comparative analysis of the clone size distribution of the T cell receptor repertoire.

Authors:  Hillary Koch; Dmytro Starenki; Sara J Cooper; Richard M Myers; Qunhua Li
Journal:  PLoS Comput Biol       Date:  2018-11-28       Impact factor: 4.475

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.