Literature DB >> 25266226

Hybrid Bayesian-rank integration approach improves the predictive power of genomic dataset aggregation.

Marcus A Badgeley1, Stuart C Sealfon1, Maria D Chikina1.   

Abstract

MOTIVATION: Modern molecular technologies allow the collection of large amounts of high-throughput data on the functional attributes of genes. Often multiple technologies and study designs are used to address the same biological question such as which genes are overexpressed in a specific disease state. Consequently, there is considerable interest in methods that can integrate across datasets to present a unified set of predictions.
RESULTS: An important aspect of data integration is being able to account for the fact that datasets may differ in how accurately they capture the biological signal of interest. While many methods to address this problem exist, they always rely either on dataset internal statistics, which reflect data structure and not necessarily biological relevance, or external gold standards, which may not always be available. We present a new rank aggregation method for data integration that requires neither external standards nor internal statistics but relies on Bayesian reasoning to assess dataset relevance. We demonstrate that our method outperforms established techniques and significantly improves the predictive power of rank-based aggregations. We show that our method, which does not require an external gold standard, provides reliable estimates of dataset relevance and allows the same set of data to be integrated differently depending on the specific signal of interest. AVAILABILITY: The method is implemented in R and is freely available at http://www.pitt.edu/~mchikina/BIRRA/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2014        PMID: 25266226      PMCID: PMC4287939          DOI: 10.1093/bioinformatics/btu518

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  PGC-1α, a potential therapeutic target for early intervention in Parkinson's disease.

Authors:  Bin Zheng; Zhixiang Liao; Joseph J Locascio; Kristen A Lesniak; Sarah S Roderick; Marla L Watt; Aron C Eklund; Yanli Zhang-James; Peter D Kim; Michael A Hauser; Edna Grünblatt; Linda B Moran; Silvia A Mandel; Peter Riederer; Renee M Miller; Howard J Federoff; Ullrich Wüllner; Spyridon Papapetropoulos; Moussa B Youdim; Ippolita Cantuti-Castelvetri; Anne B Young; Jeffery M Vance; Richard L Davis; John C Hedreen; Charles H Adler; Thomas G Beach; Manuel B Graeber; Frank A Middleton; Jean-Christophe Rochet; Clemens R Scherzer
Journal:  Sci Transl Med       Date:  2010-10-06       Impact factor: 17.956

2.  Common genetic variants account for differences in gene expression among ethnic groups.

Authors:  Richard S Spielman; Laurel A Bastone; Joshua T Burdick; Michael Morley; Warren J Ewens; Vivian G Cheung
Journal:  Nat Genet       Date:  2007-01-07       Impact factor: 38.330

3.  Nanog binds to Smad1 and blocks bone morphogenetic protein-induced differentiation of embryonic stem cells.

Authors:  Atsushi Suzuki; Ángel Raya; Yasuhiko Kawakami; Masanobu Morita; Takaaki Matsui; Kinichi Nakashima; Fred H Gage; Concepción Rodríguez-Esteban; Juan Carlos Izpisúa Belmonte
Journal:  Proc Natl Acad Sci U S A       Date:  2006-06-26       Impact factor: 11.205

4.  A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans.

Authors:  Insuk Lee; Ben Lehner; Catriona Crombie; Wendy Wong; Andrew G Fraser; Edward M Marcotte
Journal:  Nat Genet       Date:  2008-01-27       Impact factor: 38.330

5.  Exploring the human genome with functional maps.

Authors:  Curtis Huttenhower; Erin M Haley; Matthew A Hibbs; Vanessa Dumeaux; Daniel R Barrett; Hilary A Coller; Olga G Troyanskaya
Journal:  Genome Res       Date:  2009-02-26       Impact factor: 9.043

6.  Integration of external signaling pathways with the core transcriptional network in embryonic stem cells.

Authors:  Xi Chen; Han Xu; Ping Yuan; Fang Fang; Mikael Huss; Vinsensius B Vega; Eleanor Wong; Yuriy L Orlov; Weiwei Zhang; Jianming Jiang; Yuin-Han Loh; Hock Chuan Yeo; Zhen Xuan Yeo; Vipin Narang; Kunde Ramamoorthy Govindarajan; Bernard Leong; Atif Shahab; Yijun Ruan; Guillaume Bourque; Wing-Kin Sung; Neil D Clarke; Chia-Lin Wei; Huck-Hui Ng
Journal:  Cell       Date:  2008-06-13       Impact factor: 41.582

Review 7.  Comprehensive literature review and statistical considerations for microarray meta-analysis.

Authors:  George C Tseng; Debashis Ghosh; Eleanor Feingold
Journal:  Nucleic Acids Res       Date:  2012-01-19       Impact factor: 16.971

8.  Robust rank aggregation for gene list integration and meta-analysis.

Authors:  Raivo Kolde; Sven Laur; Priit Adler; Jaak Vilo
Journal:  Bioinformatics       Date:  2012-01-12       Impact factor: 6.937

9.  Bayesian integration of networks without gold standards.

Authors:  Jochen Weile; Katherine James; Jennifer Hallinan; Simon J Cockell; Phillip Lord; Anil Wipat; Darren J Wilkinson
Journal:  Bioinformatics       Date:  2012-04-06       Impact factor: 6.937

10.  InSilico DB genomic datasets hub: an efficient starting point for analyzing genome-wide studies in GenePattern, Integrative Genomics Viewer, and R/Bioconductor.

Authors:  Alain Coletta; Colin Molter; Robin Duqué; David Steenhoff; Jonatan Taminau; Virginie de Schaetzen; Stijn Meganck; Cosmin Lazar; David Venet; Vincent Detours; Ann Nowé; Hugues Bersini; David Y Weiss Solís
Journal:  Genome Biol       Date:  2012-11-18       Impact factor: 13.583

View more
  4 in total

1.  A Bayesian latent variable approach to aggregation of partial and top-ranked lists in genomic studies.

Authors:  Xue Li; Pankaj Kumar Choudhary; Swati Biswas; Xinlei Wang
Journal:  Stat Med       Date:  2018-08-09       Impact factor: 2.373

2.  Capturing functional long non-coding RNAs through integrating large-scale causal relations from gene perturbation experiments.

Authors:  Jinyuan Xu; Aiai Shi; Zhilin Long; Liwen Xu; Gaoming Liao; Chunyu Deng; Min Yan; Aiming Xie; Tao Luo; Jian Huang; Yun Xiao; Xia Li
Journal:  EBioMedicine       Date:  2018-09-01       Impact factor: 8.143

3.  Literature optimized integration of gene expression for organ-specific evaluation of toxicogenomics datasets.

Authors:  Katerina Taškova; Jean-Fred Fontaine; Ralf Mrowka; Miguel A Andrade-Navarro
Journal:  PLoS One       Date:  2019-01-14       Impact factor: 3.240

4.  A comparative study of rank aggregation methods for partial and top ranked lists in genomic applications.

Authors:  Xue Li; Xinlei Wang; Guanghua Xiao
Journal:  Brief Bioinform       Date:  2019-01-18       Impact factor: 11.622

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.