Literature DB >> 20702396

Cross-species queries of large gene expression databases.

Hai-Son Le1, Zoltán N Oltvai, Ziv Bar-Joseph.   

Abstract

MOTIVATION: Expression databases, including the Gene Expression Omnibus and ArrayExpress, have experienced significant growth over the past decade and now hold hundreds of thousands of arrays from multiple species. Since most drugs are initially tested on model organisms, the ability to compare expression experiments across species may help identify pathways that are activated in a similar way in humans and other organisms. However, while several methods exist for finding co-expressed genes in the same species as a query gene, looking at co-expression of homologs or arbitrary genes in other species is challenging. Unlike sequence, which is static, expression is dynamic and changes between tissues, conditions and time. Thus, to carry out cross-species analysis using these databases, we need methods that can match experiments in one species with experiments in another species.
RESULTS: To facilitate queries in large databases, we developed a new method for comparing expression experiments from different species. We define a distance metric between the ranking of orthologous genes in the two species. We show how to solve an optimization problem for learning the parameters of this function using a training dataset of known similar expression experiments pairs. The function we learn outperforms previous methods and simpler rank comparison methods that have been used in the past for single species analysis. We used our method to compare millions of array pairs from mouse and human expression experiments. The resulting matches can be used to find functionally related genes, to hypothesize about biological response mechanisms and to highlight conditions and diseases that are activating similar pathways in both species. AVAILABILITY: Supporting methods, results and a Matlab implementation are available from http://sb.cs.cmu.edu/ExpQ/.

Entities:  

Mesh:

Year:  2010        PMID: 20702396      PMCID: PMC2944203          DOI: 10.1093/bioinformatics/btq451

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  18 in total

1.  GEST: a gene expression search tool based on a novel Bayesian similarity metric.

Authors:  L Hunter; R C Taylor; S M Leach; R Simon
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

2.  A gene recommender algorithm to identify coexpressed genes in C. elegans.

Authors:  Art B Owen; Josh Stuart; Kathy Mach; Anne M Villeneuve; Stuart Kim
Journal:  Genome Res       Date:  2003-08       Impact factor: 9.043

3.  A gene-coexpression network for global discovery of conserved genetic modules.

Authors:  Joshua M Stuart; Eran Segal; Daphne Koller; Stuart K Kim
Journal:  Science       Date:  2003-08-21       Impact factor: 47.728

4.  Metagenes and molecular pattern discovery using matrix factorization.

Authors:  Jean-Philippe Brunet; Pablo Tamayo; Todd R Golub; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-11       Impact factor: 11.205

5.  Co-evolution of transcriptional and post-translational cell-cycle regulation.

Authors:  Lars Juhl Jensen; Thomas Skøt Jensen; Ulrik de Lichtenberg; Søren Brunak; Peer Bork
Journal:  Nature       Date:  2006-09-27       Impact factor: 49.962

Review 6.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

Review 7.  The mighty mouse: genetically engineered mouse models in cancer drug development.

Authors:  Norman E Sharpless; Ronald A Depinho
Journal:  Nat Rev Drug Discov       Date:  2006-08-18       Impact factor: 84.694

8.  A gene atlas of the mouse and human protein-encoding transcriptomes.

Authors:  Andrew I Su; Tim Wiltshire; Serge Batalov; Hilmar Lapp; Keith A Ching; David Block; Jie Zhang; Richard Soden; Mimi Hayakawa; Gabriel Kreiman; Michael P Cooke; John R Walker; John B Hogenesch
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-09       Impact factor: 11.205

9.  STEM: a tool for the analysis of short time series gene expression data.

Authors:  Jason Ernst; Ziv Bar-Joseph
Journal:  BMC Bioinformatics       Date:  2006-04-05       Impact factor: 3.169

10.  Conservation of core gene expression in vertebrate tissues.

Authors:  Esther T Chan; Gerald T Quon; Gordon Chua; Tomas Babak; Miles Trochesset; Ralph A Zirngibl; Jane Aubin; Michael J H Ratcliffe; Andrew Wilde; Michael Brudno; Quaid D Morris; Timothy R Hughes
Journal:  J Biol       Date:  2009-04-16
View more
  11 in total

1.  TROM: A Testing-Based Method for Finding Transcriptomic Similarity of Biological Samples.

Authors:  Wei Vivian Li; Yiling Chen; Jingyi Jessica Li
Journal:  Stat Biosci       Date:  2016-08-29

2.  ModuleBlast: identifying activated sub-networks within and across species.

Authors:  Guy E Zinman; Shoshana Naiman; Dawn M O'Dee; Nishant Kumar; Gerard J Nau; Haim Y Cohen; Ziv Bar-Joseph
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 16.971

3.  Learning a genome-wide score of human-mouse conservation at the functional genomics level.

Authors:  Soo Bin Kwon; Jason Ernst
Journal:  Nat Commun       Date:  2021-05-03       Impact factor: 14.919

4.  Ortho2ExpressMatrix--a web server that interprets cross-species gene expression data by gene family information.

Authors:  Thomas Meinel; Michal R Schweiger; Andreas H Ludewig; Ramu Chenna; Sylvia Krobitsch; Ralf Herwig
Journal:  BMC Genomics       Date:  2011-10-04       Impact factor: 3.969

5.  Matching experiments across species using expression values and textual information.

Authors:  Aaron Wise; Zoltán N Oltvai; Ziv Bar-Joseph
Journal:  Bioinformatics       Date:  2012-06-15       Impact factor: 6.937

6.  Bipartite tight spectral clustering (BiTSC) algorithm for identifying conserved gene co-clusters in two species.

Authors:  Yidan Eden Sun; Heather J Zhou; Jingyi Jessica Li
Journal:  Bioinformatics       Date:  2021-06-09       Impact factor: 6.931

7.  Comparison of Gene Coexpression Profiles and Construction of Conserved Gene Networks to Find Functional Modules.

Authors:  Yasunobu Okamura; Takeshi Obayashi; Kengo Kinoshita
Journal:  PLoS One       Date:  2015-07-06       Impact factor: 3.240

8.  A novel method for cross-species gene expression analysis.

Authors:  Erik Kristiansson; Tobias Österlund; Lina Gunnarsson; Gabriella Arne; D G Joakim Larsson; Olle Nerman
Journal:  BMC Bioinformatics       Date:  2013-02-27       Impact factor: 3.169

9.  Drug similarity search based on combined signatures in gene expression profiles.

Authors:  Kihoon Cha; Min-Sung Kim; Kimin Oh; Hyunjung Shin; Gwan-Su Yi
Journal:  Healthc Inform Res       Date:  2014-01-31

10.  Pathprinting: An integrative approach to understand the functional basis of disease.

Authors:  Gabriel M Altschuler; Oliver Hofmann; Irina Kalatskaya; Rebecca Payne; Shannan J Ho Sui; Uma Saxena; Andrei V Krivtsov; Scott A Armstrong; Tianxi Cai; Lincoln Stein; Winston A Hide
Journal:  Genome Med       Date:  2013-07-26       Impact factor: 11.117

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.