Literature DB >> 16301537

A data integration methodology for systems biology.

Daehee Hwang1, Alistair G Rust, Stephen Ramsey, Jennifer J Smith, Deena M Leslie, Andrea D Weston, Pedro de Atauri, John D Aitchison, Leroy Hood, Andrew F Siegel, Hamid Bolouri.   

Abstract

Different experimental technologies measure different aspects of a system and to differing depth and breadth. High-throughput assays have inherently high false-positive and false-negative rates. Moreover, each technology includes systematic biases of a different nature. These differences make network reconstruction from multiple data sets difficult and error-prone. Additionally, because of the rapid rate of progress in biotechnology, there is usually no curated exemplar data set from which one might estimate data integration parameters. To address these concerns, we have developed data integration methods that can handle multiple data sets differing in statistical power, type, size, and network coverage without requiring a curated training data set. Our methodology is general in purpose and may be applied to integrate data from any existing and future technologies. Here we outline our methods and then demonstrate their performance by applying them to simulated data sets. The results show that these methods select true-positive data elements much more accurately than classical approaches. In an accompanying companion paper, we demonstrate the applicability of our approach to biological data. We have integrated our methodology into a free open source software package named POINTILLIST.

Mesh:

Year:  2005        PMID: 16301537      PMCID: PMC1297682          DOI: 10.1073/pnas.0508647102

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  15 in total

1.  Is there a bias in proteome research?

Authors:  R Mrowka; A Patzak; H Herzel
Journal:  Genome Res       Date:  2001-12       Impact factor: 9.043

2.  Integrated genomic and proteomic analyses of a systematically perturbed metabolic network.

Authors:  T Ideker; V Thorsson; J A Ranish; R Christmas; J Buhler; J K Eng; R Bumgarner; D R Goodlett; R Aebersold; L Hood
Journal:  Science       Date:  2001-05-04       Impact factor: 47.728

3.  Comparative assessment of large-scale data sets of protein-protein interactions.

Authors:  Christian von Mering; Roland Krause; Berend Snel; Michael Cornell; Stephen G Oliver; Stanley Fields; Peer Bork
Journal:  Nature       Date:  2002-05-08       Impact factor: 49.962

4.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

5.  A Bayesian networks approach for predicting protein-protein interactions from genomic data.

Authors:  Ronald Jansen; Haiyuan Yu; Dov Greenbaum; Yuval Kluger; Nevan J Krogan; Sambath Chung; Andrew Emili; Michael Snyder; Jack F Greenblatt; Mark Gerstein
Journal:  Science       Date:  2003-10-17       Impact factor: 47.728

6.  Protein interactions: two methods for assessment of the reliability of high throughput observations.

Authors:  Charlotte M Deane; Łukasz Salwiński; Ioannis Xenarios; David Eisenberg
Journal:  Mol Cell Proteomics       Date:  2002-05       Impact factor: 5.911

7.  Teamed up for transcription.

Authors:  Christian von Mering; Peer Bork
Journal:  Nature       Date:  2002-06-20       Impact factor: 49.962

8.  A data integration methodology for systems biology: experimental verification.

Authors:  Daehee Hwang; Jennifer J Smith; Deena M Leslie; Andrea D Weston; Alistair G Rust; Stephen Ramsey; Pedro de Atauri; Andrew F Siegel; Hamid Bolouri; John D Aitchison; Leroy Hood
Journal:  Proc Natl Acad Sci U S A       Date:  2005-11-21       Impact factor: 11.205

9.  Computational discovery of gene modules and regulatory networks.

Authors:  Ziv Bar-Joseph; Georg K Gerber; Tong Ihn Lee; Nicola J Rinaldi; Jane Y Yoo; François Robert; D Benjamin Gordon; Ernest Fraenkel; Tommi S Jaakkola; Richard A Young; David K Gifford
Journal:  Nat Biotechnol       Date:  2003-10-12       Impact factor: 54.908

10.  Transcriptional regulatory networks in Saccharomyces cerevisiae.

Authors:  Tong Ihn Lee; Nicola J Rinaldi; François Robert; Duncan T Odom; Ziv Bar-Joseph; Georg K Gerber; Nancy M Hannett; Christopher T Harbison; Craig M Thompson; Itamar Simon; Julia Zeitlinger; Ezra G Jennings; Heather L Murray; D Benjamin Gordon; Bing Ren; John J Wyrick; Jean-Bosco Tagne; Thomas L Volkert; Ernest Fraenkel; David K Gifford; Richard A Young
Journal:  Science       Date:  2002-10-25       Impact factor: 47.728

View more
  134 in total

Review 1.  Aminoacyl-tRNA synthetases and tumorigenesis: more than housekeeping.

Authors:  Sunghoon Kim; Sungyong You; Daehee Hwang
Journal:  Nat Rev Cancer       Date:  2011-09-23       Impact factor: 60.716

2.  Integration analysis of quantitative proteomics and transcriptomics data identifies potential targets of frizzled-8 protein-related antiproliferative factor in vivo.

Authors:  Wei Yang; Yongsoo Kim; Taek-Kyun Kim; Susan K Keay; Kwang Pyo Kim; Hanno Steen; Michael R Freeman; Daehee Hwang; Jayoung Kim
Journal:  BJU Int       Date:  2012-06-28       Impact factor: 5.588

3.  The serotonin transporter promoter variant (5-HTTLPR), stress, and depression meta-analysis revisited: evidence of genetic moderation.

Authors:  Katja Karg; Margit Burmeister; Kerby Shedden; Srijan Sen
Journal:  Arch Gen Psychiatry       Date:  2011-01-03

Review 4.  Methods for biological data integration: perspectives and challenges.

Authors:  Vladimir Gligorijević; Nataša Pržulj
Journal:  J R Soc Interface       Date:  2015-11-06       Impact factor: 4.118

5.  DNA Methylation Regulates the Differential Expression of CX3CR1 on Human IL-7Rαlow and IL-7Rαhigh Effector Memory CD8+ T Cells with Distinct Migratory Capacities to the Fractalkine.

Authors:  Min Sun Shin; Sungyong You; Youna Kang; Naeun Lee; Seung-Ah Yoo; Kieyoung Park; Ki Soo Kang; Sang Hyun Kim; Subhasis Mohanty; Albert C Shaw; Ruth R Montgomery; Daehee Hwang; Insoo Kang
Journal:  J Immunol       Date:  2015-08-14       Impact factor: 5.422

6.  A data integration methodology for systems biology: experimental verification.

Authors:  Daehee Hwang; Jennifer J Smith; Deena M Leslie; Andrea D Weston; Alistair G Rust; Stephen Ramsey; Pedro de Atauri; Andrew F Siegel; Hamid Bolouri; John D Aitchison; Leroy Hood
Journal:  Proc Natl Acad Sci U S A       Date:  2005-11-21       Impact factor: 11.205

7.  One hundred years of high-throughput Drosophila research.

Authors:  Mathias Beller; Brian Oliver
Journal:  Chromosome Res       Date:  2006       Impact factor: 5.239

8.  A novel biological function for CD44 in axon growth of retinal ganglion cells identified by a bioinformatics approach.

Authors:  Albert Ries; Jeffrey L Goldberg; Barbara Grimpe
Journal:  J Neurochem       Date:  2007-08-30       Impact factor: 5.372

9.  The Arabidopsis NAC transcription factor ANAC096 cooperates with bZIP-type transcription factors in dehydration and osmotic stress responses.

Authors:  Zheng-Yi Xu; Soo Youn Kim; Do Young Hyeon; Dae Heon Kim; Ting Dong; Youngmin Park; Jing Bo Jin; Se-Hwan Joo; Seong-Ki Kim; Jong Chan Hong; Daehee Hwang; Inhwan Hwang
Journal:  Plant Cell       Date:  2013-11-27       Impact factor: 11.277

10.  Time-series integrated "omic" analyses to elucidate short-term stress-induced responses in plant liquid cultures.

Authors:  Bhaskar Dutta; Harin Kanani; John Quackenbush; Maria I Klapa
Journal:  Biotechnol Bioeng       Date:  2009-01-01       Impact factor: 4.530

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.