Literature DB >> 15608052

Clustering of diverse genomic data using information fusion.

Jyotsna Kasturi1, Raj Acharya.   

Abstract

MOTIVATION: Genome sequencing projects and high-through-put technologies like DNA and Protein arrays have resulted in a very large amount of information-rich data. Microarray experimental data are a valuable, but limited source for inferring gene regulation mechanisms on a genomic scale. Additional information such as promoter sequences of genes/DNA binding motifs, gene ontologies, and location data, when combined with gene expression analysis can increase the statistical significance of the finding. This paper introduces a machine learning approach to information fusion for combining heterogeneous genomic data. The algorithm uses an unsupervised joint learning mechanism that identifies clusters of genes using the combined data.
RESULTS: The correlation between gene expression time-series patterns obtained from different experimental conditions and the presence of several distinct and repeated motifs in their upstream sequences is examined here using publicly available yeast cell-cycle data. The results show that the combined learning approach taken here identifies correlated genes effectively. The algorithm provides an automated clustering method, but allows the user to specify apriori the influence of each data type on the final clustering using probabilities. AVAILABILITY: Software code is available by request from the first author. CONTACT: jkasturi@cse.psu.edu.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15608052     DOI: 10.1093/bioinformatics/bti186

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  A nonparametric approach to detect nonlinear correlation in gene expression.

Authors:  Y Ann Chen; Jonas S Almeida; Adam J Richards; Peter Müller; Raymond J Carroll; Baerbel Rohrer
Journal:  J Comput Graph Stat       Date:  2010-09-01       Impact factor: 2.302

Review 2.  Decision fusion in healthcare and medicine: a narrative review.

Authors:  Elham Nazari; Rizwana Biviji; Danial Roshandel; Reza Pour; Mohammad Hasan Shahriari; Amin Mehrabian; Hamed Tabesh
Journal:  Mhealth       Date:  2022-01-20

3.  Simultaneous clustering of gene expression data with clinical chemistry and pathological evaluations reveals phenotypic prototypes.

Authors:  Pierre R Bushel; Russell D Wolfinger; Greg Gibson
Journal:  BMC Syst Biol       Date:  2007-02-23

4.  Improving clustering with metabolic pathway data.

Authors:  Diego H Milone; Georgina Stegmayer; Mariana López; Laura Kamenetzky; Fernando Carrari
Journal:  BMC Bioinformatics       Date:  2014-04-10       Impact factor: 3.169

5.  Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis.

Authors:  Je-Keun Rhee; Je-Gun Joung; Jeong-Ho Chang; Zhangjun Fei; Byoung-Tak Zhang
Journal:  BMC Genomics       Date:  2009-12-03       Impact factor: 3.969

Review 6.  Pathogen profiling for disease management and surveillance.

Authors:  Vitali Sintchenko; Jonathan R Iredell; Gwendolyn L Gilbert
Journal:  Nat Rev Microbiol       Date:  2007-05-08       Impact factor: 60.633

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.