Literature DB >> 24334400

Hierarchical clustering of high-throughput expression data based on general dependences.

Tianwei Yu1, Hesen Peng1.   

Abstract

High-throughput expression technologies, including gene expression array and liquid chromatography--mass spectrometry (LC-MS) and so on, measure thousands of features, i.e., genes or metabolites, on a continuous scale. In such data, both linear and nonlinear relations exist between features. Nonlinear relations can reflect critical regulation patterns in the biological system. However, they are not identified and utilized by traditional clustering methods based on linear associations. Clustering based on general dependences, i.e., both linear and nonlinear relations, is hampered by the high dimensionality and high noise level of the data. We developed a sensitive nonparametric measure of general dependence between (groups of) random variables in high dimensions. Based on this dependence measure, we developed a hierarchical clustering method. In simulation studies, the method outperformed correlation- and mutual information (MI)-based hierarchical clustering methods in clustering features with nonlinear dependences. We applied the method to a microarray data set measuring the gene expression in cell-cycle time series to show it generates biologically relevant results. The R code is available at http://userwww.service.emory.edu/~tyu8/GDHC.

Entities:  

Mesh:

Year:  2013        PMID: 24334400      PMCID: PMC3905248          DOI: 10.1109/TCBB.2013.99

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  28 in total

1.  Identifying periodically expressed transcripts in microarray time series data.

Authors:  Sofia Wichert; Konstantinos Fokianos; Korbinian Strimmer
Journal:  Bioinformatics       Date:  2004-01-01       Impact factor: 6.937

2.  Inference of transcriptional regulatory network by two-stage constrained space factor analysis.

Authors:  Tianwei Yu; Ker-Chau Li
Journal:  Bioinformatics       Date:  2005-09-06       Impact factor: 6.937

3.  Study of coordinative gene expression at the biological process level.

Authors:  Tianwei Yu; Wei Sun; Shinsheng Yuan; Ker-Chau Li
Journal:  Bioinformatics       Date:  2005-08-02       Impact factor: 6.937

Review 4.  Mass spectrometry-based metabolomics.

Authors:  Katja Dettmer; Pavel A Aronov; Bruce D Hammock
Journal:  Mass Spectrom Rev       Date:  2007 Jan-Feb       Impact factor: 10.946

5.  Using GOstats to test gene lists for GO term association.

Authors:  S Falcon; R Gentleman
Journal:  Bioinformatics       Date:  2006-11-10       Impact factor: 6.937

6.  A system for enhancing genome-wide coexpression dynamics study.

Authors:  Ker-Chau Li; Ching-Ti Liu; Wei Sun; Shinsheng Yuan; Tianwei Yu
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-18       Impact factor: 11.205

7.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

8.  NCBI GEO: mining tens of millions of expression profiles--database and tools update.

Authors:  Tanya Barrett; Dennis B Troup; Stephen E Wilhite; Pierre Ledoux; Dmitry Rudnev; Carlos Evangelista; Irene F Kim; Alexandra Soboleva; Maxim Tomashevsky; Ron Edgar
Journal:  Nucleic Acids Res       Date:  2006-11-11       Impact factor: 16.971

9.  ROCS: receiver operating characteristic surface for class-skewed high-throughput data.

Authors:  Tianwei Yu
Journal:  PLoS One       Date:  2012-07-06       Impact factor: 3.240

10.  A forward-backward fragment assembling algorithm for the identification of genomic amplification and deletion breakpoints using high-density single nucleotide polymorphism (SNP) array.

Authors:  Tianwei Yu; Hui Ye; Wei Sun; Ker-Chau Li; Zugen Chen; Sharoni Jacobs; Dione K Bailey; David T Wong; Xiaofeng Zhou
Journal:  BMC Bioinformatics       Date:  2007-05-03       Impact factor: 3.169

View more
  8 in total

1.  A comprehensive survey of regulatory network inference methods using single-cell RNA sequencing data.

Authors:  Hung Nguyen; Duc Tran; Bang Tran; Bahadir Pehlivan; Tin Nguyen
Journal:  Brief Bioinform       Date:  2020-09-16       Impact factor: 11.622

2.  Missing value imputation for LC-MS metabolomics data by incorporating metabolic network and adduct ion relations.

Authors:  Zhuxuan Jin; Jian Kang; Tianwei Yu
Journal:  Bioinformatics       Date:  2018-05-01       Impact factor: 6.937

3.  Estimating Linear and Nonlinear Gene Coexpression Networks by Semiparametric Neighborhood Selection.

Authors:  Juho A J Kontio; Marko J Rinta-Aho; Mikko J Sillanpää
Journal:  Genetics       Date:  2020-05-15       Impact factor: 4.562

4.  Nonlinear variable selection with continuous outcome: a fully nonparametric incremental forward stagewise approach.

Authors:  Tianwei Yu
Journal:  Stat Anal Data Min       Date:  2018-06-19       Impact factor: 1.051

Review 5.  A comprehensive survey of regulatory network inference methods using single cell RNA sequencing data.

Authors:  Hung Nguyen; Duc Tran; Bang Tran; Bahadir Pehlivan; Tin Nguyen
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

6.  Identifying heterogeneous subtypes of gastric cancer and subtype‑specific subpaths of microRNA‑target pathways.

Authors:  Yuanhang Li; Weijun Bai; Xu Zhang
Journal:  Mol Med Rep       Date:  2017-12-20       Impact factor: 2.952

7.  K-Profiles: A Nonlinear Clustering Method for Pattern Detection in High Dimensional Data.

Authors:  Kai Wang; Qing Zhao; Jianwei Lu; Tianwei Yu
Journal:  Biomed Res Int       Date:  2015-08-03       Impact factor: 3.411

8.  Nonlinear Network Reconstruction from Gene Expression Data Using Marginal Dependencies Measured by DCOL.

Authors:  Haodong Liu; Peng Li; Mengyao Zhu; Xiaofei Wang; Jianwei Lu; Tianwei Yu
Journal:  PLoS One       Date:  2016-07-05       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.