Literature DB >> 20733236

Incorporating Nonlinear Relationships in Microarray Missing Value Imputation.

Tianwei Yu1, Hesen Peng, Wei Sun.   

Abstract

Microarray gene expression data often contain missing values. Accurate estimation of the missing values is important for downstream data analyses that require complete data. Nonlinear relationships between gene expression levels have not been well-utilized in missing value imputation. We propose an imputation scheme based on nonlinear dependencies between genes. By simulations based on real microarray data, we show that incorporating nonlinear relationships could improve the accuracy of missing value imputation, both in terms of normalized root-mean-squared error and in terms of the preservation of the list of significant genes in statistical testing. In addition, we studied the impact of artificial dependencies introduced by data normalization on the simulation results. Our results suggest that methods relying on global correlation structures may yield overly optimistic simulation results when the data have been subjected to row (gene)-wise mean removal.

Entities:  

Mesh:

Year:  2011        PMID: 20733236      PMCID: PMC3624752          DOI: 10.1109/TCBB.2010.73

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  28 in total

1.  Missing value estimation methods for DNA microarrays.

Authors:  O Troyanskaya; M Cantor; G Sherlock; P Brown; T Hastie; R Tibshirani; D Botstein; R B Altman
Journal:  Bioinformatics       Date:  2001-06       Impact factor: 6.937

2.  Identifying periodically expressed transcripts in microarray time series data.

Authors:  Sofia Wichert; Konstantinos Fokianos; Korbinian Strimmer
Journal:  Bioinformatics       Date:  2004-01-01       Impact factor: 6.937

3.  Missing-value estimation using linear and non-linear regression with Bayesian gene selection.

Authors:  Xiaobo Zhou; Xiaodong Wang; Edward R Dougherty
Journal:  Bioinformatics       Date:  2003-11-22       Impact factor: 6.937

4.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

5.  A Bayesian missing value estimation method for gene expression profile data.

Authors:  Shigeyuki Oba; Masa-aki Sato; Ichiro Takemasa; Morito Monden; Ken-ichi Matsubara; Shin Ishii
Journal:  Bioinformatics       Date:  2003-11-01       Impact factor: 6.937

6.  LSimpute: accurate estimation of missing values in microarray data with least squares methods.

Authors:  Trond Hellem Bø; Bjarte Dysvik; Inge Jonassen
Journal:  Nucleic Acids Res       Date:  2004-02-20       Impact factor: 16.971

7.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.

Authors:  A A Alizadeh; M B Eisen; R E Davis; C Ma; I S Lossos; A Rosenwald; J C Boldrick; H Sabet; T Tran; X Yu; J I Powell; L Yang; G E Marti; T Moore; J Hudson; L Lu; D B Lewis; R Tibshirani; G Sherlock; W C Chan; T C Greiner; D D Weisenburger; J O Armitage; R Warnke; R Levy; W Wilson; M R Grever; J C Byrd; D Botstein; P O Brown; L M Staudt
Journal:  Nature       Date:  2000-02-03       Impact factor: 49.962

8.  A system for enhancing genome-wide coexpression dynamics study.

Authors:  Ker-Chau Li; Ching-Ti Liu; Wei Sun; Shinsheng Yuan; Tianwei Yu
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-18       Impact factor: 11.205

9.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

10.  Missing value estimation for DNA microarray gene expression data by Support Vector Regression imputation and orthogonal coding scheme.

Authors:  Xian Wang; Ao Li; Zhaohui Jiang; Huanqing Feng
Journal:  BMC Bioinformatics       Date:  2006-01-22       Impact factor: 3.169

View more
  10 in total

1.  Shrinkage regression-based methods for microarray missing value imputation.

Authors:  Hsiuying Wang; Chia-Chun Chiu; Yi-Ching Wu; Wei-Sheng Wu
Journal:  BMC Syst Biol       Date:  2013-12-13

2.  Combining Fourier and lagged k-nearest neighbor imputation for biomedical time series data.

Authors:  Shah Atiqur Rahman; Yuxiao Huang; Jan Claassen; Nathaniel Heintzman; Samantha Kleinberg
Journal:  J Biomed Inform       Date:  2015-10-21       Impact factor: 6.317

3.  Hierarchical clustering of high-throughput expression data based on general dependences.

Authors:  Tianwei Yu; Hesen Peng
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2013 Jul-Aug       Impact factor: 3.710

4.  Missing value imputation for LC-MS metabolomics data by incorporating metabolic network and adduct ion relations.

Authors:  Zhuxuan Jin; Jian Kang; Tianwei Yu
Journal:  Bioinformatics       Date:  2018-05-01       Impact factor: 6.937

5.  Nonlinear variable selection with continuous outcome: a fully nonparametric incremental forward stagewise approach.

Authors:  Tianwei Yu
Journal:  Stat Anal Data Min       Date:  2018-06-19       Impact factor: 1.051

6.  Exploiting identifiability and intergene correlation for improved detection of differential expression.

Authors:  J R Deller; Hayder Radha; J Justin McCormick
Journal:  ISRN Bioinform       Date:  2013-06-03

7.  Nonlinear dependence in the discovery of differentially expressed genes.

Authors:  J R Deller; Hayder Radha; J Justin McCormick; Huiyan Wang
Journal:  ISRN Bioinform       Date:  2012-04-12

8.  Genomic data imputation with variational auto-encoders.

Authors:  Yeping Lina Qiu; Hong Zheng; Olivier Gevaert
Journal:  Gigascience       Date:  2020-08-01       Impact factor: 6.524

9.  Meta-Analysis of Large-Scale Toxicogenomic Data Finds Neuronal Regeneration Related Protein and Cathepsin D to Be Novel Biomarkers of Drug-Induced Toxicity.

Authors:  Hyosil Kim; Ju-Hwa Kim; So Youn Kim; Deokyeon Jo; Ho Jun Park; Jihyun Kim; Sungwon Jung; Hyun Seok Kim; KiYoung Lee
Journal:  PLoS One       Date:  2015-09-03       Impact factor: 3.240

10.  K-Profiles: A Nonlinear Clustering Method for Pattern Detection in High Dimensional Data.

Authors:  Kai Wang; Qing Zhao; Jianwei Lu; Tianwei Yu
Journal:  Biomed Res Int       Date:  2015-08-03       Impact factor: 3.411

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.