Literature DB >> 19222380

Detecting outlier samples in microarray data.

Albert D Shieh1, Yeung Sam Hung.   

Abstract

In this paper, we address the problem of detecting outlier samples with highly different expression patterns in microarray data. Although outliers are not common, they appear even in widely used benchmark data sets and can negatively affect microarray data analysis. It is important to identify outliers in order to explore underlying experimental or biological problems and remove erroneous data. We propose an outlier detection method based on principal component analysis (PCA) and robust estimation of Mahalanobis distances that is fully automatic. We demonstrate that our outlier detection method identifies biologically significant outliers with high accuracy and that outlier removal improves the prediction accuracy of classifiers. Our outlier detection method is closely related to existing robust PCA methods, so we compare our outlier detection method to a prominent robust PCA method.

Mesh:

Year:  2009        PMID: 19222380     DOI: 10.2202/1544-6115.1426

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  17 in total

Review 1.  Cardiovascular genomics: a biomarker identification pipeline.

Authors:  John H Phan; Chang F Quo; May Dongmei Wang
Journal:  IEEE Trans Inf Technol Biomed       Date:  2012-05-16

2.  PCA leverage: outlier detection for high-dimensional functional magnetic resonance imaging data.

Authors:  Amanda F Mejia; Mary Beth Nebel; Ani Eloyan; Brian Caffo; Martin A Lindquist
Journal:  Biostatistics       Date:  2017-07-01       Impact factor: 5.899

3.  Robust identification of gene-environment interactions for prognosis using a quantile partial correlation approach.

Authors:  Yaqing Xu; Mengyun Wu; Qingzhao Zhang; Shuangge Ma
Journal:  Genomics       Date:  2018-07-17       Impact factor: 5.736

4.  Similar response patterns to topical minoxidil foam 5% in frontal and vertex scalp of men with androgenetic alopecia: a microarray analysis.

Authors:  P Mirmirani; M Consolo; P Oyetakin-White; E Baron; P Leahy; P Karnik
Journal:  Br J Dermatol       Date:  2014-12-10       Impact factor: 9.302

5.  Central nervous system endoplasmic reticulum stress in a murine model of type 2 diabetes.

Authors:  C Sims-Robinson; S Zhao; J Hur; E L Feldman
Journal:  Diabetologia       Date:  2012-05-14       Impact factor: 10.122

6.  Inferring gene regulatory relationships with a high-dimensional robust approach.

Authors:  Yangguang Zang; Qing Zhao; Qingzhao Zhang; Yang Li; Sanguo Zhang; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2017-05-02       Impact factor: 2.344

7.  Thyroid hormone-regulated gene expression in juvenile mouse liver: identification of thyroid response elements using microarray profiling and in silico analyses.

Authors:  Martin A Paquette; Hongyan Dong; Rémi Gagné; Andrew Williams; Morie Malowany; Mike G Wade; Carole L Yauk
Journal:  BMC Genomics       Date:  2011-12-29       Impact factor: 3.969

8.  Gene-environment interaction identification via penalized robust divergence.

Authors:  Mingyang Ren; Sanguo Zhang; Shuangge Ma; Qingzhao Zhang
Journal:  Biom J       Date:  2021-11-01       Impact factor: 1.715

9.  Detection of expression quantitative trait Loci in complex mouse crosses: impact and alleviation of data quality and complex population substructure.

Authors:  Ovidiu D Iancu; Priscila Darakjian; Sunita Kawane; Daniel Bottomly; Robert Hitzemann; Shannon McWeeney
Journal:  Front Genet       Date:  2012-08-27       Impact factor: 4.599

10.  An integrated approach for identifying wrongly labelled samples when performing classification in microarray data.

Authors:  Yuk Yee Leung; Chun Qi Chang; Yeung Sam Hung
Journal:  PLoS One       Date:  2012-10-17       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.