Literature DB >> 20473354

SUCCESSIVE NORMALIZATION OF RECTANGULAR ARRAYS.

Richard A Olshen1, Bala Rajaratnam.   

Abstract

Standard statistical techniques often require transforming data to have mean 0 and standard deviation 1. Typically, this process of "standardization" or "normalization" is applied across subjects when each subject produces a single number. High throughput genomic and financial data often come as rectangular arrays, where each coordinate in one direction concerns subjects, who might have different status (case or control, say); and each coordinate in the other designates "outcome" for a specific feature, for example "gene," "polymorphic site," or some aspect of financial profile. It may happen when analyzing data that arrive as a rectangular array that one requires BOTH the subjects and features to be "on the same footing." Thus, there may be a need to standardize across rows and columns of the rectangular matrix. There arises the question as to how to achieve this double normalization. We propose and investigate the convergence of what seems to us a natural approach to successive normalization, which we learned from colleague Bradley Efron. We also study the implementation of the method on simulated data and also on data that arose from scientific experimentation.

Entities:  

Year:  2010        PMID: 20473354      PMCID: PMC2868388          DOI: 10.1214/09-AOS743

Source DB:  PubMed          Journal:  Ann Stat        ISSN: 0090-5364            Impact factor:   4.028


  1 in total

1.  Network analysis of human in-stent restenosis.

Authors:  Euan A Ashley; Rossella Ferrara; Jennifer Y King; Aditya Vailaya; Allan Kuchinsky; Xuanmin He; Blake Byers; Ulrich Gerckens; Stefan Oblin; Anya Tsalenko; Angela Soito; Joshua M Spin; Raymond Tabibiazar; Andrew J Connolly; John B Simpson; Eberhard Grube; Thomas Quertermous
Journal:  Circulation       Date:  2006-12-04       Impact factor: 29.690

  1 in total
  3 in total

1.  Successive Standardization of Rectangular Arrays.

Authors:  Richard A Olshen; Bala Rajaratnam
Journal:  Algorithms       Date:  2012-02-29

2.  Inference with Transposable Data: Modeling the Effects of Row and Column Correlations.

Authors:  Genevera I Allen; Robert Tibshirani
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-03-16       Impact factor: 4.488

3.  Genome-Wide Association between Transcription Factor Expression and Chromatin Accessibility Reveals Regulators of Chromatin Accessibility.

Authors:  David Lamparter; Daniel Marbach; Rico Rueedi; Sven Bergmann; Zoltán Kutalik
Journal:  PLoS Comput Biol       Date:  2017-01-24       Impact factor: 4.475

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.