Literature DB >> 19805443

Statistical challenges of high-dimensional data.

Iain M Johnstone1, D Michael Titterington.   

Abstract

Modern applications of statistical theory and methods can involve extremely large datasets, often with huge numbers of measurements on each of a comparatively small number of experimental units. New methodology and accompanying theory have emerged in response: the goal of this Theme Issue is to illustrate a number of these recent developments. This overview article introduces the difficulties that arise with high-dimensional data in the context of the very familiar linear statistical model: we give a taste of what can nevertheless be achieved when the parameter vector of interest is sparse, that is, contains many zero elements. We describe other ways of identifying low-dimensional subspaces of the data space that contain all useful information. The topic of classification is then reviewed along with the problem of identifying, from within a very large set, the variables that help to classify observations. Brief mention is made of the visualization of high-dimensional data and ways to handle computational problems in Bayesian analysis are described. At appropriate points, reference is made to the other papers in the issue.

Mesh:

Year:  2009        PMID: 19805443      PMCID: PMC2865881          DOI: 10.1098/rsta.2009.0159

Source DB:  PubMed          Journal:  Philos Trans A Math Phys Eng Sci        ISSN: 1364-503X            Impact factor:   4.226


  22 in total

1.  A global geometric framework for nonlinear dimensionality reduction.

Authors:  J B Tenenbaum; V de Silva; J C Langford
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

2.  Visual data mining.

Authors:  Edward J Wegman
Journal:  Stat Med       Date:  2003-05-15       Impact factor: 2.373

3.  Hessian eigenmaps: locally linear embedding techniques for high-dimensional data.

Authors:  David L Donoho; Carrie Grimes
Journal:  Proc Natl Acad Sci U S A       Date:  2003-04-30       Impact factor: 11.205

4.  Higher criticism thresholding: Optimal feature selection when useful features are rare and weak.

Authors:  David Donoho; Jiashun Jin
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-24       Impact factor: 11.205

5.  Selective inference in complex research.

Authors:  Yoav Benjamini; Ruth Heller; Daniel Yekutieli
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2009-11-13       Impact factor: 4.226

6.  Statistical inference for exploratory data analysis and model diagnostics.

Authors:  Andreas Buja; Dianne Cook; Heike Hofmann; Michael Lawrence; Eun-Kyung Lee; Deborah F Swayne; Hadley Wickham
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2009-11-13       Impact factor: 4.226

Review 7.  An overview of recent developments in genomics and associated statistical methods.

Authors:  Peter J Bickel; James B Brown; Haiyan Huang; Qunhua Li
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2009-11-13       Impact factor: 4.226

8.  Sufficient dimension reduction and prediction in regression.

Authors:  Kofi P Adragni; R Dennis Cook
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2009-11-13       Impact factor: 4.226

9.  Identifying graph clusters using variational inference and links to covariance parametrization.

Authors:  David Barber
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2009-11-13       Impact factor: 4.226

10.  The revolution in crystallography.

Authors:  W C Hamilton
Journal:  Science       Date:  1970-07-10       Impact factor: 47.728

View more
  46 in total

Review 1.  Methods of integrating data to uncover genotype-phenotype interactions.

Authors:  Marylyn D Ritchie; Emily R Holzinger; Ruowang Li; Sarah A Pendergrass; Dokyoon Kim
Journal:  Nat Rev Genet       Date:  2015-01-13       Impact factor: 53.242

Review 2.  Use of Exposomic Methods Incorporating Sensors in Environmental Epidemiology.

Authors:  Brett T Doherty; Jeremy P Koelmel; Elizabeth Z Lin; Megan E Romano; Krystal J Godri Pollitt
Journal:  Curr Environ Health Rep       Date:  2021-02-10

Review 3.  Systems vaccinology: Enabling rational vaccine design with systems biological approaches.

Authors:  Thomas Hagan; Helder I Nakaya; Shankar Subramaniam; Bali Pulendran
Journal:  Vaccine       Date:  2015-04-06       Impact factor: 3.641

4.  Multivariate Analysis in Metabolomics.

Authors:  Bradley Worley; Robert Powers
Journal:  Curr Metabolomics       Date:  2013

5.  From complex data to biological insight: 'DEKER' feature selection and network inference.

Authors:  Sean M S Hayes; Jeffrey R Sachs; Carolyn R Cho
Journal:  J Pharmacokinet Pharmacodyn       Date:  2021-11-17       Impact factor: 2.745

6.  Dual transcriptomic and epigenomic study of reaction severity in peanut-allergic children.

Authors:  Anh N Do; Corey T Watson; Ariella T Cohain; Robert S Griffin; Alexander Grishin; Robert A Wood; A Wesley Burks; Stacie M Jones; Amy Scurlock; Donald Y M Leung; Hugh A Sampson; Scott H Sicherer; Andrew J Sharp; Eric E Schadt; Supinda Bunyavanich
Journal:  J Allergy Clin Immunol       Date:  2019-12-12       Impact factor: 10.793

7.  Problems with Centrality Measures in Psychopathology Symptom Networks: Why Network Psychometrics Cannot Escape Psychometric Theory.

Authors:  Michael N Hallquist; Aidan G C Wright; Peter C M Molenaar
Journal:  Multivariate Behav Res       Date:  2019-08-12       Impact factor: 5.923

Review 8.  Computational principles and challenges in single-cell data integration.

Authors:  Ricard Argelaguet; Anna S E Cuomo; Oliver Stegle; John C Marioni
Journal:  Nat Biotechnol       Date:  2021-05-03       Impact factor: 54.908

Review 9.  Artificial intelligence applications in inflammatory bowel disease: Emerging technologies and future directions.

Authors:  John Gubatan; Steven Levitte; Akshar Patel; Tatiana Balabanis; Mike T Wei; Sidhartha R Sinha
Journal:  World J Gastroenterol       Date:  2021-05-07       Impact factor: 5.742

10.  Identification of key gene signatures for the overall survival of ovarian cancer.

Authors:  Akash Pawar; Oindrila Roy Chowdhury; Ruby Chauhan; Sanjay Talole; Atanu Bhattacharjee
Journal:  J Ovarian Res       Date:  2022-01-20       Impact factor: 4.234

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.