Literature DB >> 27722040

BIG DATA AND STATISTICS: A STATISTICIAN'S PERSPECTIVE.

David Rossell1.   

Abstract

Big Data brings unprecedented power to address scientific, economic and societal issues, but also amplifies the possibility of certain pitfalls. These include using purely data-driven approaches that disregard understanding the phenomenon under study, aiming at a dynamically moving target, ignoring critical data collection issues, summarizing or preprocessing the data inadequately and mistaking noise for signal. We review some success stories and illustrate how statistical principles can help obtain more reliable information from data. We also touch upon current challenges that require active methodological research, such as strategies for efficient computation, integration of heterogeneous data, extending the underlying theory to increasingly complex questions and, perhaps most importantly, training a new generation of scientists to develop and deploy these strategies.

Entities:  

Keywords:  Big Data; case studies; challenges; pitfalls; statistics

Year:  2015        PMID: 27722040      PMCID: PMC5053772          DOI: 10.7203/metode.83.3590

Source DB:  PubMed          Journal:  Metode Sci Stud J


  7 in total

Review 1.  Adaptive clinical trials in oncology.

Authors:  Donald A Berry
Journal:  Nat Rev Clin Oncol       Date:  2011-11-08       Impact factor: 66.675

2.  Public policy for the poor? A randomised assessment of the Mexican universal health insurance programme.

Authors:  Gary King; Emmanuela Gakidou; Kosuke Imai; Jason Lakin; Ryan T Moore; Clayton Nall; Nirmala Ravishankar; Manett Vargas; Martha María Téllez-Rojo; Juan Eugenio Hernández Avila; Mauricio Hernández Avila; Héctor Hernández Llamas
Journal:  Lancet       Date:  2009-04-07       Impact factor: 79.321

3.  Big data. The parable of Google Flu: traps in big data analysis.

Authors:  David Lazer; Ryan Kennedy; Gary King; Alessandro Vespignani
Journal:  Science       Date:  2014-03-14       Impact factor: 47.728

4.  Scientific method: statistical errors.

Authors:  Regina Nuzzo
Journal:  Nature       Date:  2014-02-13       Impact factor: 49.962

5.  QUANTIFYING ALTERNATIVE SPLICING FROM PAIRED-END RNA-SEQUENCING DATA.

Authors:  David Rossell; Camille Stephan-Otto Attolini; Manuel Kroiss; Almond Stöcker
Journal:  Ann Appl Stat       Date:  2014-03       Impact factor: 2.083

6.  Challenges of Big Data Analysis.

Authors:  Jianqing Fan; Fang Han; Han Liu
Journal:  Natl Sci Rev       Date:  2014-06       Impact factor: 17.275

7.  chroGPS, a global chromatin positioning system for the functional analysis and visualization of the epigenome.

Authors:  Joan Font-Burgada; Oscar Reina; David Rossell; Fernando Azorín
Journal:  Nucleic Acids Res       Date:  2013-11-23       Impact factor: 16.971

  7 in total
  2 in total

Review 1.  Development of Anticancer Peptides Using Artificial Intelligence and Combinational Therapy for Cancer Therapeutics.

Authors:  Ji Su Hwang; Seok Gi Kim; Tae Hwan Shin; Yong Eun Jang; Do Hyeon Kwon; Gwang Lee
Journal:  Pharmaceutics       Date:  2022-05-06       Impact factor: 6.525

Review 2.  Information-Based Medicine in Glioma Patients: A Clinical Perspective.

Authors:  Joeky Tamba Senders; Maya Harary; Brittany Morgan Stopa; Patrick Staples; Marike Lianne Daphne Broekman; Timothy Richard Smith; William Brian Gormley; Omar Arnaout
Journal:  Comput Math Methods Med       Date:  2018-06-13       Impact factor: 2.238

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.