Wyatt P Bensken1, Fredric M Pieracci2, Vanessa P Ho1,3. 1. Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA. 2. Department of Surgery, Denver Health Medical Center, Denver, Colorado, USA. 3. Department of Surgery, MetroHealth Medical Center, Cleveland, Ohio, USA.
Abstract
Background: Standardized and concise data presentation forms the base for subsequent analysis and interpretation. This article reviews types of data, data properties and distributions, and both numerical and graphical methods of data presentation. Methods: For the purposes of illustration, the National Inpatient Sample was queried to categorize patients as having either emergency general surgery or non-emergency general surgery admissions. Results: Variables are categorized as either categorical or numerical. Within the former, there are ordinal and or nominal subtypes; within the latter, there are ratio and interval subtypes. Categorical data are typically displayed as number (%). Numerical data must be assessed for normality as normally distributed data behave in certain patterns that allow for specific statistical tests to be used. Several properties exist for numerical data, including measurements of central tendency (mean, median, and mode), as well as standard deviation, range, and interquartile range. The best initial assessment of the distribution of numerical data is graphical with both histograms and box plots. Conclusion: Knowledge of the types, distribution, and properties of data is essential to move forward with hypothesis testing.
Background: Standardized and concise data presentation forms the base for subsequent analysis and interpretation. This article reviews types of data, data properties and distributions, and both numerical and graphical methods of data presentation. Methods: For the purposes of illustration, the National Inpatient Sample was queried to categorize patients as having either emergency general surgery or non-emergency general surgery admissions. Results: Variables are categorized as either categorical or numerical. Within the former, there are ordinal and or nominal subtypes; within the latter, there are ratio and interval subtypes. Categorical data are typically displayed as number (%). Numerical data must be assessed for normality as normally distributed data behave in certain patterns that allow for specific statistical tests to be used. Several properties exist for numerical data, including measurements of central tendency (mean, median, and mode), as well as standard deviation, range, and interquartile range. The best initial assessment of the distribution of numerical data is graphical with both histograms and box plots. Conclusion: Knowledge of the types, distribution, and properties of data is essential to move forward with hypothesis testing.
Entities:
Keywords:
data description; data science; statistics
Authors: Jason W Smith; Jennifer Knight Davis; Catherine C Quatman-Yates; Brittany L Waterman; Scott A Strassels; Jen D Wong; Victor K Heh; Holly E Baselice; Guy N Brock; Brian C Clark; John F P Bridges; Heena P Santry Journal: J Am Geriatr Soc Date: 2019-07-13 Impact factor: 5.562
Authors: Sander Greenland; Stephen J Senn; Kenneth J Rothman; John B Carlin; Charles Poole; Steven N Goodman; Douglas G Altman Journal: Eur J Epidemiol Date: 2016-05-21 Impact factor: 8.082
Authors: Sigfredo Villarin; J Alford Flippin; Wyatt P Bensken; Eric Curfman; Christopher W Towe; Jeffrey A Claridge; Vanessa P Ho Journal: Surg Infect (Larchmt) Date: 2022-08 Impact factor: 1.853