| Literature DB >> 33271009 |
Abstract
Several assumptions such as normality, linear relationship, and homoscedasticity are frequently required in parametric statistical analysis methods. Data collected from the clinical situation or experiments often violate these assumptions. Variable transformation provides an opportunity to make data available for parametric statistical analysis without statistical errors. The purpose of variable transformation to enable parametric statistical analysis and its final goal is a perfect interpretation of the result with transformed variables. Variable transformation usually changes the original characteristics and nature of units of variables. Back-transformation is crucial for the interpretation of the estimated results. This article introduces general concepts about variable transformation, mainly focused on logarithmic transformation. Back-transformation and other important considerations are also described herein.Entities:
Keywords: Back-transformation; Box-Cox transformation; Homoscedasticity; Logarithmic; Normality; Power; Retransformation; Skewed distribution; Transformation
Year: 2020 PMID: 33271009 PMCID: PMC7714623 DOI: 10.4097/kja.20137
Source DB: PubMed Journal: Korean J Anesthesiol ISSN: 2005-6419
Skewness, Characteristics of Distribution, and Recommended Choice of Transformation
| Skewness | > 0 | < 0 |
|---|---|---|
| Nomenclature | Positively skewed distribution | Negatively skewed distribution |
| Skewed right | Skewed left | |
| Characteristics | Long right tail relative to left | Long left tail relative to right |
| Recommended transformation to achieve normality | Square root | Power (square, cubic) |
| Reciprocal | ||
| Logarithmic |
Fig. 1.Quartile-Quartile plot (Q-Q plot) of original data and logarithmically transformed data. (A) Q-Q plot of original data. Upper tail of the plot seems to be going off from the straight line. This means that data has a probability of non-normal distribution. Mean and SD of this data is 20.52 and 4.117. The skewness of this distribution is 0.56, and it is a positively skewed distribution. Shapiro-Wilk normality test statistics = 0.974, P = 0.047. (B) Q-Q plot of natural logarithmic transformed data. Non-normality of data distribution seems to be improved in the part of the upper tail. Mean and SD of transformed data is 3.0 and 0.201. Skewness of transformed data is -0.14, Shapiro-Wilk normality test statistics = 0.988, P = 0.477.
Fig. 2.The shapes of the logarithmic graph. Original values become transformed values through corresponding logarithmic transformations.
Fig. 3.Mean and standard deviation changes according to the added constant before logarithmic transformation. Mean differences and SD of each group becomes smaller as the added constant increases. The original data randomly created with mean = 0, SD =1 (white dot) and mean = 1, SD = 3 (black dot), each of them has 100 cases.