Literature DB >> 24340166

Comments on statistical issues in november 2013.

Yong Gyu Park1.   

Abstract

Entities:  

Year:  2013        PMID: 24340166      PMCID: PMC3856286          DOI: 10.4082/kjfm.2013.34.6.434

Source DB:  PubMed          Journal:  Korean J Fam Med        ISSN: 2005-6443


× No keyword cloud information.
In this section, we explain the actual number of observations used in a multivariate analysis when one or more explanatory variables have missing values, which appeared in the articles titled, "Postmarketing surveillance study of the efficacy and safety of Phentermine in patients with obesity," by Kim et al.1) and "Relationships between dietary habits and allostatic load index in metabolic syndrome patients," by Kim2) published in September 2013.

MISSING VALUES IN MULTIVARIATE ANALYSES

When there are some missing values in one or more variables, most researchers choose one of the following strategies for analysis: 1) delete all observations which have missing values or 2) use all observations regardless of missing values. The purpose of this section is to show how many observations are actually analyzed in multivariate analyses, such as multiple linear regression analysis or multiple logistic regression analysis, when there are different numbers of missing values in each explanatory variable. Let's perform a multiple linear regression using the following hypothetical data (Table 1).
Table 1

Hypothetical data

In this data, explanatory variable x1 has four, x2 has two, and x3 has no missing values, respectively (denoted as a dot), and we will perform three analyzing processes using SPSS, 1) Pearson correlation analysis, 2) multiple linear regression analysis, and 3) stepwise multiple linear regression analysis.

PEARSON CORRELATION ANALYSIS

From the menus choose: Analyze Correlate Bivariate... Select all variables: y, x1, x2, x3 We obtain the following results: (Table 2).
Table 2

Correlation coefficients

X3 has the highest correlation with y, and explanatory variables, and x1, x2, and x3 are analyzed by using only their valid observations, 6, 8, and 10, respectively.

MULTIPLE LINEAR REGRESSION ANALYSIS

From the menus choose: Analyze Regression Linear... Choose dependent variable: y Independent variables: x1, x2, x3 Options: statistics: descriptive statistics We obtain the following results: (Tables 3-5).
Table 3

Descriptive statistics

Table 5

Coefficients

X3 has the highest correlation with y, but all analyses (descriptive statistics, correlation analysis, and multiple regression analysis) are performed using only six observations which have no missing values for all dependent variables.

STEPWISE MULTIPLE LINEAR REGRESSION ANALYSIS

(Menus, variable selection, and options are the same as the above) Variable selection methods: stepwise We obtain the following results: (Tables 6-8).
Table 6

Entered/removed variables

Table 8

Coefficients

Descriptive statistics (the same as above results) Correlation coefficients (the same as above results) From the total degrees of freedom (df = 5) in the analysis of variance table, a stepwise multiple regression analysis is performed using only six observations which have no missing values for all explanatory variables, even though the results show that only one variable, x3, which has no missing values, remains in the final model. As we can see from the above three results, the actual number of observations analyzed in a multivariate analysis is the minimum number of valid observations of all explanatory variables we had intended to include in the analysis, regardless of the variable selection methods.
Table 4

Correlation coefficients

Table 7

Analysis of variance

  2 in total

1.  Relationships between Dietary Habits and Allostatic Load Index in Metabolic Syndrome Patients.

Authors:  Kee Hyuck Lee; Sang Wook Park; Sung Min Ye; So-Yeon Kim; Sun-Young Kim; Jong Soo Han; Sarah Kim; Woo Kyung Bae; Ki Heon Lee; Ju Young Kim
Journal:  Korean J Fam Med       Date:  2013-09-26

2.  Postmarketing surveillance study of the efficacy and safety of phentermine in patients with obesity.

Authors:  Hyun Ok Kim; Jung Ah Lee; Hee Won Suh; Young Sik Kim; Bum Soo Kim; Eun Sook Ahn; Young Jun Roh; Seong Gil Jung; Jin Mok Kim; Moon Kuk Kang; In Soon Ahn; Young Gyu Park
Journal:  Korean J Fam Med       Date:  2013-09-26
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.