| Literature DB >> 27242639 |
Mike W-L Cheung1, Suzanne Jak2.
Abstract
Big data is a field that has traditionally been dominated by disciplines such as computer science and business, where mainly data-driven analyses have been performed. Psychology, a discipline in which a strong emphasis is placed on behavioral theories and empirical research, has the potential to contribute greatly to the big data movement. However, one challenge to psychologists-and probably the most crucial one-is that most researchers may not have the necessary programming and computational skills to analyze big data. In this study we argue that psychologists can also conduct big data research and that, rather than trying to acquire new programming and computational skills, they should focus on their strengths, such as performing psychometric analyses and testing theories using multivariate analyses to explain phenomena. We propose a split/analyze/meta-analyze approach that allows psychologists to easily analyze big data. Two real datasets are used to demonstrate the proposed procedures in R. A new research agenda related to the analysis of big data in psychology is outlined at the end of the study.Entities:
Keywords: R platform; big data; meta-analysis; multilevel model; structural equation modeling
Year: 2016 PMID: 27242639 PMCID: PMC4876837 DOI: 10.3389/fpsyg.2016.00738
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1The split/analyze/meta-analyze (SAM) model.
Comparisons between analysis of raw data, and analysis based on a fixed-effects meta-analysis with random splits.
| Subjective state of health (A009) | 0.4333 | 0.4333 | 0.4333 | 0.4333 | 0.4334 | 0.4330 | 0.4336 |
| Freedom of choice and control (A173) | 0.2313 | 0.2313 | 0.2313 | 0.2313 | 0.2314 | 0.2315 | 0.2322 |
| Financial satisfaction (C006) | 0.4243 | 0.4243 | 0.4243 | 0.4244 | 0.4245 | 0.4257 | 0.4259 |
| Sex (X001) | 0.1708 | 0.1707 | 0.1708 | 0.1708 | 0.1705 | 0.1701 | 0.1698 |
| Age (X003) | 0.0580 | 0.0580 | 0.0580 | 0.0580 | 0.0581 | 0.0579 | 0.0575 |
| Subjective state of health (A009) | 0.0043 | 0.0043 | 0.0043 | 0.0043 | 0.0043 | 0.0043 | 0.0043 |
| Freedom of choice and control (A173) | 0.0015 | 0.0015 | 0.0015 | 0.0015 | 0.0015 | 0.0015 | 0.0015 |
| Financial satisfaction (C006) | 0.0015 | 0.0015 | 0.0015 | 0.0015 | 0.0015 | 0.0014 | 0.0014 |
| Sex (X001) | 0.0070 | 0.0070 | 0.0070 | 0.0070 | 0.0070 | 0.0069 | 0.0069 |
| Age (X003) | 0.0023 | 0.0023 | 0.0023 | 0.0023 | 0.0023 | 0.0022 | 0.0022 |
Dependent variable is life satisfaction (A170).
Figure 2The 95% confidence ellipse on the indirect and direct effects on the WVS data.
Figure 3Scatter plot on the means of the selected variables on the airlines data.
Parameter estimates from the regression model and mixed-effects regression model.
| 0.9011 | 0.9011 | 0.8961 | 0.8961 | |
| 0.0176 | 0.0105 | 0.0180 | 0.0105 | |
| –0.8624 | –0.8623 | –1.2010 | –1.2009 | |
| 0.1048 | 0.0866 | 0.0976 | 0.0773 | |
| 0.0104 | 0.0108 | |||
| 0.0017 | 0.0017 | |||
| –0.0436 | –0.0440 | |||
| 0.0136 | 0.0122 | |||
| 0.0068 | 0.0024 | 0.0071 | 0.0024 | |
| –0.0178 | 0.0006 | –0.0154 | 0.0037 | |
| 0.2413 | 0.1646 | 0.2091 | 0.1313 | |
| 1.0000 | 1.0000 | |||
| 0.9995 | 0.9991 | |||
| 0.6436 | 0.6570 | |||
| 0.3176 | 0.3720 | |||
: regression coefficient of Dep. : regression coefficient of Dist. : regression coefficient of Year in predicting the coefficient of Dep. : regression coefficient of Year in predicting the coefficient of Dist.
Figure 4The 95% confidence ellipse on the regression coefficients on the airlines data.