Literature DB >> 26594331

Combining complexity measures of EEG data: multiplying measures reveal previously hidden information.

Abstract

Many studies have noted significant differences among human electroencephalograph (EEG) results when participants or patients are exposed to different stimuli, undertaking different tasks, or being affected by conditions such as epilepsy or Alzheimer's disease. Such studies often use only one or two measures of complexity and do not regularly justify their choice of measure beyond the fact that it has been used in previous studies. If more measures were added to such studies, however, more complete information might be found about these reported differences. Such information might be useful in confirming the existence or extent of such differences, or in understanding their physiological bases. In this study we analysed publically-available EEG data using a range of complexity measures to determine how well the measures correlated with one another. The complexity measures did not all significantly correlate, suggesting that different measures were measuring unique features of the EEG signals and thus revealing information which other measures were unable to detect. Therefore, the results from this analysis suggests that combinations of complexity measures reveal unique information which is in addition to the information captured by other measures of complexity in EEG data. For this reason, researchers using individual complexity measures for EEG data should consider using combinations of measures to more completely account for any differences they observe and to ensure the robustness of any relationships identified.

Entities: Chemical Disease Species

Keywords: Lemel-Ziv complexity; Weiner entropy; complexity; complexity measure; electroencephalograph; fractal dimension; permutation entropy; sample entropy

Year: 2015 PMID： 26594331 PMCID： PMC4648221 DOI： 10.12688/f1000research.6590.1

Source DB: PubMed Journal: F1000Res ISSN： 2046-1402

Introduction

Electroencephalography (EEG) is a common, relatively non-invasive research and diagnostic tool. Its one-dimensional signals from localised peripheral regions on the head make it attractive for its simplistic fidelity and has allowed high clinical and basic research throughput. When it comes to interpreting EEG data, investigators have a wide range of analytical tools at their disposal ( Dauwels ; Delorme & Makeig, 2004) and in recent years have explored a number of novel relationships between measures of complexity ( Cao & Slobounov, 2011; Dauwels ; Jing ; Sitt ; Susmáková & Krakovská, 2008; Weiss ). Studies which have included complexity measures, however, do not regularly include more than one or two such measures. For example, Dauwels include the Lempel-Ziv (LZ) complexity measure ( Lempel & Ziv, 1976) - an algorithmic-based measure - and regularity measures, but ignore potential chaotic and fractal measures. This is not to suggest that the LZ complexity measure or that regularity measures are meaningless, nor that chaotic and fractal measures are more or less important than other measures of complexity, but that all may be measuring different features. Thus, for a more complete and robust picture of any relationships found for one complexity measure in EEG data, it might be useful for investigators to include other measures in their analyses. This study therefore aims to determine whether different measures of complexity of EEG signals correlate, and (if so) to what degree. To do this, a small battery of complexity measures were computed for publicly-available normative data and subsequently analysed for correlations. If some measures were found not to significantly correlate or correlate fully, this would suggest that these measures are detecting unique information which might otherwise have remained hidden to investigators who were computing only a single complexity measure from their data.

Methods

One thousand, one hundred EEG recordings of 1-second duration from 13 healthy control subjects undergoing an object recognition psychophysics task were obtained from a publicly-available database created by Begleiter (1996) of the Neurodynamics Laboratory, State University of New York Health Center, Brooklyn, United States. The control subjects were selected so as to avoid disease-specific influences. While our sample size was limited by the database, prior studies which used this database reached significance (thus, independent power calculations were not performed). Detailed demographic, subject, recording, and task information can be found in the original study by Zhang . The following complexity measures were calculated in MATLAB for each recording: LZ algorithmic complexity ( Lempel & Ziv, 1976), fractal dimension estimation (FD) ( Higuchi, 1988), permutation entropy (PE) ( Bandt & Pompe, 2002), Wiener entropy (WE) ( Wiener, 1954), and spectral structure variability (SSV) ( Singh, 2011). These measures were chosen on the basis of their broad representation of different conceptions of ‘complexity’, including informational theoretic, chaotic/fractal, and computational informatic approaches; details of how these measures are calculated and what they measure are well-described by their respective original proposers ( Bandt & Pompe, 2002; Higuchi, 1988; Lempel & Ziv, 1976; Singh, 2011; Wiener, 1954) and so will not be repeated here (see Data Availability for code details). Many more measures exist than these, however as the principle aim of this paper was to determine if differences exist at all, any differences detected in this small cross-section of measures would sufficiently illustrate this. Results from the complexity measures were analysed by linear regression and significance (considered as p<0.05) for relationships between pairs of measures was calculated using Pearson product-moment correlation coefficients. For relationships which appeared to have non-linear components when viewing its scatterplot, binomial regression was attempted. Graphs and statistics were generated using MATLAB R2012a (7.14.0.739) and Microsoft Excel 2007.

Results

Of the ten pairs of measures, eight pairs exhibited highly significant (p<0.0001) correlations while two pairs - (i) PE and FD, (ii) WE and LZ - did not significantly correlate ( Table 1 and Table 2). High degrees of spread were noted among all correlations.

Table 1.

Pearson (r) correlation matrix for each pair of complexity measures computed for normative EEG recordings.

	FD	LZ	WE	PE	SSV
FD	-	0.5402	0.4155	-0.0255	0.6517
LZ	0.5402	-	-0.0472	-0.1273	0.5983
WE	0.4155	0.4155	-	0.3469	0.5977
PE	-0.0255	-0.1273	0.3469	-	0.1672
SSV	0.6517	0.5983	0.5977	0.1672	-

Table 2.

Significance (p) of correlations for each pair of complexity measures computed for normative EEG recordings.

	FD	LZ	WE	PE	SSV
FD	-	<0.0001	<0.0001	0.3990	<0.0001
LZ	<0.0001	-	0.1174	<0.0001	<0.0001
WE	<0.0001	0.1174	-	<0.0001	<0.0001
PE	0.3990	<0.0001	<0.0001	-	<0.0001
SSV	<0.0001	<0.0001	<0.0001	<0.0001	-

These relationships were visualised using scatter plots ( Figure 1 and Figure 2) to help determine if any of these relationships may be non-linear. Two such relationships - (i) LZ and FD, (ii) SSV and FD - appeared to follow a binomial trend ( Figure 3), and binomial regression improved these relationships greatly.

Figure 1.

Scatter plots with linear trendlines for pairs of significantly-correlated complexity measures.

Eight pairs of complexity measures of the EEG signals had a significant (p<0.0001) correlation. Although the relationships are significant, high degrees of spread are noticeable and some of the relationships may have non-linear components. EEG = electroencephalogram; LZ = Lempel-Ziv algorithmic complexity; FD = fractal dimension estimate (Higuchi method); PE = permutation entropy; SSV = spectral structure variability; WE = Wiener entropy (also known as spectral flatness).

Figure 2.

Scatter plots with linear trendlines for two pairs of insignificant, uncorrelated complexity measures.

Two pairs of complexity measures of the EEG signals were insignificant and uncorrelated - PE & FD (r=-0.0255, p=0.3990) and WE & LZ (r=-0.0472, p=0.1174). There appears to be no non-linear components or any evidence of a clear relationship between these pairs of measures. EEG = electroencephalogram; LZ = Lempel-Ziv algorithmic complexity; FD = fractal dimension estimate (Higuchi method); PE = permutation entropy; WE = Wiener entropy (also known as spectral flatness).

Figure 3.

Scatter plots with binomial regression lines for potential non-linearly-related pairs of complexity measures.

Scatter plots with linear trendlines for pairs of significantly-correlated complexity measures.

Scatter plots with linear trendlines for two pairs of insignificant, uncorrelated complexity measures.

Scatter plots with binomial regression lines for potential non-linearly-related pairs of complexity measures.

Two pairs of complexity measures of the EEG signals appeared to have noticeable non-linear relationships: (i) LZ and FD; and (ii) SSV and FD. Although these binomial relationships were - like their linear relationships - significant (p<0.0001), the binomial regressions produced less spread and appear to be truer representations of the relationships. EEG = electroencephalogram; LZ = Lempel-Ziv algorithmic complexity; FD = fractal dimension estimate (Higuchi method); SSV = spectral structure variability. The following data are the results from MATLAB functions which calculated complexity measures for each EEG recording. The following data are the results from MATLAB functions which calculated complexity measures for each EEG recording. ID = identification code as per ( Burns & Rajan, 2015). Click here for additional data file.

Discussion and conclusions

Some - but not all - measures of complexity of EEG signals correlate, and to varying degrees of significance, e.g. we found no significant relationship between PE and FD but did find a significant relation between PE and LZ. To the best of our knowledge, this study represents the first report of such complexity measure differences in EEG signals. Of the many complexity measures available to researchers investigating EEG data, overreliance or overconfidence in any single measure therefore seems misplaced. As research groups who have attempted to classify or predict sleep stages or conscious states from EEG data have implicitly noted ( Susmáková & Krakovská, 2008; Sitt ; Weiss ), no individual measure can reliably predict all possibly relevant physiology. Instead, combinations of measures are needed. In the same way, no individual measurement of complexity can reliably predict all possibly relevant complexity. In part, the results from this short study reflect on a more generalised ambiguity of the concept of ‘complexity’. Who is to say, after all, that more is revealed about ‘complexity’ by FD than LZ? It seems that it cannot be said that either elucidate more or less about ‘complexity’, since both ultimately treat it in a different way on even a conceptual basis. This further reiterates the primary finding of the present study: by multiplying measures we can reveal information which was previously hidden or unknown to us. However, there are two caveats to this: (1) not all information may be physiologically or otherwise relevant all of the time (or ever); and (2) different datasets may, due to their differences in nature, show different levels of covariance between complexity measures. It would be interesting for future studies to analyse previously-noted complexity differences - e.g., between patients with and without Alzheimer's disease ( Dauwels ) - to determine if these differences were measuring the same difference. Our results suggests they may not be. And if this is the case, more might be gleaned from the available data if more measures were applied in combination. It could even be possible that there exists entirely separate complexity dimensions, along which patients progress at different rates. Such information could therefore contain even more physiological, clinical, or other significance than previously thought.

Data availability

The data referenced by this article are under copyright with the following copyright statement: Copyright: © 2015 Burns T and Rajan R Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication). A copy of MATLAB functions used in this study has been uploaded to GitHub and can be accessed here: https://github.com/tfburns/MATLAB-functions-for-complexity-measures-of-one-dimensional-signals. Results from these functions for the EEG data used can be found in the F1000Research repository (see below). F1000Research: Dataset 1. Calculated complexity measures for 1100 EEG recordings. The data are the results from MATLAB functions which calculated complexity measures for each EEG recording, 10.5256/f1000research.6590.d48983 ( Burns & Rajan, 2015). This is a well written report addressing the utility of multiple measures used to evaluate EEG. EEG continues to be a valuable tool for research and clinical work. This paper nicely assesses through statistical analysis whether unique features of EEG can be helpful. The methods and analysis are well formulated. The conclusions are clearly articulated and are supported by research findings. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. This is an interesting article. The authors computed five complexity measures which were broadly representative, aiming to determine the correlations and differences between measures of complexity used in EEG signals study. Just as the findings of this study, more information can be revealed by multiplying measures instead of single complexity measures. Furthermore, the methods of combinations of complexity measures can be applied in future studies such as Alzheimer’s disease. With the coming of big data era, we need to adopt effective means to mine information hidden in many diseases. Perhaps, this paper gives us some reference. I have read this submission. I believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

9 in total

1. Permutation entropy: a natural complexity measure for time series.

Authors: Christoph Bandt; Bernd Pompe
Journal: Phys Rev Lett Date: 2002-04-11 Impact factor: 9.161

Review 2. Diagnosis of Alzheimer's disease from EEG signals: where are we standing?

Authors: J Dauwels; F Vialatte; A Cichocki
Journal: Curr Alzheimer Res Date: 2010-09 Impact factor: 3.498

3. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis.

Authors: Arnaud Delorme; Scott Makeig
Journal: J Neurosci Methods Date: 2004-03-15 Impact factor: 2.390

4. Comparison of fractal and power spectral EEG features: effects of topography and sleep stages.

Authors: Béla Weiss; Zsófia Clemens; Róbert Bódizs; Péter Halász
Journal: Brain Res Bull Date: 2010-12-13 Impact factor: 4.077

5. Discrimination ability of individual measures used in sleep stages classification.

Authors: Kristína Susmáková; Anna Krakovská
Journal: Artif Intell Med Date: 2008-09-19 Impact factor: 5.326

6. Event related potentials during object recognition tasks.

Authors: X L Zhang; H Begleiter; B Porjesz; W Wang; A Litke
Journal: Brain Res Bull Date: 1995 Impact factor: 4.077

7. Application of a novel measure of EEG non-stationarity as 'Shannon- entropy of the peak frequency shifting' for detecting residual abnormalities in concussed individuals.

Authors: Cheng Cao; Semyon Slobounov
Journal: Clin Neurophysiol Date: 2011-01-07 Impact factor: 3.708

8. Slowing and Loss of Complexity in Alzheimer's EEG: Two Sides of the Same Coin?

Authors: Justin Dauwels; K Srinivasan; M Ramasubba Reddy; Toshimitsu Musha; François-Benoît Vialatte; Charles Latchoumane; Jaeseung Jeong; Andrzej Cichocki
Journal: Int J Alzheimers Dis Date: 2011-04-13

9. Large scale screening of neural signatures of consciousness in patients in a vegetative or minimally conscious state.

Authors: Jacobo Diego Sitt; Jean-Remi King; Imen El Karoui; Benjamin Rohaut; Frederic Faugeras; Alexandre Gramfort; Laurent Cohen; Mariano Sigman; Stanislas Dehaene; Lionel Naccache
Journal: Brain Date: 2014-06-11 Impact factor: 13.501

9 in total

7 in total

1. Determining states of consciousness in the electroencephalogram based on spectral, complexity, and criticality features.

Authors: Nike Walter; Thilo Hinterberger
Journal: Neurosci Conscious Date: 2022-06-17

2. The successful discrimination of depression from EEG could be attributed to proper feature extraction and not to a particular classification method.

Authors: Milena Čukić; Miodrag Stokić; Slobodan Simić; Dragoljub Pokrajac
Journal: Cogn Neurodyn Date: 2020-03-25 Impact factor: 5.082

3. A Mathematical Approach to Correlating Objective Spectro-Temporal Features of Non-linguistic Sounds With Their Subjective Perceptions in Humans.

Authors: Thomas Burns; Ramesh Rajan
Journal: Front Neurosci Date: 2019-07-31 Impact factor: 4.677

Review 7. Classification of Depression Through Resting-State Electroencephalogram as a Novel Practice in Psychiatry: Review.

Authors: Milena Čukić; Victoria López; Juan Pavón
Journal: J Med Internet Res Date: 2020-11-03 Impact factor: 5.428