Literature DB >> 18369410

Statistical modeling and analysis of laser-evoked potentials of electrocorticogram recordings from awake humans.

Zhe Chen¹, Shinji Ohara, Jianting Cao, François Vialatte, Fred A Lenz, Andrzej Cichocki.

Abstract

This article is devoted to statistical modeling and analysis of electrocorticogram (ECoG) signals induced by painful cutaneous laser stimuli, which were recorded from implanted electrodes in awake humans. Specifically, with statistical tools of factor analysis and independent component analysis, the pain-induced laser-evoked potentials (LEPs) were extracted and investigated under different controlled conditions. With the help of wavelet analysis, quantitative and qualitative analyses were conducted regarding the LEPs' attributes of power, amplitude, and latency, in both averaging and single-trial experiments. Statistical hypothesis tests were also applied in various experimental setups. Experimental results reported herein also confirm previous findings in the neurophysiology literature. In addition, single-trial analysis has also revealed many new observations that might be interesting to the neuroscientists or clinical neurophysiologists. These promising results show convincing validation that advanced signal processing and statistical analysis may open new avenues for future studies of such ECoG or other relevant biomedical recordings.

Entities: Chemical Disease Gene Species

Year: 2007 PMID： 18369410 PMCID： PMC2271124 DOI： 10.1155/2007/10479

Source DB: PubMed Journal: Comput Intell Neurosci

1. INTRODUCTION

Pain is an essential function for the organism to enable immediate awareness of actual or threatening injury for further adopting a self-protective behavior. Roughly speaking, pain is a complex and subjective experience in the brain; it involves sensory, affective, cognitive, and motivational components and is associated with autonomous activity, nocifensive reflexes and reactions. In clinical practice, neurophysiological evaluation of pain in humans has been an important subject of research in the last decade (Bromm and Lorenz [1]). In the literature, there are many approaches for monitoring and measuring the pain-related brain activities, including electroencephalogram (EEG), magnetoencephalogram (MEG), and fMRI. In particular, the electrocorticogram (ECoG) records directly the cortical (electrical) activities from subdural electrode grids that are implanted in the human subjects for collecting information for surgical treatments of medically intractable epilepsy (i.e., patients in the hospital upon approval). As an invasive recording tool, ECoG offers some superior features that are unavailable for EEG or MEG recordings. Specifically, unlike EEG that measures the electrical potentials recorded from the scalp, ECoG directly records the potentials from the cortical surface, thereby having a higher signal-to-noise ratio (SNR) and higher spatial resolution (because of closer electrode spacing). Consequently, activities in beta or gamma bands are better recorded in ECoG due to less spatial summation and phase cancelation (or high-cut filter effect) than in scalp EEG recordings. Since the energy of the infrared laser can be used to produce a brief thermal stimulus applied to the skin such as to selectively activate the skin nociceptor, the recordings of brain responses to short laser pulses (the so-called laser-evoked potentials, or LEPs) have increasingly become a useful method for evaluating the function of central nociceptive pathways. The roles of LEPs for detecting abnormalities in patients have been noted (García-Larrea et al. [2]). Generally, there are two or three major peaks in the pain-evoked LEPs, which may be generated in multiple regions. In the literature, most research efforts focused on two peaks of the LEPs, the so-called N2 and P2, which correspond to the vertex negative-positive complex . The timing when the peak of the LEP appears is referred to the latency of LEPs. Typically, N2 was found around 150–400 milliseconds, and P2 was found around 230–500 milliseconds, depending on the laser pulse duration and intensity, as well as the stimulus site or area (Bromm and Lorenz [1]). The difference in latency is essentially related to the response differences in peripheral conduction distance. Specifically, LEP reflects an integrative cortical response to the painful laser stimuli rather than a simple reaction of the sensory cortex to it; thus, in the healthy subject the amplitude of cortical LEPs correlates with the subjective sensation of pain, rather than with the physical stimulus intensity (García-Larrea et al. [2]). For instance, paying attention to the laser stimulus simultaneously increases the subjective pain sensation and the LEP amplitude, both of which decrease in turn when the subject is distracted from the stimulus (García-Larrea et al. [3]). In addition to the amplitude, the latencies of the LEPs are often important for the neurophysiological evaluation of pain (Bromm and Lorenz [1]). In a later section, we will analyze the amplitudes and latencies of LEP components N2 and P2 in detail. As suggested in the literature, the negative component (N2) seems to be induced mainly by the activation in the bilateral operculoinsular cortices and contralateral primary somatosensory cortex (SI) (e.g., Tarkka and Treede [4], Iannetti [5]), and the positive component (P2) is mainly generated by the cingulate gyrus (e.g., Tarkka and Treede [4], Lenz et al. [6], Iannetti et al. [5]). However, it should also be noted that both N2 and P2 could be recorded and observed at multiple cortical regions simultaneously (e.g., Ohara et al. [7-9]); therefore, although there may be some evidence that one LEP is more related to a particular region than the other, a full understanding of their underlying mechanisms remains unclear. In the previous studies (Ohara et al. [8]) of the ECoG recordings from the awake humans, it was found that attention to painful cutaneous laser stimuli enhances pain-related LEPs in cortical regions receiving nociceptive input, typically at multiple cortical sites (Ohara et al. [9]). Specifically, it was observed that at primary somatosensory (SI), parasylvian (PS), and medial frontal (MF: anterior cingulate and supplementary motor area) cortex areas, the amplitudes of the negative (N2∗) and positive (P2∗ ∗) LEP components were enhanced by attention to (counting stimuli), in comparison with distraction from the stimuli (reading for comprehension). It was suggested therein that attention controls both early (N2∗) and late (P2∗ ∗) pain-related input to SI (and other) cortical regions, while the late positive deflections (that follow the P2∗ ∗ peak) are specifically related to attention. It was also reported in other independent EEG studies (e.g., Legrain et al. [10, 11]) that LEPs can be modulated by selective spatial attention. In [7], Ohara et al. observed that attention to painful stimuli leads to enhanced event-related desynchronization (ERD) in cortical regions receiving input from nociceptors, and the alpha ERD is more widespread and more intense during attention to the laser than distraction from the stimuli. This was also consistent with the observations from other studies using EEG or MEG recordings (Mouraux et al. [12], Ploner et al. [13]). In recent years, many statistical tools, such as principal component analysis (PCA), independence component analysis (ICA), parallel factor analysis (PARAFAC), common spatial subspace decomposition (CSSD), statistical wavelet thresholding (SWT), and Kalman filtering, have been used for analyzing biological or biomedical data, including EEG, MEG, and fMRI (e.g., Lee et al. [14], Cao et al. [15, 16], Makeig et al. [17], Anemüller et al. [18], Miwakeichi et al. [19], Browne and Cutmore [20], Wang [21], Galka [22], Cichocki [23, 24]). The common goal of these mathematical tools is to discover the hidden components underlying the data and extract the markers for characterizing specific events (e.g., event-related potentials). In addition, combing ICA or other statistical tools with advanced time-frequency analysis methods has also been advocated in cognitive neuroscience and neuroimaging (e.g., Makeig et al. [25], Mørup et al. [26]). In this paper, we conduct both quantitative and qualitative analyses of ECoG data induced by pain stimuli controlled by a laser pulse. The investigation is focused on two selected human subjects under several different controlled stimuli conditions: attention, distraction, as well as under different laser intensity levels. Statistical analysis was conducted for both averaging trials and single trials. The averaging-trial study attempts to find out the dominant and common components (especially LEPs) by averaging all trials (of one subject) under the same conditions. In contrast, the single-trial study aims to search for instantaneous brain waves and to analyze the corresponding LEP properties (such as the amplitude and latency). The signal-trial analysis is important because the spontaneous brain activities that are regarded as “noise” are often diminished by averaging. We believe that the results obtained from the single trials, if analyzed appropriately, often offer extra information that is unavailable in the averaging-trial study (e.g., Makeig et al. [25]). To achieve our goal, we select proper processing procedures and mathematical tools, including factor analysis (FA) and ICA, to the experimental recordings. This builds on the assumption that within a short timescale the ECoG recordings are approximated by an instantaneous linear generative model that is corrupted by additive noise. The LEPs of interest and other instantaneous brain activities are assumed to be mutually independent. To blindly separate the sources of interest (i.e., LEPs), we first resort on a dimensionality reduction procedure followed by an efficient and robust ICA estimation method. In addition, with eigenvalue decomposition, an energy ratio threshold is defined to reject nonsignificant components, which are regarded as the interfering noise from the raw ECoG recordings. The values of these two statistical analysis methods have been demonstrated in both averaging and single trials (e.g., Cao et al. [15, 16]). In terms of single-trial analysis, wavelet-based time-frequency analysis is also used to assist the quantitative analysis of Z-score transformed power across different frequency bands. Whilst these statistical methods are not new the contribution of this paper is to integrate these methods with careful computational procedures and present a systematic study of the ECoG recordings for their LEP characterizations, which might offer some insights for the neurophysiological or clinical practice. To our best knowledge, we are in the first position or for the first time, to employ the statistical ICA tools to pain-related ECoG recordings. We describe the computational modeling and analysis in detail and present some interpretations and discussions from our experimental results. On the one hand, we strive to relate the results to the reported neurophysiological observations in the literature; on the other hand, we also pinpoint several interesting findings and observations in our single-trial data analysis.

2. DATA

2.1. Recordings

To obtain the ECoG recordings, special grid electrodes were implanted on the cortical surface of the subjects (i.e., patients for surgical treatment of epilepsy). The grid consisted of platinum-iridium circular electrodes (2.3mm diameter) with a center-to-center distance between electrodes of 1cm (Ad-Tech, Racine, Wis, USA). The LEPs were recorded with the implanted grid electrodes over the SI, PS, and MF regions; see Figure 1 for an illustration. During recordings, the subjects wore goggles and reclined on a bed, quietly wakeful with eyes open. Painful heat stimulation was delivered to the contralateral hand dorsum (contralateral to the grid) by a Thulium YAG laser (Neurotest, Wavelight Inc., Starnberg, Germany). The duration of each pulse was 1 millisecond and the beam diameter was 6mm. Laser energy level was determined to produce a painful sensation of 3-4/10 on a decimal scale (with 0 denoting no pain, and 10 denoting the most intense pain). The ECoG signals were recorded with sampling frequency 1000Hz. The recordings were carried out at the Johns Hopkins Hospital between 1999 and 2003 (Ohara et al. [7-9]). The protocol was reviewed and approved annually by the Institutional Review Board of the Johns Hopkins Hospital and all subjects signed an informed consent for the studies.

Figure 1

The implanted electrodes' layout; the somatosensory cortex that is associated with the sensation of the pain is located in the parietal lobe of the brain. (a) subject 1; (b) subject 2 (where CS and SF correspond to no. 8 and no. 64 channels, resp.); (c), (d) implanted grids imposed on the reconstructed 3D magnetic resonance images of two subjects. Note that the number of implanted grids shown on the 3D images is more than the number of the available channels shown in Table 1; because of the limitation in data acquisition, only a subset of the grids were selected (CS: central sulcus; SF: sylvian fissure; CiS: cingulate sulcus; MCiS: marginal CiS).

2.2. Subjects

For the purpose of presentation clarification and due to space limit, we have chosen two human subjects in the current study. The statistics of the recording setup regarding the selected two subjects are listed in Table 1. Specifically, the first subject was a 21-year old woman with medically intractable seizures since age 10; her neurological examinations and brain magnetic resonance images (MRIs) were normal. Subdural electrode grids were planted over the frontal-central-parasylvian cortex (no. 1–64 channels) and the medial wall of the left hemisphere (no. 65–80 channels). The second subject was a 21-year old man with complex partial seizures since age 4, whose MRI showed a small cavernoma in the right parietal lobe (contralateral to the side of the implantation). The ECoG signals were recorded from the left fronto-parietal lobe (64 channels) and medial frontal lobe (16 channels). All the signals were recorded with reference to one intracranial electrode.

Table 1

Summary of the experimental recordings of two human subjects.

Subject	Condition	Laser intensity	No. of electrodes	No. of runs	No. of trials at each run
1	Attention	720 mJ	89	3	38, 38, 38
1	Distraction	720 mJ	89	3	38, 38, 38

2	Attention	720 mJ	80	2	38, 38
2	Distraction	720 mJ	80	2	38, 38

2	Intensity	480 mJ	80	4	8, 12, 10, 10
2	Intensity	640 mJ	80	4	11, 11, 12, 10
2	Intensity	800 mJ	80	4	10, 10, 10, 10

2.3. Experimental paradigm

There are two types of experimental protocols designed for subjects: attention/distraction , and intensity . In the attention condition, the subject was asked to count the number of painful stimuli and to report both that number and the average pain intensity after each run of laser pulses; in the distraction condition, the subject read a magazine article and answered questions about it after the run. In these two conditions, constant level of laser intensity was used for the subject, and 38 laser pulses were delivered with an interstimulus interval that was randomly varied between 50 and 10 seconds within each run. Additionally, in the intensity experiment, varying levels of laser stimuli were delivered to the subject, and the subject was asked to rate the subjective pain sensation according to the decimal scale.

2.4. Filtering

Upon loading the raw ECoG recordings to the computer, the data were amplified and band-pass filtered at 0.1–300Hz (Astro-Med, Inc., West Warwick, RI, USA). Subsequently, we conducted a simple notch filtering procedure to filter out the AC components of power supply (60Hz).

3. MATHEMATICAL MODELING AND ANALYSIS

3.1. Generative model

The experimental data are assumed to be generated by a probabilistic generative model that is described by two equations as follows: where t denotes the time index. Equation (1) is essentially a factor analysis (FA) model, where ∈ ℝ is the hidden variable called “factor,” the m × n matrix B is called the “loading matrix, ” x ∈ ℝ denotes the observed multi-channel signals measured in the electrodes, ∈ ℝ denotes the constant mean vector that is often assumed to be zero, and ε ∈ ℝ denotes the additive uncorrelated noise that corrupts the measurements. Equation (2) describes a linear mixture model that is related to the blind source separation (BSS) problem of our interest, where ∈ ℝ denotes the independent source signals originated from the brain, A denotes a linear mixing matrix that roughly models the mixing process and the stationary propagation or scattering effect within a short timescale (say, 200 to 600 mesc); and the mixed signals consist of the hidden factor obtained in (1). In the current setting of this paper, we assume m > n = N. No doubt that the generative model described by (1) and (2) is somewhat oversimplified for the ECoG data. However, we believe that the instantaneous linear mixing model is rather reasonable at a short timescale and therefore can be used in the first step. In addition, we assume that matrices and are constant within the a short duration of measurements. Now, the statistical estimation problem is to infer the independent sources given the observed . We will tackle this problem via these two statistical tools as described below. Notably, similar methodology has been applied to MEG or EEG recordings with successes in some other real-life recordings (e.g., Cao et al. [15, 16]).

3.2. Factor analysis

Without loss of generality, we assume that μ = 0, and the factor variables satisfy 𝔼[ ] = 0 and 𝔼[ ] = C , where C is the covariance matrix; and the noise is Gaussian distributed with zero mean and covariance matrix Σ, which we denote by ε ∼ 𝒩(0,Σ). In light of (1), we have If z is Gaussian distributed, then x is also Gaussian distributed. If we further restrict that the factor z is whitened, then C = I (where I denotes the identity matrix); this assumption is reasonable since we can always scale the loading factors B to satisfy the original model equation. Typically, dim⁡(x) > dim⁡(z), therefore FA is also a dimensionality-reduction method. A close examination of our experimental multielectrode recordings indicates that there are strong correlations between adjacent electrodes, which therefore justifies the necessity of dimensionality reduction. From a probabilistic point of view, we can write p(z ) = 𝒩(0,I), then p(x ) = 𝒩(0, BB + Σ). Under the Gaussian assumption of the factor analyzer, the posterior probability p(z | x ) is also Gaussian, with mean and covariance, respectively, defined by Now the goal of FA is to estimate the unknown matrices B and Σ, given the observed data {x }. In the literature, two types of estimation procedures can be employed.

Maximum likelihood estimation

By deriving the log likelihood function (see the appendix) with respect to the unknown variables, we can use iterative optimization procedures, such as the gradient ascent or expectation-maximization (EM) algorithm, to obtain the optimal solution. Upon obtaining the maximum likelihood estimates of B and Σ, we can further calculate the hidden factor z by (4).

Least-squared estimation

Given observed samples {x } , we can calculate the sample covariance matrix (assuming zero mean) and conduct its eigenvalue decomposition (EVD) as follows:where U is the m × m orthogonal matrix that consists of eigenvectors as its column vectors, Λ is a diagonal matrix that consists of the diagonal entries as eigenvalues. Note that when the noise is zero or the noise is negligible and has a diagonal covariance matrix, then FA reduces to PCA as a special case. Upon PCA, we can empirically estimate the noise covariance. Let U denote an m × n matrix that consists of the first n dominant eigenvectors, then we can estimate the noise covariance by and estimate the loading matrix byFinally, the factor variable z is produced by a linear transformation: where . Note that in this case, the dimensionality of z can be determined by PCA with dimensionality reduction, whereas the remaining components are considered to be “significant” in terms of variance or energy contribution.

3.3. Independent component analysis

Upon performing the model reduction using FA, we further aim to apply the blind source separation (BSS) approach, using the tool of ICA (e.g., Cichocki and Amari [27]), to recover the hidden sources in (2). Roughly speaking, ICA is built upon the assumption that the hidden sources in s are mutually independent and subject to an instantaneous linear mixing. There are many ICA/BSS algorithms available in the literature. To our interest, two kinds of batch (i.e., noniterative) ICA/BSS algorithms are considered.

Time-domain method

Specifically, we focus on the BSS algorithms based on generalized EVD of the time-delayed cross-correlation matrices or cumulant statistics, such as the SOBI (second-order blind identification) and JADE (joint approximate diagonalization of eigen-matrices) algorithms. These methods are fast and noniterative (thereby independent of the initial conditions). In our experiments, we have tried and compared the SOBI and JADE algorithms, and found that their results were qualitatively similar. However, JADE is more desirable and preferred since it incorporates higher-order statistics.

Time-frequency method

Specifically, the source separation criterion of this method is conducted in time-frequency domain based on joint diagonalization of the spatial time-frequency distribution (TFD). A representative example is the algorithm described by Févotte and Doncarli [28]. This method is more intuitively appealing (by taking into account of the information in both time and frequency) and has been demonstrated to be robust to noise. Notably, although the hidden factor z is whitened (with zero mean and unit variance), it is still likely that the mixing matrix is ill-conditioned, which thereby makes the estimation of its inverse (or Moore-Penrose pseudoinverse), the demixing matrix W = A −1 (or W = A † ≡ (A A)−1 A ), rather difficult, especially in single-trial experiments. One way to overcome this problem is to conduct a two-stage ICA procedure. The essence of the two-stage ICA is as follows: the role of the first-stage ICA is “rough tuning,” which produces a guess (or poor estimate) of the ill-conditioned mixing matrix; and the final “fine tuning” job is accomplished by the second-stage ICA routine. The trick of such a two-stage ICA often helps to recover the hidden components in many ill-conditioned scenarios if it is not the case, the second-stage ICA simply produce improved or similar results as in the first-stage ICA. The significance of the (uncorrelated or independent) components is determined by their relative energy (or variance). Physiologically, we believe those sources that have relative great energy are more meaningful in terms of repeatability. In practice, selecting the number of principal components is done by EVD followed by a threshold selection. In our experiments, five to eight principal components were typically selected, which account for about 97–99% of the total energy. Specifically, let 9 = diag{λ 1,λ 2,…,λ n} denote the diagonal matrix that contains the nondecreasing eigenvalues λ 1 ≥ λ 2 ≥⋯≥λ n≥ 0 , the number of significant components, k, is chosen according to the following criterion: in which the threshold Th was empirically set as 0.97; the nonnegative eigenvalue indicates the relative significance of specific component in terms of its energy contribution.

3.4. Identification of interested source by deflation

Let y (1) = W (1) z and y = W (2) y (1) denote, respectively, the first- and second-stage ICA unmixing equations, where W (1) and W (2) denote the associated unmixing matrices; then the final unmixed signals, y , can be estimated aswhere W = W (2) W (1) denote the global (combined) unmixing matrix. Notably, each column of W −1 contains the relative strengths of a source component at the individual scalp electrodes, which can be used to identify the interested source component. Given the estimated y = [y 1(t),y 2(t),…,y n(t)], we can also reconstruct the partial hidden factor by projecting the ith component of y , denoted by y (t), backward onto the subspace where [W †] denotes the ith column vector of the matrix W †. Furthermore, we can reconstruct the specific source of interest in the observed data space (i.e., the scalp signals contributed merely to the ith source) By projecting to the original channels' positions (i.e., the 8 × 8 electrode layout), we essentially identify the source(s) of interest. It should be noted that the “source identification” here is only limited to the two-dimensional scalp surface, and does not refer to localization of the three-dimensional spatial position of the “voxel.” In addition, in order to evaluate the relative contribution of every electrode to the extracted independent component (especially for the LEP), we need to consider the joint effect of and W. For this purpose, we may also calculate the weighted estimate of the sensor space as follows: where ⊙ denotes the Hadamard (elementwise) product, w = [w 1,w 2,…,w n] denotes the ith row vector of the matrix W, and is the back-projected sensor space from the ith independent source via (13). As a distinction, we call the reconstructed in the sensory space as “unweighted map” and the reconstructed in the sensory space as “weighted map.” Notably, because of the degeneracy of W, the “weighted map” is subject to the scaling and algebraic sign uncertainties.

3.5. Time-frequency analysis

In addition to analyzing temporal signals, we also resort on time-frequency analysis tools (such as the short-time Fourier transform, or Wigner-Ville distribution, and wavelet transform) to extract more information for quantitative comparisons. Specifically, wavelet analysis is appealing and considered superior to the short-time Fourier transform for nonstationary signals, including EEG (e.g., Mallat et al. [29]; Tallon-Baudry et al. [30], Düzel et al. [31], Mouraux et al. [12], Ohara et al. [7]). Here, we choose the continuous wavelet transform for our purpose because of its adaptive time-frequency analysis via multiscale decomposition. However, because of the uncertainty principle , in order to obtain a good frequency resolution, sufficient time samples are required. In the experiments, we will use the Wigner-Ville distribution for an illustration purpose, while in the quantitative analysis we will use the continuous wavelet transform. For a temporal signal x(t) (i.e., the raw recordings from one electrode channel), the power of its continuous-time wavelet transform is described bywhere ∗ denotes convolution product between the signal and the mother wavelet function, and ψ(t,ω 0) is a complex-valued Morlet mother function:where , and σ is the bandwidth parameter. The width of the Morlet wavelet, defined by 2π σ ω 0, is set to 7 in our study. The central frequency ω 0 ranges from 1 to 60 Hz in steps of 1 Hz. To analyze the specific temporal window of interest, we select a 100-millisecond prestimulus period and a 500-millisecond poststimulus period, with a total window length 600 milliseconds. To compare the power change between the prestimulus and poststimulus periods, we need to introduce some “relative” measures to obtain a baseline for the poststimulus power. This is important because we are not interested in the “absolute” power statistic per se , but interested in the stimulus-induced relative power change. In the literature, the measure of event-related band power change (ERBP) was defined as (e.g., Ohara et al. [7]) where m(ω 0) denotes the median power envelope during the prestimulus period. Alternatively, we can use another measure, which we refer to as “ Z-score transformed poststimulus power,” by using the Z-score transformation (e.g., Browne and Cutmore [20]):where μ(ω 0) and σ(ω 0) are, respectively, the mean and standard deviation of the power in a specific channel band (with center frequency ω 0) during the prestimulus period. The motivation of (18) is to introduce baseline power values across different frequency bands for the poststimulus power statistics, which are used for standardized comparisons. In doing so, the low-amplitude component at high frequency will be highlighted, which also makes the time-frequency atom in the gamma (>32 Hz) band more visible. See Figure 2 for an illustrative example. Note that the Z-score power value can be negative; the positive values indicate the event-related synchronization (ERS), and the negative values indicate the event-related desynchronization (ERD), both between the prestimulus and poststimulus periods. Hence, the Z-score transformation provides a clearer understanding of the time-frequency map (in terms of relative power change).

Figure 2

The original (upper panel, unit μ V 2) versus Z-score transformed (bottom panel, unitless) wavelet scalogram of one selected channel in a single trial (subject 2, attention task, laser intensity 720mJ). The white dash lines indicate the laser stimulus onset. As seen, the ERS and ERD are highlighted more clearly by the Z-score transformation given by (18).

In some cases, the resulted Z-score transformed poststimulus power will be converted to a two-dimensional time-frequency distribution map, denoted by E(t, ω), and further normalized to unity such that ∬ E(t,ω)d ω d t = 1, which we refer to as the normalized power. In doing so, each time-frequency atom can be interpreted by a nonnegative probability in the time-frequency plane.

4. COMPARATIVE EXPERIMENTS FOR AVERAGING TRIALS

We first apply the above described computational procedure and statistical tools for averaging trials, the signal-trial experiments will be treated later in more detail. The experimental results reported in this section will be illustrated for subject 1; two kinds of conditions, counting and reading, are considered.

4.1. Extraction of laser-evoked potentials

First, we aim at extracting LEPs for the averaging-trial experiments. Specifically, according to the laser onset tag, the ECoG recordings (of all channels) were averaged upon the total number of trials at each run. By doing so, the effect of the visual or muscle artifacts may be greatly reduced. However, it is difficult to identify the LEPs from the averaging ECoG waveforms of all channels (see Figure 3). Not only the peaks of the LEPs are less evident, the averaging waveforms still suffer from noise and artifact corruption.

Figure 3

The averaging waveforms (arbitrary scaling) from averaging trials for both subjects in two tasks. As seen, the averaging evoked potentials are not clearly evident in these plots.

To overcome these issues, we then apply the statistical tools (FA and ICA) to further process these trial-averaging signals. In the averaging-trial experiments for subject 1, we selected five independent components for the purpose of extracting LEPs. These five independent components are considered to be “significant” because they contribute mostly to the averaged ECoG data in terms of variance or energy. Due to the averaging/smoothing effect, one-stage ICA procedure (with the JADE algorithm) was found typically sufficient in the experiments The experimental results for the subject 1, in the time domain as well as in the time-frequency domain, are illustrated in Figures 4 and 5. As observed in the figures, we can extract typical peaks around 150 milliseconds and 200 milliseconds, which might correspond to the hypothetic N2 and P2 peaks (or N2∗ and P2∗ ∗) of LEPs, which we also refer to as N150 and P200, (a), 5(a), 13, 15(a), and 21. Please check. respectively; the other components can be viewed as other significant independent spontaneous brain activities. These findings were confirmed in both attention (counting) and distraction (reading) conditions.

Figure 4

Left panels: five estimated significant independent components (ICs) extracted from averaging-trial experiment of the counting task (attention situation) for subject 1. Right panels: the associated time-frequency distribution (TFD) maps.

Figure 5

Left panels: five estimated significant independent components (ICs) extracted from averaging-trial experiment of the reading task (distraction situation) for subject 1. Right panels: the associated time-frequency distribution (TFD) maps.

Next, we conduct the task of LEP source identification. This is done by back-projecting the ith independent component (i.e., the estimated LEP) back to the observed sensor space. Specifically, the power contour maps of N2 (N150) and P2 (P200) under the attention and distraction conditions are illustrated in Figures 6 and 8, respectively. The results are qualitatively close (but not identical) to the previous study (Ohara et al. [7, 8]), in which the LEP peak was found over the interhemispheric (medial) surface. We also plot the combined contributions of the power contour map for N150 and P200, averaged from 120 milliseconds to 240 milliseconds, as shown in Figure 7. As seen in the figure, in the LEP-N2 (i.e., N150), the greatest brain activities happen around the vertex (Cz)—the upper right corner of the 8 × 8 electrode layout (see Figures 1, 6, and 7), these observations were consistent with our early result (Ohara et al. [7, 8]), as well as other independent findings using EEG and fMRI with a similar setup (e.g., see Figure 1 of Iannetti et al. [5]). Similarly, we also obtained the LEPs' mappings for subject 2 (see Figure 9).

Figure 6

Source identification in the averaging trial of subject 1: the back-projected 8 × 8 (first 64 channels) scaled amplitude contour map of the LEP peak at N150 (the 5th independent source at 150 milliseconds, left panel) and P200 (the 4th independent source at 200 milliseconds, right panel) in the counting task (attention condition).

Figure 8

Source identification in the averaging trial of subject 1: the back-projected 8-by-8 (first 64 channels) scaled amplitude contour map of the LEP peak at N150 (the 3rd independent source at 150 milliseconds, left panel) and P200 (the 4th independent source at 200 milliseconds, right panel) in the reading task (distraction condition).

Figure 7

Left panel: the “weighted” map of LEP-N150 (compared to the “unweighted” map the left panel of Figure 5) from (14). Right panel: the back-projected 8-by-8 (first 64 channels) power (i.e., the absolute value of the amplitude) contour map of the two LEPs, N150 and P200, averaged between 120 milliseconds to 240 milliseconds (subject 1, attention condition).

Figure 9

Source identification in the averaging trial of subject 2: the back-projected 8-by-8 (first 64 channels) scaled amplitude contour map of the LEP peak at N2 and P2 in the counting (top 2 panels) and reading (bottom 2 panels) tasks.

4.2. Relative power

We compare the time-frequency distribution (TFD) power between the prestimulus and poststimulus periods. The averaged total power (per channel) and the averaged power (per channel) of specific frequency bands, including theta (4–7.5 Hz), alpha (8–12Hz), beta (12.5–32Hz), and gamma (32–60Hz), are all calculated. In Table 2, we summarize the statistics of two subjects under the attention (counting) and distraction (reading) conditions. The corresponding scatter plots of prestimulus and poststimulus power (of selected frequency bands) of all channels are shown in Figures 10 and 11.

Table 2

The relative power comparisons between prestimulus period (100 milliseconds) and poststimulus period (500 milliseconds) in the averaging-trials. The statistics are averaged over the total number of channels (namely, divided by 89 and 80 for subjects 1 and 2, resp.) and the relative time period. The values are unitless, reflecting the ratio among the normalized energy of the time-frequency map.

	Subject 1	Subject 1	Subject 2	Subject 2
	(counting)	(reading)	(counting)	(reading)
Ave. prestimulus total power	0.0667	0.0796	0.0748	0.0969
Ave. prestimulus θ power	0.0160	0.0265	0.0175	0.0333
Ave. prestimulus α power	0.0048	0.0105	0.0112	0.0102
Ave. prestimulus β power	0.0047	0.0078	0.0136	0.0087
Ave. prestimulus γ power	0.0021	0.0019	0.0026	0.0025

Ave. poststimulus total power	0.1867	0.1841	0.1850	0.1806
Ave. poststimulus θ power	0.0486	0.0617	0.0539	0.0502
Ave. poststimulus α power	0.0074	0.0124	0.0154	0.0160
Ave. poststimulus β power	0.0051	0.0073	0.0134	0.0104
Ave. poststimulus γ power	0.0024	0.0024	0.0033	0.0023

Figure 10

The scatter plots of prestimulus and poststimulus power comparisons in averaging trials for 89 channels (subject 1, left: counting task, right: reading task).

Figure 11

The scatter plots of prestimulus and poststimulus power comparisons in averaging trials for 80 channels (subject 2, left: counting task, right: reading task).

From Table 2, several observations are noteworthy. The power in the poststimulus period is generally greater than that in the prestimulus period, which is obviously evidenced in terms of total power, θ and α power. The θ power increase (or ERS) is relatively more pronounced in the attention condition than in the distraction condition. The β power remains roughly the same level after the laser stimulus, regardless of the undertaken tasks. The γ power is typically small in all conditions, with slightly greater value in the attention condition than in the distraction condition. It is noteworthy that the above observations are consistent with the findings reported in neuroscience and neurophysiology (to name a few, Bromm and Lorenz [1], García-Larrea et al. [2], Ohara et al. [7]). Although the statistics summarized in Table 2 are calculated based on the averaging trials, statistical test (see the next subsection) on single trials also reveals statistical significance.

4.3. Statistical hypothesis testing

In order to evaluate the results of the averaging trials, we conduct some statistical hypothesis tests in order to confirm the “statistical meaning” of the extracted LEPs. This procedure is necessary because the result of the extracted LEPs in the averaging trials does not tell anything in statistical sense about each single trial; namely, we need to be sure if the results we are tempted to interpret are due to random effects from averaging, or due to the consistent causality in all or most of individual single trials. Two popular hypothesis testing methods we consider here are the ANOVA (analysis of variance, or F-test) and Mann-Whitney test (or U-test). In our experiments, we first use the Mann-Whiteny test to calculate the so-called P-value. Second, we also apply a logarithm transformation of the raw samples in attempt to obtain the Gaussianity (i.e., the raw samples are lognormal distributed, as confirmed by the Shapiro-Wilk test), and then apply the ANOVA to calculate the P-values. To conduct the statistical tests, we apply the estimated unmixing matrix W from the averaged trial to each single trial; then we obtain the surrogate “single-trial LEP” for individual single trials, for either LEP-N150 or LEP-P200. For a specific LEP component, we expect that there is a consistent and significant difference between the prestimulus and poststimulus periods in terms of their absolute values. In our case, the statistical test was conducted in the time-frequency domain. For instance, for the LEP-N150 (or LEP-P200), according to its time-frequency map, we empirically choose a window around the maximum power value (i.e., the magnitude) in the time-frequency map, and further conducted the Mann-Whitney test for each extracted LEP component in all single trials, in which the comparison was done in the time-frequency domain. Specifically, we compared the average mean of the power value inside the time-frequency window centered around the maximum point (which in the time domain corresponds to the extracted LEP peak) with that of the prestimulus period (with the same time-frequency window size), both averaged across all frequency bins. Consequently, we may expect that the signal amplitude in the region of interest is significantly greater than that of the baseline. The statistical hypothesis testing results are summarized in Table 3, and the corresponding boxplots are shown in Figure 12. As seen in the table, the P-values of U-test are all smaller than .05, and three of them are much smaller than .01, consequentially, they are statistically significant. For the sake of completeness and sanity check, we also calculated the P-values that are not associated with the LEPs, (i.e., the other independent components extracted from the averaged-trials), we have consistently observed that their P-values are greater than .2 (around .2∼.6); hence, we can conclude that these non-LEP components obtained in the averaged trials are ascribed by the random effect that is not consistent in each single trial.

Table 3

Statistical hypothesis testing statistics of various extracted LEPs in averaging trials for subject 1. The Mann-Whitney U-test was applied to the “absolute value” of the raw samples, and the ANOVA F-test was applied to the logarithm transformation of the absolute value of the raw samples. The N/A implies that the samples are neither normally nor log-normally distributed and therefore cannot be used for ANOVA.

P-value	Counting	Counting	Reading	Reading
P-value	(N150)	(P200)	(N150)	(P200)
U-test	.0029	.0013	7 × 10⁻⁵	.0269
F-test	.0003	N/A	6 × 10⁻⁵	.0183

Figure 12

Boxplots of the absolute value of raw samples, together with their Mann-Whitney test P-values on the counting (left panel) and reading (right panel) tasks (subject 1, laser intensity 720 mJ).

5. QUALITATIVE AND QUANTITATIVE ANALYSES OF SINGLE-TRIAL RECORDINGS

The averaging-trial experiments and statistical tests described above present an informative baseline and guideline for further single-trial experiments. As we mentioned earlier, it is well known that by averaging the ECoG recordings, we might lose some valuable information due to cancelation. For this reason, single-trial experimental findings would be also interesting. Nevertheless, single-trial analysis is more challenging because of the random background activities and artifacts; hence, obtaining consistent yet interpretable results is quite difficult. To succeed, we may require additional care or more sophisticated processing. Table 4 lists the operation comparisons between the averaging and single-trial analyses at each stage of procedure.

Table 4

A comparison of main operations between the averaging and single-trial analyses.

Routine	Averaging trials	Single trials	Purpose
Averaging	Yes	No	Smoothing
FA+PCA	Yes	Yes	Noise and dimensionality reduction
first-stage ICA	Yes	Yes	Extracting independent sources
second-stage ICA	No	Optional	Fine tuning of the sources
Source identification	Yes	Yes	Locating the LEPs of interest
WVD	Optional	Optional	Visualization
Wavelet transform	Yes	Optional	Z-score transform
Statistical test	Optional	Optional	Testing hypothesis

Notably, in contrast to the averaging-trial experiments in which the artifact effects are greatly reduced, strong artifacts may exist in the single-trial experiments. In practice, artifacts (often with low-frequency components) are sometimes observed by visual inspection. In this case, we will be cautioned about using these “bad” channels. A simple solution is to discard them or average with their neighboring channels. Selection of bad channels is often assisted with the reference of averaging trials. For instance, channels with extremely high amplitude and low frequency are generally regarded as eye movement artifacts. Since the FA/ICA statistical methods described above can somehow reduce these effects, hence only those channels with obvious artifacts were removed in the experimental procedure. In the sequel, we will conduct qualitative and quantitative comparisons of single-trial recordings for different measurements listed in Table 1.

5.1. Setup

In single-trial experiments, the number of independent components usually varies from trial to trial (for the purpose of extracting LEPs), and we typically choose the number between 5 and 8. This is because in individual single trials, some small-amplitude but potentially important components at high frequency may play a crucial role, which is also interesting to observe. For the same purpose, we will use the two-stage ICA procedure (JADE algorithm followed by TFD joint diagonalization) described earlier in Section 3. Upon extracting the LEP of interest, we further identify the LEP localization in the sensor space and focus on the analysis on one specific channel (in contrast to the analysis of all channels in the averaging trials). Specifically, we will examine the single-trial recordings under attention and distraction conditions, as well as the statistics of the LEP attributes (latency and amplitude) with varying pain levels (i.e., given different laser intensities).

5.2. Single trials versus averaging trials

In the single-trial experiments, we apply the above-described procedure with the goal of extracting the LEPs under different conditions, and the results obtained in the averaging trials are considered to be the baselines for qualitative comparison. Typically, not all single trials have good quality recordings compared with the averaging trials. Here we show a few successful examples that are capable of identifying the markers of the LEPs. Notably, in our experiments, it was observed that the LEP-N2 can be easily identified, while the LEP-P2 is more difficult to separate. See Figure 13 for two illustrations under different setting conditions.

Figure 13

The extracted independent components (including LEPs) in signal-trials experiments for counting (left) and reading (right) tasks (subject 1, laser intensity 720 mJ).

In order to evaluate the variability between different single trials, we apply the estimated demixing matrix W obtained from averaging trials to all individual single trials, by which we obtain a set of LEP components for N150 and P200 (one pair for each single trial). Furthermore, we may use the available tools of the EEGLAB toolbox (http://www.sccn.ucsd.edu/eeglab; Delorme and Makeig [32], Delorme et al. [33], Makeig et al. [25]) to visualize the event-related spectral perturbation (ERSP) and the intertrial coherence (ITL) for the specific LEP components, as well as the cross-coherence between the independent LEP components. Specifically, the ERSP shows the spectral power change from prestimulus baseline (in dB) relative to the stimulus onset; and the ITL measures the consistency or reproducibility of the phase of stimulus-locked trial activity in the selected independent components. For instance, see Figure 14 for an illustrative example of two LEP components obtained from the attention task (recalling Figure 4). As seen in the figure, the cross-coherence magnitude (from 0 and 1) indicates the degree of synchronization between two independent LEP components, and the cross-coherence phase (from −180 to 180 degree) indicates that the LEP-N150 component is leading ahead of the LEP-P200 component.

Figure 14

Left and middle panels: event-related (log) power spectral perturbation (ERSP, in dB, top row) and inter-trial coherence (ITC, from 0 to 1, bottom row) changes time locked to the LEP components in single trials (subject 1, attention task). Right panel: cross-coherence between LEP N150 and P200, with magnitude plot (from 0 to 1; top row) and phase plot (from −180 to 180 degree; bottom row).

5.3. Attention versus distraction

For subjects 1 and 2, consistent alpha waves were found among many (but not all) single trials in the reading task (i.e., distraction condition); whereas in the counting task (i.e., attention condition), the significant alpha component was not observed in most of single trials. In some reading tasks, no obvious LEP was identified, while the dominant alpha waves can be observed. See Figure 15 for an illustration. In such cases, since there are no clear LEP peaks being observed, it remains an open question that whether this phenomenon is ascribed to “ habituation to the pain ” or “ loss of attention .” The reason that alpha rhythms appear frequently in the reading task might be due to the fact that the subject was in a relatively relaxed mood (especially compared with the counting task).

Figure 15

Left panels: the 5 estimated sources extracted from a single-trial experiment of the reading task (subject 2). The 5th independent source contains typical alpha waves. Right panels: the corresponding time-frequency representation.

In addition, we also measure the coherence of signal-trial ECoG data under different conditions. In Figure 16, the coherency of alpha (8–12Hz) and beta (12.5–32Hz) bands between pairwise channels during the poststimulus period is illustrated. In order to visualize the coherency, putting all connections in one plot will be informative. Specifically, the complete 8-by-8 layout illustrates the first 64 electrodes' positions; at each electrode's position, we also plot aN 8-by-8 contour plot that represents the pairwise coherence between a specific electrode and the other electrodes, in which the specific electrode is marked by a relatively big filled circle. As seen, typically there is strong coherence in the range of neighboring electrodes. Comparing Figure 16(a) with Figure 16(b), and Figure 16(c) with Figure 16(d), we can observe that there is stronger coherence in the alpha and beta bands in the distraction condition than in the attention condition.

Figure 16

Pairwise coherence maps between the first 64 channels (subject 1, laser intensity 720 mJ) averaged over all single trials within a duration of 800 milliseconds in poststimulus period. (a) alpha-range coherence in the counting task; (b) alpha-range coherence in the reading task. (c) beta-range coherence in the counting task; (d) beta-range coherence in the reading task.

5.4. LEP-component power versus laser intensity

For the same human subject in a series of single trials, it is expected that varying the level of stimuli (by changing the laser intensity), the amplitude and latency of the LEPs will consequently vary, so does the power of the LEP components in the time-frequency map. For this purpose of analyzing the power of LEP components at difference frequency bands, we have conducted quantitative and comparative analysis for subject 2 under varying controlled conditions. The power statistics are summarized in Table 5. It should be noted that the power values in Table 5 refer to the “ Z-score transformed” poststimulus power according to (18), and all the values are averaged over the total number of trials in each run. The statistics are calculated for the first 64 channels including the one that has the highest power contribution (no. 14 channel) at each trial. The scatter plots of the Z-score transformed power for theta, alpha, beta, and gamma bands are shown in Figure 17. In the off-diagonal subplots, the scatter plots of cross-band power are shown; whereas the diagonal subplots show the histograms of the power distribution in the relative frequency bands. In each subplot, the correlation coefficient between the power across different frequency bands is also calculated (stored in matrix C), as well as the associated P-values for the student's t-test (stored in matrix P). As seen, with different levels of laser intensities, the Z-score transformed power across different bands is correlated to certain degree: as the laser intensity increases, the degree of correlation at certain frequency bands (e.g., between theta and alpha) tends to decrease. A cut-off correlation coefficient of 0.7 was considered as a sign of significance. Each P-value indicates the probability of testing the hypothesis of no correlation, or the probability of getting a correlation as large as the observed value by random chance, when the true correlation is zero. If P(i, j) is small (say, less than 0.05), then the correlation C(i, j) is statistically significant.

Table 5

Comparative statistics of the relative power of the normalized wavelet scalogram followed by Z-score transformation (for subject 2, no. 14 electrode) in single-trial analysis. The mean and standard deviation (mean ±SD) statistics are calculated by averaging the number of trials in each run.

Run	No. of trials	Intensity	θ power	α power	β power	γ power
1a	8	480 mJ	0.10 ± 0.76	0.24 ± 0.76	0.33 ± 0.70	0.27 ± 0.62
1a	11	640 mJ	0.22 ± 0.83	0.09 ± 0.48	0.03 ± 0.30	0.10 ± 0.49
1a	10	800 mJ	0.57 ± 2.92	0.09 ± 0.72	−0.03 ± 0.39	0.14 ± 0.67

1b	12	480 mJ	0.38 ± 1.63	0.19 ± 0.73	0.25 ± 0.44	0.22 ± 0.40
1b	11	640 mJ	0.58 ± 0.91	0.22 ± 0.57	0.08 ± 0.58	0.05 ± 0.48
1b	10	800 mJ	0.66 ± 1.23	0.59 ± 0.79	0.47 ± 1.39	0.41 ± 0.92

2a	10	480 mJ	0.29 ± 0.61	0.40 ± 0.57	0.53 ± 1.26	0.43 ± 0.66
2a	12	640 mJ	0.76 ± 1.62	0.35 ± 0.93	0.21 ± 0.66	0.11 ± 0.61
2a	10	800 mJ	0.77 ± 2.11	0.24 ± 0.69	0.12 ± 0.33	0.36 ± 0.51

2b	10	480 mJ	−0.16 ± 0.30	−0.04 ± 0.31	0.17 ± 0.28	0.31 ± 0.38
2b	10	640 mJ	0.30 ± 0.93	0.34 ± 0.85	0.28 ± 0.84	0.05 ± 0.44
2b	10	800 mJ	0.64 ± 1.56	0.21 ± 0.59	0.10 ± 0.27	0.20 ± 0.33

Figure 17

The scatter plots (using the MATLAB function “plotmatrix”) of the Z-score transformed power for the theta, alpha, beta , and gamma bands: (a) 480 mJ, (b) 640 mJ, (c) 800 mJ, each based upon 40, 44, and 40 single trials, respectively. At each panel, the diagonal plots show the histograms of Z-score power of the associated frequency bands (from left to right, theta, alpha, beta, and gamma); the off-diagonal plots show the scatter plots of Z-score power across different frequency bands. Matrix C contains the correlation coefficients, and matrix P contains the associated P-values from the student's t-test.

From our data analysis, several observations are noteworthy. Compared to the prestimulus period, the power across different frequency bands in the poststimulus period mostly (or in majority) increases, as evidenced by the positive mean values of the Z-score transformed (relative) power, although their standard deviations are relative large. In one specific run, the general trend is that the Z-score transformed θ power increases as the laser intensity increases; it seems that no general rule can be found for α, β, and γ power among our experiments. In different runs (i.e., 1a, 1b, 2a, 2b), the mean power statistics with the same laser intensity often vary. This is not unreasonable because in each run the conditions of the subject may be different; in addition, the (random) order for presenting the laser stimuli is also different in each run (see Figure 18), their overall effects (say, e.g., between 480 → 640 → 800 and 640 → 800 → 480) would be certainly distinct. Such a “hysteresis” phenomenon is well known in psychology and psychophysics. In an effort to investigate this phenomenon, we take the 800mJ intensity level as an example. According to Figure 18, the total numbers of 480mJ, 640mJ, and 800mJ preceding 800mJ are 10, 16, and 14, respectively. In order to compare their effects on the Z-score power, we calculate the means and standard deviations of different frequency bands under these three different conditions (namely, 480 → 800 , 640 → 800 , 800 → 800 ), and the results are shown in Figure 19. It is interesting to observe from the figure that their Z-score power statistics are quite different especially at the low-frequency (theta and alpha) bands. Generally, the Z-score power are highest for 480 → 800 , followed by 640 → 800 , and then lowest for 800 → 800 this is not surprising considering the sensation habituation effect. Statistical tests show at the theta and alpha bands, the pairwise comparison of Z-scored power among three conditions is statistically significant (ANOVA, P < .01).

Figure 18

The graphical illustration of the laser intensity presentation orders at different runs (1a, 1b, 2a, and 2b). Note that the combined 62 trial sequences of “1a + 1b” and “2a + 2b” are of identical order.

Figure 19

The Z-score power comparison of three different hysteresis effects. (a) theta band, (b) alpha band, (c) beta band, (d) gamma band.

5.5. LEP amplitude and latency versus laser intensity

Consistent with the previous studies (Ohara et al. [7, 8]), the peak amplitudes were measured from the baseline value, which was defined as the averaged value during the prestimulus period. Latencies were measured as the time of the peak amplitude (except for the artifact) for each component; and peak was regarded as significant when the peak amplitude was above the mean ± SD prestimulus level. However, in the previous studies, peak amplitudes and latencies were both measured from reproducible, averaged waveforms; here, we attempt to measure the latencies from single trials, while the amplitude will still be measured from averaging (over the trials at each run) because of its strong randomness; and the standard deviation of the amplitude estimate is calculated based on 4 independent runs among the recordings. In the meantime, we will focus the measurements on the first 64 electrodes (channels) for the primary somatosensory (SI) region, while the analyses for the parasylvian and medial frontal (MF) regions are ignored here. As observed in our experiments (Table 6), the averaged amplitudes of the LEPs (for both N2 and P2) increase as increasing laser intensity, except for one case of P2 under the 800mJ condition; however, the mean statistic is also accompanied with a relatively large standard deviation, which reflects the random variations of measurement and/or subject conditions.

Table 6

Comparative results of the estimated amplitudes and latencies of the LEPs (subject 2, under rating condition) from single and averaging trials. The last row indicate the selected number of single trials (by excluding some bad trials) used to evaluate the latencies.

	N2 (SI region)			P2 (SI region)
Intensity (mJ)	480	640	800	480	640	800

Latency (milliseconds)	260	178	122	300	248	171
Amplitude (μV)	−121 ± 18	−125 ± 23	−150 ± 41	112 ± 31	126 ± 55	98 ± 20
No. of trials	28	36	34	22	26	26

In our single-trial experiments, it was found that the latencies of the LEPs vary from trials to trials, evidenced by a large standard deviations (see Figure 20). In addition, by varying the laser intensity, the LEP-N2 and LEP-P2 also exhibit different attributes in terms of latency and amplitude. The corresponding statistics are summarized in Table 6 and Figure 20. Specifically, several observations are noteworthy.

Figure 20

The error bars (mean ± SD) of the estimated latencies (from single trials) for LEP-N2 (left panel) and LEP-P2 (right panel) with varying laser intensities. The dotted lines indicate the estimated latencies from the averaging trials.

As seen in Table 6, the stronger is the laser intensity, the sooner the LEP appears; namely, the value of the LEP latency is smaller. See Figure 21 for two illustrative results.

Figure 21

The 6 estimated sources extracted from averaged trials (subject 2) with different laser intensities (left panel: 640mJ, right panel: 800mJ).

When the laser intensity is small (e.g., 480mJ), it is quite difficult to extract the LEP (either one or two) with the available ICA technique. This is partly because the LEP is so weak that it is overwhelmed in the background “noise” (brain activities). Indeed, it is even difficult to identify the peaks via visual inspection from the averaging recordings. Generally, the amplitude of the LEP is a reflection of the sensation of the pain. Although it seems difficult to discover quantitative relationship between the intensity of the laser beam and the amplitude/latency of the LEPs, it is qualitatively clear that there exists correlation between them, especially when the intensity difference is large. This phenomenon might serve as a useful evaluation tool in the clinical practice. To evaluate the statistical significance of the LEP peak amplitude and latency, we also conduct statistical tests based on their single-trial measurements. We first conduct a robust linear regression fit (using the MATLAB function “robustfit”) between the laser intensity value (regression variable) and the measure of interest (amplitude or latency of the LEP), and then obtain Pearson's correlation statistic r. Next, we calculate the t statistic as follows: where ℓ denotes the number of regression sample pairs. From the t-statistic, we can further evaluate the statistical significance (i.e., P-value) from the t-table. In our case, we found the linear fit for LEP's latency is significant (r = 0.87, P < .05); however, the linear fit for LEP's amplitude is not significant.

5.6. Subjective sensation versus laser intensity

Finally, we follow the procedure of Ohara et al. [34] to analyze the relationship between the subjective sensation (in terms of pain rating) and the laser intensity. Specifically, the subject was asked to rate the pain level in decimal scale (0 no pain, 10 the most intense pain sensation). The mean and standard deviation statistics are calculated based on all single trials given three different laser intensities, as shown in Figure 22. Generally, it is seen that the average subjective pain sensation increases as the level of the laser intensity increases. Statistical tests show significant sensation differences between different levels of laser intensities (ANOVA, P < .001 between 480 mJ and 800 mJ; P < .05 between 640 mJ and 800 mJ). Moreover, we also evaluate the correlation between subjective sensation and LEP amplitude; however, no significant correlation was observed between the pain sensation rating and LEP amplitude for subject 2. We suspect this is partially due to the large variations among the subjective pain rating, even with the same laser intensity (specifically, the mean ± SD of the pain rating value for laser intensities 480 mJ, 640 mJ, and 800 mJ are 0.15 ± 0.70, 1.05 ± 1.49, and 2.10 ± 2.47, resp.). Although our data here seem to suggest that the subjective pain sensation and the objective LEP attribute observation might not be necessarily correlated, we should also be cautioned that the pain is a very complex sensation and is susceptible to many human factors and experimental conditions. Verification of any claim in this matter require more data and careful analysis.

Figure 22

Pain sensation rating at different levels of laser intensity (left panel, averaged over all 124 single trials for subject 2). This is compared with another subject (right panel, averaged over 124 single trials, data from Ohara et al. [34]).

6. DISCUSSION AND CONCLUSION

In this paper, we have used the statistical tools of FA/ICA for extracting and analyzing the LEPs. To our best knowledge, the statistical analysis and quantitative results reported here are among the new (if not the first) reports that apply sophisticated and systematic statistical analyses to the laser-induced pain data in the literature. In both averaging and single trials, we have demonstrated that the pain-evoked event potentials can be extracted and further analyzed with careful design of statistical procedure, and that the ICA/BSS approaches show a promising role in analyzing the multichannel ECoG data recorded from the awake human subjects. Our results here have also validated our previous findings in the early investigations and the reported neurophysiological observations in the literature. This is encouraging in that it justifies the merits of blind signal processing for neurobiological or physiological data analysis. The next challenge of this line of research is to extract consistently less-dominant (in terms of power) and potentially important pain-related components that are beyond the LEPs from single trials, which will be the subject of future study. We have focused on one particular type of blind signal processing tool (namely, ICA) in this paper. However, we make no claim that the choice is unique or optimal. Indeed, we have been aware of the strengthes and weaknesses of the ICA during the experimental investigations (e.g., Makeig et al. [25]), although other improved ICA models, such as the spatially constrained ICA (Ille et al. [35], Hesse and James [36]) or the temporally constrained ICA (James and Gibson [37]), can be considered. It is also noteworthy to point out several other powerful blind signal processing tools and statistical algorithms, whichmight be valuable for the future investigation: nonnegative matrix factorization (NMF) (e.g., Lee and Seung [38]), which is an approximate matrix factorization method for nonnegative data (e.g., spectra, or time-frequency map). Unlike ICA, the independence assumption is relaxed or unnecessary in NMF, on the other hand, extra constraints (such as the smoothness or sparsity) can be imposed for this statistical model. parallel factor analysis (PARAFAC) (e.g., Bro [39]), which is a well-suited method for analyzing highdimensional tensorial data; PARAFAC can be viewed as a generalization of higher-order FA or highdimensional NMF (if additional nonnegativity constraint is imposed). common spatial subspace decomposition (CSSD) (Wang et al. [21]), which is a spatial filtering method for extracting signal components specific to one condition from multichannel electrode recordings given multiple task conditions. This kind of common spatial pattern algorithm may be used for evaluating the ECoG recordings under different task conditions; however, unlike the ICA method, it is a supervised algorithm that uses labeled data for classification. In addition to the above-mentioned statistical tools, it would be also interesting to investigate the instantaneous brain activities and dynamics (Makeig et al. [25]), which may provide useful information of interactions inside the brain for specific patients with ECoG recordings. Finally, we believe what we reported here is only the first step towards a complete “statistical” understanding of the pain-evoked ECoG data, the substantiation of our observations, claims, and conclusionmade in this article would require more experimental verification of ECoG recordings in the future.

29 in total

1. Learning the parts of objects by non-negative matrix factorization.

Authors: D D Lee; H S Seung
Journal: Nature Date: 1999-10-21 Impact factor: 49.962

2. A multivariate, spatiotemporal analysis of electromagnetic time-frequency data of recognition memory.

Authors: E Düzel; R Habib; B Schott; A Schoenfeld; N Lobaugh; A R McIntosh; M Scholz; H J Heinze
Journal: Neuroimage Date: 2003-02 Impact factor: 6.556

3. Cutaneous painful laser stimuli evoke responses recorded directly from primary somatosensory cortex in awake humans.

Authors: S Ohara; N E Crone; N Weiss; R-D Treede; F A Lenz
Journal: J Neurophysiol Date: 2003-11-05 Impact factor: 2.714

4. Attention to a painful cutaneous laser stimulus modulates electrocorticographic event-related desynchronization in humans.

Authors: S Ohara; N E Crone; N Weiss; F A Lenz
Journal: Clin Neurophysiol Date: 2004-07 Impact factor: 3.708