Literature DB >> 30279589

Multivariate analysis of short time series in terms of ensembles of correlation matrices.

Manan Vyas¹, T Guhr^2,3, T H Seligman^4,5.

Abstract

When dealing with non-stationary systems, for which many time series are available, it is common to divide time in epochs, i.e. smaller time intervals and deal with short time series in the hope to have some form of approximate stationarity on that time scale. We can then study time evolution by looking at properties as a function of the epochs. This leads to singular correlation matrices and thus poor statistics. In the present paper, we propose an ensemble technique to deal with a large set of short time series without any consideration of non-stationarity. Given a singular data matrix, we randomly select subsets of time series and thus create an ensemble of non-singular correlation matrices. As the selection possibilities are binomially large, we will obtain good statistics for eigenvalues of correlation matrices, which are typically not independent. Once we defined the ensemble, we analyze its behavior for constant and block-diagonal correlations and compare numerics with analytic results for the corresponding correlated Wishart ensembles. We discuss differences resulting from spurious correlations due to repetitive use of time-series. The usefulness of this technique should extend beyond the stationary case if, on the time scale of the epochs, we have quasi-stationarity at least for most epochs.

Entities: Chemical Disease Species

Year: 2018 PMID： 30279589 PMCID： PMC6168610 DOI： 10.1038/s41598-018-32891-4

Source DB: PubMed Journal: Sci Rep ISSN： 2045-2322 Impact factor: 4.379

Introduction

Non-equilibrium stationary states (NESS) have attracted large amounts of attention in recent years[1-6] but more recently increasing attention is given to non-stationary situations, as they actually cover a wide range of observational as well as of experimental data. Such data cover diverse fields including astronomy, financial markets and meteorology or chemical engineering, fractures and colloids, as well as numerical results for models of such systems and dynamical systems and many others. Among such systems the ones that have several near stationary states with more or less abrupt transitions are of particular interest. Such systems are wide spread and of relevance. They include bi-stable, and multi-stable systems with smooth transitions as well as systems that might run into catastrophic instability. We can think of both types occurring as first order phase transitions under temperature change depending on conditions. Beyond that, we may hope that non-stationary systems may be quasi-stationary over sufficiently short time periods. Yet abrupt non-stationarities may occur and we may hope to obtain either warnings or at least post-event learning from a correlation analysis of known facts over a short time period before the abrupt events. For the sake of illustration, let us think of a chemical reactor that should produce certain end products in a stationary fashion, but in fact the state is only quasi-stationary. This reactor may have other states that produce less of the desired and more undesirable products and a transition might prove costly. Yet this might get much worse if breaking stationarity may lead to explosions with release of toxic substances, that may in addition cause great cost of lives and health, such as in Bhopal 1984[7]. To use a Wishart model as a model for non-stationarity was first put forward in[8] and used for credit risk analysis in[9,10]. Our interest was triggered by studies of financial markets, where the very attempt to define states of quasi-stationary evolution is relatively new[11]. In this paper, the correlation matrix of short time series was detected to be a good basis to specify the states and clustering techniques were used to identify these states. An attempt to detect conditions under which change may occur was not made and may also be futile in this context, as the clustering technique by definition assigned each correlation matrix to a state and thus borders become unclear. One could use different clustering algorithms to detect larger differences in clusterings, but this might depend very much on the definition of distance we use[12]. Time series analysis is rather straight forward for stationary systems, even if these are out of equilibrium (NESS). It can also be extended to analyze non-stationary states using standard detrending techniques to eliminate log time trends and periodic oscillations. These tasks are more challenging if we have many time series. Nonetheless, it becomes much more difficult to identify state changes[11] or have some early signals of catastrophic events[13]. The most usual way out in complicated non-stationary situations is to assume stationarity over short time intervals. Early attempts in this direction analyze the entire correlation matrix. This matrix is not invariant and thus the ordering, i.e. the labeling of the time series, becomes very relevant. While for financial markets there is some ordering that has long standing merits in other cases (e.g. for two dimensional array of detectors) this is no longer the case. If we detect different properties, say pressure and temperature, it is entirely unclear if we should give preference to the spatial distribution or to the different types of measurements to assigned indices in the matrix. The use of sophisticated data handling tools will not remove the arbitrariness of basis dependence. For these reasons (and probably because of heritage from the physics background of the authors), it seems reasonable to choose invariant quantities (not only under permutations but also under orthogonal or unitary transformations) that correspond to linear changes in the measuring devices. The logical choices are eigenvalues and eigenvectors. Obviously, the eigenvectors are basis dependent but they may provide relevant information if the preferred basis is reasonable. We believe that in some sense eigenvalues do indicate very relevant aspects of dynamics and recently it was shown, that this is also true for the correlation matrices. Using Metropolis dynamics the larger eigenvalues of the correlation matrix of a 2-D Ising model at critical temperature, display a power law, that can be directly derived from the power law of space correlations in this system[14]; it was further shown that such a power law will survive if a sufficiently large random subset of time series is used. Yet long time series are essential to see this effect because the number of large eigenvalues rapidly becomes too small as the correlation matrix becomes more and more singular with shorter time series. We define a short time correlation matrix as the one for which the time horizon T (length of the time series) is much smaller than the number N of time series. This obeys to the idea that we can have information on a complex dynamical system if we measure more properties. By definition, the correlation matrix will have only T-1 (except for T = 1) non-zero eigenvalues. Thus, increasing N but not T will not affect the number of non-zero eigenvalues. It is important to mention that, given a N × T data matrix for N time series of length T with N ≫ T, one can always diagonalize the T × T dimensional correlation matrix of the position series AA/N rather than the N × N correlation matrix of the time series AA/T as they have same non-zero eigenvalues. However, as we obtain only T non-zero eigenvalues, we do not obtain good statistics and the analysis is inconclusive - whether the spectral features are noise based or have correlations. Importantly, the correlation matrix of the position series AA/N is capable of detecting lead-lag and other non-Markovian effects[15-18]. One can also increase both N and T keeping N/T fixed. While this limit is nice for theoretical purposes, financial markets, chemical reactions, neuron processes etc. have time scales which determine T. Thus, we have to either return to the idea of treating the entire correlation matrix or to use some technique to obtain a large enough set of eigenvalues to get good statistics. One option is to use the power map technique. The use of the power map originally introduced for noise reduction[19,20] was suggested in[21] as an appropriate tool to detect correlations if powers very near to identity are used, indeed in[14] this can be explicitly seen. Yet while the power map does detect correlations efficiently, a detailed understanding of the correlations via the additional emerging spectrum is not yet available. Importantly, information as to the nature of the correlation at this point has not been given and the power map is not transparent due to its nonlinearity. The method proposed in the present paper is a more natural one. We shall replace the large but singular (N ≫ T) correlation matrix by an ensemble of smaller non-singular correlation matrices. This is achieved by first defining an ensemble of data matrices, i.e. sets of m(≪N) time series, chosen from a large number of random selections among the given N time series (corresponding to the large but singular correlation matrix). While making selections, we ensure that no time series is repeated in a given data matrix and no two data matrices are the same in the ensemble. There are non-trivial dependencies in the resulting matrices due to random choices. Using this ensemble of the data matrices, we construct the corresponding ensemble of non-singular correlation matrices. In principle, the number of members in the ensemble is given by and thus, the number of non-zero eigenvalues we have can now be increased dramatically. Thus, the method provides higher statistical significance by retrieving information from the given singular data matrix. In this way, we explore the entire position space and thus, obtain the distribution for each of the eigenvalues rather than a single number. While probably we don’t retrieve the full information of eigenvectors we thus reach simultaneously two goals - we obtain smooth distributions and obtain the information about the distribution of each of the eigenvalues. This is particularly relevant for outliers. It implicitly gives information about the outliers among the time series, as these will only appear in some of the subsets. We shall focus on the largest eigenvalues in our examples. Keeping in mind that finally there is no more information available than there is in the original data matrix, we arbitrarily choose the size of the ensemble in order to produce good statistics i.e. we smooth the curves. One can also construct an ensemble of completely independent data matrices in the very large N limit to obtain the ensemble of non-singular correlation matrices and we mention this limit on occasions. It avoids some deviations due to the dependencies but reduces the smoothing. In the next section, we describe in detail the construction of the ensemble. In the following section, we will present basic results obtained from supersymmetric calculations to derive the formula for correlated Wishart ensembles with arbitrary correlations. We treat in some detail the case of constant correlation Wishart matrices including the zero correlation case that provides the unbiased a priori hypothesis to which experimental data can be compared to obtain clarification of the data. In situations where average correlations are important, one can also use correlated matrices as a priori hypothesis. We compare numerics with the result obtained from supersymmetric calculations. Here, we will also discuss the differences between the proposed random choice selections resulting in dependencies and completely independent choice for which plenty of analytical results are known. As block structures are important, at least in econophysics, we analyze the special case of block-wise correlated subsets of time series. We compare the supersymmetric results with numerical calculations where we restrict the block situation to two blocks of different size and block-wise constant correlations. We see that the bulk of the spectra is well-described by the analytics while the outliers will only be approximated as far as their average position is concerned. The shape is different as the analytic result we present is for independent time series with large N, T and a fixed ratio κ = N/T. Finally, we give conclusions and an outlook.

Construction of the Ensemble

For short time series, the correlation matrices will be strongly singular i.e. the number of non-zero eigenvalues will be greatly reduced and the eigenfunctions corresponding to zero eigen sub-space are arbitrary. We will now introduce a method to overcome these shortcomings. The building blocks for the correlation(covariance) matrices are rectangular N × T data matrices A = [Aij], with and . Each row in the data matrix A is a time series of length T, measured at usually equidistant times. It can be obtained from observations or experimental measurements of observables like stock prices, temperature, intensity, astronomical observations and so on. The matrix C = AA/T, with A denoting the transpose of matrix A, is the N × N covariance matrix. Wishart matrices are random matrix models used to describe universal features of covariance matrices[22]. We consider the case for real entries , known in the literature as Wishart orthogonal ensemble (WOE). For WOE, the matrix elements of A are real independent Gaussian variables with fixed mean μ and variance σ2 i.e. . In order to arrive at correlation matrices, one needs to normalize μ = 0 and σ2 = 1. In the context of time series, C may be interpreted as the correlation matrix, calculated over stochastic time series of time horizon T for N statistically independent variables. By construction, C is a real symmetric positive semidefinite matrix. For T < N, C is singular and has exactly (N − T − 1) zero eigenvalues. Note that, stationarity improves when short time series are used. In real applications, one needs to understand the role of correlations and thus, correlated WOE (CWOE) models provide the null hypothesis. CWOE is defined by real-symmetric matrices , with . Here, χ is a real symmetric positive definite non-random N × N matrix that accounts for the correlations in time series (rows) of data matrix and . On ensemble average, . We analyze highly singular correlation matrices (N ≫ T) by constructing ensembles of correlation matrices from a given correlation matrix by randomly selecting short observational time series. By randomly choosing m rows out of N given rows of A() such that m = aT with a being a real number close but smaller than unity, we construct an ensemble of m × T dimensional matrices. While making selections, we ensure that no two rows are same in a given matrix and no two matrices are same in the ensemble. Using these, we obtain an ensemble of m × m non-singular correlation matrices and analyze eigenvalue distribution. If the number N of time series available is large compared to the number of entries T in each time series, the discussion of eigenvalues becomes statistically unsatisfactory. A typical example would be financial time series of increments of 40 consecutive closing prices for a selection of N = 400 shares from some index (say a selection from Standard and Poors 500). In this case we would obtain but 39 non-zero eigenvalues (40 for covariance matrices) from the 400 × 400 correlation matrix, which might be all over the place. We propose to select m time series (experimental, observational or computational) with m < T at random. If we allow all different choices, we would end up with a very large ensemble of correlation matrices (in our example, we might choose m = 36 leading to choices, which is an unpractically large number). So we choose a random subset of a few thousand and get excellent statistics for eigenvalues. Having more members in the ensemble would increase the amount of spurious information, which enters unavoidably if we allow repeated time series in different members of the ensemble. If on the other hand we do not allow repetitions the results would depend very much on the selection we make and statistics would be less adequate. An alternative may be to make an ensemble of ensembles with different but totally independent choices, and calculate averages and variances of specific statistical quantities obtained for the lower level ensemble. We choose not to go this more complicated route. The question arises, how stable and informative the corresponding results are. The purpose of the present paper is to take this simple idea and compare it to cases where analytic results can be derived from well-known results[23,24]. We start by analyzing white noise time series and the resulting correlation matrices known as the Wishart ensemble[22,25] as well as for correlated Wishart ensembles with constant correlations[26]. Here, the level densities are known analytically and the n-point correlation function converges to the universal result[25]. Because the case of constant correlations will mimic real situations only very roughly, we shall study in more detail the situations where subsets of time series are more correlated among each other than with the time series of other subsets. This will be the typical case of market sectors of stock exchanges. To emphasize the characteristics of such a block structure, we shall restrict ourselves in graphical displays to two blocks in this paper. We shall see that clear signatures of the correlations (or lack thereof) can be obtained with very good statistics. This distinguishes the present linear method both from the clustering techniques[11] and the power-map technique[21], which are inherently non-linear. The first is a transparent standard technique but requires considerable previous insight into the problem on hand, while the second turns out to be quite stable but interpretation is an open problem.

Supersymmetry Approach

Time series analysis is an imperative tool to study dynamics of variety of complex systems. Wishart correlation matrices are standard models employed for statistical analysis of ensembles of time series. We provide here a brief sketch of the derivation using standard supersymmetric steps; for further details refer to[23,24,27,28]. In multivariate analysis, it is desirable to derive a “null hypothesis” from a statistical ensemble to understand the measured eigenvalue density of the given correlation matrix. The random matrix ensemble we consider is CWOE with arbitrary correlations that gives the ‘empirical’ (population) correlation matrix C0 upon averaging over the probability density function P(A|C0) (normalized to unity),By construction, , with measure being product of differentials of all independent elements in A. It is important to mention that, in the supersymmetric approach, one assumes T ≥ N to ensure invertibility of C0. In order to be able to derive the ensemble averaged eigenvalue density (one-point function), we may replace C0 by diagonal matrix Λ of its eigenvalues since the domain of A is orthogonally invariant. In terms of resolvent, the ensemble averaged eigenvalue density for correlation matrix AA is defined byIn case of CWOE (also WOE), the eigenvalue density for the correlation matrices is derived using the supersymmetry technique[23,24]. In this approach, the eigenvalue density is written as the derivative of the generating function. The generating function in turn is mapped onto a suitable superspace which leads to drastic reduction in degrees of freedom. Then, the eigenvalue density is derived by introducing eigenvalue coordinates for the supermatrix and integrating over the anti-commuting Grassmann variables. The generating function Z as a function of source variable J is the starting point of this approach,Note that x+ = x + iε and Z(0) = 1. The one-point function is then computed by the derivative,The generalized Hubbard-Stratanovich transformation[29,30] and superbosonization formula[31] have been used to express the generating function as an integral over a suitable superspace. In fact, these are equivalent[32]. The determinant in the denominator of Equation (3) can be expressed as a Gaussian integral over a vector in ordinary commuting variables. Similarly, the determinant in the numerator can be expressed as a Gaussian integral over a vector in anti-communting variables. Combining these expressions, we obtain a Gaussian integral over a rectangular supermatrix which is n × (2|2) dimensional,Here, are Grassmann variables. Using this and , in Equation (3) and performing the Gaussian integral over A, we apply the duality relation between ordinary spaces and superspaces , one can then rewrite the determinant as a superdeterminant. Importantly, the supermatrix is 4 × 4 dimensional and the original matrix is N × N dimensional. This dimensional reduction is the advantage of the supersymmetry technique. The left upper block (boson-boson block) of supermatrix is a Hermitian matrix. We now use the generalized Hubbard-Stratonovich transformation to replace the supermatrix by a supermatrix σ with independent matrix elements. For the required power of superdeterminant in the expression for the generating function, we write a super-Fourier representationThe Fourier transform gives a supersymmetric Ingham-Siegel distribution,Here,are supermatrices of dimension 4 × 4 with real-symmetric 2 × 2 diagonal blocks. The off-diagonal blocks are Grassmann variables with the structure(similarly, for ω). The super-integration measure is , where σ0, σ0 are diagonal and σ0 is the off-diagonal elements of σ0. The measure d[ρ] is defined in a similar fashion. Using these and integrating over the supermatrix , the generating function is a supermatrix integral,Here, the matrix is diagonal. For arbitrary small J, using Equations (7) and (8), we have with a Lagrangian given byand we end up with a scalar polynomial equation resulting from the saddle point equation that can be solved numerically,This is the main analytic result of the paper which we test with numerics for different WOE models in the following section. The one-point function is then given in terms of the complex solution, say , of this saddle point equation,in the limit with fixed ratio κ = N/T. Note that the eigenvalue density is normalized to unity. Equation (10) is valid for CWOE with arbitrary correlations and the structure of the correlation matrix enters via its eigenvalues Λ (1 ≤ i ≤ N). Equation (10) is another version of a classical result[33-36].

Numerical Results

For the random selections, we have two choices: (a) ‘Non-Singular Random Selection Ensemble’ (NSRSE) in which a given time series can appear many times but at most once in the construction of any correlation matrix to avoid singularities. As mentioned above, we will have binomially many choices but the members of the ensemble are not entirely independent. We will usually not have N and T very large, but even so we will find that the behavior of the bulk is not significantly affected although the outliers are. Alternatively, for sufficiently large N and T, we could use a random matrix model ‘Exclusive Random Selection Ensemble’ (ERSE) that constructs an ensemble that excludes any repetition of time series in its construction. We can use this ensemble to calculate the expectation values of the quantities we are interested in and average those over all or a subset of possible selections. In this case, we expect to a large extent coincidence with correlated Wishart ensembles but the procedure is rather complicated and we will thus focus on the first choice namely NSRSE. We now proceed to analyze two special cases. First, we consider the case of constant correlations where we, in addition to the spectrum bulk, have an outlier that should be described. Here, we also consider the case of zero correlations i.e. uncorrelated time series, where we reproduce the Marčenko-Pastur distribution[33,37]. Then, we proceed to the block structure which we illustrate by using two blocks of time series which have constant internal correlation and relatively small correlation between the two blocks. Note that our results for NSRSE need not agree with theory for Wishart matrices because starting with a single representative of this ensemble in the large space, we select the smaller matrices from that space and repetitions of the selection will turn out to be important. We compare the distribution of outliers for NSRSE with ERSE in terms of the first four moments.

Correlated Non-Singular Random Selection Ensemble With Constant Linear Correlations

We consider correlated NSRSE with constant linear correlations defined by ; υ being the correlation coefficient. NSRSE of correlation matrices will then be obtained from the data set of correlated white noise time series by selecting m time series in L samples from the possible selections. The corresponding eigenvalues will be obtained numerically below and compared to the solution of the polynomial equation in Equation (10). The parameters used in the calculations are L = 5000, N = 1000, κ = 10 and a = 0.9. We choose constant linear correlations defined by υ = 0, 0.1, 0.5 and 0.9. For Monte-Carlo simulations, we start with a singular data matrix of dimension 1000 × 100 (κ = 10). One can normalize these 1000 time series in two ways: (1) by rescaling each time series by its respective mean and standard deviation (micro-canonical normalization) and (2) by rescaling all the time series by their average mean and average standard deviation (canonical normalization). Then, by randomly selecting the rows of this data matrix as explained above, we construct a 5000 member ensemble of 90 × 100 (a = 0.9) data matrices (κ = 0.9). Using these, we construct the L = 5000 members of NSRSE and diagonalize these to obtain the eigenvalues. The choice c = 0 corresponds to uncorrelated NSRSE and average eigenvalues for . Using these in Equation (10) results in a quadratic equation which can be solved analytically to obtainHere, and define the spectral support of the eigenvalue density. This describes the distribution of non-zero eigenvalues for WOE in the limit with fixed κ. Hence, in order to be able to compare with the Marčenko-Pastur distribution and numerics, one needs to re-scale the variables as and in Equation (10). We compare numerical NSRSE eigenvalue densities with the analytical result given by Equation (12) in Fig. 1. In Fig. 1(a), we show the numerical histogram for the 1000 eigenvalues of the correlation matrix corresponding to the initial 1000 × 100 data matrix obtained using microcanonical normalization and similarly for canonical normalization in Fig. 1(b). The spectral bounds are in agreement with the solid curve obtained using Equation (12). However, as we have a single copy of correlation matrix, there are a lot of fluctuations in numerics. We do not find any significant differences between microcanonical and canonical normalizations for NSRSE. Then, we apply ensemble technique and eigenvalue histograms for microcanonical and canonical normalizations respectively are shown in Fig. 1(c,d). The agreement with the solid curves obtained using Equation (12) is excellent. Again, we do not observe any significant differences in the microcanonical and canonical normalizations for NSRSE using the ensemble technique.

Figure 1

Density of non-zero eigenvalues of a singular correlation matrix C obtained from a data matrix A of dimension 1000 × 100; κ = 10 with (a) micro-canonical and (c) canonical normalization. Ensemble averaged eigenvalue density for a 5000 member NSRSE of correlation matrices constructed using 0.97T × T (κ = 0.9) dimensional A matrices with (b) micro-canonical and (d) canonical normalization. Numerical results are histograms and solid curves are obtained from Equation (12). The numerical histograms obtained for correlated NSRSE with constant linear correlations defined by , 0.5 and 0.9 are shown respectively in Fig. 2(a–c). The solid histograms correspond to microcanonical normalization and empty histograms correspond to canonical normalization. The solid curves are obtained by numerically solving Equation (10) with for and (a third order polynomial equation). Insets in each of these pictures show the distribution of the outlier Λ. The agreement of the polynomial equation solution in the bulk of the spectrum is excellent except for small deviations in the tails with increasing correlation coefficient υ. Notice the increasing difference between the bulk and the outlier along with shrinking of spectral bounds for the bulk distribution with increasing υ. The histograms for microcanonical and canonical normalizations are similar for the bulk distribution while there are differences in outliers noticeable with increasing υ.

Figure 2

Ensemble averaged eigenvalue density for a 5000 member ensemble of 90 × 90 dimensional correlated NSRSE matrices with constant linear correlations defined by (a) υ = 0.1, (b) υ = 0.5 and (c) υ = 0.9. Here, κ = 10. Numerical results are histograms and solid curves are obtained from Equation (10). The solid histograms correspond to microcanonical normalization and empty histograms correspond to canonical normalization. Insets give the distribution of the outlier. The shape of the farthest peak (outlier) is Gaussian for the numerical histograms whereas it resembles a semicircle for the respective solutions from the polynomial equation. The saddle point approximation must be good where many peaks overlap. It must be worse for individual peaks (outliers). But as seen from Fig. 2, the saddle point approximation reproduces the position of the outliers not too far from reality. However, it cannot reproduce the shape of the peaks. In the saddle point approximation, the bulk of the spectrum is order N correction and if the outlier is far away from the bulk, it is only order 1 correction term. The exact problem is highly complex and one cannot expect to get all the features by a simple polynomial equation. It is now well established that the distribution of the largest eigenvalue separated from the bulk for a correlated covariance matrix converges to a Gaussian distribution[38]. As ERSE should produce results close to Wishart ensembles, we compare the largest eigenvalue distributions for NSRSE and ERSE in Fig. 3. The corresponding moments are given in Table 1. As can be seen from these results, the largest eigenvalue distributions for NSRSE are also Gaussian, however the moments are different. Thus, the repetition of time series in the construction of NSRSE strongly affects the outliers.

Figure 3

Probability distribution of the outliers for NSRSE and ERSE. The largest eigenvalues are normalized with respect to their centroids (μ) and widths (σ) i.e. . The solid histograms correspond to NSRSE and the empty histograms correspond to ERSE. Corresponding first four moments are given in Table 1.

Table 1

Moments for outliers with constant correlations. All values are listed as (NSRSE/ERSE).

Case		Mean	Width	Skewness	Kurtosis
Largest eigenvalue, correlated NSRSE/ERSE	υ = 0.1	(10.69/11.94)	(1.23/0.61)	(0.21/0.05)	(−0.12/0.05)
	υ = 0.5	(45.53/48.90)	(3.18/0.93)	(−0.10/0.01)	(−0.12/0.12)
	υ = 0.9	(80.97/82.19)	(1.17/0.53)	(−0.44/−0.01)	(0.32/0.13)
Largest eigenvalue, block NSRSE/ERSE	υ₁ = 0.1, υ₂ = 0.1	(8.96/10.27)	(1.03/0.53)	(0.25/0.08)	(0.03/−0.03)
	υ₁ = 0.1, υ₂ = 0.5	(36.66/41.07)	(3.09/0.79)	(−0.04/0.03)	(−0.07/−0.04)
	υ₁ = 0.5, υ₂ = 0.1	(10.83/10.85)	(1.05/0.45)	(0.31/0.16)	(0.13/−0.03)
	υ₁ = 0.5, υ₂ = 0.5	(36.71/40.92)	(3.12/0.79)	(−0.04/0.03)	(−0.04/−0.03)
Second largest eigenvalue, block NSRSE/ERSE	υ₁ = 0.1, υ₂ = 0.1	(3.87/3.87)	(0.32/0.28)	(0.47/0.30)	(0.16/−0.02)
	υ₁ = 0.1, υ₂ = 0.1	(3.41/3.21)	(0.41/0.28)	(0.36/0.22)	(0.27/0.01)
	υ₂ = 0.5, υ₂ = 0.1	(8.3/9.71)	(0.85/0.44)	(0.06/−0.04)	(0.04/−0.06)
	υ₁ = 0.5, υ₂ = 0.5	(9.91/9.48)	(1.31/0.42)	(0.29/0.08)	(0.15/0.003)

Block Non-Singular Random Selection Ensemble

As is usual in financial market analysis, one deals with approximate block matrices where each block represents a sector. For instance, energy, utility and technology are a few sectors in stocks. Inspired by this, we consider a simple 2 × 2 model for a two sector NSRSE model. The corresponding data matrix has the structure with A1 and A2 representing data matrix in each sector with respective dimensions N1 × T and N2 × T; . In each sector, we consider constant linear correlations with correlation coefficients υ1 and υ2. For numerics, we construct a L = 5000 member block NSRSE with N = 1000, κ = 10, , and . To generate the ensemble, for each member, random selections of time series from the given A matrix are done depending on the weights (say, these are p1 and p2): and . These are the number of time series randomly chosen from each sector. One can also make the random permutations without any weights. This does not affect the structure of the correlation matrices in NSRSE. Figure 4 shows the structure of ensemble averaged correlation matrices constructed using canonical normalization with (a) ; (b) , ; (c) , and (d) . Here random permutations were carried out without any weights. Thus, the block structure remains intact even without the weighted random permutations. This is obvious as the χ matrix is invariant under permutations.

Figure 4

Ensemble averaged non-singular block correlation matrices constructed using 90 × 100 dimensional data matrices A with constant block correlations: (a) (b) , ; (c) , and (d) .

Ensemble averaged non-singular block correlation matrices constructed using 90 × 100 dimensional data matrices A with constant block correlations: (a) (b) , ; (c) , and (d) . In Fig. 5, we compare the eigenvalue histograms (solid ones corresponding to microcanonical normalization and empty ones corresponding to canonical normalization) of block NSRSE for (a) ; (b) , ; (c) , , (d) with the solid curve obtained using Equation (10) with for , , for , (fifth order polynomial equation). We find good agreement in the bulk distributions with deviations in the tails for larger correlation coefficients υ1 and υ2. Insets show the distributions of the two outliers ( and ). It can be single peaked, overlapping peaks or double peaked as the positions depend on correlation coefficients υ1 and υ2. The choice of normalization generates differences in the distribution of outliers. The saddle point approximation gives the approximate positions of the peaks but not the shape.

Figure 5

Ensemble averaged eigenvalue density for a 5000 member block NSRSE of correlation matrices constructed using 0.9T × T (κ = 0.9) dimensional A matrices with constant block correlations defined by (a) (b) , (c) , and (d) . Numerical results are histograms and solid curves are obtained from Equation (10). The solid histograms correspond to microcanonical normalization and empty histograms correspond to canonical normalization. Insets give the distribution of the outliers. We compare the distributions of the outliers (largest and second largest eigenvalues) for NSRSE with those corresponding to ERSE in Fig. 6. The corresponding moments are given in Table 1. The distributions of outliers separated from the bulk are well approximated by Gaussians for both NSRSE and ERSE while the moments are different. The convergence to Gaussian distribution also depends on the separation of the outliers from the bulk distribution. Thus, the repetition of time series in the construction of block NSRSE strongly affects the outliers.

Figure 6

Probability distribution of the outliers for NSRSE and ERSE. Upper panels [(a–d)] gives the distribution of largest eigenvalue for block correlation matrices constructed using 90 × 100 dimensional data matrices A with constant block correlations: (a) (b) , (c) , and (d) . Similarly, the lower panels [(e–h)] gives the distribution of second largest eigenvalue. In the upper panel, the largest eigenvalues are normalized with respect to their centroids (μ) and widths (σ) i.e. . Similarly, with the corresponding μ and σ in the lower panel. The solid histograms correspond to NSRSE and the empty histograms correspond to ERSE. Corresponding first four moments are given in Table 1.

Conclusions and Outlook

We have presented an entirely new way to treat large numbers of short time series pertaining to the same system and therefore likely to display some correlation. Basically the proposition consists in dividing the entire set of time series in different ways, thus obtaining the Non-Singular Random Selection Ensemble from the data. This allows to obtain a spectral distribution for an ensemble of correlation or covariance matrices and also to get distributions of particular eigenvalues, importantly the largest or the smallest one. Using our technique, we obtain a large set of eigenvalues for a singular data matrix and thus, get information about the bulk eigenvalue distribution along with the outliers, which is otherwise not possible. It also allows analysis of two and three point functions which is impossible for a small set (total number T with T ≪ N) of non-zero eigenvalues for a single data matrix. The same will hold for eigenfunctions. If the matrix elements are not Gaussian distributed, the shapes and moments of distributions of eigenvalues will change with the strength of the correlation coefficient and there will be effects in the tails of the distributions. We expect that our technique will be sensitive to tails of distributions of eigenvalues as the probability of randomly selecting the outliers increases with our technique. Finally we may note that selection of such subsets gives valuable insight into the consequences of having incomplete sets of time series. This is illustrated in Ising model calculations[14] where random selections of subsets of very long time series were used to test the effect of having incomplete measurements, which showed that a power law remaines visible using only 10% of the time series associated with the lattice. This opens a new alley for investigations of systems that are not stationary on longer time scales but quasi-stationary on a short time scale as defined by the length of the epochs we choose. We can then study the temporal evolution of such an ensemble. This in turn may give hints to instabilities emerging in the system which might be sufficiently strong to be used to give an early warning. The next step will be to show how such an ensemble behaves, when at or near a critical transition. At this point we are studying this in financial markets and in two dynamical systems, namely the TASEP[6] and the 2-D Ising model near criticality[21]. The range of potential applications is very wide and in the present paper we have performed the first tests using correlated random matrices as a model where analytic results are available. The case we generically discuss is a set of time series, which are strongly correlated within each of two subsets leading to a block structure in the correlation matrix. This is a toy model for financial markets with its traditional division into market sectors. Preliminary results on financial markets can be viewed in a master thesis[39] and further work in this direction is in progress.

9 in total

1 in total

1. Instability of networks: effects of sampling frequency and extreme fluctuations in financial data.

Authors: Jalshayin Bhachech; Arnab Chakrabarti; Taisei Kaizoji; Anindya S Chakrabarti
Journal: Eur Phys J B Date: 2022-04-25 Impact factor: 1.398

1 in total

Multivariate analysis of short time series in terms of ensembles of correlation matrices.

Introduction

Construction of the Ensemble

Supersymmetry Approach

Numerical Results

Correlated Non-Singular Random Selection Ensemble With Constant Linear Correlations

Block Non-Singular Random Selection Ensemble

Conclusions and Outlook

1. Fourier law in the alternate-mass hard-core potential chain.

2. Eigenvalue density of correlated complex random Wishart matrices.

3. Correlated Wishart ensembles and chaotic time series.

4. Spectral moments of correlated Wishart matrices.

5. Emerging spectra of singular correlation matrices under small power-map deformations.

6. Open XXZ spin chain: nonequilibrium steady state and a strict bound on ballistic transport.

7. Eigenvalue densities of real and complex Wishart correlation matrices.

8. The Bhopal disaster and its aftermath: a review.

9. Rich structure in the correlation matrix spectra in non-equilibrium steady states.

1. Instability of networks: effects of sampling frequency and extreme fluctuations in financial data.