| Literature DB >> 17299590 |
Abstract
Time series of Circulation Weather Type (CWT), including daily averaged wind direction and vorticity, are self-classified by similarity using Kohonen Neural Networks (KNN). It is shown that KNN is able to map by similarity all 7300 five-day CWT sequences during the period of 1975-94, in London, United Kingdom. It gives, as a first result, the most probable wind sequences preceding each one of the 27 CWT Lamb classes in that period. Inversely, as a second result, the observed diffuse correlation between both five-day CWT sequences and the CWT of the 6(th) day, in the long 20-year period, can be generalized to predict the last from the previous CWT sequence in a different test period, like 1995, as both time series are similar. Although the average prediction error is comparable to that obtained by forecasting standard methods, the KNN approach gives complementary results, as they depend only on an objective classification of observed CWT data, without any model assumption. The 27 CWT of the Lamb Catalogue were coded with binary three-dimensional vectors, pointing to faces, edges and vertex of a "wind-cube," so that similar CWT vectors were close.Entities:
Mesh:
Year: 2007 PMID: 17299590 PMCID: PMC1790699 DOI: 10.1371/journal.pone.0000210
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Distribution of the 27 LC wind classes in London, along 50 years.
| A | ANE | AE | ASE | AS | ASW | AW | ANW | AN |
| 3644 | 151 | 165 | 185 | 320 | 530 | 551 | 364 | 240 |
| UND | NE | E | SE | S | SW | W | NW | N |
| 198 | 323 | 326 | 497 | 1010 | 1641 | 1914 | 1076 | 657 |
| C | CNE | CE | CSE | CS | CSW | CW | CNW | CN |
| 2343 | 103 | 110 | 198 | 307 | 451 | 433 | 339 | 187 |
The first column is for null wind, the first and second rows are for anticyclone (A) vorticity, the third and fourth for undefined vorticity and the fifth and sixth for cyclone (C). The rest of acronyms are as conventional, N: North, S: South, E: East, W: West, and UND: total undefined for wind direction and vorticity.
LC wind coincidences (coinc) for two days separated by an interval of interv days, δ(coinc) is the difference between consecutive number of coincidences.
|
| 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 |
|
| 1839 | 1864 | 1966 | 2072 | 2138 | 2322 | 2707 | 3059 | 3738 | 5330 |
|
| 25 | 102 | 106 | 66 | 184 | 385 | 352 | 679 | 1592 |
Description of the 27 Lamb Codes and their 3D vector codifications used in this work.
| 0 | A | (1 0 0) | 10 | UND | (0 0 0) | 20 | C | (−1 0 0) |
| 1 | ANE | (1 1 1) | 11 | NE | (0 1 1) | 21 | CNE | (−1 1 1) |
| 2 | AE | (1 0 1) | 12 | E | (0 0 1) | 22 | CE | (−1 0 1) |
| 3 | ASE | (1-1 1) | 13 | SE | (0-1 1) | 23 | CSE | (−1-1 1) |
| 4 | AS | (1-1 0) | 14 | S | (0-1 0) | 24 | CS | (−1-1 0) |
| 5 | ASW | (1-1-1) | 15 | SW | (0-1-1) | 25 | CSW | (−1-1-1) |
| 6 | AW | (1 0-1) | 16 | W | (0 0-1) | 26 | CW | (−1 0-1) |
| 7 | ANW | (1 1-1) | 17 | NW | (0 1-1) | 27 | CNW | (−1 1-1) |
| 8 | AN | (1 1 0) | 18 | N | (0 1 0) | 28 | CN | (−1 1 0) |
The three groups, in columns, show the 9 classes (in rows) for anticyclone circulation (A), undefined circulation, and cyclone circulation (C). In each group the first column is the Lamb Code, the second column is the wind direction and the third column is our 3D vector codification (pointing to the surface of the winds cube shown in Figure 1). The classes of the first row correspond to zero daily average wind and in successive rows, rotating clockwise, are the eight conventional directions of the wind from North-East to North. The Lamb Code 10 is for winds of undetermined vorticity and direction.
Figure 1Winds cube representing the 27 Lamb Classes of CWT, including wind direction and vorticity, codified by 3D binary vectors BLC. The origin (000) represents the total UNDefined wind.
Estimation of the CWT data distribution having into account their BLC codified vector dispersion on 3D space.
| N (days) | <BLC> | <σ> | <dist> | <α> |
| 7300 (random) | 0.00,−0.01,−0.00 | 0.82 | 1.87 | 90.01 |
| 7300 (75_94) | 0.09,−0.09,−0.29 | 0.69 | 1.56 | 86.57 |
| 360 (1995) | 0.12,−0.04,−0.23 | 0.70 | 1.57 | 87.64 |
First row for a randomly generated BLC set of vectors, second and third rows for observed BLC data. In the second column are the components of the average vector
Figure 2The 50×50 Kohonen maps of the unsupervised clustering (left) and of the supervised clustering (right) of the 5BLC vectors during 1992–93. For clarity reason, only the labels of the most populated LC classes A (black), C (grey) and W (white), are shown. Self-aggregations of equal LC6 labels are amplified by supervising. A given LC spreads on some clusters each one corresponding to a cluster of similar 5LC sequences.
Figure 3Neighbourhood matrix of LC6 classes, with rounded values of VN3(i,j)×100, for the non supervised Kohonen map in the period 1975–94, where W has been smoothed three times. The non-zero values are coloured to show the closer classes on the map. In the first column are the aggregation factors for each LC, in the second column the LC6 numeration of Table 3.
Figure 4Neighbourhood matrix of LC6 classes, calculated as that of Figure 3, but for the Kohonen map after supervising the training with the CWT of the 6th days.
The three most probable sequences of five daily winds, preceding each LC wind class.
| A | A | A | A | A | A, | AW | A | A | A | A, | ASW | SW | AW | ANW | AS |
| ANE | A | A | A | A | AE, | A | A | SE | E | A, | W | CNW | C | CNE | NE |
| AE | A | A | A | A | A, | AE | ANE | E | E | NE, | E | E | E | A | NE |
| ASE | A | S | AS | AS | ASE, | ASE | SE | SE | ASE | ASE, | S | A | ASE | SE | E |
| AS | A | A | A | A | A, | A | A | A | A | AS, | C | CS | C | C | W |
| ASW | C | C | C | CSW | CS, | ANW | ANW | ASW | C | CSW, | CSW | C | AW | CSW | SW |
| AW | CN | AW | CW | CW | AW, | A | W | AW | A | ASW, | S | S | CSW | SW | W |
| ANW | W | CW | W | C | NW, | AW | ANW | AW | NW | ANW, | AS | A | A | A | A |
| AN | A | SW | CSW | C | NE, | C | C | CNW | E | CNE, | A | A | A | A | N |
| UND | A | A | A | A | A, | NE | NE | N | UND | C, | CSW | W | C | C | NW |
| NE | A | A | AN | NE | NE, | W | SW | CNW | CNW | NE, | S | C | C | CNE | NE |
| E | E | NE | NE | AE | E, | A | A | A | AE | E, | SE | SE | SE | SE | SE |
| SE | A | S | SE | S | SE, | S | S | S | SE | SE, | SW | SW | ASW | A | S |
| S | S | S | S | CS | S, | AN | AS | S | S | S, | A | A | A | CS | ASW |
| SW | ASW | SW | W | CSW | NW, | W | W | SW | AW | ASW, | CW | W | SW | CSW | SW |
| W | W | CW | W | W | CW, | A | A | A | ANW | NW, | S | SW | W | S | SW |
| NW | SW | CSW | W | W | CW, | N | NW | ANW | NW | NW, | AS | ANE | A | A | NW |
| N | NW | NW | NW | NW | CNW, | NW | N | N | N | CN, | W | NW | W | C | CNW |
| C | C | C | C | C | C, | AW | C | C | CSW | CSE, | S | SW | C | C | CSW |
| CNE | ASW | SW | CSW | C | C, | E | E | UND | ANE | CSE, | C | C | C | C | CE |
| CE | SE | SE | SE | NE | CNE, | SE | SE | SE | E | CE, | CSE | C | C | CE | CE |
| CSE | SE | A | A | S | CSE, | S | CSE | C | CE | SE, | S | SE | SE | SE | CSE |
| CS | C | C | SW | S | CS, | CS | SW | S | C | CSW, | NE | AE | CN | CN | CS |
| CSW | ANW | SW | SW | SW | S, | AE | SE | C | C | C, | ASW | CSW | CS | C | SW |
| CW | CNW | SW | NW | W | W, | C | C | CS | CS | CW, | A | A | SW | SW | C |
| CNW | NE | CE | C | CS | C, | SW | W | CSW | NW | NW, | C | NW | NW | NW | W |
| CN | ANW | ANW | ANW | W | C, | CW | W | CW | W | C, | ANE | ANE | AN | N | NW |
In the first column on the left are the 6th day LC classes, then, the previous five wind sequences separated by commas, from the most (left) to the least probable sequence (right). For each sequence, from left to right the observed average winds in the 5th, 4th, 3rd, 2nd and 1st previous days. These winds have been calculated from the smoothed learned matrix of a KNN, after a supervised training with the 7300 5BLC/BLC6 vector pairs in the period 1975_94.
Distribution of the 27 LC's of table 3, in row A, among different periods, LC = 10 for completely undefined wind is omitted.
| A | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 |
| B | 134 | 3 | 7 | 3 | 8 | 21 | 27 | 17 | 11 | 17 | 15 | 20 | 33 | 78 | 85 | 40 | 24 | 87 | 5 | 4 | 5 | 10 | 23 | 17 | 16 | 8 |
| C1 | 1482 | 53 | 62 | 69 | 118 | 185 | 214 | 147 | 97 | 137 | 123 | 193 | 401 | 672 | 779 | 416 | 282 | 974 | 44 | 35 | 66 | 124 | 182 | 157 | 134 | 80 |
| C2 | 0.28 | 0.89 | 0.61 | 0.86 | 0.67 | 0.73 | 0.65 | 0.79 | 0.71 | 0.69 | 0.55 | 0.54 | 0.52 | 0.44 | 0.39 | 0.52 | 0.53 | 0.31 | 0.84 | 0.69 | 0.84 | 0.71 | 0.75 | 0.66 | 0.77 | 0.68 |
| D1 | 75 | 5 | 6 | 4 | 4 | 8 | 12 | 10 | 2 | 9 | 6 | 12 | 20 | 26 | 31 | 24 | 18 | 46 | 1 | 3 | 4 | 3 | 6 | 14 | 2 | 3 |
| D2 | 0.80 | 1.58 | 1.04 | 1.86 | 1.63 | 1.36 | 1.10 | 1.39 | 1.60 | 1.32 | 0.98 | 1.50 | 1.05 | 1.16 | 0.88 | 1.34 | 1.15 | 1.03 | 1.16 | 1.59 | 2.03 | 0.68 | 1.77 | 1.07 | 1.34 | 1.23 |
| D3 | 0.71 | 1.30 | 0.96 | 1.80 | 1.25 | 1.32 | 1.17 | 1.55 | 1.86 | 1.03 | 1.01 | 1.29 | 0.94 | 1.19 | 0.88 | 1.08 | 1.12 | 0.77 | 1.01 | 1.31 | 1.83 | 1.41 | 1.41 | 1.19 | 1.38 | 1.07 |
In row B the LC distribution in 1992–93. In row C1 the LC distribution in 1975–94, and in C2 the average ‘prediction’ errors for LC6 vectors after a supervised training. In row D1 the LC distribution in 1995, in D2 the average prediction errors for LC6 vectors after supervised (with the 6th day) training of 1975–94, and in D3 after supervised (with the 5th day) training. All errors have been calculated with the smoothed matrices.
Wind (BLC6) prediction from the previous (5BLC) five days data wind by Kohonen Neural Networks.
| sup | sx | coin | δV0 | nw-w | δα | δV1 | nw-w | δα | δV2 | nw-w | δα | < | < |
| |
| 75–94 | 6 | 0 | 7136 | 7180 | 26 | 0.1 | 46 | 0 | 1 | 0 | 0.29 | 0.09 | 0.01 | ||
| 6 | 3 | 4769 | 6365 | 713 | 7 | 861 | 77 | 12 | 0 | 1.22 | 0.48 | 0.37 | |||
| 1995 | 6 | 0 | 62 | 157 | 47 | 33 | 162 | 79 | 50 | 35 | 17 | 45 | 0.98 | 1.24 | 1.30 |
| 6 | 3 | 70 | 168 | 35 | 33 | 155 | 69 | 50 | 31 | 15 | 49 | 1.29 | 1.10 | 1.24 | |
| 1995 | 5 | 0 | 106 | 189 | 37 | 19 | 147 | 70 | 33 | 18 | 4 | 30 | 0.97 | 0.98 | 1.00 |
| 5 | 3 | 92 | 175 | 35 | 21 | 159 | 65 | 33 | 20 | 9 | 28 | 1.27 | 1.01 | 1.06 |
After the row of variable labels, the first and second rows are for the KNN training along 10000 epochs, in the learning period 1975–94 (7300 days), supervised by the winds of the 6th days. The third and fourth rows are for the 6th day wind (real) prediction in 1995 by the above training. The fifth and sixth rows are for the 6th day wind (real) prediction in 1995 by similar training but supervised by the winds of the 5th days. The supervising day is indicated by sup, sx is the times the learned matrix has been smoothed with sm = 2/3, coin is the number of total (wind direction plus vorticity) coincidences between observed and predicted winds. Next columns to the right are, the first set (δV0 nw-w δα) for the number of days, δV0, having the same vorticity for the observed and the predicted winds, the second set when the vorticity difference is 1, δV1, and the third set when the difference is 2, δV2, (A to C or C to A). The other two columns of each set are the number of days changing from no-wind to wind (nw-w), or vice versa, and the average angle between the observed and predicted winds (δα). The last three columns are the average localization error for the 5BLC vectors <δ_loc>, the prediction error for the BLC6 vectors <δ_pred>, and the same error but approaching the matrix components to 1, 0 or −1, to get the closest possible observed vector <δ_pred >1.
Above, the probable situation of an Anticyclone or Cyclone, with respect to a point P receiving their wind. Below, as an example, the probable evolution of the 2D circulation map previous to a day with CWT = AN
| Wind on P coming from | N | NE | E | SE | S | SW | W | NW |
| A situation with respect to P | W | NW | N | NE | E | SE | S | SW |
| C situation with respect to P | E | SE | S | SW | W | NW | N | NE |
| Day before | 5 | 4 | 3 | 2 | 1 | 0 | ||
| Situation of A or C with respect to P | A(on) | A(SE) C(NW) | C(NW) | C(on) | A(NW) C(SE) | A(W) |
Figure 5Scheme of the Kohonen NN. Each input vector 5BLC(k), on the left, activates every point of the 2D Kohonen map KOHM(i,j) by the 3D synapses matrix W(i,j,k). The values of W(i,j,k) are trained to cluster the input vectors in the self classified 2D Kohonen map.