| Literature DB >> 28580940 |
A Sanchez-Gonzalez1, P Micaelli1, C Olivier1, T R Barillot1, M Ilchen2,3, A A Lutman4, A Marinelli4, T Maxwell4, A Achner3, M Agåker5, N Berrah6, C Bostedt4,7, J D Bozek8, J Buck9, P H Bucksbaum2,10, S Carron Montero4,11, B Cooper1, J P Cryan2, M Dong5, R Feifel12, L J Frasinski1, H Fukuzawa13, A Galler3, G Hartmann9,14, N Hartmann4, W Helml4,15, A S Johnson1, A Knie14, A O Lindahl2,12, J Liu3, K Motomura13, M Mucke5, C O'Grady4, J-E Rubensson5, E R Simpson1, R J Squibb12, C Såthe16, K Ueda13, M Vacher17,18, D J Walke1, V Zhaunerchyk12, R N Coffee4, J P Marangos1.
Abstract
Free-electron lasers providing ultra-short high-brightness pulses of X-ray radiation have great potential for a wide impact on science, and are a critical element for unravelling the structural dynamics of matter. To fully harness this potential, we must accurately know the X-ray properties: intensity, spectrum and temporal profile. Owing to the inherent fluctuations in free-electron lasers, this mandates a full characterization of the properties for each and every pulse. While diagnostics of these properties exist, they are often invasive and many cannot operate at a high-repetition rate. Here, we present a technique for circumventing this limitation. Employing a machine learning strategy, we can accurately predict X-ray properties for every shot using only parameters that are easily recorded at high-repetition rate, by training a model on a small set of fully diagnosed pulses. This opens the door to fully realizing the promise of next-generation high-repetition rate X-ray lasers.Entities:
Year: 2017 PMID: 28580940 PMCID: PMC5465316 DOI: 10.1038/ncomms15461
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1Machine learning technique.
Schematic technique based on machine learning to predict complex diagnostics at a high repetition rate using a fraction of fully diagnosed events containing all the information obtained at a much lower repetition rate. Information from fast diagnostics is available for all the events, but information from the complex diagnostics is only available for a small fraction of the events. The set of fully diagnosed events is divided into different subsets: the training set, the validation set and the test set. The training set is used to train a machine learning model on how to predict the information obtained with complex diagnostics using the simple diagnostics as input. The validation set is used to optimize the training process by minimizing the prediction errors on that set. The final prediction error for the optimized model is calculated using data from the test set. Once the final optimized model is trained and tested, it can be used to predict the missing information from the complex diagnostics for the remainder of the events.
Summary of the results.
| Mean error of single-pulse photon energy (eV) | 5.62 | 0.29 (0.28) | 0.30 (0.24) | 0.32 (0.27) | 0.30 (0.29) |
| Shape agreement of single-pulse spectrum | 67% | 88% (88%) | 94% (95%) | 95% (95%) | 97% (97%) |
| Mean error of double-pulse delay (fs) | 6.82 | 2.07 (2.04) | 1.67 (1.58) | 1.67 (1.57) | 1.59 (1.52) |
| Mean error of double-pulse photon energy (eV) | Pulse 1: 1.45 | 0.47 (0.49) | 0.49 (0.48) | 0.50 (0.48) | 0.46 (0.47) |
| Pulse 2: 1.03 | 0.44 (0.44) | 0.41 (0.39) | 0.41 (0.39) | 0.40 (0.40) |
Mean absolute error or agreement of the different prediction examples obtained from each of the four models. The first column shows the mean error from the average of the initial distribution. In the case of shape agreement, this value corresponds to the mean agreement between each of the single-shot spectra and the mean spectrum. The values for each of the models correspond to the predictions on the test set, while the numbers in brackets correspond to the training set.
Figure 2Photon energy prediction for a single pulse.
(a) Two samples of single-shot spectra at two different photon energies measured with the optical spectrometer (light red, light blue) and the corresponding Gaussian fits (thick red, thick blue). (b) Distribution of the measured photon energies for the dataset. Mean error of distribution: 5.6 eV. (c) Measured photon energies compared to the predicted photon energies for the test set using a linear model. Experimental points are shown in blue. The perfect correlation line is included for reference as a black dashed line. Mean error of predictions: 0.29 eV.
Figure 3Spectral shape prediction for a single pulse.
(a) Distribution of agreements between the predicted and the measured spectra for the test set using the four different models. SVR: Support vector regressor. ANN: Artificial neural network. (b–e) Examples of the measured (blue) and the predicted (red) spectra using an ANN to illustrate the accuracy for different agreement values. The four examples have been chosen by picking from the entire test set the events with agreement values closest to 96%, 97%, 98% and 99%, respectively.
Figure 4Pump-probe time delay prediction.
(a,b) Examples of the X-band transverse deflecting cavity (XTCAV) traces used to extract the delay values. The delay values are calculated by finding the lasing part of each electron bunch (black and red vertical dashed lines for the high-energy bunch and low-energy bunch, respectively) and subtracting the values. (c) Distribution of all the delay values for the dataset. Mean error of distribution: 6.8 fs. (d–g) Delay prediction errors for the test set using each of the four models. Experimental points are shown in blue. The perfect correlation lines are included for reference as black dashed lines. Mean error of predictions: 2.07, 1.67, 1.67 and 1.59 fs, respectively. (h) Delay prediction learning curve showing the mean error for the validation set (solid lines), and the training set (dashed lines) for each of the four models as function of the number of samples used for training. ANN: Artificial neural network. SVR: Support vector regressor.
Figure 5Photon energy prediction in a double-pulse mode.
(a) A sample of a double-pulse spectrum measured with the electron time-of-flight spectrometer (light red) and the corresponding double Gaussian fit (thick red). (b,c) Measured photon energies of each of the pulses compared to the predicted photon energies for the test set using an artificial neural network. Experimental points are shown in blue. The perfect correlation lines are included for reference as black dashed lines. Mean error of predictions: 0.46 and 0.40 eV, respectively. Mean error of initial distributions: 1.45 and 1.03 eV, respectively.