Skip to main content

ORIGINAL RESEARCH article

Front. Phys., 22 July 2022
Sec. Interdisciplinary Physics
This article is part of the Research Topic Application of Artificial Intelligence and Machine Learning to Accelerators View all 10 articles

Mixed Diagnostics for Longitudinal Properties of Electron Bunches in a Free-Electron Laser

  • Deutsches Elektronen-Synchrotron DESY, Hamburg, Germany

Longitudinal properties of electron bunches are critical for the performance of a wide range of scientific facilities. In a free-electron laser, for example, the existing diagnostics only provide very limited longitudinal information of the electron bunch during online tuning and optimization. We leverage the power of artificial intelligence to build a neural network model using experimental data, in order to bring the destructive longitudinal phase space (LPS) diagnostics online virtually and improve the existing current profile online diagnostics which uses a coherent transition radiation (CTR) spectrometer. The model can also serve as a digital twin of the real machine on which algorithms can be tested efficiently and effectively. We demonstrate at the FLASH facility that the encoder-decoder model with more than one decoder can make highly accurate predictions of megapixel LPS images and coherent transition radiation spectra concurrently for electron bunches in a bunch train with broad ranges of LPS shapes and peak currents, which are obtained by scanning all the major control knobs for LPS manipulation. Furthermore, we propose a way to significantly improve the CTR spectrometer online measurement by combining the predicted and measured spectra. Our work showcases how to combine virtual and real diagnostics in order to provide heterogeneous and reliable mixed diagnostics for scientific facilities.

1 Introduction

Tuning and optimization of the longitudinal phase space (LPS) of electron bunches are of vital importance for the performance of various scientific facilities such as free-electron lasers (FELs) [1], ultrafast electron diffractions (UEDs) [2], laser-plasma accelerators (LPAs) [3], plasma wakefield accelerators (PWFAs) [4], THz-driven accelerators [5] and so on. The prerequisite of quickly and accurately manipulating the LPS of an electron bunch is being able to measure and monitor it rapidly and reliably. The LPS of an electron bunch is usually measured directly in the time domain by combining a transverse deflecting structure (TDS) and a dipole spectrometer magnet in a dispersive section [6, 7]. An LPS image provides rich and important information such as the shape, the current profile and the slice energy spread of the electron bunch. However, this diagnostic method interferes with delivering photons to user experiments and thus cannot be employed online during machine tuning and optimization. The current profile of an electron bunch can also be reconstructed by measuring the coherent transition radiation (CTR) [8, 9] or the coherent diffraction radiation (CDR) [10] spectrum generated by an electron bunch. Although CTR generation is invasive to an electron bunch, it works as an online diagnostics at facilities such as FLASH, in which a single bunch from a bunch train can be selected for this purpose without interfering with user experiments [8]. On the other hand, physics-based beam dynamic simulation plays an important role in understanding the LPS of an electron bunch. However, high-resolution physics-based simulation is extremely time-consuming [11] and often does not agree with the measurement very well.

In recent years, machine learning has demonstrated to be able to learn relationships inside a complex system and produce accurate and fast predictions [12, 13]. Using artificial neural networks as a tool for electron bunch longitudinal property prediction has garnered more and more attention in recent years [1418]. At the LCLS, early work has demonstrated prediction of LPS images and current profiles at the linac exit using two separated multi-layer perceptrons (MLPs) [14] and a single input parameter. Later, a spectral virtual diagnostics was proposed to improve the prediction accuracy and robustness by using the CDR spectrum of the electron bunch as input [17]. Simulated CDR spectra were used in this study. Indicated by tests with simulated LPS images for the LCLS-II, a neural network is also expected to predict the microbunching structure in the LPS on a shot-to-shot basis using the spectrum as input with a fixed machine set point [17]. The latest work at the European XFEL injector demonstrated that a deep encoder-decoder neural network can achieve extremely high accuracy in predicting megapixel LPS images using up to three RF phases as input. The current profile, energy spectrum and slice energy spread extracted from the predicted LPS image all show very good agreement with the measurement [18]. In addition, an innovative method was demonstrated to efficiently build large models with multiple distinctive working points [18].

In this paper, we experimentally demonstrate training an encoder-decoder neural network model with more than one decoder to predict the LPS image and the CTR spectrum of the electron bunches in a single bunch train concurrently at the end of the FLASH linac. Building a model using the data-driven approach heavily relies on the availability of the data. However, the main focus of a user facility is to deliver electron or photon beams to user experiments and thus cannot allocate a large amount of time exclusively for routine data collection. We demonstrate that only a reasonable amount of data is required to train a performant model around a user working point, with all major control knobs for LPS manipulation included. More importantly, we propose a method which can significantly improve the existing online current profile measurement using a CTR spectrometer by combining predicted and measured CTR spectra. Analogous to mixed reality, we call diagnostics which provide a blend of predicted and measured signals mixed diagnostics. This approach offers the possibility to provide heterogeneous and reliable LPS information in real-time for electron bunches with a broad range of parameters.

The applications of the above encoder-decoder model are summarized in Figure 1. The model can serve as not only virtual and mixed diagnostics for online machine tuning and optimization, but also a digital twin of the actual machine on which machine tuning and optimization algorithms [1924] can be tested efficiently and effectively before applying them in the real world. These virtual experiments can even be performed in advance to find an optimal or near-optimal setup for the real machine. One of the major advantages of testing with neural network models is that it is orders of magnitude faster than real experiments or physics based simulations because the inference time is typically on the order of milliseconds. Compared with neural network models trained with simulated data [2527], models trained with experimental data provide better testing environments by generating predictions which are almost identical to the real-world signals.

FIGURE 1
www.frontiersin.org

FIGURE 1. Applications of an encoder-decoder neural network model with more than one decoder for heterogeneous predictions of electron bunch longitudinal properties.

2 Experimental Setup

FLASH is a soft x-ray free-electron laser (FEL) user facility which is capable of delivering MHz pulse trains to two user experiments, FLASH1 and FLASH2, in parallel with individually selected photon beam characteristics [28, 29]. The layout of the FLASH1 beamline is shown in Figure 2. The longitudinal properties of two electron bunches in a bunch train can be measured concurrently by streaking one of the electron bunches using the S-band TDS [30, 31] and picking up another one for the CRISP spectrometer using a fast kicker magnet [8]. It should be emphasized that the two diagnostics cannot measure the same bunch in a bunch train simultaneously. In this study, the electron bunch picked up by the CRISP spectrometer travels immediately after the one streaked by the TDS. A systematic comparison between these two diagnostics was conducted recently, which shows excellent agreement on the current profile down to the 10 fs level [9]. However, due to the non-optimized optics required by the parallel user experiment at the FLASH2 and a cap on the TDS power, the rms time resolution of the measurement using the TDS is larger than 70 fs in our experiments. Moreover, the CRISP spectrometer has two sets of remotely interchangeable grating sets, which cover different frequency ranges. The low-frequency grating set covers the range from 0.7 to 6.6 THz, and the high-frequency grating set covers the range from 6.9 to 58.8 THz. In order to have an accurate reconstruction of the current profile, the full spectra range from 0.7 to 58.8 THz is required. For a meticulous characterization of the current profiles at a few set points, this can be achieved by recording data using the two grating sets consecutively [9]. However, this is not practical for a large number of machine set points. A detailed discussion about how to address this issue when building a neural network model is presented in Section 4.2.

FIGURE 2
www.frontiersin.org

FIGURE 2. Schematic of the FLASH1 beamline. Components are not to scale. ACC1, ACC23, ACC45 and ACC67 are 1.3 GHz cryomodules which boost the electron energy. ACC39 is a 3.9 GHz cryomodule which linearizes the LPS before the first bunch compression stage. The CTR radiation is generated by deflecting one electron bunch onto an off-axis screen using a fast magnetic kicker and measured by the coherent radiation intensity spectrometer (CRISP).

Up to 6 control knobs, including the phases and amplitudes of ACC1, ACC39, and ACC23, are scanned during data collection around a user working point because of the following considerations. During machine operation, the RF phases and amplitudes of ACC1, ACC39, and ACC23 are commonly used to adjust the LPS of the electron bunch in order to optimize the FEL performance. The LPS is very sensitive to the phase and amplitude changes of these three RF stations because they affect the LPS of an electron bunch upstream the bunch compressors. ACC45 and ACC67 are operated on-crest downstream the 2nd bunch compressor and thus the LPS is very insensitive to the phase changes of these two RF stations. The amplitudes of ACC45 and ACC67 can be used to slightly tweak the beam energy which have nevertheless negligible impact on the LPS shape.

The scanned parameters are sampled uniformly within predefined ranges, which are mainly restricted by the OTR screen size for the TDS. For each combination, the new values are written into the control system via the Python interface of DOOCS [32] and the data readout has a delay of 0.5 s. Since it takes time to collect data which belong to the same bunch train but are sent out from different sources, the actual data collection speed is about 1 Hz.

3 Methods

3.1 Data Wrangling

The data quality is essential to the performance of a neural network model. First of all, the data quality in this study is controlled during data taking. A single data point is recorded for each randomly selected machine set point. It prevents data with the same machine set point from appearing in both the train and test data, which leads to overfitting, as much as possible. Secondly, the data are further normalized and cleaned. The original size of the 12-bit camera image is 1360 × 1024 pixels. After background subtraction, the pixels are normalized by 4095. All the pixel values below 0.01 are set to 0 in order to remove negative pixel values and suppress background noise. Although the model is expected to learn and predict the position of the LPS on the screen [18], the horizontal position of the beam (along the streaking direction) depends on a lot of machine parameters, including the phase and amplitude of the TDS. We noticed that the beam sometimes moved significantly horizontally even when the machine set point remained unchanged. As a result, it is not possible to train a performant model even when only a single control knob (e.g., the phase of ACC23) is scanned. Since the horizontal position of the beam only provides the timing information, which can actually be measured using the bunch arrival monitors [33] installed at different locations of the machine, all the LPS images are cropped to 768 × 1024 and centered horizontally. Moreover, an image will be removed from the dataset if the electron bunch is completely or partially off the screen, or if the electron bunch is very close to the left or right edge.

3.2 Modeling

The detailed structure of the encoder-decoder neural network is shown in Figure 3. A MLP is used to build latent features from the input parameters. Two different decoders then translate the latent features to the LPS image and CTR spectrum (the raw signal measured by the CRISP spectrometer), respectively. Latent features are designed to be a compressed representation of the LPS, just as it can be interpreted by decoders for different longitudinal property diagnostics. It could also facilitate building a larger model efficiently by sharing decoders among multiple encoders which represent different machine working points [18]. The neural network model is implemented and trained using the machine learning framework TensorFlow [34] version 2.4.3. For training, we adopt the weight initialization in [35] and the Adam optimizer [36]. 80% of the data are used for training with a minibatch size of 32, and the rest are used for testing.

FIGURE 3
www.frontiersin.org

FIGURE 3. Diagram of the encoder-decoder neural network. The leftmost blue box represents the input layer. It is followed by three fully-connected layers (encoder) in purple with each layer activated by the Leaky ReLU (Rectified Linear Unit) function. The latent space is depicted in grey. The latent space leads to two decoder branches. One consists of several fully-connected layers with each layer activated by the Leaky ReLU function. The other consists of ten transposed convolutional layers in yellow. Each transposed convolutional layer is followed by a batch normalization layer and activated by the leaky ReLU function except the last one, which is activated by the sigmoid function depicted in green. The kernel sizes of the first and second transposed convolutional layers are 3 × 4 and 3 × 3, respectively, and the kernel sizes of the other eight transposed convolutional layers are all 5 × 5. The total number of trainable parameters is 2,096,681 with 6 input parameters.

The loss function Ltotal for training is given by

Ltotal=LLPS+wLspectrum,(1)

where LLPS and Lspectrum are the loss functions for the LPS image decoder and the spectrum decoder, respectively, and w is the weight which balances the influences of the two decoders. The model is trained with a learning rate of 3 × 10−4 for 400 epochs, and then a learning rate of 1 × 10−4 for another 400 epochs. During the first 400 epochs, both LLPS and Lspectrum are the mean squared error (MSE) and w is set to 1. During the second 400 epochs, LLPS is changed to the multiscale structural similarity index measure (SSIM) [37] with hyperparameters defined in [18] and w is set to 100 empirically in order to make the losses of the two decoders at the same order of magnitude at the end of the training. There are two advantages for using different loss functions for the LPS image decoder in different training phases. First, the multiscale SSIM loss is much more computationally expensive than the MSE loss. In our implementation, the training time for a single batch reduces by about 30% when the MSE loss is used. Therefore, the multiscale SSIM loss is only used to fine-tune the model in order to learn high-frequency features in the LPS images [18]. Second, it is found that sometimes the model does not learn correctly near a corner or an edge for all the images when only the multiscale SSIM loss is employed, as shown in Figure 4A. This can be explained by the gradients of both the loss functions, as shown in Figure 4B. The pixel values near the lower-left corner of the predicted LPS image shown in Figure 4A result in an extremely small gradient of the SSIM loss on the plateau, which prevents them from converging towards 0. However, how those pixels are trapped on the plateau is not clear.

FIGURE 4
www.frontiersin.org

FIGURE 4. (A) Example of a prediction from a model trained using only the multiscale SSIM loss for the LPS image decoder. The pixel values near the lower-left corner of the predicted image are trapped in values around 1. This phenomenon occurs in all the LPS images. (B) SSIM and MSE losses calculated between two unary images as a function of the pixel value difference. The pixel value of the first image is 0 while the pixel value of the second image ranges from 0 to 1.

The performance of our model is reported separately for the LPS image decoder and the spectrum decoder using the single-scale SSIM and MSE as metrics, respectively, over the test dataset. The single-scale SSIM value ranges between 0 and 1 with a value of 1 indicating two images are exactly the same.

4 Results

We record multiple datasets during two different beam times with different machine setups (working points). The beam energy is ∼1124 MeV for the first working point (WP1) and ∼1027 MeV for the second one (WP2). The bunch charge is ∼400 pC for both the working points. As mentioned previously, the beam parameters are not optimized for the LPS measurement due to the parallel FLASH2 user experiment. Consequently, it is found that the LPSs depend on the streaking direction of the TDS in a non-trivial way for WP2 due to transverse-longitudinal correlations [6, 9], which makes it difficult to compare the current profiles measured by the TDS and the CRISP spectrometer. Nonetheless, this does not affect the conclusion of this study because the goal is to achieve an excellent agreement between the prediction and the measurement. Actually, the agreement between the current profiles measured by the TDS and the CRISP spectrometer can be verified using the dataset of WP1.

4.1 Prediction

Five typical prediction results from a dataset of WP1 are shown in Figure 5. The electron bunches have significantly different peak currents. The histograms of the scanned parameters are shown in Figure 6. The performance of the LPS image decoder is 0.9877 ± 0.00227 and the performance of the spectrum decoder is 7.9, ×, 10−5 ± 4.6 × 10−4. As shown in Figures 5A,B, the predicted LPS images all agree with the measured ones excellently. There are also excellent agreements between the measured and predicted spectra, as shown in Figure 5C.

FIGURE 5
www.frontiersin.org

FIGURE 5. Five typical results from the test data of a dataset of WP1. The compression strength increases monotonically from left to right. (A) Measured LPS images. δ denotes the fractional energy deviation. (B) Predicted LPS images. (C) Comparisons between the measured and predicted spectra from the low-frequency grating set of the CRISP spectrometer. (D) Comparisons of the current profiles calculated from the measured and predicted LPS images as well as reconstructed from the measured and predicted spectra.

FIGURE 6
www.frontiersin.org

FIGURE 6. Histograms of the scanned parameters for the dataset shown in Figure 5. The total number of data points is about 3000.

The current profiles calculated from the measured and predicted LPS images as well as reconstructed from the measured and predicted spectra using the combination of analytical (Kramers-Kronig) and iterative phase retrieval methods [9] are shown in Figure 5D. The spectra in the left-most case (weakly compressed) contain only noise. Therefore, no current profiles can be reconstructed from them. In the right-most case, due to considerable energy loss induced by the CSR effect, there are unknown portion of electrons off the screen. Therefore, the peak current calculated from the LPS image is much smaller than that reconstructed from the spectrum. For the other three cases, the current profiles calculated from the LPS images and reconstructed from the spectra agree reasonably well. Due to the lack of phase information in such spectral measurements [9], minor deviations in measured and predicted spectrum can lead to noticeable differences of the reconstructed current profiles. Nonetheless, key features (e.g., peak current) are barely affected by this.

It is noticed that there are conspicuous density modulations in the measured LPSs while the predicted LPSs are rather smooth. The density modulation is indeed induced by the microbunching instability which is seeded by the shot noise of an electron bunch [11, 3840]: an initial small density modulation inside the electron bunch can result in sufficient energy modulation due to the longitudinal space charge force, which in turn causes larger density modulation in a magnetic chicane bunch compressor. Because of the poor longitudinal resolution of the TDS measurement, only very weak density modulation can be observed in the current profile calculated from the measured LPS of the longest bunch in Figure 5. The reconstructed current profiles from the spectra also do not show any evidence of the density modulation due to the lack of high-frequency components in the spectrum. Because such density modulation is not deterministic, it cannot be predicted by the neural network model using only RF phases and amplitudes as input. However, it is worth mentioning that the non-deterministic density modulation does not degrade the performance of the model. It appears that the model tends to predict the averaged LPS in which the density modulation is smoothed out.

In order to demonstrate the scalability of the model, six parameters are scanned for a dataset of WP2. The histograms of the scanned parameters are shown in Figure 7. In total, more than 9,000 data points are taken. However, only about 5200 data points survive the data wrangling. Nonetheless, the performance of the LPS image decoder is 0.9801 ± 0.00978 and the performance of the spectrum decoder is 3.9, ×, 10−4 ± 3.2 × 10−3. An example prediction is shown in Figure 8.

FIGURE 7
www.frontiersin.org

FIGURE 7. Histograms of the scanned parameters for a dataset of WP2. The total number of data points is about 5200. The input data are not uniformly distributed because of two reasons. First, the data are combined from two datasets taken successively. The parameter ranges of the second dataset are smaller than the first one. Second, more than 40% of the data are dropped, as discussed in Section 3.1.

FIGURE 8
www.frontiersin.org

FIGURE 8. An example result for the scan shown in Figure 7. (A) Measured LPS image. (B) Predicted LPS image. (C) Comparison of the current profiles calculated from the measured and predicted LPS images. Because the LPS depends on the streaking direction of the TDS in a non-trivial way due to transverse-longitudinal correlations, the current profile here is simply the projection of the LPS image in the vertical direction. (D) Comparison of the measured and predicted spectra from the low-frequency grating set of the CRISP spectrometer. (E) Comparison of the current profiles reconstructed from the measured and predicted spectra.

4.2 Mixed Diagnostics

As mentioned previously, it is essential to combine the spectra from both grating sets of the CRISP spectrometer in order to achieve an accurate reconstruction of the current profile. At European XFEL, the high beam energy and short bunch length make it possible to extrapolate the result measured by the high-frequency grating set to the low-frequency regime [41]. However, this is not feasible for the typical electron bunches at FLASH, especially when the electron bunches have a broad range of bunch lengths. Although another decoder could be trained to predict the spectrum measured by the high-frequency grating set, there are two concerns with this approach. Firstly, the spectra from the high-frequency grating set generally contain microbunching information, which fluctuates from bunch to bunch and train to train. Secondly, this will double the data collection time, which could be unaffordable for user facilities with very limited beam time for non-user experiments.

In order to improve the reconstructed current profile for real-time measurement, a mixed diagnostics method is proposed. The predicted spectrum for the low-frequency grating set can be combined with the measured spectrum from the high-frequency grating set so as to achieve the full-range spectrum in real time. To this end, we recorded two datasets using the high-frequency and low-frequency grating sets, respectively. Each dataset contains about 3000 data points. Only three control knobs (the phases of ACC1, ACC39 and ACC23) are scanned within the same ranges. A neural network model is trained using the dataset recorded with the low-frequency grating set. The performance of the LPS image decoder is 0.9854 ± 0.00394 and the performance of the spectrum decoder is 1.8 × 10−4 ± 1.3 × 10−3. The trained model is also tested against all the LPS images in the dataset recorded with the high-frequency grating set and the performance of the LPS image decoder is as high as 0.9814 ± 0.00525. It indicates that the machine drift during data recording is negligible. Three typical prediction results with significantly different peak currents are shown in Figure 9. They all show that it is essential to have the high-frequency components to reveal the exact shape and structures of the current profile although the low-frequency components are good enough to estimate the bunch length and the peak current.

FIGURE 9
www.frontiersin.org

FIGURE 9. Three typical results for mixed diagnostics with the CRISP spectrometer. The measured LPS image and the mixed spectrum are shown for each result. The current profiles reconstructed from the mixed spectra are compared with the current profiles reconstructed from only the predicted (virtual) spectra.

5 Conclusion

In summary, we have experimentally demonstrated highly accurate megapixel LPS images and CTR spectra predictions for electron bunches in a bunch train concurrently at the end of the FLASH linac. Up to six major control knobs for LPS manipulation are scanned in order to collect data for electron bunches with broad ranges of LPS shapes and peak currents. The model is capable of providing heterogeneous LPS information and ensures reliable online diagnostics because a single type of diagnostic cannot cover electron bunches with a broad range of parameters. LPS images measured in the time domain are essential for electron bunches longer than a few hundreds of fs while CTR spectra provide more accurate current profiles for strongly compressed electron bunches.

A mixed diagnostics method is proposed to significantly improve the online current profile measurement using the CRISP spectrometer. The predicted CTR spectrum for the low-frequency grating set can be combined with the spectrum measured by the high-frequency grating set, which enables reconstruction of the current profile with a much higher accuracy in real time. Although the current profile reconstructed from the spectrum measured by the low-frequency grating set is good enough to estimate the bunch lengths and peak currents for typical electron bunches at FLASH, the spectrum measured by the high-frequency grating set is indispensable to reveal the exact shape and structures of the current profile.

Data Availability Statement

The raw data supporting the conclusion of this article will be made available by the authors, without undue reservation.

Author Contributions

JZ, MC, and HS designed the study; MC coordinated the experiments at FLASH; JZ, MC, and NL performed the measurements. JZ designed the neural network model, wrote the manuscript as well as the code for data collection and analysis; NL reconstructed the current profiles from the CTR spectra and provided critical insight for interpreting the data.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We would like to thank V. Ayvazyan, J. Branlard, and S. Pfeiffer for advices on RF phase and amplitude scan, M. Hoffmann, J. Kral, J. Roensch-Schulenburg, C. Schmidt, and M. Vogt for assistance as well as useful discussions during the data collection campaigns. We are grateful to the FLASH team for support and to B. Steffen for his thoughtful comments.

References

1. Decking W, Abeghyan S, Abramian P, Abramsky A, Aguirre A, Albrecht C, et al. A Mhz-Repetition-Rate Hard X-ray Free-Electron Laser Driven by a Superconducting Linear Accelerator. Nat Photon (2020) 14:391. doi:10.1038/s41566-020-0607-z

CrossRef Full Text | Google Scholar

2. Qi F, Ma Z, Zhao L, Cheng Y, Jiang W, Lu C, et al. Breaking 50 Femtosecond Resolution Barrier in Mev Ultrafast Electron Diffraction with a Double bend Achromat Compressor. Phys Rev Lett (2020) 124:134803. doi:10.1103/PhysRevLett.124.134803

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Jalas S, Kirchen M, Messner P, Winkler P, Hübner L, Dirkwinkel J, et al. Bayesian Optimization of a Laser-Plasma Accelerator. Phys Rev Lett (2021) 126:104801. doi:10.1103/PhysRevLett.126.104801

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Lindstrøm CA, Garland JM, Schröder S, Boulton L, Boyle G, Chappell J, et al. Energy-spread Preservation and High Efficiency in a Plasma-wakefield Accelerator. Phys Rev Lett (2021) 126:014801. doi:10.1103/PhysRevLett.126.014801

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Tang H, Zhao L, Zhu P, Zou X, Qi J, Cheng Y, et al. Stable and Scalable Multistage Terahertz-Driven Particle Accelerator. Phys Rev Lett (2021) 127:074801. doi:10.1103/PhysRevLett.127.074801

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Emma P, Frisch J, Krejcik P. A Transverse Rf Deflecting Structure for bunch Length and Phase Space Diagnostics (2000).

Google Scholar

7. Akre R, Bentson L, Emma P, Krejcik P. A Transverse Rf Deflecting Structure for bunch Length and Phase Space Diagnostics. Pacs2001 Proc 2001 Part Accelerator Conf (Cat. No.01CH37268) (2001) 33:2353–5. doi:10.1109/PAC.2001.987379

CrossRef Full Text | Google Scholar

8. Wesch S, Schmidt B, Behrens C, Delsim-Hashemi H, Schmüser P. A Multi-Channel Thz and Infrared Spectrometer for Femtosecond Electron bunch Diagnostics by Single-Shot Spectroscopy of Coherent Radiation. Nucl Instr Methods Phys Res Section A: Acc Spectrometers, Detectors Associated Equipment (2011) 665:40–7. doi:10.1016/j.nima.2011.11.037

CrossRef Full Text | Google Scholar

9. Schmidt B, Lockmann NM, Schmüser P, Wesch S. Benchmarking Coherent Radiation Spectroscopy as a Tool for High-Resolution bunch Shape Reconstruction at Free-Electron Lasers. Phys Rev Accel Beams (2020) 23:062801. doi:10.1103/PhysRevAccelBeams.23.062801

CrossRef Full Text | Google Scholar

10. Lockmann NM, Gerth C, Schmidt B, Wesch S. Noninvasive Thz Spectroscopy for bunch Current Profile Reconstructions at Mhz Repetition Rates. Phys Rev Accel Beams (2020) 23:112801. doi:10.1103/PhysRevAccelBeams.23.112801

CrossRef Full Text | Google Scholar

11. Qiang J, Ding Y, Emma P, Huang Z, Ratner D, Raubenheimer TO, et al. Start-to-end Simulation of the Shot-Noise Driven Microbunching Instability experiment at the Linac Coherent Light Source. Phys Rev Accel Beams (2017) 20:054402. doi:10.1103/PhysRevAccelBeams.20.054402

CrossRef Full Text | Google Scholar

12. Anirudh R, Thiagarajan JJ, Bremer P-T, Spears BK. Improved Surrogates in Inertial Confinement Fusion with Manifold and Cycle Consistencies. Proc Natl Acad Sci U.S.A (2020) 117:9741–6. doi:10.1073/pnas.1916634117

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly Accurate Protein Structure Prediction with Alphafold. Nature (2021) 596:583–9. doi:10.1038/s41586-021-03819-2

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Emma C, Edelen A, Hogan MJ, O’Shea B, White G, Yakimenko V. Machine Learning-Based Longitudinal Phase Space Prediction of Particle Accelerators. Phys Rev Accel Beams (2018) 21:112802. doi:10.1103/PhysRevAccelBeams.21.112802

CrossRef Full Text | Google Scholar

15. Edelen A, Neveu N, Emma C, Ratner D, Mayes C. Machine Learning Models for Optimization and Control of X-ray Free Electron Lasers. Vancouver: NeurIPS Machine Learning for the Physical Sciences Workshop (2019).

Google Scholar

16. Emma C, Edelen A, Hanuka A, O’Shea B, Scheinker A. Virtual Diagnostic Suite for Electron Beam Prediction and Control at Facet-Ii. Information (2021) 12:61. doi:10.3390/info12020061

CrossRef Full Text | Google Scholar

17. Hanuka A, Emma C, Maxwell T, Fisher AS, Jacobson B, Hogan MJ, et al. Accurate and Confident Prediction of Electron Beam Longitudinal Properties Using Spectral Virtual Diagnostics. Sci Rep (2021) 11:2945. doi:10.1038/s41598-021-82473-0

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Zhu J, Chen Y, Brinker F, Decking W, Tomin S, Schlarb H. High-fidelity Prediction of Megapixel Longitudinal Phase-Space Images of Electron Beams Using Encoder-Decoder Neural Networks. Phys Rev Appl (2021) 16:024005. doi:10.1103/PhysRevApplied.16.024005

CrossRef Full Text | Google Scholar

19. Scheinker A, Edelen A, Bohler D, Emma C, Lutman A. Demonstration of Model-independent Control of the Longitudinal Phase Space of Electron Beams in the Linac-Coherent Light Source with Femtosecond Resolution. Phys Rev Lett (2018) 121:044801. doi:10.1103/PhysRevLett.121.044801

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Leemann SC, Liu S, Hexemer A, Marcus MA, Melton CN, Nishimura H, et al. Demonstration of Machine Learning-Based Model-independent Stabilization of Source Properties in Synchrotron Light Sources. Phys Rev Lett (2019) 123:194801. doi:10.1103/PhysRevLett.123.194801

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Duris J, Kennedy D, Hanuka A, Shtalenkova J, Edelen A, Baxevanis P, et al. Bayesian Optimization of a Free-Electron Laser. Phys Rev Lett (2020) 124:124801. doi:10.1103/PhysRevLett.124.124801

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Bruchon N, Fenu G, Gaio G, Lonza M, O’Shea FH, Pellegrino FA, et al. Basic Reinforcement Learning Techniques to Control the Intensity of a Seeded Free-Electron Laser. Electronics (2020) 9:781. doi:10.3390/electronics9050781

CrossRef Full Text | Google Scholar

23. Kain V, Hirlander S, Goddard B, Velotti FM, Della Porta GZ, Bruchon N, et al. Sample-efficient Reinforcement Learning for Cern Accelerator Control. Phys Rev Accel Beams (2020) 23:124801. doi:10.1103/PhysRevAccelBeams.23.124801

CrossRef Full Text | Google Scholar

24. Scheinker A, Cropp F, Paiagua S, Filippetto D. An Adaptive Approach to Machine Learning for Compact Particle Accelerators. Sci Rep (2021) 11:19187. doi:10.1038/s41598-021-98785-0

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Edelen A, Neveu N, Frey M, Huber Y, Mayes C, Adelmann A. Machine Learning for Orders of Magnitude Speedup in Multiobjective Optimization of Particle Accelerator Systems. Phys Rev Accel Beams (2020) 23:044601. doi:10.1103/PhysRevAccelBeams.23.044601

CrossRef Full Text | Google Scholar

26. Hanuka A, Huang X, Shtalenkova J, Kennedy D, Edelen A, Zhang Z, et al. Physics Model-Informed Gaussian Process for Online Optimization of Particle Accelerators. Phys Rev Accel Beams (2021) 24:072802. doi:10.1103/PhysRevAccelBeams.24.072802

CrossRef Full Text | Google Scholar

27. Roussel R, Hanuka A, Edelen A. Multiobjective Bayesian Optimization for Online Accelerator Tuning. Phys Rev Accel Beams (2021) 24:062801. doi:10.1103/PhysRevAccelBeams.24.062801

CrossRef Full Text | Google Scholar

28. Ackermann W, Asova G, Ayvazyan V, Azima A, Baboi N, Bähr J, et al. Operation of a Free-Electron Laser from the Extreme Ultraviolet to the Water Window. Nat Photon (2007) 1:336. doi:10.1038/nphoton.2007.76

CrossRef Full Text | Google Scholar

29. Faatz B, Plönjes E, Ackermann S, Agababyan A, Asgekar V, Ayvazyan V, et al. Simultaneous Operation of Two Soft X-ray Free-Electron Lasers Driven by One Linear Accelerator. New J Phys (2016) 18:062002. doi:10.1088/1367-2630/18/6/062002

CrossRef Full Text | Google Scholar

30. Röhrs M, Gerth C, Schlarb H, Schmidt B, Schmüser P. Time-resolved Electron Beam Phase Space Tomography at a Soft X-ray Free-Electron Laser. Phys Rev ST Accel Beams (2009) 12:050704. doi:10.1103/PhysRevSTAB.12.050704

CrossRef Full Text | Google Scholar

31. Behrens C, Gerasimova N, Gerth C, Schmidt B, Schneidmiller EA, Serkez S, et al. Constraints on Photon Pulse Duration from Longitudinal Electron Beam Diagnostics at a Soft X-ray Free-Electron Laser. Phys Rev ST Accel Beams (2012) 15:030707. doi:10.1103/PhysRevSTAB.15.030707

CrossRef Full Text | Google Scholar

32. Hensler GG, Goloborodko S, Grygiel G, Hensler O, Kocharyan V, Rehlich K, et al. Doocs: An Object Oriented Control System as the Integrating Part for the Ttf Linac (1998).

Google Scholar

33. Viti M, Czwalinna M, Dinter H, Gerth C, Przygoda K, Rybaniec R, et al. The bunch Arrival Time Monitor at Flash and European Xfel. Proc ICALEPCS2017 (2017) TUPHA125.

Google Scholar

34. Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, et al. Tensorflow: Large-Scale Machine Learning on Heterogeneous Distributed Systems (2017).

Google Scholar

35. He K, Zhang X, Ren S, Sun J. Delving Deep into Rectifiers: Surpassing Human-Level Performance on Imagenet Classification. In: 2015 IEEE International Conference on Computer Vision (2015). p. 1026–34. doi:10.1109/ICCV.2015.123

CrossRef Full Text | Google Scholar

36. Kingma DP, Ba J. Adam: A Method for Stochastic Optimization (2015).

Google Scholar

37. Wang Z, Bovik AC, Sheikh HR, Simoncelli EP. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Trans Image Process (2004) 13:600–12. doi:10.1109/TIP.2003.819861

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Huang Z, Kim K-J. Formulas for Coherent Synchrotron Radiation Microbunching in a bunch Compressor Chicane. Phys Rev ST Accel Beams (2002) 5:074401. doi:10.1103/PhysRevSTAB.5.074401

CrossRef Full Text | Google Scholar

39. Saldin EL, Schneidmiller EA, Yurkov MV. An Analytical Description of Longitudinal Phase Space Distortions in Magnetic bunch Compressors. Nucl Instr Methods Phys Res Section A: Acc Spectrometers, Detectors Associated Equipment (2002) 483:516–20. doi:10.1016/S0168-9002(02)00372-8

CrossRef Full Text | Google Scholar

40. Ratner D, Behrens C, Ding Y, Huang Z, Marinelli A, Maxwell T, et al. Time-resolved Imaging of the Microbunching Instability and Energy Spread at the Linac Coherent Light Source. Phys Rev ST Accel Beams (2015) 18:030704. doi:10.1103/PhysRevSTAB.18.030704

CrossRef Full Text | Google Scholar

41. Lockmann NM. Noninvasive Measurements of Electron Bunch Current Profiles with Few-Femtosecond Resolution at MHz Repetition Rates. Universität Hamburg: Ph.D. thesis (2021).

Google Scholar

Keywords: free-electron laser (FEL), longitudinal phase space, machine learning, encoder-decoder, mixed diagnostics

Citation: Zhu J, Lockmann NM, Czwalinna MK and Schlarb H (2022) Mixed Diagnostics for Longitudinal Properties of Electron Bunches in a Free-Electron Laser. Front. Phys. 10:903559. doi: 10.3389/fphy.2022.903559

Received: 24 March 2022; Accepted: 08 June 2022;
Published: 22 July 2022.

Edited by:

Alexander Scheinker, Los Alamos National Laboratory (DOE), United States

Reviewed by:

Daniel Phillips, Ohio University, United States
Zhi Zeng, Hefei Institutes of Physical Science (CAS), China

Copyright © 2022 Zhu, Lockmann, Czwalinna and Schlarb. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: J. Zhu, jun.zhu@desy.de

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.