研究目的
Applying a Reinforcement Learning model-free approach to the alignment of a seed laser at FERMI, the free-electron laser facility at Elettra Sincrotrone Trieste, to overcome the limitations of classical model-free approaches.
研究成果
The preliminary results show that the Q-learning with linear function approximation is effective for the alignment of a seed laser at FERMI. The approach represents a first step toward a fully automatic procedure, overcoming some limitations of classical model-free approaches.
研究不足
The need for the gradient of the objective function, high sensitivity to hyper-parameters, and the absence of a mechanism for storing and exploiting past experience are intrinsic limitations. The approach also requires a huge dataset and long learning time.
1:Experimental Design and Method Selection:
The study employs Q-learning with linear function approximation for the alignment of a seed laser. The methodology involves training and testing phases on two physical systems: the EOS and the FEL.
2:Sample Selection and Data Sources:
The state x is a 4-dimensional vector providing current voltage values applied to each piezo-motor. The input u defines the variation of the state along allowed directions.
3:List of Experimental Equipment and Materials:
The study involves tip-tilt mirrors driven by piezo-motors, charge-coupled devices (CCDs), and an I0 monitor for intensity measurement.
4:Experimental Procedures and Operational Workflow:
The training phase consists of runs of episodes where the controller learns to align the laser beam. The test phase evaluates the learned policy.
5:Data Analysis Methods:
The reward is shaped to avoid sparse reward, and the Q-learning update rule is applied with a discount factor.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容