I am currently researching reinforcement learning (RL), deep learning, large language models (LLMs), and their cross-applications. Specifically, I am familiar with RL methods, applications, and their algorithm construction; I am also engaged in research on the safety of large models, such as jailbreak methods and applications, and I am aware of LLMs distillation technology and characteristics.