I am a senior BS @HIT and also an incoming MS @HIT, a member of the SCIR LA. My current research interests focus on RL4LLM. I have research experience in Safe RL and Offline RL.
Intern:
WestlakeU (2023.12-2024.9)
Du Xiaoman Financial (2024.1-)
Publication:
(ICML2024, seconed author)Reinformer: Max-Return Sequence Modeling for Offline RL (https://proceedings.mlr.press/v235/zhuang24b.html)
Email: