I am a third year Ph.D. student at Duke University advised by Prof. Pan Xu. My primary research interests focus on reinforcement learning (RL), with a specific emphasis on off-dynamics RL, RL-driven applications in healthcare, and the incorporation of RL techniques into foundation models. I am also exploring the use of RL to improve large language models (LLMs), particularly in enhancing their reasoning capabilities and alignment with human.
Powered by Jekyll and Minimal Light theme.