Publications
Reinforcement Learning
-
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption, NeurIPS 2025. [Code]
Longxiang He, Li Shen, Junbo Tan, Xueqian Wang.
-
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization, Preprint 2024. [Code]
Longxiang He, Li Shen, Junbo Tan, Xueqian Wang.
-
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning, Preprint 2024. [Code]
Longxiang He, Li Shen, Linrui Zhang, Junbo Tan, Xueqian Wang.
-
FOSP: Fine-tuning Offline Safe Policy through World Models, ICLR 2025.
Chenyang Cao, Yucheng Xin, Silang Wu, Longxiang He, Zichen Yan, Junbo Tan, Xueqian Wang