Residual Reward Models for Preference-based Reinforcement Learning

Venue
arXiv:2507.00611
Year
2025
Chenyang Cao
Chenyang Cao
Ph.D. Student
Mohamed Nabail
Mohamed Nabail
Ph.D. Student
Nicholas Rhinehart
Nicholas Rhinehart
Assistant Professor

PI of LEAF Lab