Residual Reward Models for Preference-based Reinforcement Learning

Venue
arXiv:2507.00611
Year
2025
Mohamed Nabail
Mohamed Nabail
Ph.D. Student
Nicholas Rhinehart
Nicholas Rhinehart
Assistant Professor

PI of LEAF Lab