Home
Publications
People
News
Teaching
Join Us
Residual Reward Models for Preference-based Reinforcement Learning
Chenyang Cao
,
Miguel Rogel-García
,
Mohamed Nabail
,
Xueqian Wang
,
Nicholas Rhinehart
.
Residual Reward Models for Preference-based Reinforcement Learning
.
arXiv:2507.00611
, (
arXiv
), 2025.
PDF
Cite
Reward Learning
Machine Learning
Robotics
Reinforcement Learning
Manipulation
Type
Preprint
Venue
arXiv:2507.00611
Year
2025
Chenyang Cao
Ph.D. Student
Mohamed Nabail
Ph.D. Student
Nicholas Rhinehart
Assistant Professor
PI of LEAF Lab
Cite
×