Showing results 1 to 1 of 1
B-Pref: Benchmarking Preference-Based Reinforcement Learning Lee, Kimin; Laura Smith; Anca Dragan; Pieter Abbeel, 35th Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, 2021-12 |
Discover