•From x.com

Harrison Chase @hwchase17
RT @Vtrivedy10: there’s a great mapping between reward design/RL and writing great evals empathy towards agents failure modes is a good wa…
Sort:

Harrison Chase @hwchase17
RT @Vtrivedy10: there’s a great mapping between reward design/RL and writing great evals empathy towards agents failure modes is a good wa…
Sort: