Hacker News
new
show
ask
jobs
Reinforcement Learning from Human Feedback
96 points
by
onurkanbkrc
9 hours ago
5
comments
story
https://arxiv.org/abs/2504.12501
loading...