From x.com
rryssf_'s profile

Robert Youssef @rryssf_

RT @rryssf_: Meta FAIR just solved the "cold start" problem in LLM training when a model scores 0/128 on hard math problems, standard RL t…

Sort: