BaNEL (Bayesian Negative Evidence Learning) is a novel algorithm that trains generative models using only failed attempts, addressing the challenge of extremely sparse rewards in hard problems like theorem proving and drug discovery. By learning patterns from failures through a separate generative model, BaNEL achieves up to

8m read timeFrom blog.ml.cmu.edu
Post cover image
Table of contents
Tackling Very Hard ProblemsLearning from Negative RewardsLearning a Generative Model of FailuresOnline Recursive UpdateExperiment: Adversarial Attack On Toy Language ModelExperiment: Language Model ReasoningClosing Remarks

Sort: