Recent developments in AI have focused on the use of test-time compute and chain-of-thought (CoT) to improve model performance by emulating human-like thinking processes, which involve both fast and slow thought modes. Techniques like parallel sampling and sequential revision are being explored for enhancing the decoding

18m read timeFrom lilianweng.github.io
Post cover image
Table of contents
Motivation #Thinking in Tokens #Thinking in Continuous Space #Thinking as Latent Variables #Scaling Laws for Thinking Time #What’s for Future #Citation #References #

Sort: