Recent developments in AI have focused on the use of test-time compute and chain-of-thought (CoT) to improve model performance by emulating human-like thinking processes, which involve both fast and slow thought modes. Techniques like parallel sampling and sequential revision are being explored for enhancing the decoding
Table of contents
Motivation #Thinking in Tokens #Thinking in Continuous Space #Thinking as Latent Variables #Scaling Laws for Thinking Time #What’s for Future #Citation #References #Sort: