Thinking models (reasoning models) improve LLM performance on complex tasks by using more compute during response generation. Chain-of-thought prompting demonstrates that generating intermediate reasoning steps leads to better answers. Test-time compute strategies include generating multiple responses and selecting the best one

13m watch time

Sort: