Tencent AI Lab developed ALPHA LLM, a novel framework that integrates MCTS with LLMs for self-improvement without additional data annotations. It significantly enhances LLMs' reasoning capabilities and demonstrated improved performance on the GSM8K and MATH datasets.
Sort: