Liger-Kernel is an open-sourced library designed to enhance GPU efficiency for training large language models (LLMs), offering 20% improved training throughput and 60% reduced memory usage with minimal code changes. Since its release in August 2024, it has gained significant traction and is integrated with major training
Sort: