A simple, performant and scalable Jax LLM! Contribute to google/maxtext development by creating an account on GitHub.

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

MaxText is a high performance and scalable LLM written in Python/Jax. It supports TPUs and GPUs for training and inference. MaxText aims to be a launching off point for ambitious LLM projects. It has achieved high-performance training and can scale to tens of thousands of chips. MaxText is heavily inspired by MinGPT/NanoGPT and Megatron-LM, but uses a different programming strategy. It is also comparable to Pax, but provides a simple implementation of LLMs.

google/maxtext: A simple, performant and scalable Jax LLM!