LLM training in simple, raw C/CUDA. Contribute to karpathy/llm.c development by creating an account on GitHub.

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

LLM training in simple, pure C/CUDA. The post provides instructions on downloading and tokenizing datasets, initializing the model with GPT-2 weights, and decoding token ids back to text.

karpathy/llm.c: LLM training in simple, raw C/CUDA