Explains the fundamental mathematical concepts needed to understand how Large Language Models work, focusing on vectors, matrices, high-dimensional spaces, embeddings, and projections. Covers vocab spaces where logits represent token probabilities, embedding spaces where similar concepts cluster together, and how matrix multiplication enables projections between different dimensional spaces. Demonstrates that neural network layers are essentially matrix multiplications that project between spaces, making LLM inference accessible with high-school level mathematics.
Table of contents
Vectors and high-dimensional spacesVocab spaceEmbeddingsProjections by matrix multiplicationNeural networksWrapping upComments (1):Leave a comment:1 Comment
Sort: