This post discusses bigram language modeling, its implementation, and its comparison to a neural network model. It also provides an overview of the training loop and inference process.

15m read time From pub.towardsai.net
Post cover image
Table of contents
Bigram Language Modeling From ScratchUsing Neural NetworkUsing Entire dataset

Sort: