I'm Sebastian: a machine learning & AI researcher, programmer, and author. As Staff Research Engineer Lightning AI, I focus on the intersection of AI research, software development, and large language models (LLMs).

Sebastian Raschka's Blog offers insights, tutorials, and research updates on machine learning, deep learning, and artificial intelligence. Covering topics such as neural networks, data science, and Python programming, Sebastian Raschka's Blog provides resources for students, researchers, and practitioners in the field of AI. Developers can learn about  algorithms, research methodologies, and practical applications of machine learning through Raschka's blog posts and publications.

Sebastian Raschka

Learn how to transform pretrained large language models (LLMs) into effective text classifiers, with a focus on spam classification. The post highlights the process of finetuning GPT models, discussing the modification of model outputs, the importance of transformer blocks, and various experiments to optimize model performance. The release of the author's new book on building GPT-like LLMs from scratch is also announced, providing a deep dive into understanding and constructing LLMs.

Building A GPT-Style LLM Classifier From Scratch

Initializing a model with pretrained weights

Build A Large Language Model From Scratch