This post explores the process of choosing an open source model for an LLM RAG QA Chatbot, and delves into the concept of quantization in language modeling. It discusses different quantization algorithms commonly used in deep learning and the advantages and limitations of each algorithm. The post also explores the options of

6m read timeFrom levelup.gitconnected.com
Post cover image
Table of contents
What is Quantization?Local Model
1 Comment

Sort: