This post explores the process of choosing an open source model for an LLM RAG QA Chatbot, and delves into the concept of quantization in language modeling. It discusses different quantization algorithms commonly used in deep learning and the advantages and limitations of each algorithm. The post also explores the options of
1 Comment
Sort: