Manticore Search 13.2.3 introduces vector quantization that compresses vectors from 32-bit floats to 8-bit or 1-bit representations, reducing RAM usage by 4x to 32x while maintaining search performance. The feature includes asymmetric quantization for better accuracy, oversampling and rescoring options to recover full-precision
•14m read time• From manticoresearch.com
Table of contents
What vector quantization isEnabling VQWhy oversampling + rescoring mattersBenchmarksConclusionsSort: