Manticore Search 13.2.3 introduces vector quantization that compresses vectors from 32-bit floats to 8-bit or 1-bit representations, reducing RAM usage by 4x to 32x while maintaining search performance. The feature includes asymmetric quantization for better accuracy, oversampling and rescoring options to recover full-precision

14m read time From manticoresearch.com
Post cover image
Table of contents
What vector quantization isEnabling VQWhy oversampling + rescoring mattersBenchmarksConclusions

Sort: