Collection

TurboQuant: Google's quantization method cuts KV cache memory by 6x with no accuracy loss

Last updated
Post cover image

Sort: