From x.com
antirez's profile

antirez @antirez

Sometimes people have the feeling that when a model is released it has stellar performances, and then a few weeks later it is somewhat less shiny. Often times it is just human bias. However when the AI provider swears it is the same model checkpoint, are you sure it didn't turn on some aggressive KV cache quantization?

Sort: