KVarN: Native vLLM backend for KV-cache quantization by Huawei

Article URL: https://github.com/huawei-csl/KVarN

Comments URL: https://news.ycombinator.com/item?id=48399974

Points: 56

# Comments: 7