Intel released llm-scaler-vllm beta 0.11.1-b7, adding symmetric 4-bit integer quantization for Qwen3 models, support for PaddleOCR and GLM-4.6v-Flash, and various bug fixes. The Docker-based solution helps deploy GenAI workloads on Intel Battlemage graphics cards, optimized for Arc Pro B60 but compatible with other Arc Graphics hardware with sufficient vRAM.

2m read timeFrom phoronix.com
Post cover image

Sort: