Tags · #vllm

vLLM v0.20.0 ships DeepSeek V4 + PyTorch 2.11 + FlashAttention 4

vLLM v0.20.0: 752 commits, 320 contributors. CUDA 13, PyTorch 2.11, Transformers v5, Python 3.14, FlashAttention 4 default, 2-bit KV cache.

Tip