CVE-2026-47155
- EPSS 0.14%
- Veröffentlicht 22.06.2026 22:20:10
- Zuletzt bearbeitet 24.06.2026 16:49:17
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.0, vLLM's revision pinning controls do not consistently apply to all artifacts loaded for a model. A deployment that supplies --revision or --code-revision can st...
CVE-2026-41523
- EPSS 0.39%
- Veröffentlicht 22.06.2026 22:18:14
- Zuletzt bearbeitet 24.06.2026 16:48:45
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.0, an assert-based security check in vLLM's activation function loading allows any unauthenticated attacker to achieve arbitrary code execution on the server by p...
CVE-2026-54232
- EPSS 0.29%
- Veröffentlicht 22.06.2026 22:16:43
- Zuletzt bearbeitet 24.06.2026 16:51:45
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.1, the vLLM Dockerfile is vulnerable to a dependency confusion attack through the flashinfer-jit-cache package. The package is installed from a custom index (flas...
CVE-2026-54233
- EPSS 0.25%
- Veröffentlicht 22.06.2026 22:10:45
- Zuletzt bearbeitet 24.06.2026 16:52:33
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, vLLM's /v1/audio/transcriptions endpoint limits compressed upload size but not decoded PCM output. A 25MB OPUS file expands to ~14.9GB of float32 PCM at dec...
CVE-2026-54236
- EPSS 0.82%
- Veröffentlicht 22.06.2026 22:09:15
- Zuletzt bearbeitet 24.06.2026 16:53:59
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, the fix for CVE-2026-22778, which introduced a sanitize_message helper that strips object-repr memory addresses from error messages before they reach the cl...
CVE-2026-54235
- EPSS 0.32%
- Veröffentlicht 22.06.2026 21:59:02
- Zuletzt bearbeitet 24.06.2026 16:53:13
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, ll temperature validation gates use comparison operators (<, >), which silently evaluate to False for NaN and for positive Infinity in Python's IEEE 754 flo...
CVE-2026-48746
- EPSS 0.74%
- Veröffentlicht 22.06.2026 21:57:28
- Zuletzt bearbeitet 24.06.2026 16:49:36
vLLM is an inference and serving engine for large language models (LLMs). From 0.3.0 until 0.22.0, a vulnerability in ASGI web servers and starlette's trust on those web servers enables an authentication bypass of the OpenAI API AuthenticationMiddlew...
CVE-2026-53923
- EPSS 0.32%
- Veröffentlicht 22.06.2026 21:55:42
- Zuletzt bearbeitet 24.06.2026 16:51:00
vLLM is an inference and serving engine for large language models (LLMs). From 0.5.5 until 0.23.1rc0, integer truncation of tensor dimensions in vLLM's GGUF dequantize kernels (csrc/quantization/gguf/gguf_kernel.cu) causes partial tensor processing. ...
CVE-2026-56340
- EPSS 0.29%
- Veröffentlicht 20.06.2026 18:27:10
- Zuletzt bearbeitet 24.06.2026 17:17:30
vLLM versions >= 0.10.2 and < 0.13.0 are missing sparse tensor validation in multimodal embeddings processing. Because PyTorch disables sparse tensor invariant checks by default, an attacker can submit crafted embedding requests with malformed (negat...
CVE-2025-71379
- EPSS 0.23%
- Veröffentlicht 20.06.2026 18:27:09
- Zuletzt bearbeitet 22.06.2026 21:16:19
vLLM versions >= 0.6.3 and < 0.9.0 contain multiple regular expression denial of service (ReDoS) vulnerabilities. Several regex patterns — in vllm/lora/utils.py, the phi4mini tool parser, and the OpenAI-compatible serving chat endpoint — are suscepti...