CVE-2026-47155
- EPSS 0.14%
- Veröffentlicht 22.06.2026 22:20:10
- Zuletzt bearbeitet 24.06.2026 16:49:17
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.0, vLLM's revision pinning controls do not consistently apply to all artifacts loaded for a model. A deployment that supplies --revision or --code-revision can st...
CVE-2026-41523
- EPSS 0.39%
- Veröffentlicht 22.06.2026 22:18:14
- Zuletzt bearbeitet 24.06.2026 16:48:45
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.0, an assert-based security check in vLLM's activation function loading allows any unauthenticated attacker to achieve arbitrary code execution on the server by p...
CVE-2026-54232
- EPSS 0.29%
- Veröffentlicht 22.06.2026 22:16:43
- Zuletzt bearbeitet 24.06.2026 16:51:45
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.1, the vLLM Dockerfile is vulnerable to a dependency confusion attack through the flashinfer-jit-cache package. The package is installed from a custom index (flas...
CVE-2026-54233
- EPSS 0.25%
- Veröffentlicht 22.06.2026 22:10:45
- Zuletzt bearbeitet 24.06.2026 16:52:33
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, vLLM's /v1/audio/transcriptions endpoint limits compressed upload size but not decoded PCM output. A 25MB OPUS file expands to ~14.9GB of float32 PCM at dec...
CVE-2026-54236
- EPSS 0.82%
- Veröffentlicht 22.06.2026 22:09:15
- Zuletzt bearbeitet 24.06.2026 16:53:59
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, the fix for CVE-2026-22778, which introduced a sanitize_message helper that strips object-repr memory addresses from error messages before they reach the cl...
CVE-2026-54235
- EPSS 0.32%
- Veröffentlicht 22.06.2026 21:59:02
- Zuletzt bearbeitet 24.06.2026 16:53:13
vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, ll temperature validation gates use comparison operators (<, >), which silently evaluate to False for NaN and for positive Infinity in Python's IEEE 754 flo...
CVE-2026-48746
- EPSS 0.74%
- Veröffentlicht 22.06.2026 21:57:28
- Zuletzt bearbeitet 24.06.2026 16:49:36
vLLM is an inference and serving engine for large language models (LLMs). From 0.3.0 until 0.22.0, a vulnerability in ASGI web servers and starlette's trust on those web servers enables an authentication bypass of the OpenAI API AuthenticationMiddlew...
CVE-2026-53923
- EPSS 0.32%
- Veröffentlicht 22.06.2026 21:55:42
- Zuletzt bearbeitet 24.06.2026 16:51:00
vLLM is an inference and serving engine for large language models (LLMs). From 0.5.5 until 0.23.1rc0, integer truncation of tensor dimensions in vLLM's GGUF dequantize kernels (csrc/quantization/gguf/gguf_kernel.cu) causes partial tensor processing. ...
CVE-2026-5497
- EPSS 0.42%
- Veröffentlicht 11.06.2026 08:31:18
- Zuletzt bearbeitet 15.06.2026 16:11:21
vLLM versions 0.8.0 and later are vulnerable to an Out-of-Memory (OOM) Denial of Service (DoS) attack due to unbounded frame count processing in the `VideoMediaIO.load_base64()` method. When processing `video/jpeg` data URLs, the method splits the ba...
CVE-2026-4944
- EPSS 0.75%
- Veröffentlicht 28.05.2026 18:04:05
- Zuletzt bearbeitet 29.05.2026 15:39:34
vllm-project/vllm version 0.14.1 contains a vulnerability where the `trust_remote_code=True` parameter is hardcoded in two model implementation files (`vllm/model_executor/models/nemotron_vl.py` and `vllm/model_executor/models/kimi_k25.py`). This byp...