Vllm-project

Vllm

41 Schwachstellen gefunden.

Hinweis: Diese Liste kann unvollständig sein. Daten werden ohne Gewähr im Ursprungsformat bereitgestellt.
  • EPSS 0.14%
  • Veröffentlicht 22.06.2026 22:20:10
  • Zuletzt bearbeitet 24.06.2026 16:49:17

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.0, vLLM's revision pinning controls do not consistently apply to all artifacts loaded for a model. A deployment that supplies --revision or --code-revision can st...

Exploit
  • EPSS 0.39%
  • Veröffentlicht 22.06.2026 22:18:14
  • Zuletzt bearbeitet 24.06.2026 16:48:45

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.0, an assert-based security check in vLLM's activation function loading allows any unauthenticated attacker to achieve arbitrary code execution on the server by p...

Exploit
  • EPSS 0.29%
  • Veröffentlicht 22.06.2026 22:16:43
  • Zuletzt bearbeitet 24.06.2026 16:51:45

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.22.1, the vLLM Dockerfile is vulnerable to a dependency confusion attack through the flashinfer-jit-cache package. The package is installed from a custom index (flas...

  • EPSS 0.25%
  • Veröffentlicht 22.06.2026 22:10:45
  • Zuletzt bearbeitet 24.06.2026 16:52:33

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, vLLM's /v1/audio/transcriptions endpoint limits compressed upload size but not decoded PCM output. A 25MB OPUS file expands to ~14.9GB of float32 PCM at dec...

Exploit
  • EPSS 0.82%
  • Veröffentlicht 22.06.2026 22:09:15
  • Zuletzt bearbeitet 24.06.2026 16:53:59

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, the fix for CVE-2026-22778, which introduced a sanitize_message helper that strips object-repr memory addresses from error messages before they reach the cl...

Exploit
  • EPSS 0.32%
  • Veröffentlicht 22.06.2026 21:59:02
  • Zuletzt bearbeitet 24.06.2026 16:53:13

vLLM is an inference and serving engine for large language models (LLMs). Prior to 0.23.1rc0, ll temperature validation gates use comparison operators (<, >), which silently evaluate to False for NaN and for positive Infinity in Python's IEEE 754 flo...

  • EPSS 0.74%
  • Veröffentlicht 22.06.2026 21:57:28
  • Zuletzt bearbeitet 24.06.2026 16:49:36

vLLM is an inference and serving engine for large language models (LLMs). From 0.3.0 until 0.22.0, a vulnerability in ASGI web servers and starlette's trust on those web servers enables an authentication bypass of the OpenAI API AuthenticationMiddlew...

  • EPSS 0.32%
  • Veröffentlicht 22.06.2026 21:55:42
  • Zuletzt bearbeitet 24.06.2026 16:51:00

vLLM is an inference and serving engine for large language models (LLMs). From 0.5.5 until 0.23.1rc0, integer truncation of tensor dimensions in vLLM's GGUF dequantize kernels (csrc/quantization/gguf/gguf_kernel.cu) causes partial tensor processing. ...

Exploit
  • EPSS 0.42%
  • Veröffentlicht 11.06.2026 08:31:18
  • Zuletzt bearbeitet 15.06.2026 16:11:21

vLLM versions 0.8.0 and later are vulnerable to an Out-of-Memory (OOM) Denial of Service (DoS) attack due to unbounded frame count processing in the `VideoMediaIO.load_base64()` method. When processing `video/jpeg` data URLs, the method splits the ba...

  • EPSS 0.75%
  • Veröffentlicht 28.05.2026 18:04:05
  • Zuletzt bearbeitet 29.05.2026 15:39:34

vllm-project/vllm version 0.14.1 contains a vulnerability where the `trust_remote_code=True` parameter is hardcoded in two model implementation files (`vllm/model_executor/models/nemotron_vl.py` and `vllm/model_executor/models/kimi_k25.py`). This byp...