vLLM vllm/vllm-openai https://hub.docker.com/r/vllm/vllm-openai bridge 8000 8000 tcp **Nvidia Driver plugin** (nVidia Support) sh false https://discord.gg/jz7wjKhh6g https://docs.vllm.ai/ Easy, fast, and cheap LLM serving for everyone AI: http://[IP]:[PORT:8000]/ https://i.imgur.com/oQcntuY.png --runtime=nvidia --ipc=host all all /mnt/user/appdata/vllm 8000