The g4 GPU recipes uses CUDA 12.8. When the recommendation is for 13.0+. Path: inference/g4/single-host-serving/vllm Please update the images.
The g4 GPU recipes uses CUDA 12.8. When the recommendation is for 13.0+.
Path: inference/g4/single-host-serving/vllm
Please update the images.