Skip to content

examples/models/qwen3_5_moe: CUDA Engine/Session adapter + OpenAI serving #1532

examples/models/qwen3_5_moe: CUDA Engine/Session adapter + OpenAI serving

examples/models/qwen3_5_moe: CUDA Engine/Session adapter + OpenAI serving #1532

Job Run time
33s
14m 59s
38m 40s
45m 59s
40m 51s
10m 26s
39m 43s
10m 30s
11m 16s
14m 0s
11m 35s
12m 1s
11m 42s
17m 49s
11m 37s
11m 52s
11m 11s
11m 46s
11m 20s
12m 2s
11m 58s
11m 16s
16m 48s
11m 59s
6h 41m 53s