Skip to content

examples/models/gemma4_31b: CUDA Engine/Session adapter + OpenAI serving #1584

examples/models/gemma4_31b: CUDA Engine/Session adapter + OpenAI serving

examples/models/gemma4_31b: CUDA Engine/Session adapter + OpenAI serving #1584

Job Run time
26s
40m 8s
13m 52s
1h 12m 1s
46m 24s
9m 47s
9m 7s
38m 29s
11m 15s
12m 15s
15m 23s
10m 56s
17m 2s
10m 41s
11m 0s
10m 21s
10m 26s
11m 34s
10m 14s
11m 30s
10m 35s
10m 30s
11m 51s
11m 33s
6h 57m 20s