Skip to content

[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline #1562

[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline

[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline #1562

Job Run time
30s
9m 10s
14m 34s
38m 30s
39m 6s
9m 46s
34m 40s
36m 31s
11m 15s
15m 45s
11m 37s
10m 58s
11m 43s
11m 22s
10m 21s
12m 1s
12m 32s
12m 38s
16m 28s
17m 13s
10m 22s
11m 3s
10m 15s
10m 45s
6h 19m 5s