Skip to content

[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline #1573

[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline

[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline #1573

Job Run time
32s
50m 24s
13m 16s
39m 1s
9m 24s
41m 30s
55m 32s
9m 48s
12m 8s
10m 4s
10m 38s
10m 37s
10m 53s
17m 51s
10m 41s
10m 59s
10m 17s
11m 6s
10m 36s
11m 6s
11m 9s
10m 21s
10m 37s
15m 19s
6h 43m 49s