[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline #1562
| Job | Run time |
|---|---|
| 30s | |
| 9m 10s | |
| 14m 34s | |
| 38m 30s | |
| 39m 6s | |
| 9m 46s | |
| 34m 40s | |
| 36m 31s | |
| 11m 15s | |
| 15m 45s | |
| 11m 37s | |
| 10m 58s | |
| 11m 43s | |
| 11m 22s | |
| 10m 21s | |
| 12m 1s | |
| 12m 32s | |
| 12m 38s | |
| 16m 28s | |
| 17m 13s | |
| 10m 22s | |
| 11m 3s | |
| 10m 15s | |
| 10m 45s | |
| 6h 19m 5s |