[ET Device Support] CUDA-native Qwen 3.5 MoE inference with device tensor pipeline #1573
| Job | Run time |
|---|---|
| 32s | |
| 50m 24s | |
| 13m 16s | |
| 39m 1s | |
| 9m 24s | |
| 41m 30s | |
| 55m 32s | |
| 9m 48s | |
| 12m 8s | |
| 10m 4s | |
| 10m 38s | |
| 10m 37s | |
| 10m 53s | |
| 17m 51s | |
| 10m 41s | |
| 10m 59s | |
| 10m 17s | |
| 11m 6s | |
| 10m 36s | |
| 11m 6s | |
| 11m 9s | |
| 10m 21s | |
| 10m 37s | |
| 15m 19s | |
| 6h 43m 49s |