Skip to content

Qwen3.5-MoE CUDA V2 foundation: one model, many isolated sessions #77852

Qwen3.5-MoE CUDA V2 foundation: one model, many isolated sessions

Qwen3.5-MoE CUDA V2 foundation: one model, many isolated sessions #77852

Triggered via pull request June 10, 2026 21:38
Status Cancelled
Total duration 15s
Artifacts

pull.yml

on: pull_request
Get changed files  /  get-changed-files
3s
Get changed files / get-changed-files
Matrix: test-qnn-testsuite-linux / test-backend-linux
android  /  build-android
6s
android / build-android
CI run decision  /  decide
5s
CI run decision / decide
test-mcu-cortex-m-backend  /  linux-job
6s
test-mcu-cortex-m-backend / linux-job
unittest  /  ...  /  linux-job
6s
unittest / linux / linux-job
unittest  /  ...  /  macos-job
6s
unittest / macos / macos-job
unittest  /  ...  /  windows-job
6s
unittest / windows / windows-job
unittest-editable  /  ...  /  linux-job
7s
unittest-editable / linux / linux-job
unittest-editable  /  ...  /  macos-job
7s
unittest-editable / macos / macos-job
unittest-editable  /  ...  /  windows-job
7s
unittest-editable / windows / windows-job
unittest-buck  /  ...  /  linux-job
7s
unittest-buck / linux / linux-job
unittest-buck  /  ...  /  macos-job
7s
unittest-buck / macos / macos-job
test-qnn-direct-build-linux  /  linux-job
6s
test-qnn-direct-build-linux / linux-job
unittest-nxp-neutron  /  linux-job
6s
unittest-nxp-neutron / linux-job
test-samsung-quantmodels-linux  /  linux-job
6s
test-samsung-quantmodels-linux / linux-job
test-samsung-models-linux  /  linux-job
6s
test-samsung-models-linux / linux-job
test-vulkan-models-linux  /  linux-job
6s
test-vulkan-models-linux / linux-job
test-vulkan-operators-linux  /  linux-job
6s
test-vulkan-operators-linux / linux-job
nxp-build-test  /  linux-job
6s
nxp-build-test / linux-job
test-binary-size-linux  /  job
test-binary-size-linux / job
test-binary-size-linux-gcc  /  job
test-binary-size-linux-gcc / job
test-build-wasm-linux  /  job
test-build-wasm-linux / job
test-custom-ops-linux  /  job
test-custom-ops-linux / job
test-eval_llama-wikitext-linux  /  job
test-eval_llama-wikitext-linux / job
test-llama-runner-linux-android  /  job
test-llama-runner-linux-android / job
test-llama_runner_eager-linux  /  job
test-llama_runner_eager-linux / job
test-lora-linux  /  job
test-lora-linux / job
test-lora-multimethod-linux  /  job
test-lora-multimethod-linux / job
test-mediatek-models-linux  /  job
test-mediatek-models-linux / job
test-moshi-linux  /  job
test-moshi-linux / job
test-openvino-linux  /  job
test-openvino-linux / job
test-parakeet-xnnpack-linux  /  job
test-parakeet-xnnpack-linux / job
test-phi-3-mini-runner-linux  /  job
test-phi-3-mini-runner-linux / job
test-qnn-buck-build-linux  /  job
test-qnn-buck-build-linux / job
test-qnn-delegate-linux  /  job
test-qnn-delegate-linux / job
test-qnn-passes-linux  /  job
test-qnn-passes-linux / job
test-qnn-python-imports-linux  /  job
test-qnn-python-imports-linux / job
test-quantized-aot-lib-linux  /  job
test-quantized-aot-lib-linux / job
test-selective-build-linux  /  job
test-selective-build-linux / job
test-setup-linux-gcc  /  job
test-setup-linux-gcc / job
test-voxtral-realtime-xnnpack-linux  /  job
test-voxtral-realtime-xnnpack-linux / job
unittest-buck  /  ...  /  job
unittest-buck / windows / job
Matrix: test-arm-backend-no-driver
Matrix: test-arm-cortex-m-size-test
Matrix: test-llama-runner-linux
Matrix: test-llama-runner-qnn-linux
Matrix: test-models-linux-basic
Matrix: test-models-linux
Matrix: test-multimodal-linux
Matrix: test-qnn-models-linux
Matrix: test-qnn-testsuite-linux / test-backend-macos
Waiting for pending jobs
Matrix: test-qnn-wheel-packages-linux
Matrix: test-sqnr-static-llm-qnn-linux
Matrix: test-static-llama-qnn-linux
Matrix: unittest-wasm-bindings
test-minimal-wheel-linux  /  job
test-minimal-wheel-linux / job
test-qnn-testsuite-linux  /  package-golden-artifacts
test-qnn-testsuite-linux / package-golden-artifacts
android  /  run-emulator
android / run-emulator
Matrix: test-coreml-bc-macos
Waiting for pending jobs
Fit to window
Zoom out
Zoom in

Annotations

134 errors and 1 warning
android / build-android
Canceling since a higher priority waiting request for pull-20117-false-false exists
Get changed files / get-changed-files
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-samsung-quantmodels-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (linear, portable, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-binary-size-linux-gcc / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest-nxp-neutron / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-direct-build-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-mcu-cortex-m-backend / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-vulkan-operators-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-selective-build-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-buck-build-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-custom-ops-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-wheel-packages-linux (3.10) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-build-wasm-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-arm-cortex-m-size-test (bare_metal) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-linux-android / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
nxp-build-test / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-parakeet-xnnpack-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama_runner_eager-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-passes-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-setup-linux-gcc / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-quantized-aot-lib-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-samsung-models-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (mv3, portable, cmake, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-moshi-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-multimodal-linux (gemma3-4b) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-vulkan-models-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-voxtral-realtime-xnnpack-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-static-llama-qnn-linux (stories_260k_bc) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-linux (fp32, xnnpack+custom+qe, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-binary-size-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-mediatek-models-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-arm-backend-no-driver (test_pytest_ops_no_target) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-lora-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-models-linux (mv2) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-sqnr-static-llm-qnn-linux (smollm2_135m) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-static-llama-qnn-linux (stories_110m) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-phi-3-mini-runner-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-models-linux (mv3) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-python-imports-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest-wasm-bindings / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-qnn-linux (fp32, qnn_8a8w, qnn) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-openvino-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-wheel-packages-linux (3.11) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-eval_llama-wikitext-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-delegate-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-linux (fp32, xnnpack+custom+qe, linux.arm64.2xlarge, executorch-ubuntu-22.04-gc... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (mv3, portable, cmake, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest-wasm-bindings (--enable-etdump) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-lora-multimethod-linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-arm-backend-no-driver (test_pytest_ops_tosa) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (linear, xnnpack-quantization-delegation, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-arm-cortex-m-size-test (zephyr-preset) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (add, portable, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-wheel-packages-linux (3.12) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest-buck / macos / macos-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (mv3, portable, buck2, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-linux (fp32, xnnpack+custom+quantize_kv, linux.2xlarge, executorch-ubuntu-22.04... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-testsuite-linux / test-backend-linux (qnn, models) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-models-linux (dl3) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest-buck / linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-arm-backend-no-driver (test_pytest_models_tosa) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (add, xnnpack-quantization-delegation, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest / macos / macos-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-wheel-packages-linux (3.13) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest-editable / linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest-editable / windows / windows-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (mv3, xnnpack-quantization-delegation, cmake, linux.2xlarge, executorch-u... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-qnn-testsuite-linux / test-backend-linux (qnn, operators) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-linux (fp32, xnnpack+custom+quantize_kv, linux.arm64.2xlarge, executorch-ubuntu... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest-editable / macos / macos-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest / windows / windows-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
unittest / linux / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (add_mul, portable, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-linux (fp32, xnnpack+quantize_kv, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (mv3, xnnpack-quantization-delegation, buck2, linux.2xlarge, executorch-u... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-arm-backend-no-driver (test_run_tosa) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-linux (fp32, xnnpack+quantize_kv, linux.arm64.2xlarge, executorch-ubuntu-22.04-... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (mv3, xnnpack-quantization-delegation, cmake, linux.arm64.2xlarge, execut... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (add_mul, xnnpack-quantization-delegation, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-llama-runner-linux (bf16, custom, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (vit, portable, cmake, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
CI run decision / decide
Canceling since a higher priority waiting request for pull-20117-false-false exists
CI run decision / decide
The operation was canceled.
test-models-linux (ic3, portable, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (ic3, xnnpack-quantization-delegation, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (vit, portable, cmake, linux.arm64.2xlarge, executorch-ubuntu-22.04-gcc11... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (mv2, portable, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (vit, xnnpack-quantization-delegation, cmake, linux.2xlarge, executorch-u... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (vit, portable, buck2, linux.2xlarge, executorch-ubuntu-22.04-clang12) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (mv2, xnnpack-quantization-delegation, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (resnet18, xnnpack-quantization-delegation, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (vit, xnnpack-quantization-delegation, cmake, linux.arm64.2xlarge, execut... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (resnet18, portable, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux-basic (vit, xnnpack-quantization-delegation, buck2, linux.2xlarge, executorch-u... / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (resnet50, portable, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (resnet50, xnnpack-quantization-delegation, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (mobilebert, portable, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (emformer_transcribe, portable, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (mobilebert, xnnpack-quantization-delegation, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (emformer_transcribe, xnnpack-quantization-delegation, linux.2xlarge) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (ic4, portable, linux.4xlarge.memory) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (emformer_join, xnnpack-quantization-delegation, linux.4xlarge.memory) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (ic4, xnnpack-quantization-delegation, linux.4xlarge.memory) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (llama3_2_vision_encoder, portable, linux.4xlarge.memory) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (emformer_join, portable, linux.4xlarge.memory) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (w2l, portable, linux.4xlarge.memory) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
test-models-linux (phi_4_mini, portable, linux.4xlarge.memory) / linux-job
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
pull
Canceling since a higher priority waiting request for pull-20117-false-false exists
CI run decision / decide
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@v4. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/