Skip to content

qwen3_5_moe: add CUDA Engine/Session execution path #14059

qwen3_5_moe: add CUDA Engine/Session execution path

qwen3_5_moe: add CUDA Engine/Session execution path #14059

Triggered via push June 16, 2026 17:30
Status Success
Total duration 1h 20m 17s
Artifacts 17

cuda.yml

on: push
Get changed files  /  get-changed-files
2s
Get changed files / get-changed-files
CI run decision  /  decide
34s
CI run decision / decide
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
test-models-cuda  /  linux-job
47m 8s
test-models-cuda / linux-job
unittest-cuda  /  linux-job
52m 48s
unittest-cuda / linux-job
Matrix: test-cuda-pybind
Matrix: test-model-cuda-e2e
check-all-cuda-builds
4s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Annotations

128 errors and 40 warnings
export-model-cuda-artifact (facebook, dinov2-small-imagenet1k-1-layer, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (facebook, dinov2-small-imagenet1k-1-layer, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (facebook, dinov2-small-imagenet1k-1-layer, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (Qwen, Qwen3-0.6B, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (Qwen, Qwen3-0.6B, non-quantized) / linux-job
Process completed with exit code 1.
export-model-cuda-artifact (Qwen, Qwen3-0.6B, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (Qwen, Qwen3-0.6B, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (openai, whisper-small, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (openai, whisper-small, non-quantized) / linux-job
Process completed with exit code 1.
export-model-cuda-artifact (openai, whisper-small, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (openai, whisper-small, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (nvidia, parakeet-tdt, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (nvidia, parakeet-tdt, non-quantized) / linux-job
Process completed with exit code 1.
export-model-cuda-artifact (nvidia, parakeet-tdt, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (nvidia, parakeet-tdt, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (google, gemma-3-4b-it, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (google, gemma-3-4b-it, non-quantized) / linux-job
Process completed with exit code 1.
export-model-cuda-artifact (google, gemma-3-4b-it, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (google, gemma-3-4b-it, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (SocialLocalMobile, Qwen3.5-35B-A3B-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (SocialLocalMobile, Qwen3.5-35B-A3B-HQQ-INT4, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (SocialLocalMobile, Qwen3.5-35B-A3B-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
export-model-cuda-artifact (SocialLocalMobile, gemma-4-31B-it-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
export-model-cuda-artifact (SocialLocalMobile, gemma-4-31B-it-HQQ-INT4, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
export-model-cuda-artifact (SocialLocalMobile, gemma-4-31B-it-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (nvidia, parakeet-tdt, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (nvidia, parakeet-tdt, non-quantized) / linux-job
Process completed with exit code 1.
test-model-cuda-e2e (nvidia, parakeet-tdt, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (nvidia, parakeet-tdt, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
Process completed with exit code 1.
test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (facebook, dinov2-small-imagenet1k-1-layer, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (facebook, dinov2-small-imagenet1k-1-layer, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (facebook, dinov2-small-imagenet1k-1-layer, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (google, gemma-3-4b-it, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (google, gemma-3-4b-it, non-quantized) / linux-job
Process completed with exit code 1.
test-model-cuda-e2e (google, gemma-3-4b-it, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (google, gemma-3-4b-it, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Process completed with exit code 1.
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (SocialLocalMobile, Qwen3.5-35B-A3B-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (SocialLocalMobile, Qwen3.5-35B-A3B-HQQ-INT4, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (SocialLocalMobile, Qwen3.5-35B-A3B-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (SocialLocalMobile, gemma-4-31B-it-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (SocialLocalMobile, gemma-4-31B-it-HQQ-INT4, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (SocialLocalMobile, gemma-4-31B-it-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (openai, whisper-small, non-quantized) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (openai, whisper-small, non-quantized) / linux-job
Process completed with exit code 1.
test-model-cuda-e2e (openai, whisper-small, non-quantized) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (openai, whisper-small, non-quantized) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
test-model-cuda-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
test-model-cuda-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
[OSDC] Step script exited with code 1. This is a script/workflow error, not an infrastructure issue. Check the step logs above for the actual failure.
test-model-cuda-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
Not authorized to perform sts:AssumeRoleWithWebIdentity
CI run decision / decide
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@v4. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (facebook, dinov2-small-imagenet1k-1-layer, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (Qwen, Qwen3-0.6B, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (openai, whisper-small, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (Qwen, Qwen3-0.6B, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-executorch-cuda-build-13.0 / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: ./test-infra/.github/actions/setup-ssh, actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (nvidia, parakeet-tdt, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-executorch-cuda-build-12.6 / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: ./test-infra/.github/actions/setup-ssh, actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (google, gemma-3-4b-it, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (SocialLocalMobile, Qwen3.5-35B-A3B-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-models-cuda / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: ./test-infra/.github/actions/setup-ssh, actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
export-model-cuda-artifact (SocialLocalMobile, gemma-4-31B-it-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
unittest-cuda / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: ./test-infra/.github/actions/setup-ssh, actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (nvidia, parakeet-tdt, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (facebook, dinov2-small-imagenet1k-1-layer, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (google, gemma-3-4b-it, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (google, gemma-3-4b-it, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, quantized-int4-weight-only) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (SocialLocalMobile, Qwen3.5-35B-A3B-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (SocialLocalMobile, gemma-4-31B-it-HQQ-INT4, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-cuda-pybind (qwen3-0.6b, Qwen-Qwen3-0.6B-cuda-non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: ./test-infra/.github/actions/setup-ssh, actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-cuda-pybind (qwen3-0.6b, --quantize, Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: ./test-infra/.github/actions/setup-ssh, actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-cuda-pybind (gemma3-4b, --quantize, google-gemma-3-4b-it-cuda-quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: ./test-infra/.github/actions/setup-ssh, actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (openai, whisper-small, non-quantized) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-model-cuda-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job
Node.js 20 actions are deprecated. The following actions are running on Node.js 20 and may not work as expected: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093, aws-actions/configure-aws-credentials@67fbcbb121271f7775d2e7715933280b06314838, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. Actions will be forced to run with Node.js 24 by default starting June 16th, 2026. Node.js 20 will be removed from the runner on September 16th, 2026. Please check if updated versions of these actions are available that support Node.js 24. To opt into Node.js 24 now, set the FORCE_JAVASCRIPT_ACTIONS_TO_NODE24=true environment variable on the runner or in your workflow file. Once Node.js 24 becomes the default, you can temporarily opt out by setting ACTIONS_ALLOW_USE_UNSECURE_NODE_VERSION=true. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/

Artifacts

Produced during runtime
Name Size Digest
Qwen-Qwen3-0.6B-cuda-non-quantized
1.1 GB
sha256:487563a49a61dfe31be71fd0b12217a0ea4a5775ef2371c4375602d31a590b8b
Qwen-Qwen3-0.6B-cuda-quantized-int4-tile-packed
559 MB
sha256:acdbec9e99b4b2882bf82981ca6f3f3dc9b4bf3bd59c84a46aba706e8ecb67ac
SocialLocalMobile-Qwen3.5-35B-A3B-HQQ-INT4-cuda-quantized-int4-tile-packed
14.6 GB
sha256:e05d10dc1902d8d5c3ad4bb66e1b3fa35e62dd394b9c99c154c4fd091e566f91
SocialLocalMobile-gemma-4-31B-it-HQQ-INT4-cuda-quantized-int4-tile-packed
18.6 GB
sha256:0de79bd6f35ea0ee059f77822d9f00acfa7f5e8d8bfb7c7907e8239d1a745e34
facebook-dinov2-small-imagenet1k-1-layer-cuda-non-quantized
34.6 MB
sha256:66bdad5b942ef8ad22defaa5b018454a5c446e6d999e2b83d6a82a0283817f60
google-gemma-3-4b-it-cuda-non-quantized
7.22 GB
sha256:ef2270a6c2301a3c8cb3329ed8242b9d60f112c914c3e4e5e28c45ea6ec96e74
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
3.4 GB
sha256:62c894dcc30d252c40b582e0cdb544d8afb74568499539fb25578a9d17cfd4ee
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
6.82 GB
sha256:be496e20f322c659b8937155684b8df73ad185dae9fe56c4e1050ad749c98fc9
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
2.88 GB
sha256:0d730b0d2fe931ada7a421bd60d412e11dc3a5628aa4465bddae1b3d539e62f4
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
6.2 GB
sha256:07a76832eb1205db00c6eb181928b44ea586e763d9b61690420384c66fa0e112
mistralai-Voxtral-Mini-4B-Realtime-2602-cuda-quantized-int4-tile-packed
2.71 GB
sha256:fe4fa2e39a3aca55aa782d2bf48351de666cf6edb54f186daf6fe5ec5e340cb9
nvidia-diar_streaming_sortformer_4spk-v2-cuda-non-quantized
436 MB
sha256:a6916e2ad9b9168f22883df54e45ef2fa1b161e8b58336e5e88c94c007ab01aa
nvidia-parakeet-tdt-cuda-non-quantized
952 MB
sha256:a2d779043b1a9f31de5036e0956ddee97c724d65b0e04ae13a46707d87433acf
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
443 MB
sha256:bdc1d13d50722f3b9d69aafc2f8e98d5ff31410371ac0cf432043de2db11dd9c
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
491 MB
sha256:f6d1ae2c72c18f61369cb5202ac0a342167565ae9684b80b39aa140e8d1e6c4d
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
570 MB
sha256:16bd5cb219bc3cbc8fd32f5d424793b6cba0710328a7e822f0e175dc11236f2b
openai-whisper-small-cuda-non-quantized
362 MB
sha256:2b6a69c23e8088f472ec6d91d57f4fc89fc6f10a202005746c50cd659596e1b9