fix(scheduler): compatible with app's stop fast behavior by dkeven · Pull Request #17 · beclab/HAMi

dkeven · 2026-03-18T12:15:40Z

What type of PR is this?

/kind bug

What this PR does / why we need it:

After beclab/Olares#2699, an Application that's reported unschedulable by hami-scheduler will be stopped immediately by app-service, however, hami-scheduler also reports unschedulable and make kube-sheduler retry scheduling in many retryable cases, such as node locked by another pod. Also, the asynchronous nature of HAMi's informer may lead to device occupation stats not updated immediately, causing pod to be scheduled only in the next retry. Two changes have been made to make HAMi compatible with this new logic:

Add a new event type reasoned as InsufficientGPU that's dedicated to the case when no available GPU resources can be found for the to-be scheduled pod, separating from other normal retryable cases.
When pod is deleted by HAMi-scheduler itself, update the in-memory device usage immediately rather than relying on the pod informer to update the state, to avoid potential race conditions with the deployment controller.

dkeven added 2 commits March 18, 2026 16:19

fix(scheduler): clear pod usage fast if deleted by ourselves

7d40d07

fix(scheduler): use a dedicated reason for insufficient GPU

b67df49

github-actions bot added the kind/bug label Mar 18, 2026

dkeven merged commit f62448f into feat/nvshare Mar 18, 2026
1 check passed

dkeven deleted the scheduler/fix/appsvc_stopfast_compat branch March 18, 2026 12:31

dkeven mentioned this pull request Mar 18, 2026

fix(hami-scheduler): compatible with app's stop fast behavior beclab/Olares#2712

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(scheduler): compatible with app's stop fast behavior#17

fix(scheduler): compatible with app's stop fast behavior#17
dkeven merged 2 commits intofeat/nvsharefrom
scheduler/fix/appsvc_stopfast_compat

dkeven commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dkeven commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant