feat(gpu): update gpu plugin version to v2.6.8#2410
Merged
Conversation
|
The latest updates on your projects. Learn more about Vercel for GitHub. 1 Skipped Deployment
|
eball
approved these changes
Jan 14, 2026
eball
added a commit
that referenced
this pull request
Jan 15, 2026
Meow33
pushed a commit
to Meow33/Olares
that referenced
this pull request
Feb 27, 2026
eball
added a commit
that referenced
this pull request
Mar 3, 2026
* docs/feat/content-add * docs/feat/stirlingpdf-more * docs/feat/stirlingpdf-add-index * docs/feat/stirlingpdf-refine * Update docs/use-cases/stirling-pdf.md Co-authored-by: Meow33 <supermonkey03@163.com> * docs/update/address-comments * Update docs/use-cases/stirling-pdf.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/stirling-pdf.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/stirling-pdf.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * ci: bump version to 1.12.5 (#2405) * docs/update/address-comments * docs/update/address-comment * docs: add CLI docs for user, upgrade, and disk commands (#2383) * docs: add CLI docs for user, upgrade, and disk commands * docs: update based on comments * docs: fix typo * docs: refine formatting and add description for argument * docs: resolve conflicts * feat(cli): sync kubeconfig for the original user invoking sudo (#2406) * fix: files check disk space for upload link and copy (#2407) * user-service: update mtranserverv2 (#2408) fix(user-service): update mtranserverv2 * docs: add PDFMathTranslate tutorial (#2378) * docs/feat/draft * docs/update/more-content * docs/updates/refine * docs/update/fix-build-conflict * docs/update/fix-broken-link * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * docs/updates/compress-images * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * docs/update/comments * docs/update/refine * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * docs/update/comment * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * docs/update/fix-link * Update docs/use-cases/pdfmathtranslate.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * docs/update/comment --------- Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * feat(gpu): update gpu plugin version to v2.6.8 (#2410) * daemon: handle missing auth token for WebSocket connections (#2411) * fix: fix english highight missing (#2412) Co-authored-by: ubuntu <you@example.com> * olares-app, login: update version to v1.7.4 (#2413) * kubeblocks: skip check pod spec,status image (#2414) fix: skip check pod spec,status image * docs/update/fixtoc * docs/update/image-size-opt * settings: update search origin (#2417) feat: update system frontend version * fix: fix meaningless word highlight (#2418) Co-authored-by: ubuntu <you@example.com> * docs: add lobechat tutorial (#2368) * docs/feat/add-lobechat-tutorial * docs/feat/fix-images * docs/feat/lobechat-fixlink * docs/feat/iterate-content * docs/update/more-content * docs/updaate/refine * docs/feat/lobechat-refine * docs/feat/add-lobechat-index * docs/updates/fix-link * Update docs/use-cases/lobechat.md Co-authored-by: Meow33 <supermonkey03@163.com> * Update docs/use-cases/lobechat.md Co-authored-by: Meow33 <supermonkey03@163.com> * Update docs/use-cases/lobechat.md Co-authored-by: Meow33 <supermonkey03@163.com> * Update docs/use-cases/lobechat.md Co-authored-by: Meow33 <supermonkey03@163.com> * docs/update/address-comments * Apply suggestions from code review Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * docs/update/address-comment * docs/update/conflict * refine edit * docs/updates/image-size-opt * docs/update/resize * Apply suggestions from code review Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * docs/update/add-faq --------- Co-authored-by: Meow33 <supermonkey03@163.com> Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * feat: optimize highlight segment order (#2420) Co-authored-by: ubuntu <you@example.com> * authelia: fix target url parse method (#2421) * feat(olares-app): update new version to v1.7.6 (#2422) fix(share): fixed the error message that appeared after exceeding the upload limit. * feat(olares-app): update olares-app version to v1.7.7 (#2423) * hami: revert hami-core latest update (#2424) * tapr: add max retry for delete action (#2426) * tapr: upgrade pod template and image for PGCluster reconciliation (#2213) * tapr: upgrade pod template and image for PGCluster reconciliation * fix(ci): specify working directory in github action for tapr (#2215) --------- Co-authored-by: dkeven <82354774+dkeven@users.noreply.github.com> * tapr: upgrade pod template and image for PGCluster reconciliation * fix(kvrocks): update init container image and pull policy configuration (#2331) * tapr: change kvrocks running as root by default * fix: add max retry for delete action * tapr: update middleware-operator image tag to 0.2.31 --------- Co-authored-by: eball <liuy102@hotmail.com> Co-authored-by: dkeven <82354774+dkeven@users.noreply.github.com> * docs: refactor local access guide (#2419) * docs: refactor local access guide * Apply suggestions from code review Co-authored-by: Meow33 <supermonkey03@163.com> * address comments --------- Co-authored-by: Meow33 <supermonkey03@163.com> * daemon: modify mDNS registration method (#2427) daemon: update zeroconf dependency to v0.2.5 and modify mDNS registration method * docs/update/olares-space-storage-info * feat(cli): collect nginx logs stored temporarily in some containers (#2429) * cli: feat amdgpu install (#2430) * bfl: myapps api add rawAppName (#2432) * fix: myapps api add rawAppName field * update bfl api image tag to v0.4.39 * feat(olares-app): update version to v1.8.2 (#2433) * feat(olares-app): update version to v1.8.2 * feat(olares-app): update version to v1.8.2 * feat(gpu): supports dynamic detection of hot plugged-in GPUs (#2435) * tapr: add clickhouse support (#2437) * feat: add clickhouse support * fix: dependabot alerts * middleware-operator 0.2.32 * daemon: change pcap open timeout to 1 millisecond to prevent close hang (#2439) * appservice: add clickhouse support (#2440) * fix: failed release upgrade * fix: helm upgrade do not use atomic param and allow upgrade failed release * feat: add clickhouse support * appservice image tag to 0.4.76 * l4: skip invalid expose port (#2441) fix: skip invalid expose port (#2434) * cli: upgrade l4-bfl-proxy to v0.3.10 (#2442) * docs: add storage expansion via CLI (#2409) * docs: add storage expansion method * docs: add guide to access Olares terminal * Update zh.ts * fix formatting and file directory --------- Co-authored-by: yajing wang <413741312@qq.com> * download-server:add download err category && modify aria2 max concurrent (#2445) download server * appservice: v2 app stop (#2455) * fix: failed release upgrade * fix: helm upgrade do not use atomic param and allow upgrade failed release * feat: add icon filed to nats event * fix: v2 app stop * fix: check k8s request before into installing state * fix: add spec ports * set appservice image tag to 0.4.77 * refactor(cli): unify config of command line options and envs (#2453) * settings: add settings new version and update provider api (#2456) feat: add settings new version and update provider api * feat: search upgrade to v0.1.6 (#2459) Co-authored-by: ubuntu <you@example.com> * bfl: enhance user login background handling with style support (#2464) * fix: myapps api add rawAppName field * update bfl api image tag to v0.4.39 * feat: enhance user login background handling with style support (#2462) * bfl: enhance user login background handling with style support --------- Co-authored-by: hys <hysyeah@gmail.com> * settings, user service: update wallpaper style (#2463) feat: update system frontend and user service version * fix(cli): set node port range in minikube to allow smb service (#2460) * fix(cli): unify config setting for release command (#2465) * authelia: add user regulation for TOTP authentication attempts (#2466) * desktop, settings, files, vault: fix multiple known issues (#2467) feat: update login, system frontend, user service version * fix a link issue * ci: bump version to 1.12.6 (#2471) * app-service: add support for selecting GPU types in application installation (#2470) * fix: failed release upgrade * fix: helm upgrade do not use atomic param and allow upgrade failed release * feat: add icon filed to nats event * fix: v2 app stop * fix: check k8s request before into installing state * fix: add spec ports * set appservice image tag to 0.4.77 * feat: add support for selecting GPU types in application installation (#2458) * fix: failed release upgrade * fix: helm upgrade do not use atomic param and allow upgrade failed release * feat: add clickhouse support * appservice image tag to 0.4.76 * feat: add icon filed to nats event * chores: get all node gpu types * feat: add support for selecting GPU types in application installation * feat: enhance GPU type selection logic in application installation * feat: replace hardcoded GPU type with constant for supported GPU selection * feat: update app config methods to include selected GPU type and enhance validation for NVIDIA GPUs * feat: update supported GPU handling to include default options and improve validation logic * feat: update GPU resource handling to unset previous limits before setting new ones * feat: refactor permission parsing to use exported function and update related calls --------- Co-authored-by: hys <hysyeah@gmail.com> * app-service: add support for selecting GPU types in application installation --------- Co-authored-by: hys <hysyeah@gmail.com> * feat: support more scheme update to env crs (#2473) * fix(cli): bind config item to the effective command (#2474) * app-service: feat app uninstall delete data (#2480) * fix: failed release upgrade * fix: helm upgrade do not use atomic param and allow upgrade failed release * fix: v2 app stop * fix: check k8s request before into installing state * fix: add spec ports * feat(appservice): support updating more fields in api & controller (#2472) * fix: app uninstall delete data (#2478) --------- Co-authored-by: dkeven <82354774+dkeven@users.noreply.github.com> * fix(cli): do not override upgrade target version by config file (#2483) * market, settings: support optional data deletion and fix bugs. (#2486) * feat: support optional data deletion when uninstalling apps in Market * market: add deleteData switch, add users info * feat: update system frontend version --------- Co-authored-by: aby913 <aby913@163.com> * fix(cli): clear master host config when uninstalling (#2488) * download-server: nats message publish modify (#2489) download * backup: sync systemEnv default remote url (#2492) * fix: get systemenv remove host * backup: sync systemEnv default value * fix: seafile trim commit_id for syncing and change psql ccnet init (#2495) * Modify release-daemon.yaml for arm64 support Add support for arm64 architecture in release daemon workflow * docs: add docs for distributing olares apps (#2484) * docs: add docs for distributing olares apps * docs: update translation * Apply suggestion from @fnalways Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * docs: refine documentation structure * docs: fix punctuations --------- Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * docs: add middleware data access and integration guides (#2444) * docs: add guides to view middleware data * docs: add guide for grafana * docs: add guide for otel and integration guides for other middleware * docs: add guide for elasticsearch * docs: update based on suggestions * Update zh.ts * docs: update content * docs: resolve conflict * docs: add FAQs about activation and login (#2481) * add: FAQs about activation and login * Apply suggestion from @fnalways Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * Apply suggestion from @fnalways Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * address comments --------- Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * fix(cli): seperate dmesg args for dmesg logs (#2497) * app-service: handle case for system apps without configuration in permission API (#2499) * fix: failed release upgrade * fix: helm upgrade do not use atomic param and allow upgrade failed release * fix: v2 app stop * fix: check k8s request before into installing state * fix: add spec ports * feat(appservice): support updating more fields in api & controller (#2472) * fix: app uninstall delete data (#2478) * fix: handle case for system apps without configuration in permission API (#2498) * app-service: handle case for system apps without configuration in permission API --------- Co-authored-by: hys <hysyeah@gmail.com> Co-authored-by: dkeven <82354774+dkeven@users.noreply.github.com> * feat(cli): add more lines to default journalctl limit (#2502) * settings, market, files, vault, desktop: fix some ui bugs (#2503) feat: update system frontend version * docs: batch add docs for one (#2457) * add index and faq * add comfyui * add vpn and ssh * add deerflow * add expand storage * fix link * fix meta * refine first boot & spec * refine redeem basic plan * add open webui & fix formatting * batch review * add rest image * update introduction * add zh-cn * add nav * add screenshots * add single-drive setup & update dual-drive setup * add egpu * add steam * fix lint * align en and zh-cn * fix image path * fix lint * docs: add OpenClaw tutorial (#2506) * add OpenClaw tutorial * modify images, refine text for clarity * add to index page, add description * adjust table * remove hidden text * refinements for consistency * update for clarity and concise * appservice: handle case for system applications without configuration in provider list (#2509) * fix: failed release upgrade * fix: helm upgrade do not use atomic param and allow upgrade failed release * fix: v2 app stop * fix: check k8s request before into installing state * fix: add spec ports * feat(appservice): support updating more fields in api & controller (#2472) * fix: app uninstall delete data (#2478) * fix: handle case for system apps without configuration in permission API (#2498) * app-service: handle case for system apps without configuration in permission API * fix: handle case for system applications without configuration in provider list (#2507) * fix: update app-service image version to 0.5.2 --------- Co-authored-by: hys <hysyeah@gmail.com> Co-authored-by: dkeven <82354774+dkeven@users.noreply.github.com> * bfl: remove deprecated ingress mode handling from NginxController (#2511) * fix: myapps api add rawAppName field * update bfl api image tag to v0.4.39 * feat: enhance user login background handling with style support (#2462) * bfl: enhance user login background handling with style support * fix: remove deprecated ingress mode handling from NginxController * fix: update ingress image version to v0.3.29 --------- Co-authored-by: hys <hysyeah@gmail.com> * docs: fix sunshine address for .local domain and formatting for olares one docs (#2512) * improve wording for olares one iso image download * update the local address for sunshine * fix formatting for ssh access * olares-app: update version to v1.9.1 (#2515) * docs: update instructions per latest operations (#2517) update step per latest operation * authelia: add auth type param to user regulation (#2518) * olares-app: update version to v1.9.2 (#2520) * olares-app: update version to v1.9.2 * login: update version to v1.9.2 * docs: add skills and plugins management for OpenClaw (#2521) * add skills and plugins management for OpenClaw * resize images * update: add minimum permissions * olares-app: update version to v1.9.3 (#2524) * olares-app: update version to v1.9.3 * olares-app: update version to v1.9.3 * docs: resolve comments on managing apps (#2523) resolve comments on managing apps * docs: add SMB account management to Settings (#2526) add: SMB account manage in Settings * fix(cli): ignore finished pods in readiness check (#2528) * appservice: stop app if it is hami cause unschedule no wait (#2533) * fix: failed release upgrade * fix: helm upgrade do not use atomic param and allow upgrade failed release * fix: v2 app stop * fix: check k8s request before into installing state * fix: add spec ports * fix: set amd apu/gpu limit key to amd.com/gpu * fix: stop app if it is hami cause unschedule no wait (#2531) * fix: stop app if it is hami cause unschedule * ingore param from req if size=0 * update appservice image tag to 0.5.3 * bfl: add sync urls to master node (#2540) * fix: myapps api add rawAppName field * update bfl api image tag to v0.4.39 * feat: enhance user login background handling with style support (#2462) * bfl: enhance user login background handling with style support * fix: remove deprecated ingress mode handling from NginxController * fix: update ingress image version to v0.3.29 * feat(bfl): supports switch on/off access from external network (#2513) * fix: bfl add sync urls to master node (#2537) * fix: update bfl-ingress image to v0.3.30 (#2539) update bfl-ingress to v0.3.30 --------- Co-authored-by: hys <hysyeah@gmail.com> Co-authored-by: dkeven <82354774+dkeven@users.noreply.github.com> Co-authored-by: lovehunter9 <39935488+lovehunter9@users.noreply.github.com> * Use the Robot font to match the theme of the rest of the applications * docs: update content related to reference app (#2530) * updates for reference apps * more updates for reference app * l4-bfl-proxy: skip nginx reload if configuration has not changed (#2556) * fix: skip invalid expose port (#2434) * fix: skip nginx reload if configuration has not changed * fix: update L4_PROXY_IMAGE_VERSION to v0.3.11 in bfl_deploy.yaml and Olares.yaml --------- Co-authored-by: hysyeah <hysyeah@gmail.com> * cli: upgrade l4-bfl-proxy to v0.3.11 (#2557) * docs: update custom domain tutorial screenshots and align copy with latest UI (#2559) update screenshots to align with latest UI * docs: fix space nav display, extract use-case/developer sidebars, add note in space docs (#2562) * fix nav display issues, extract use case & developer docs * fix wording for space docs note * fix links * address comments * docs: update method for installing drivers on windows (#2564) * update installation method * add image, update zh version * add missing en version * refine link style * feat(olares-app): update olares-app version to v1.9.5 (#2563) * chore: update version from 1.12.6 to 1.12.5 in workflows and scripts * docs: updates for releasing resources and uninstalling shared apps (#2568) * faq on free up resources * update zh version * update UI label in zh version for accuracy * docs: add troubleshooting guide for memory not released after stopping apps (#2565) * add troubleshooting guide for memory not released after stopping apps * add zh-cn version & fix wording * Apply suggestions * Apply suggestions * docs: update the initialization steps for OpenClaw tutorial (#2567) * update initialization steps * remove outdated steps, fix indentation * add image for easy understanding * add images * update initialization, pairing, add upgrade notes * change to use complete command name * refine title to be concise * address comment * docs: update installation method of drivers on windows (#2566) * update installation method * add image, update zh version * add missing en version * refine link style * add one package driver installation for windows * update toc labels * update zh version * naming consistency * address comments * update method to use drive package only * update zh version * update a note * fix tip display * remove redundant word * feat(olares-app): update olares-app version to v1.9.6 (#2573) * docs: add troubleshooting guide for missing apps in Market (#2574) * add troubleshooting guide for missing apps in Market * address comment * fix: conditionally install storage for juicefs (#2579) * docs: revamp the "Advanced" page (previously "Developer") (#2534) * update images and related descriptions * Revamp the Developer resources page * address previous comments * Revamp zh version of Developer resources * update Settings index page * app-service: support injecting gpu memory and container selection (#2581) * fix: failed release upgrade * fix: helm upgrade do not use atomic param and allow upgrade failed release * fix: v2 app stop * fix: check k8s request before into installing state * fix: add spec ports * fix: set amd apu/gpu limit key to amd.com/gpu * fix: stop app if it is hami cause unschedule no wait (#2531) * fix: stop app if it is hami cause unschedule * ingore param from req if size=0 * update appservice image tag to 0.5.3 * feat: support injecting gpu memory and container selection (#2580) * refactor: unify GPU resource handling and remove hardcoded values * fix: handle CPU type selection in GPU resource management * feat: enhance GPU resource management with memory limits and chip type handling * feat: update GPU resource patching to support selective container injection * feat: adjust GPU memory format in deployment patching for compatibility * fix: revert unchanged file * Revert "fix: revert unchanged file" This reverts commit 5f48862. * fix: revert unchanged file * chore: update app-service image tag to 0.5.4 --------- Co-authored-by: hys <hysyeah@gmail.com> * cli, daemon: enhance DGX Spark support and update GPU type handling (#2496) * feat(gpu): enhance DGX Spark support and update GPU type handling * feat(amdgpu): refactor AMD GPU detection and support for GB10 chip and APU * feat(connector): enhance GB10 chip detection with environment variable support * feat(gpu): enhance DGX Spark support and update GPU type handling * feat(amdgpu): refactor AMD GPU detection and support for GB10 chip and APU * feat(connector): enhance GB10 chip detection with environment variable support * feat: add nvidia device plugin for gb10 * fix(gpu): update pod selector for hami-device-plugin based on GB10 chip detection fix(deploy): bump app-service image version to 0.4.78 * feat: enable CGO for building on ARM architecture and adjust build constraints for Linux * feat: enhance multi-architecture support for ARM64 in release workflow * feat: update multi-arch setup for ARM64 in release workflow * feat: enhance ARM64 multi-architecture support in release workflow * feat: streamline ARM64 cross-compilation setup in release workflow * feat: enhance ARM64 support by adding architecture-specific package installations * feat: update ARM64 package sources in release workflow for improved compatibility * feat: amd device plugin and container toolkit install * refactor: remove GB10 chip type check from GPU info update * feat(gpu): update hami version to v2.6.10-compatible for spark * fix: remove gb10 device plugin checking * fix: update klauspost/cpuid to v2.3.0 * fix: amd gpu check (#2522) * feat: enhance storage device detection with USB serial properties * feat: update hami version to v2.6.11-compatible-arm * feat: add chip type support for AMD and NVIDIA GPUs in node label updates * feat(gpu): supports auto binding GPU to app * feat(gpu): remove chip type handling from GPU label updates * feat(gpu): remove GPU type specification from DaemonSet and values.yaml * feat(gpu): remove GB10 device plugin installation and related checks * feat(gpu): update HAMi to v2.6.11 --------- Co-authored-by: dkeven <dkvvven@gmail.com> Co-authored-by: hys <hysyeah@gmail.com> * docs: add factory reset via BIOS and reinstall via USB (#2576) * add factory reset via BIOS and reinstall via USB * refine wording & add screenshots * add zh docs * address comments * docs: add how to check SSH password in vault (#2571) * add how to check SSH password in vault * reuse reset ssh content, improve wording & flow * docs: fix & streamline ssh access (#2584) fix & streamline ssh access * docs: fix model name used in tutorial (#2582) * fix model name used in tutorial * Update docs/use-cases/openclaw.md --------- Co-authored-by: Power-One-2025 <zhengchunhong@bytetrade.io> * docs: update Windows local access steps & tidy wording (#2587) update Windows .local access & tidy wording * fix: add kubeblocks addon chart image to manifest (#2590) * fix(cli): dynamic creation of nvidia runtimeclass (#2591) * ci: change cdn backend storage to cos (#2592) * fix: remove public-read ACL from coscmd upload commands in release workflows * fix: update upload command in release workflows to remove unnecessary 'cp' argument * fix: remove unnecessary 'cp' argument from coscmd upload commands in release workflows * fix: update error handling to check for both 403 and 404 HTTP status codes in upload scripts * Add VERSION environment variable to workflow * fix: coscmd invalid parameters * docs: updated wise and desktop docs (#2586) * docs: updated wise and desktop docs * Refined expressions. * Updated larepass index * refine wording * Updated translation. * Update docs/manual/larepass/index.md Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> --------- Co-authored-by: yajing wang <413741312@qq.com> Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> * fix(appservice): avoid race condition between upgrade & applyenv (#2594) * fix(appservice): avoid race condition between upgrade & applyenv (#2593) * chore(appservice): update image version to 0.5.5 --------- Co-authored-by: eball <liuy102@hotmail.com> * daemon: enhance USB device mounting by dynamically setting options based on filesystem type (#2596) fix: enhance USB device mounting by dynamically setting options based on filesystem type * l4-bfl-proxy: fix multi users app custom domain (#2599) * l4-bfl-proxy: fix multi users app custom domain (#2597) * l4-bfl-proxy: fix multi users app custom domain * fix: update error handling to check for both 403 and 404 HTTP status codes in upload scripts --------- Co-authored-by: eball <liuy102@hotmail.com> * system frontend: fix system app launch and display bugs. (#2600) * feat: update system frontend version * feat: update system frontend version --------- Co-authored-by: eball <liuy102@hotmail.com> * authelia: fix bug of sub-policy failed if set it to two-factor (#2601) authelia: fix sub-policy failed when the main policy is internal --------- Co-authored-by: Power-One-2025 <zhengchunhong@bytetrade.io> Co-authored-by: Meow33 <supermonkey03@163.com> Co-authored-by: Yajing <110797546+fnalways@users.noreply.github.com> Co-authored-by: dkeven <82354774+dkeven@users.noreply.github.com> Co-authored-by: lovehunter9 <39935488+lovehunter9@users.noreply.github.com> Co-authored-by: yyh <24493052+yongheng2016@users.noreply.github.com> Co-authored-by: salt <bleachzou2@163.com> Co-authored-by: ubuntu <you@example.com> Co-authored-by: wiy <guojianmin@bytetrade.io> Co-authored-by: hysyeah <hysyeah@gmail.com> Co-authored-by: berg <zyh2433219116@gmail.com> Co-authored-by: yajing wang <413741312@qq.com> Co-authored-by: simon <89775922+kaki-admin@users.noreply.github.com> Co-authored-by: aby913 <aby913@163.com> Co-authored-by: Ethan Collins <ETHAN.COLLINS@broncos.nfl.net> Co-authored-by: dkeven <dkvvven@gmail.com> Co-authored-by: Teng <142984611+TShentu@users.noreply.github.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Background
Allow GPU devices marked unhealthy by Xid critical error events to recover
Mount node level vgpu lock to tmpfs for auto clean up across machine reboots
Clean up orphaned pods assigned to GPU devices that are gone
Target Version for Merge
1.12.5
Related Issues
1.12.5
PRs Involving Sub-Systems
fix(device-plugin): allow devices to recover to healthy after xid error HAMi#10
fix(device-plugin): mount vgpu lock path in container to tmpfs in host HAMi#11
feat(scheduler): clean up pods allocated with missing GPUs HAMi#12
Other information:
none