Bump the pip group across 2 directories with 4 updates by dependabot[bot] · Pull Request #3 · signcl/BELLE

dependabot · 2026-03-19T00:04:43Z

Bumps the pip group with 4 updates in the / directory: transformers, deepspeed, torch and tqdm.
Bumps the pip group with 4 updates in the /train directory: transformers, deepspeed, torch and tqdm.

Updates transformers from 4.28.1 to 4.53.0

Release notes

Sourced from transformers's releases.

Release v4.53.0

Gemma3n

Gemma 3n models are designed for efficient execution on low-resource devices. They are capable of multimodal input, handling text, image, video, and audio input, and generating text outputs, with open weights for pre-trained and instruction-tuned variants. These models were trained with data in over 140 spoken languages.

Gemma 3n models use selective parameter activation technology to reduce resource requirements. This technique allows the models to operate at an effective size of 2B and 4B parameters, which is lower than the total number of parameters they contain. For more information on Gemma 3n's efficient parameter management technology, see the Gemma 3n page.
from transformers import pipeline
import torch
pipe = pipeline(
"image-text-to-text",
torch_dtype=torch.bfloat16,
model="google/gemma-3n-e4b",
device="cuda",
)
output = pipe(
"https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/bee.jpg",
text="<image_soft_token> in this image, there is"
)
print(output)
Dia

Dia is an opensource text-to-speech (TTS) model (1.6B parameters) developed by Nari Labs. It can generate highly realistic dialogue from transcript including nonverbal communications such as laughter and coughing. Furthermore, emotion and tone control is also possible via audio conditioning (voice cloning).

Model Architecture: Dia is an encoder-decoder transformer based on the original transformer architecture. However, some more modern features such as rotational positional embeddings (RoPE) are also included. For its text portion (encoder), a byte tokenizer is utilized while for the audio portion (decoder), a pretrained codec model DAC is used - DAC encodes speech into discrete codebook tokens and decodes them back into audio.

Add Dia model by @buttercrab in #38405

Kyutai Speech-to-Text

Kyutai STT is a speech-to-text model architecture based on the Mimi codec, which encodes audio into discrete tokens in a streaming fashion, and a Moshi-like autoregressive decoder. Kyutai’s lab has released two model checkpoints:

kyutai/stt-1b-en_fr: a 1B-parameter model capable of transcribing both English and French

... (truncated)

Commits

67ddc82 Release: v4.53.0
0a8081b [Modeling] Fix encoder CPU offloading for whisper (#38994)
c63cfd6 Gemma 3n (#39059)
3e5cc12 [tests] remove tests from libraries with deprecated support (flax, tensorflow...
cfff7ca [Whisper] Pipeline: handle long form generation (#35750)
02ecdcf add _keep_in_fp32_modules_strict (#39058)
d973e62 fix condition where torch_dtype auto collides with model_kwargs. (#39054)
44b2316 [qwen2-vl] fix vision attention scaling (#39043)
ae15715 polishing docs: error fixes for clarity (#39042)
3abeaba Create test for #38916 (custom generate from local dir with imports) (#39015)
Additional commits viewable in compare view

Updates deepspeed from 0.9.0 to 0.15.1

Release notes

Sourced from deepspeed's releases.

v0.15.1 Patch release

What's Changed

Update version.txt after 0.15.0 release by @loadams in microsoft/DeepSpeed#6403

Fix Type Mismatch by @jomayeri in microsoft/DeepSpeed#6410

Fix redundant seq data parallel grp argument in Z3/MiCS by @samadejacobs in microsoft/DeepSpeed#5352

add Huawei Ascend NPU setup guide by @xuedinge233 in microsoft/DeepSpeed#6445

Add documentation for launcher without SSH by @dogacancolak-kensho in microsoft/DeepSpeed#6455

Dtype support check for accelerator in UTs by @raza-sikander in microsoft/DeepSpeed#6360

Store/Load CIFAR from local/offline by @raza-sikander in microsoft/DeepSpeed#6390

Add the accelerator setup guide link in Getting Started page by @rogerxfeng8 in microsoft/DeepSpeed#6452

Allow triton==3.0.x for fp_quantizer by @siddartha-RE in microsoft/DeepSpeed#6447

Change GDS to 1 AIO thread by @jomayeri in microsoft/DeepSpeed#6459

[CCL] fix condition issue in ccl.py by @YizhouZ in microsoft/DeepSpeed#6443

Avoid gds build errors on ROCm by @rraminen in microsoft/DeepSpeed#6456

TestLowCpuMemUsage UT get device by device_name by @raza-sikander in microsoft/DeepSpeed#6397

Add workflow to build DS without torch to better test before releases by @loadams in microsoft/DeepSpeed#6450

Fix patch for parameter partitioning in zero.Init() by @tohtana in microsoft/DeepSpeed#6388

Add default value to "checkpoint_folder" in "load_state_dict" of bf16_optimizer by @ljcc0930 in microsoft/DeepSpeed#6446

DeepNVMe tutorial by @tjruwase in microsoft/DeepSpeed#6449

bf16_optimizer: fixes to different grad acc dtype by @nelyahu in microsoft/DeepSpeed#6485

print warning if actual triton cache dir is on NFS, not just for default by @jrandall in microsoft/DeepSpeed#6487

DS_BUILD_OPS should build only compatible ops by @tjruwase in microsoft/DeepSpeed#6489

Safe usage of popen by @tjruwase in microsoft/DeepSpeed#6490

Handle an edge case where CUDA_HOME is not defined on ROCm systems by @amorehead in microsoft/DeepSpeed#6488

New Contributors

@xuedinge233 made their first contribution in microsoft/DeepSpeed#6445

@siddartha-RE made their first contribution in microsoft/DeepSpeed#6447

@ljcc0930 made their first contribution in microsoft/DeepSpeed#6446

@jrandall made their first contribution in microsoft/DeepSpeed#6487

@amorehead made their first contribution in microsoft/DeepSpeed#6488

Full Changelog: deepspeedai/DeepSpeed@v0.15.0...v0.15.1

DeepSpeed v0.15.0

What's Changed

Update version.txt after 0.14.5 release by @loadams in microsoft/DeepSpeed#5982

move pynvml install to setup.py by @Rohan138 in microsoft/DeepSpeed#5840

add moe topk(k>2) gate support by @inkcherry in microsoft/DeepSpeed#5881

Move inf_or_nan_tracker to cpu for cpu offload by @BacharL in microsoft/DeepSpeed#5826

Enable dynamic shapes for pipeline parallel engine inputs by @tohtana in microsoft/DeepSpeed#5481

Add and Remove ZeRO 3 Hooks by @jomayeri in microsoft/DeepSpeed#5658

DeepNVMe GDS by @jomayeri in microsoft/DeepSpeed#5852

Pin transformers version on nv-nightly by @loadams in microsoft/DeepSpeed#6002

DeepSpeed on Window blog by @tjruwase in microsoft/DeepSpeed#6364

Bug Fix 5880 by @jomayeri in microsoft/DeepSpeed#6378

Update linear.py compatible with torch 2.4.0 by @terry-for-github in microsoft/DeepSpeed#5811

GDS Swapping Fix by @jomayeri in microsoft/DeepSpeed#6386

Long sequence parallelism (Ulysses) integration with HuggingFace by @samadejacobs in microsoft/DeepSpeed#5774

reduce cpu host overhead when using moe by @ranzhejiang in microsoft/DeepSpeed#5578

... (truncated)

Commits

10ba3dd Handle an edge case where CUDA_HOME is not defined on ROCm systems (#6488)
662a421 Safe usage of popen (#6490)
ddd3571 Add default value to "checkpoint_folder" in "load_state_dict" of bf16_optimiz...
5d1a30c DS_BUILD_OPS should build only compatible ops (#6489)
ddeb0c1 Fix patch for parameter partitioning in zero.Init() (#6388)
9d17116 print warning if actual triton cache dir is on NFS, not just for default (#6487)
5df12a4 DeepNVMe tutorial (#6449)
cfc6ed3 bf16_optimizer: fixes to different grad acc dtype (#6485)
9b7fc54 Add workflow to build DS without torch to better test before releases (#6450)
89c4d9f TestLowCpuMemUsage UT get device by device_name (#6397)
Additional commits viewable in compare view

Updates torch from 1.13.0 to 2.8.0

Release notes

Sourced from torch's releases.

PyTorch 2.8.0 Release Notes

Highlights

Backwards Incompatible Changes

Deprecations

New Features

Improvements

Bug fixes

Performance

Documentation

Developers

Highlights

... (truncated)

Commits

ba56102 Cherrypick: Add the RunLLM widget to the website (#159592)
c525a02 [dynamo, docs] cherry pick torch.compile programming model docs into 2.8 (#15...
a1cb3cc [Release Only] Remove nvshmem from list of preload libraries (#158925)
c76b235 Move out super large one off foreach_copy test (#158880)
20a0e22 Revert "[Dynamo] Allow inlining into AO quantization modules (#152934)" (#158...
9167ac8 [MPS] Switch Cholesky decomp to column wise (#158237)
5534685 [MPS] Reimplement tri[ul] as Metal shaders (#158867)
d19e08d Cherry pick PR 158746 (#158801)
a6c044a [cherry-pick] Unify torch.tensor and torch.ops.aten.scalar_tensor behavior (#...
620ebd0 [Dynamo] Use proper sources for constructing dataclass defaults (#158689)
Additional commits viewable in compare view

Updates tqdm from 4.65.0 to 4.66.3

Release notes

Sourced from tqdm's releases.

tqdm v4.66.3 stable

cli: eval safety (fixes CVE-2024-34062, GHSA-g7vv-2v7x-gj9p)

tqdm v4.66.2 stable

pandas: add DataFrame.progress_map (#1549)

notebook: fix HTML padding (#1506)

keras: fix resuming training when verbose>=2 (#1508)

fix format_num negative fractions missing leading zero (#1548)

fix Python 3.12 DeprecationWarning on import (#1519)

linting: use f-strings (#1549)

update tests (#1549)

fix pandas warnings

fix asv (airspeed-velocity/asv#1323)

fix macos notebook docstring indentation

CI: bump actions (#1549)

tqdm v4.66.1 stable

fix utils.envwrap types (#1493 <- #1491, #1320 <- #966, #1319)

e.g. cloudwatch & kubernetes workaround: export TQDM_POSITION=-1

drop mentions of unsupported Python versions

tqdm v4.66.0 stable

environment variables to override defaults (TQDM_*) (#1491 <- #1061, #950 <- #614, #1318, #619, #612, #370)

e.g. in CI jobs, export TQDM_MININTERVAL=5 to avoid log spam

add tests & docs for tqdm.utils.envwrap

fix & update CLI completion

fix & update API docs

minor code tidy: replace os.path => pathlib.Path

fix docs image hosting

release with CI bot account again (cli/cli#6680)

tqdm v4.65.2 stable

exclude examples from distributed wheel (#1492)

tqdm v4.65.1 stable

migrate setup.{cfg,py} => pyproject.toml (#1490)

fix asv benchmarks

update docs

fix snap build (#1490)

fix & update tests (#1490)

fix flaky notebook tests

bump pre-commit

bump workflow actions

Commits

4e613f8 Merge pull request from GHSA-g7vv-2v7x-gj9p
b53348c cli: eval safety
cc372d0 bump version, merge pull request #1549 from tqdm/devel
e9f0c05 use PyPI trusted publishing
7323d5b slight makefile clean
5306125 tests: bump pre-commit
4a6fd4f fix datetime.utcfromtimestamp py3.12 warning (#1519)
6f13759 tests: fix macos notebook indentation
3abcd2a tests: fix asv
a4d15c8 tests: fix pandas warnings
Additional commits viewable in compare view

Updates transformers from 4.28.1 to 4.53.0

Release notes

Sourced from transformers's releases.

Release v4.53.0

Gemma3n

Gemma 3n models are designed for efficient execution on low-resource devices. They are capable of multimodal input, handling text, image, video, and audio input, and generating text outputs, with open weights for pre-trained and instruction-tuned variants. These models were trained with data in over 140 spoken languages.

Gemma 3n models use selective parameter activation technology to reduce resource requirements. This technique allows the models to operate at an effective size of 2B and 4B parameters, which is lower than the total number of parameters they contain. For more information on Gemma 3n's efficient parameter management technology, see the Gemma 3n page.
from transformers import pipeline
import torch
pipe = pipeline(
"image-text-to-text",
torch_dtype=torch.bfloat16,
model="google/gemma-3n-e4b",
device="cuda",
)
output = pipe(
"https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/bee.jpg",
text="<image_soft_token> in this image, there is"
)
print(output)
Dia

Dia is an opensource text-to-speech (TTS) model (1.6B parameters) developed by Nari Labs. It can generate highly realistic dialogue from transcript including nonverbal communications such as laughter and coughing. Furthermore, emotion and tone control is also possible via audio conditioning (voice cloning).

Model Architecture: Dia is an encoder-decoder transformer based on the original transformer architecture. However, some more modern features such as rotational positional embeddings (RoPE) are also included. For its text portion (encoder), a byte tokenizer is utilized while for the audio portion (decoder), a pretrained codec model DAC is used - DAC encodes speech into discrete codebook tokens and decodes them back into audio.

Add Dia model by @buttercrab in #38405

Kyutai Speech-to-Text

Kyutai STT is a speech-to-text model architecture based on the Mimi codec, which encodes audio into discrete tokens in a streaming fashion, and a Moshi-like autoregressive decoder. Kyutai’s lab has released two model checkpoints:

kyutai/stt-1b-en_fr: a 1B-parameter model capable of transcribing both English and French

... (truncated)

Commits

67ddc82 Release: v4.53.0
0a8081b [Modeling] Fix encoder CPU offloading for whisper (#38994)
c63cfd6 Gemma 3n (#39059)
3e5cc12 [tests] remove tests from libraries with deprecated support (flax, tensorflow...
cfff7ca [Whisper] Pipeline: handle long form generation (#35750)
02ecdcf add _keep_in_fp32_modules_strict (#39058)
d973e62 fix condition where torch_dtype auto collides with model_kwargs. (#39054)
44b2316 [qwen2-vl] fix vision attention scaling (#39043)
ae15715 polishing docs: error fixes for clarity (#39042)
3abeaba Create test for #38916 (custom generate from local dir with imports) (#39015)
Additional commits viewable in compare view

Updates deepspeed from 0.9.0 to 0.15.1

Release notes

Sourced from deepspeed's releases.

v0.15.1 Patch release

What's Changed

Update version.txt after 0.15.0 release by @loadams in microsoft/DeepSpeed#6403

Fix Type Mismatch by @jomayeri in microsoft/DeepSpeed#6410

Fix redundant seq data parallel grp argument in Z3/MiCS by @samadejacobs in microsoft/DeepSpeed#5352

add Huawei Ascend NPU setup guide by @xuedinge233 in microsoft/DeepSpeed#6445

Add documentation for launcher without SSH by @dogacancolak-kensho in microsoft/DeepSpeed#6455

Dtype support check for accelerator in UTs by @raza-sikander in microsoft/DeepSpeed#6360

Store/Load CIFAR from local/offline by @raza-sikander in microsoft/DeepSpeed#6390

Add the accelerator setup guide link in Getting Started page by @rogerxfeng8 in microsoft/DeepSpeed#6452

Allow triton==3.0.x for fp_quantizer by @siddartha-RE in microsoft/DeepSpeed#6447

Change GDS to 1 AIO thread by @jomayeri in microsoft/DeepSpeed#6459

[CCL] fix condition issue in ccl.py by @YizhouZ in microsoft/DeepSpeed#6443

Avoid gds build errors on ROCm by @rraminen in microsoft/DeepSpeed#6456

TestLowCpuMemUsage UT get device by device_name by @raza-sikander in microsoft/DeepSpeed#6397

Add workflow to build DS without torch to better test before releases by @loadams in microsoft/DeepSpeed#6450

Fix patch for parameter partitioning in zero.Init() by @tohtana in microsoft/DeepSpeed#6388

Add default value to "checkpoint_folder" in "load_state_dict" of bf16_optimizer by @ljcc0930 in microsoft/DeepSpeed#6446

DeepNVMe tutorial by @tjruwase in microsoft/DeepSpeed#6449

bf16_optimizer: fixes to different grad acc dtype by @nelyahu in microsoft/DeepSpeed#6485

print warning if actual triton cache dir is on NFS, not just for default by @jrandall in microsoft/DeepSpeed#6487

DS_BUILD_OPS should build only compatible ops by @tjruwase in microsoft/DeepSpeed#6489

Safe usage of popen by @tjruwase in microsoft/DeepSpeed#6490

Handle an edge case where CUDA_HOME is not defined on ROCm systems by @amorehead in microsoft/DeepSpeed#6488

New Contributors

@xuedinge233 made their first contribution in microsoft/DeepSpeed#6445

@siddartha-RE made their first contribution in microsoft/DeepSpeed#6447

@ljcc0930 made their first contribution in microsoft/DeepSpeed#6446

@jrandall made their first contribution in microsoft/DeepSpeed#6487

@amorehead made their first contribution in microsoft/DeepSpeed#6488

Full Changelog: deepspeedai/DeepSpeed@v0.15.0...v0.15.1

DeepSpeed v0.15.0

What's Changed

Update version.txt after 0.14.5 release by @loadams in microsoft/DeepSpeed#5982

move pynvml install to setup.py by @Rohan138 in microsoft/DeepSpeed#5840

add moe topk(k>2) gate support by @inkcherry in microsoft/DeepSpeed#5881

Move inf_or_nan_tracker to cpu for cpu offload by @BacharL in microsoft/DeepSpeed#5826

Enable dynamic shapes for pipeline parallel engine inputs by @tohtana in microsoft/DeepSpeed#5481

Add and Remove ZeRO 3 Hooks by @jomayeri in microsoft/DeepSpeed#5658

DeepNVMe GDS by @jomayeri in microsoft/DeepSpeed#5852

Pin transformers version on nv-nightly by @loadams in microsoft/DeepSpeed#6002

DeepSpeed on Window blog by @tjruwase in microsoft/DeepSpeed#6364

Bug Fix 5880 by @jomayeri in microsoft/DeepSpeed#6378

Update linear.py compatible with torch 2.4.0 by @terry-for-github in microsoft/DeepSpeed#5811

GDS Swapping Fix by @jomayeri in microsoft/DeepSpeed#6386

Long sequence parallelism (Ulysses) integration with HuggingFace by @samadejacobs in microsoft/DeepSpeed#5774

reduce cpu host overhead when using moe by @ranzhejiang in microsoft/DeepSpeed#5578

... (truncated)

Commits

10ba3dd Handle an edge case where CUDA_HOME is not defined on ROCm systems (#6488)
662a421 Safe usage of popen (#6490)
ddd3571 Add default value to "checkpoint_folder" in "load_state_dict" of bf16_optimiz...
5d1a30c DS_BUILD_OPS should build only compatible ops (#6489)
ddeb0c1 Fix patch for parameter partitioning in zero.Init() (#6388)
9d17116 print warning if actual triton cache dir is on NFS, not just for default (#6487)
5df12a4 DeepNVMe tutorial (#6449)
cfc6ed3 bf16_optimizer: fixes to different grad acc dtype (#6485)
9b7fc54 Add workflow to build DS without torch to better test before releases (#6450)
89c4d9f TestLowCpuMemUsage UT get device by device_name (#6397)
Additional commits viewable in compare view

Updates torch from 1.13.0 to 2.8.0

Release notes

Sourced from torch's releases.

PyTorch 2.8.0 Release Notes

Highlights

Backwards Incompatible Changes

Deprecations

New Features

Improvements

Bug fixes

Performance

Documentation

Developers

Highlights

... (truncated)

Commits

ba56102 Cherrypick: Add the RunLLM widget to the website (#159592)
c525a02 [dynamo, docs] cherry pick torch.compile programming model docs into 2.8 (#15...
a1cb3cc [Release Only] Remove nvshmem from list of preload libraries (#158925)
c76b235 Move out super large one off foreach_copy test (#158880)
20a0e22 Revert "[Dynamo] Allow inlining into AO quantization modules (#152934)" (#158...
9167ac8 [MPS] Switch Cholesky decomp to column wise (#158237)
5534685 [MPS] Reimplement tri[ul] as Metal shaders (#158867)
d19e08d Cherry pick PR 158746 (#158801)
a6c044a [cherry-pick] Unify torch.tensor and torch.ops.aten.scalar_tensor behavior (#...
620ebd0 [Dynamo] Use proper sources for constructing dataclass defaults (#158689)
Additional commits viewable in compare view

Updates tqdm from 4.65.0 to 4.66.3

Release notes

Sourced from tqdm's releases.

tqdm v4.66.3 stable

cli: eval safety (fixes CVE-2024-34062, GHSA-g7vv-2v7x-gj9p)

tqdm v4.66.2 stable

pandas: add DataFrame.progress_map (#1549)

notebook: fix HTML padding (#1506)

keras: fix resuming training when verbose>=2 (#1508)

fix format_num negative fractions missing leading zero (#1548)

fix Python 3.12 DeprecationWarning on import (#1519)

linting: use f-strings (#1549)

update tests (#1549)

fix pandas warnings

fix asv (airspeed-velocity/asv#1323)

fix macos notebook docstring indentation

CI: bump actions (#1549)

tqdm v4.66.1 stable

fix utils.envwrap types (#1493 <- #1491, #1320 <- #966, #1319)

e.g. cloudwatch & kubernetes workaround: export TQDM_POSITION=-1

drop mentions of unsupported Python versions

tqdm v4.66.0 stable

environment variables to override defaults (TQDM_*) (#1491 <- #1061, #950 <- #614, #1318, #619, #612, #370)

e.g. in CI jobs, export TQDM_MININTERVAL=5 to avoid log spam

add tests & docs for tqdm.utils.envwrap

fix & update CLI completion

fix & update API docs

minor code tidy: replace os.path => pathlib.Path

fix docs image hosting

release with CI bot account again (cli/cli#6680)

tqdm v4.65.2 stable

exclude examples from distributed wheel (#1492)

tqdm v4.65.1 stable

migrate setup.{cfg,py} => pyproject.toml (#1490)

fix asv benchmarks

update docs

fix snap build (#1490)

fix & update tests (#1490)

fix flaky notebook tests

bump pre-commit

bump workflow actions

Commits

4e613f8 Merge pull request from GHSA-g7vv-2v7x-gj9p
b53348c cli: eval safety
cc372d0 bump version, merge pull request #1549 from tqdm/devel
e9f0c05 use PyPI trusted publishing
7323d5b slight makefile clean
5306125 tests: bump pre-commit
4a6fd4f fix datetime.utcfromtimestamp py3.12 warning (#1519)
6f13759 tests: fix macos notebook indentation
3abcd2a tests: fix asv
a4d15c8 tests: fix pandas warnings
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore <dependency name> major version will close this group update PR and stop Dependabot creating any more for the specific dependency's major version (unless you unignore this specific dependency's major version or upgrade to it yourself)
@dependabot ignore <dependency name> minor version will close this group update PR and stop Dependabot creating any more for the specific dependency's minor version (unless you unignore this specific dependency's minor version or upgrade to it yourself)
@dependabot ignore <dependency name> will close this group update PR and stop Dependabot creating any more for the specific dependency (unless you unignore this specific dependency or upgrade to it yourself)
@dependabot unignore <dependency name> will remove all of the ignore conditions of the specified dependency
@dependabot unignore <dependency name> <ignore condition> will remove the ignore condition of the specified dependency and ignore conditions
You can disable automated security fix PRs for this repo from the Security Alerts page.

Bumps the pip group with 4 updates in the / directory: [transformers](https://github.com/huggingface/transformers), [deepspeed](https://github.com/deepspeedai/DeepSpeed), [torch](https://github.com/pytorch/pytorch) and [tqdm](https://github.com/tqdm/tqdm). Bumps the pip group with 4 updates in the /train directory: [transformers](https://github.com/huggingface/transformers), [deepspeed](https://github.com/deepspeedai/DeepSpeed), [torch](https://github.com/pytorch/pytorch) and [tqdm](https://github.com/tqdm/tqdm). Updates `transformers` from 4.28.1 to 4.53.0 - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](huggingface/transformers@v4.28.1...v4.53.0) Updates `deepspeed` from 0.9.0 to 0.15.1 - [Release notes](https://github.com/deepspeedai/DeepSpeed/releases) - [Commits](deepspeedai/DeepSpeed@v0.9.0...v0.15.1) Updates `torch` from 1.13.0 to 2.8.0 - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md) - [Commits](pytorch/pytorch@v1.13.0...v2.8.0) Updates `tqdm` from 4.65.0 to 4.66.3 - [Release notes](https://github.com/tqdm/tqdm/releases) - [Commits](tqdm/tqdm@v4.65.0...v4.66.3) Updates `transformers` from 4.28.1 to 4.53.0 - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](huggingface/transformers@v4.28.1...v4.53.0) Updates `deepspeed` from 0.9.0 to 0.15.1 - [Release notes](https://github.com/deepspeedai/DeepSpeed/releases) - [Commits](deepspeedai/DeepSpeed@v0.9.0...v0.15.1) Updates `torch` from 1.13.0 to 2.8.0 - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md) - [Commits](pytorch/pytorch@v1.13.0...v2.8.0) Updates `tqdm` from 4.65.0 to 4.66.3 - [Release notes](https://github.com/tqdm/tqdm/releases) - [Commits](tqdm/tqdm@v4.65.0...v4.66.3) --- updated-dependencies: - dependency-name: transformers dependency-version: 4.53.0 dependency-type: direct:production dependency-group: pip - dependency-name: deepspeed dependency-version: 0.15.1 dependency-type: direct:production dependency-group: pip - dependency-name: torch dependency-version: 2.8.0 dependency-type: direct:production dependency-group: pip - dependency-name: tqdm dependency-version: 4.66.3 dependency-type: direct:production dependency-group: pip - dependency-name: transformers dependency-version: 4.53.0 dependency-type: direct:production dependency-group: pip - dependency-name: deepspeed dependency-version: 0.15.1 dependency-type: direct:production dependency-group: pip - dependency-name: torch dependency-version: 2.8.0 dependency-type: direct:production dependency-group: pip - dependency-name: tqdm dependency-version: 4.66.3 dependency-type: direct:production dependency-group: pip ... Signed-off-by: dependabot[bot] <support@github.com>

dependabot bot added dependencies Dependencies updates by Dependabot python Pull requests that update python code labels Mar 19, 2026

dependabot bot mentioned this pull request Mar 19, 2026

Bump the pip group across 1 directory with 4 updates #2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump the pip group across 2 directories with 4 updates#3

Bump the pip group across 2 directories with 4 updates#3
dependabot[bot] wants to merge 1 commit intomainfrom
dependabot/pip/pip-2d92e43337

dependabot bot commented on behalf of github Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Conversation

dependabot bot commented on behalf of github Mar 19, 2026

Release v4.53.0

Gemma3n

Dia

Kyutai Speech-to-Text

v0.15.1 Patch release

What's Changed

New Contributors

DeepSpeed v0.15.0

What's Changed

PyTorch 2.8.0 Release Notes

Highlights

tqdm v4.66.3 stable

tqdm v4.66.2 stable

tqdm v4.66.1 stable

tqdm v4.66.0 stable

tqdm v4.65.2 stable

tqdm v4.65.1 stable

Release v4.53.0

Gemma3n

Dia

Kyutai Speech-to-Text

v0.15.1 Patch release

What's Changed

New Contributors

DeepSpeed v0.15.0

What's Changed

PyTorch 2.8.0 Release Notes

Highlights

tqdm v4.66.3 stable

tqdm v4.66.2 stable

tqdm v4.66.1 stable

tqdm v4.66.0 stable

tqdm v4.65.2 stable

tqdm v4.65.1 stable

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants