training issues, bump in the loss value at the start of each epoch

Hi, I am trainng with Adamw optimizer with following parameter for some time, and the loss value shows bump at the start of each epoch, what is the reason for this? 
num_epoch: 36
optimizer: 'adamw'
lr: 0.00002
momentum: 0.9
weight_decay: 0.001
scheduler: 'cosine'
filter_bias_and_bn: true
warmup_epoch: 0
max_grad_norm: 1.0
lr_milestones: []


![Untitled](https://github.com/user-attachments/assets/34b056d4-16e9-4e3f-9038-935709ba944a)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training issues, bump in the loss value at the start of each epoch #29

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

training issues, bump in the loss value at the start of each epoch #29

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions