Why not using calibrated_grads directly?

hello, I am very interested in your paper. thank you for the implementation. but I have some questions about your code。
in this line:
https://github.com/csyhhu/MetaQuant/blob/3169e0b11e179011b1ffd3bd8ac49fb5656d7442/meta_utils/meta_quantized_module.py#L86
` self.meta_weight = self.weight - \
                                           lr * (self.calibrated_grads \
                                   + (self.weight.grad.data - self.calibrated_grads.data).detach())`
why not using the `self.calibrated_grads` directly? instead, you used the refine gradients: `self.weight.grad`.

furthermore, the weights have been updated in the main function using the refine gradients. 
so i am very confused why using the refine gradients again!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why not using calibrated_grads directly? #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Why not using calibrated_grads directly? #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions