-
Notifications
You must be signed in to change notification settings - Fork 17
Open
Description
Guys, are you planning to opensource the training code/data? :)
- I think it makes sense to update the architecture: add RoPE, RMSNorm, qk-norm, etc.
- Experiments with loss, include post training, for example RLVR (reward the model for generating correct domains).
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels