Skip to content

repo for my personal llm experiments in using different optimizers, attention mechs, data preprocessing strategies, quantization schemes, and other stuff. Optimized for my nvidia rtx 4080 w 12gb of vram (i am gpu poor) - originally a fork of nanoGPT

License

Notifications You must be signed in to change notification settings

adheep04/llm-training

Repository files navigation

About

repo for my personal llm experiments in using different optimizers, attention mechs, data preprocessing strategies, quantization schemes, and other stuff. Optimized for my nvidia rtx 4080 w 12gb of vram (i am gpu poor) - originally a fork of nanoGPT

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 37

Languages