Implementing training and generation flow for GPT

In this projects I have implemented training flow for GPT for single GPU using Huggingface Accelerate. Then I have exported the model to ONNX format for faster inference. Post that I have deployed it to Huggingface Spaces. Training logs can be seen from Wandb

Huggingface Spaces Link

https://huggingface.co/spaces/prerana1205/GPT-Inference

Speedup in inference

Inference Type	Time Taken for 1000 tokens
Pytorch Model	83 secs
Quantized Model	81 secs
Onnx Quantized	56 secs

Run Training On GPU

Clone the project

  git clone https://github.com/kurchi1205/GPT-Scratch.git

Go to the project directory

  cd GPT-Scratch

Install dependencies

  pip install -r requirements_train.txt

Start the training

  ./train.sh

Exporting the model

    python export.py

This will export the model to onnx and quantize it.

Testing the model

    python generate.py #for pytorch model
    python generate_onnx.py #for onnx model

Acknowledgements

Andrej Karpathy lecture

Optimizations

I have implemented flash attention flow, although the cuda implementation is not there, the matrix slicing part has been implemented.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.ipynb_checkpoints		.ipynb_checkpoints
datasets		datasets
.gitignore		.gitignore
README.md		README.md
config.json		config.json
data.py		data.py
data_formation.ipynb		data_formation.ipynb
export.py		export.py
generate.py		generate.py
generate_onnx.py		generate_onnx.py
gpt.py		gpt.py
requirements_train.txt		requirements_train.txt
run_train.py		run_train.py
single_gpu_training.yaml		single_gpu_training.yaml
test_2.py		test_2.py
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Implementing training and generation flow for GPT

Huggingface Spaces Link

Speedup in inference

Run Training On GPU

Exporting the model

Testing the model

Acknowledgements

Optimizations

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Implementing training and generation flow for GPT

Huggingface Spaces Link

Speedup in inference

Run Training On GPU

Exporting the model

Testing the model

Acknowledgements

Optimizations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages