PyTorch Inference Engine (Gemma3 270M)

A project based on the book Build a Large Language Model (from Scratch) by Sebastian Raschka.

This repository takes one of the notebooks and turns it into a standalone Python codebase written entirely in PyTorch, making it easier to read, reuse, and experiment with.

✨ Goals

Provide a clear and educational implementation of a minimalist LLM.
Remove external dependencies (except PyTorch) to keep the code self-contained and easy to run.
Serve as a foundation for experimenting with new ideas and deepening the understanding of language models.

📂 Project Structure

.
├── config.py
├── forward.py
├── gpa.py
├── model3.py
├── modelload.py
├── modelsize.py
├── rmsnorm.py
├── rope.py
├── tokenizer.py
├── transformer.py
├── main.py
├── README.md
└── requirements.txt

⚙️ Installation

Clone the repository and install the dependencies:

git clone https://github.com/xigh/pytorch-llm.git
cd pytorch-llm
uv venv .venv --seed
source .venv/bin/activate
pip install -r requirements.txt

🚀 Usage

usage: main.py [-h] [--repo-id REPO_ID] [--prompt PROMPT] [--show-debug]
        [--show-logits SHOW_LOGITS] [--verbose] [--compile]

options:
  -h, --help                    show this help message and exit
  --repo-id REPO_ID             ID du modèle à utiliser
  --prompt PROMPT               prompt
  --show-debug                  show debugging logs
  --show-logits k               show {k} logits
  --verbose                     show timings
  --compile                     compile model
  --force-cpu                   force cpu inference

Known working repo ids :

Example to run an inference request:

$ python main.py --prompt "How many legs does a duck have?"
A duck has 4 legs.

(Yes... tiny model thinks a duck has 4 legs 🦆😂 - it’s just a toy demo!)

📖 References

Sebastian Raschka - Build a Large Language Model (From Scratch)
Original repo: rasbt/LLMs-from-scratch

🛠️ Next Steps

TODO

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PyTorch Inference Engine (Gemma3 270M)

✨ Goals

📂 Project Structure

⚙️ Installation

🚀 Usage

Known working repo ids :

📖 References

🛠️ Next Steps

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
forward.py		forward.py
gpa.py		gpa.py
load.py		load.py
main.py		main.py
model3.py		model3.py
modelsize.py		modelsize.py
requirements.txt		requirements.txt
rmsnorm.py		rmsnorm.py
rope.py		rope.py
tokenizer.py		tokenizer.py
transformer.py		transformer.py

License

xigh/pytorch-llm

Folders and files

Latest commit

History

Repository files navigation

PyTorch Inference Engine (Gemma3 270M)

✨ Goals

📂 Project Structure

⚙️ Installation

🚀 Usage

Known working repo ids :

📖 References

🛠️ Next Steps

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages