TinTorch-trainer — Distributed Transformer Training from Scratch A minimal project exploring data, tensor, and pipeline parallel training for Transformers — inspired by minigpt and picotron.