Aashutoshh01

Follow

💭

I may be slow to respond.

Aashutosh Joshi Aashutoshh01

💭

I may be slow to respond.

Follow

IIT Kharagpur Alum

5 followers · 59 following

Aashutoshh01/README.md

Hey I'm Aashutosh Joshi

Currently undergrad at Indian Institute of Technology, Kharagpur

📌 About Me

🌱 I learn and build across advanced AI/ML systems, transformer architectures, small language models, and complex-valued deep learning.
🤝 I collaborate on open-source AI projects, research prototypes, and high-performance model training pipelines.
🔍 I work on scalable agentic AI systems, distributed training setups, and multimodal model design.
🚀 I build models from scratch, GPT style LLMs, SLMs, ViTs, and multi-agent systems, and design full training and evaluation workflows.
🧠 I research SAR processing, complex-valued neural networks, and efficient adaptation methods such as LoRA and CV-LoRA.
💬 Ask me about transformers, PyTorch, AutoGen, RAG systems, model optimization, quantization, or H100-class GPU training.
⚡ I love creating end-to-end AI systems, from dataset pipelines to inference, deployment, and interactive UIs.

🧠 My Focus Areas

AI/ML Research
Transformer Architectures
Small & Large Language Models (SLMs/LLMs)
Complex-Valued Deep Learning
LLM Pretraining & Fine-tuning
Agentic AI Systems (LangGraph & AutoGen)
Retrieval-Augmented Generation (RAG)
Open-Source Contribution
Model Optimization & Quantization
Vision Models (ViT, CNNs)
Time-Series Modeling
Multi-Agent Automation Systems

📊 GitHub Stats & Trophies

🔗 Connect with Me

💬 Quote

SLM>>>LLM

Pinned Loading

paligemma-pytorch paligemma-pytorch Public

End-to-end reconstruction of PaliGemma in PyTorch, featuring SigLIP ViT, Gemma decoder with RoPE & GQA, KV caching, and autoregressive multimodal inference.

Python 1
ArXivGPT ArXivGPT Public

A 124M-parameter GPT-2 model trained from scratch on ArXiv abstracts, featuring fully custom Transformer components implemented in PyTorch.

Jupyter Notebook 1
VisionSmith VisionSmith Public

A clean, from-scratch Vision Transformer (ViT) implementation in PyTorch, trained on MNIST and based on the “An Image is Worth 16×16 Words” paper. Includes full patch embedding, positional encoding…

Jupyter Notebook 1
wav2vec2-pytorch wav2vec2-pytorch Public

End-to-end PyTorch implementation of Wav2Vec2 with convolutional feature encoding, Transformer contextualization, Gumbel vector quantization, and full pipelines for contrastive pre-training and CTC…

Python 1
ScratchSpeakSLM ScratchSpeakSLM Public

A tiny GPT-style language model built completely from scratch in PyTorch, inspired by TinyStories, demonstrating how small transformer models can learn to generate coherent text.

Jupyter Notebook 1
conformer-pytorch conformer-pytorch Public

End-to-end Conformer ASR in PyTorch featuring Macaron-style blocks, relative multi-head attention, depthwise convolutions, and an LSTM decoder with CTC loss.

Python 1